* 2.6.22-rc1-mm1 huge pages VM freeze (maybe?)
@ 2007-07-31 21:44 Zan Lynx
2007-07-31 22:02 ` Randy Dunlap
0 siblings, 1 reply; 11+ messages in thread
From: Zan Lynx @ 2007-07-31 21:44 UTC (permalink / raw)
To: Linux Kernel
[-- Attachment #1: Type: text/plain, Size: 1467 bytes --]
I was playing with huge pages and libhugetlbfs. Small programs like
"ls" work fine. I tried running Evolution through libhugetlbfs and the
system slowly stops running. One interesting thing is the "ps" command,
it gets stuck like this:
ps D ffff81001e57ed40 0 103558 103483
ffff81001f061dc8 0000000000000096 ffff81003d8586e8 ffff81001cbadc00
0000000000000006 ffffffff80537009 0000000000000030 ffffffff807ff700
ffffffff807ff700 ffffffff807ff700 ffffffff807ff700 ffffffff807ff700
Call Trace:
[<ffffffff80537009>] _spin_unlock+0x29/0x50
[<ffffffff80536425>] __down_read+0x75/0xaf
[<ffffffff80295f49>] access_process_vm+0x49/0x190
[<ffffffff802f3003>] proc_pid_cmdline+0xa3/0x130
[<ffffffff802f4cea>] proc_info_read+0xba/0x100
[<ffffffff802b0085>] vfs_read+0xc5/0x180
[<ffffffff802b0583>] sys_read+0x53/0x90
[<ffffffff8020c1de>] system_call+0x7e/0x83
and nothing will touch it after that.
Here's my kernel command line:
root=/dev/sda2 rootfstype=reiser4 rootflags=no_write_barrier ro
i8042.nomux elevator=cfq resume=/dev/sda3 panic=5 nmi_watchdog=2,panic
debug hugepages=32
Here's the "huge" script I was using to run programs:
#!/bin/sh
export LD_PRELOAD=libhugetlbfs.so
export HUGETLB_MORECORE=yes
export HUGETLB_PATH=/mnt/huge
export HUGETLB_VERBOSE=1
exec "$@"
I don't have any more info than that at the moment but I could reproduce
it with whatever, on request.
--
Zan Lynx <zlynx@acm.org>
[-- Attachment #2: This is a digitally signed message part --]
[-- Type: application/pgp-signature, Size: 189 bytes --]
^ permalink raw reply [flat|nested] 11+ messages in thread
* Re: 2.6.22-rc1-mm1 huge pages VM freeze (maybe?)
2007-07-31 22:02 ` Randy Dunlap
@ 2007-07-31 22:01 ` Zan Lynx
2007-08-01 15:47 ` Nish Aravamudan
` (2 more replies)
2007-08-01 15:45 ` Nish Aravamudan
1 sibling, 3 replies; 11+ messages in thread
From: Zan Lynx @ 2007-07-31 22:01 UTC (permalink / raw)
To: Randy Dunlap; +Cc: Linux Kernel
[-- Attachment #1: Type: text/plain, Size: 675 bytes --]
On Tue, 2007-07-31 at 15:02 -0700, Randy Dunlap wrote:
> On Tue, 31 Jul 2007 15:44:21 -0600 Zan Lynx wrote:
>
> > I was playing with huge pages and libhugetlbfs. Small programs like
> > "ls" work fine. I tried running Evolution through libhugetlbfs and the
> > system slowly stops running. One interesting thing is the "ps" command,
> > it gets stuck like this:
>
> Do you mean 2.6.22-rc1-mm1 or 2.6.23-rc1-mm1?
D'oh! I mean 2.6.23-rc1-mm1, the 22 was a typo. Cut & paste to be
sure:
Linux zephyr 2.6.23-rc1-mm1 #1 SMP PREEMPT Wed Jul 25 17:33:04 MDT 2007
x86_64 AMD Athlon(tm) 64 Processor 3400+ AuthenticAMD GNU/Linux
--
Zan Lynx <zlynx@acm.org>
[-- Attachment #2: This is a digitally signed message part --]
[-- Type: application/pgp-signature, Size: 189 bytes --]
^ permalink raw reply [flat|nested] 11+ messages in thread
* Re: 2.6.22-rc1-mm1 huge pages VM freeze (maybe?)
2007-07-31 21:44 2.6.22-rc1-mm1 huge pages VM freeze (maybe?) Zan Lynx
@ 2007-07-31 22:02 ` Randy Dunlap
2007-07-31 22:01 ` Zan Lynx
2007-08-01 15:45 ` Nish Aravamudan
0 siblings, 2 replies; 11+ messages in thread
From: Randy Dunlap @ 2007-07-31 22:02 UTC (permalink / raw)
To: Zan Lynx; +Cc: Linux Kernel
On Tue, 31 Jul 2007 15:44:21 -0600 Zan Lynx wrote:
> I was playing with huge pages and libhugetlbfs. Small programs like
> "ls" work fine. I tried running Evolution through libhugetlbfs and the
> system slowly stops running. One interesting thing is the "ps" command,
> it gets stuck like this:
Do you mean 2.6.22-rc1-mm1 or 2.6.23-rc1-mm1?
There was a hugepage problem fixed very recently, in 2.6.23-rc1 IIRC.
> ps D ffff81001e57ed40 0 103558 103483
> ffff81001f061dc8 0000000000000096 ffff81003d8586e8 ffff81001cbadc00
> 0000000000000006 ffffffff80537009 0000000000000030 ffffffff807ff700
> ffffffff807ff700 ffffffff807ff700 ffffffff807ff700 ffffffff807ff700
> Call Trace:
> [<ffffffff80537009>] _spin_unlock+0x29/0x50
> [<ffffffff80536425>] __down_read+0x75/0xaf
> [<ffffffff80295f49>] access_process_vm+0x49/0x190
> [<ffffffff802f3003>] proc_pid_cmdline+0xa3/0x130
> [<ffffffff802f4cea>] proc_info_read+0xba/0x100
> [<ffffffff802b0085>] vfs_read+0xc5/0x180
> [<ffffffff802b0583>] sys_read+0x53/0x90
> [<ffffffff8020c1de>] system_call+0x7e/0x83
>
> and nothing will touch it after that.
>
> Here's my kernel command line:
> root=/dev/sda2 rootfstype=reiser4 rootflags=no_write_barrier ro
> i8042.nomux elevator=cfq resume=/dev/sda3 panic=5 nmi_watchdog=2,panic
> debug hugepages=32
>
> Here's the "huge" script I was using to run programs:
> #!/bin/sh
> export LD_PRELOAD=libhugetlbfs.so
> export HUGETLB_MORECORE=yes
> export HUGETLB_PATH=/mnt/huge
> export HUGETLB_VERBOSE=1
> exec "$@"
>
> I don't have any more info than that at the moment but I could reproduce
> it with whatever, on request.
---
~Randy
*** Remember to use Documentation/SubmitChecklist when testing your code ***
^ permalink raw reply [flat|nested] 11+ messages in thread
* Re: 2.6.22-rc1-mm1 huge pages VM freeze (maybe?)
2007-07-31 22:02 ` Randy Dunlap
2007-07-31 22:01 ` Zan Lynx
@ 2007-08-01 15:45 ` Nish Aravamudan
2007-08-01 15:54 ` Randy Dunlap
1 sibling, 1 reply; 11+ messages in thread
From: Nish Aravamudan @ 2007-08-01 15:45 UTC (permalink / raw)
To: Randy Dunlap; +Cc: Zan Lynx, Linux Kernel
On 7/31/07, Randy Dunlap <randy.dunlap@oracle.com> wrote:
> On Tue, 31 Jul 2007 15:44:21 -0600 Zan Lynx wrote:
>
> > I was playing with huge pages and libhugetlbfs. Small programs like
> > "ls" work fine. I tried running Evolution through libhugetlbfs and the
> > system slowly stops running. One interesting thing is the "ps" command,
> > it gets stuck like this:
>
> Do you mean 2.6.22-rc1-mm1 or 2.6.23-rc1-mm1?
>
> There was a hugepage problem fixed very recently, in 2.6.23-rc1 IIRC.
Actually fixed just after 2.6.23-rc1:
git describe 5ab3ee7b1cd5c91eb2272764f9d7d1fe4749681e
v2.6.23-rc1-14-g5ab3ee7
Thanks,
Nish
^ permalink raw reply [flat|nested] 11+ messages in thread
* Re: 2.6.22-rc1-mm1 huge pages VM freeze (maybe?)
2007-07-31 22:01 ` Zan Lynx
@ 2007-08-01 15:47 ` Nish Aravamudan
2007-08-01 15:52 ` Nish Aravamudan
2007-08-01 21:41 ` Nish Aravamudan
2 siblings, 0 replies; 11+ messages in thread
From: Nish Aravamudan @ 2007-08-01 15:47 UTC (permalink / raw)
To: Zan Lynx; +Cc: Randy Dunlap, Linux Kernel
On 7/31/07, Zan Lynx <zlynx@acm.org> wrote:
> On Tue, 2007-07-31 at 15:02 -0700, Randy Dunlap wrote:
> > On Tue, 31 Jul 2007 15:44:21 -0600 Zan Lynx wrote:
> >
> > > I was playing with huge pages and libhugetlbfs. Small programs like
> > > "ls" work fine. I tried running Evolution through libhugetlbfs and the
> > > system slowly stops running. One interesting thing is the "ps" command,
> > > it gets stuck like this:
> >
> > Do you mean 2.6.22-rc1-mm1 or 2.6.23-rc1-mm1?
>
> D'oh! I mean 2.6.23-rc1-mm1, the 22 was a typo. Cut & paste to be
> sure:
> Linux zephyr 2.6.23-rc1-mm1 #1 SMP PREEMPT Wed Jul 25 17:33:04 MDT 2007
> x86_64 AMD Athlon(tm) 64 Processor 3400+ AuthenticAMD GNU/Linux
Hrm -- if you kill Evolution does the system come back? Or is it
unkillable/unusable. I guess you were able to run ps at the same time.
What is Evolution doing (sysrq+t)? For that matter, what was the
output from libhuge?
Thanks,
Nish
^ permalink raw reply [flat|nested] 11+ messages in thread
* Re: 2.6.22-rc1-mm1 huge pages VM freeze (maybe?)
2007-08-01 15:54 ` Randy Dunlap
@ 2007-08-01 15:52 ` Nish Aravamudan
0 siblings, 0 replies; 11+ messages in thread
From: Nish Aravamudan @ 2007-08-01 15:52 UTC (permalink / raw)
To: Randy Dunlap; +Cc: Zan Lynx, Linux Kernel
On 8/1/07, Randy Dunlap <randy.dunlap@oracle.com> wrote:
> Nish Aravamudan wrote:
> > On 7/31/07, Randy Dunlap <randy.dunlap@oracle.com> wrote:
> >> On Tue, 31 Jul 2007 15:44:21 -0600 Zan Lynx wrote:
> >>
> >>> I was playing with huge pages and libhugetlbfs. Small programs like
> >>> "ls" work fine. I tried running Evolution through libhugetlbfs and the
> >>> system slowly stops running. One interesting thing is the "ps" command,
> >>> it gets stuck like this:
> >> Do you mean 2.6.22-rc1-mm1 or 2.6.23-rc1-mm1?
> >>
> >> There was a hugepage problem fixed very recently, in 2.6.23-rc1 IIRC.
> >
> > Actually fixed just after 2.6.23-rc1:
> >
> > git describe 5ab3ee7b1cd5c91eb2272764f9d7d1fe4749681e
> > v2.6.23-rc1-14-g5ab3ee7
>
> Looks to me like Andrew included Ken's patch in his rc1-mm1 anyway,
> so that shouldn't be the issue. Or did I not read mm/hugetlb.c correctly?
Yeah you're right, the -mm tree has that bug fixed.
Thanks,
Nish
^ permalink raw reply [flat|nested] 11+ messages in thread
* Re: 2.6.22-rc1-mm1 huge pages VM freeze (maybe?)
2007-07-31 22:01 ` Zan Lynx
2007-08-01 15:47 ` Nish Aravamudan
@ 2007-08-01 15:52 ` Nish Aravamudan
2007-08-02 1:40 ` Zan Lynx
2007-08-01 21:41 ` Nish Aravamudan
2 siblings, 1 reply; 11+ messages in thread
From: Nish Aravamudan @ 2007-08-01 15:52 UTC (permalink / raw)
To: Zan Lynx; +Cc: Randy Dunlap, Linux Kernel
On 7/31/07, Zan Lynx <zlynx@acm.org> wrote:
> On Tue, 2007-07-31 at 15:02 -0700, Randy Dunlap wrote:
> > On Tue, 31 Jul 2007 15:44:21 -0600 Zan Lynx wrote:
> >
> > > I was playing with huge pages and libhugetlbfs. Small programs like
> > > "ls" work fine. I tried running Evolution through libhugetlbfs and the
> > > system slowly stops running. One interesting thing is the "ps" command,
> > > it gets stuck like this:
> >
> > Do you mean 2.6.22-rc1-mm1 or 2.6.23-rc1-mm1?
>
> D'oh! I mean 2.6.23-rc1-mm1, the 22 was a typo. Cut & paste to be
> sure:
> Linux zephyr 2.6.23-rc1-mm1 #1 SMP PREEMPT Wed Jul 25 17:33:04 MDT 2007
> x86_64 AMD Athlon(tm) 64 Processor 3400+ AuthenticAMD GNU/Linux
Just to confirm, still happens with -mm2?
Thanks,
Nish
^ permalink raw reply [flat|nested] 11+ messages in thread
* Re: 2.6.22-rc1-mm1 huge pages VM freeze (maybe?)
2007-08-01 15:45 ` Nish Aravamudan
@ 2007-08-01 15:54 ` Randy Dunlap
2007-08-01 15:52 ` Nish Aravamudan
0 siblings, 1 reply; 11+ messages in thread
From: Randy Dunlap @ 2007-08-01 15:54 UTC (permalink / raw)
To: Nish Aravamudan; +Cc: Zan Lynx, Linux Kernel
Nish Aravamudan wrote:
> On 7/31/07, Randy Dunlap <randy.dunlap@oracle.com> wrote:
>> On Tue, 31 Jul 2007 15:44:21 -0600 Zan Lynx wrote:
>>
>>> I was playing with huge pages and libhugetlbfs. Small programs like
>>> "ls" work fine. I tried running Evolution through libhugetlbfs and the
>>> system slowly stops running. One interesting thing is the "ps" command,
>>> it gets stuck like this:
>> Do you mean 2.6.22-rc1-mm1 or 2.6.23-rc1-mm1?
>>
>> There was a hugepage problem fixed very recently, in 2.6.23-rc1 IIRC.
>
> Actually fixed just after 2.6.23-rc1:
>
> git describe 5ab3ee7b1cd5c91eb2272764f9d7d1fe4749681e
> v2.6.23-rc1-14-g5ab3ee7
Looks to me like Andrew included Ken's patch in his rc1-mm1 anyway,
so that shouldn't be the issue. Or did I not read mm/hugetlb.c correctly?
--
~Randy
*** Remember to use Documentation/SubmitChecklist when testing your code ***
^ permalink raw reply [flat|nested] 11+ messages in thread
* Re: 2.6.22-rc1-mm1 huge pages VM freeze (maybe?)
2007-07-31 22:01 ` Zan Lynx
2007-08-01 15:47 ` Nish Aravamudan
2007-08-01 15:52 ` Nish Aravamudan
@ 2007-08-01 21:41 ` Nish Aravamudan
2 siblings, 0 replies; 11+ messages in thread
From: Nish Aravamudan @ 2007-08-01 21:41 UTC (permalink / raw)
To: Zan Lynx; +Cc: Randy Dunlap, Linux Kernel
On 7/31/07, Zan Lynx <zlynx@acm.org> wrote:
> On Tue, 2007-07-31 at 15:02 -0700, Randy Dunlap wrote:
> > On Tue, 31 Jul 2007 15:44:21 -0600 Zan Lynx wrote:
> >
> > > I was playing with huge pages and libhugetlbfs. Small programs like
> > > "ls" work fine. I tried running Evolution through libhugetlbfs and the
> > > system slowly stops running. One interesting thing is the "ps" command,
> > > it gets stuck like this:
> >
> > Do you mean 2.6.22-rc1-mm1 or 2.6.23-rc1-mm1?
>
> D'oh! I mean 2.6.23-rc1-mm1, the 22 was a typo. Cut & paste to be
> sure:
> Linux zephyr 2.6.23-rc1-mm1 #1 SMP PREEMPT Wed Jul 25 17:33:04 MDT 2007
> x86_64 AMD Athlon(tm) 64 Processor 3400+ AuthenticAMD GNU/Linux
Also, are we at all sure this isn't a reiser4 issue? I assume you're
able to use Evolution w/o libhuge on rc1-mm1 ok? Any chance to remove
reiser4 from the picture? Have you been using libhuge this way
regularly? Any chance you know it worked ok with some recent kernel
(say 2.6.23-rc1?).
Thanks,
Nish
^ permalink raw reply [flat|nested] 11+ messages in thread
* Re: 2.6.22-rc1-mm1 huge pages VM freeze (maybe?)
2007-08-01 15:52 ` Nish Aravamudan
@ 2007-08-02 1:40 ` Zan Lynx
2007-08-03 3:51 ` Nish Aravamudan
0 siblings, 1 reply; 11+ messages in thread
From: Zan Lynx @ 2007-08-02 1:40 UTC (permalink / raw)
To: Nish Aravamudan; +Cc: Randy Dunlap, Linux Kernel
[-- Attachment #1: Type: text/plain, Size: 1669 bytes --]
On Wed, 2007-08-01 at 08:52 -0700, Nish Aravamudan wrote:
> On 7/31/07, Zan Lynx <zlynx@acm.org> wrote:
> > On Tue, 2007-07-31 at 15:02 -0700, Randy Dunlap wrote:
> > > On Tue, 31 Jul 2007 15:44:21 -0600 Zan Lynx wrote:
> > >
> > > > I was playing with huge pages and libhugetlbfs. Small programs like
> > > > "ls" work fine. I tried running Evolution through libhugetlbfs and the
> > > > system slowly stops running. One interesting thing is the "ps" command,
> > > > it gets stuck like this:
> > >
> > > Do you mean 2.6.22-rc1-mm1 or 2.6.23-rc1-mm1?
> >
> > D'oh! I mean 2.6.23-rc1-mm1, the 22 was a typo. Cut & paste to be
> > sure:
> > Linux zephyr 2.6.23-rc1-mm1 #1 SMP PREEMPT Wed Jul 25 17:33:04 MDT 2007
> > x86_64 AMD Athlon(tm) 64 Processor 3400+ AuthenticAMD GNU/Linux
>
> Just to confirm, still happens with -mm2?
No, it does not seem to. Evolution runs OK. ps, top, pmap all work
fine.
However, a couple of other things happened. Could be unrelated or only
loosely related.
Evolution launches spamd (spamassassin) to filter junk mail. spamd died
and I have this in dmesg to show for it:
VM: killing process spamd
spamd would have inherited the libhugetlbfs.so environment variables.
There are no other clues as to why it died though.
Also, immediately after launching evolution with libhugetlbfs, I got
that USB bug where the mouse starts creating keyboard input. I got some
of these in dmesg:
keyboard.c: can't emulate rawmode for keycode 240
That could be pure coincidence, although I had been using the system
almost all day before that, and it hadn't happened.
--
Zan Lynx <zlynx@acm.org>
[-- Attachment #2: This is a digitally signed message part --]
[-- Type: application/pgp-signature, Size: 189 bytes --]
^ permalink raw reply [flat|nested] 11+ messages in thread
* Re: 2.6.22-rc1-mm1 huge pages VM freeze (maybe?)
2007-08-02 1:40 ` Zan Lynx
@ 2007-08-03 3:51 ` Nish Aravamudan
0 siblings, 0 replies; 11+ messages in thread
From: Nish Aravamudan @ 2007-08-03 3:51 UTC (permalink / raw)
To: Zan Lynx; +Cc: Randy Dunlap, Linux Kernel
On 8/1/07, Zan Lynx <zlynx@acm.org> wrote:
> On Wed, 2007-08-01 at 08:52 -0700, Nish Aravamudan wrote:
> > On 7/31/07, Zan Lynx <zlynx@acm.org> wrote:
> > > On Tue, 2007-07-31 at 15:02 -0700, Randy Dunlap wrote:
> > > > On Tue, 31 Jul 2007 15:44:21 -0600 Zan Lynx wrote:
> > > >
> > > > > I was playing with huge pages and libhugetlbfs. Small programs like
> > > > > "ls" work fine. I tried running Evolution through libhugetlbfs and the
> > > > > system slowly stops running. One interesting thing is the "ps" command,
> > > > > it gets stuck like this:
> > > >
> > > > Do you mean 2.6.22-rc1-mm1 or 2.6.23-rc1-mm1?
> > >
> > > D'oh! I mean 2.6.23-rc1-mm1, the 22 was a typo. Cut & paste to be
> > > sure:
> > > Linux zephyr 2.6.23-rc1-mm1 #1 SMP PREEMPT Wed Jul 25 17:33:04 MDT 2007
> > > x86_64 AMD Athlon(tm) 64 Processor 3400+ AuthenticAMD GNU/Linux
> >
> > Just to confirm, still happens with -mm2?
>
> No, it does not seem to. Evolution runs OK. ps, top, pmap all work
> fine.
Interesting.
> However, a couple of other things happened. Could be unrelated or only
> loosely related.
>
> Evolution launches spamd (spamassassin) to filter junk mail. spamd died
> and I have this in dmesg to show for it:
>
> VM: killing process spamd
>
> spamd would have inherited the libhugetlbfs.so environment variables.
> There are no other clues as to why it died though.
Interesting. Any chance spamd can be run manually with those env
variables, but with HUGETLB_VERBOSE=99 (and/or in gdb) to see what
happens to it?
> Also, immediately after launching evolution with libhugetlbfs, I got
> that USB bug where the mouse starts creating keyboard input. I got some
> of these in dmesg:
> keyboard.c: can't emulate rawmode for keycode 240
>
> That could be pure coincidence, although I had been using the system
> almost all day before that, and it hadn't happened.
Had you started evolution w/o libhugetlbs at all before that?
It does seem like that would be coincidence.
Thanks,
Nish
^ permalink raw reply [flat|nested] 11+ messages in thread
end of thread, other threads:[~2007-08-03 3:51 UTC | newest]
Thread overview: 11+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2007-07-31 21:44 2.6.22-rc1-mm1 huge pages VM freeze (maybe?) Zan Lynx
2007-07-31 22:02 ` Randy Dunlap
2007-07-31 22:01 ` Zan Lynx
2007-08-01 15:47 ` Nish Aravamudan
2007-08-01 15:52 ` Nish Aravamudan
2007-08-02 1:40 ` Zan Lynx
2007-08-03 3:51 ` Nish Aravamudan
2007-08-01 21:41 ` Nish Aravamudan
2007-08-01 15:45 ` Nish Aravamudan
2007-08-01 15:54 ` Randy Dunlap
2007-08-01 15:52 ` Nish Aravamudan
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox