All of lore.kernel.org
 help / color / mirror / Atom feed
From: Jesse Barnes <jbarnes@sgi.com>
To: linux-kernel@vger.kernel.org
Cc: Andrew Morton <akpm@osdl.org>
Subject: Re: 2.6.5-rc1-mm2
Date: Wed, 31 Mar 2004 11:02:58 -0800	[thread overview]
Message-ID: <200403311102.58136.jbarnes@sgi.com> (raw)
In-Reply-To: <200403301144.26050.jbarnes@sgi.com>

On Tuesday 30 March 2004 11:44 am, Jesse Barnes wrote:
> It looks like there's a bug in the sysrq implementation in the sn_serial
> driver.  Once the initial console is opened, sysrq no longer works.  All
> I've determined so far is that both CPUs in my box are in cpu_idle
> somewhere... Anyway, I'll keep looking.

Ah, now sysrq is working (just had to configure it correctly).  I've seen two
backtraces in the hangs I've seen.  The one I just reproduced looks like this:

Enabling local filesystem quotas:  [  OK  ]
Enabling swap space:  [  OK  ]
INIT: Entering runlevel: 3
Entering non-interactive startup
Starting sysstat:  [  OK  ]
Setting network parameters:  ^[SYSSysRq : Show State
[ bunch of kernel daemon traces ]
...
S10network    S a0000001000d8cf0     0  1143   1104  1156               (NOTLB)

Call Trace:
 [<a0000001000c4200>] schedule+0xda0/0x1360
                                sp=e00000387a27fdc0 bsp=e00000387a2791b8
 [<a0000001000d8cf0>] sys_wait4+0x450/0x660
                                sp=e00000387a27fdd0 bsp=e00000387a2790f0
 [<a000000100011a60>] ia64_ret_from_syscall+0x0/0x20
                                sp=e00000387a27fe30 bsp=e00000387a2790b8
initlog       S a0000001000e8650     0  1156   1143  1157               (NOTLB)

Call Trace:
 [<a0000001000c4200>] schedule+0xda0/0x1360
                                sp=e00000387af47ce0 bsp=e00000387af411a0
 [<a0000001000e8650>] schedule_timeout+0x190/0x1a0
                                sp=e00000387af47cf0 bsp=e00000387af41168
 [<a00000010072eb70>] unix_wait_for_peer+0x210/0x220
                                sp=e00000387af47d30 bsp=e00000387af41130
 [<a00000010072ee30>] unix_stream_connect+0x2b0/0xd00
                                sp=e00000387af47d90 bsp=e00000387af41098
 [<a0000001006285f0>] sys_connect+0xf0/0x140
                                sp=e00000387af47da0 bsp=e00000387af41020
 [<a000000100011a60>] ia64_ret_from_syscall+0x0/0x20
                                sp=e00000387af47e30 bsp=e00000387af41020
sysctl        Z a0000001000d7330     0  1157   1156                     (L-TLB)

Call Trace:
 [<a0000001000c4200>] schedule+0xda0/0x1360
                                sp=e00000347a5a7e20 bsp=e00000347a5a1078
 [<a0000001000d7330>] do_exit+0x490/0x500
                                sp=e00000347a5a7e30 bsp=e00000347a5a1018
 [<a0000001000d77b0>] do_group_exit+0x290/0x360
                                sp=e00000347a5a7e30 bsp=e00000347a5a0fe0
 [<a000000100011a60>] ia64_ret_from_syscall+0x0/0x20
                                sp=e00000347a5a7e30 bsp=e00000347a5a0fc8

and the CPU is in cpu_idle (somewhere, either default_idle or somewhere
along that call path).  The other failure was also a hang, and it looked
like an infinite number of page faults was being generated, something
like

...
 [<a0000001001233c0>] __free_pages+0x60/0x140
                                sp=e0000030148ebb80 bsp=e0000030148e5388
 [<a00000010012b670>] slab_destroy+0x2f0/0x3e0
                                sp=e0000030148ebb80 bsp=e0000030148e5338
 [<a000000100130120>] reap_timer_fnc+0x480/0x680
                                sp=e0000030148ebb80 bsp=e0000030148e5268
 [<a0000001000e7ee0>] run_timer_softirq+0x380/0x5c0
                                sp=e0000030148ebb90 bsp=e0000030148e51e0
 [<a0000001000dbd10>] __do_softirq+0x1d0/0x1e0
                                sp=e0000030148ebbb0 bsp=e0000030148e5160
 [<a0000001000dbda0>] do_softirq+0x80/0xe0
                                sp=e0000030148ebbb0 bsp=e0000030148e5100
 [<a000000100018300>] ia64_handle_irq+0x180/0x1c0
                                sp=e0000030148ebbb0 bsp=e0000030148e50c0
 [<a000000100011c00>] ia64_leave_kernel+0x0/0x280
                                sp=e0000030148ebbb0 bsp=e0000030148e50c0
 [<a000000100019d20>] default_idle+0xe0/0x180

or

...
 [<a00000010005de40>] mapped_kernel_page_is_present+0x100/0x120
                                sp=e0000030148eb920 bsp=e0000030148e5438
 [<a00000010005dfd0>] ia64_do_page_fault+0x170/0x960
                                sp=e0000030148eb920 bsp=e0000030148e53c8
 [<a000000100011c00>] ia64_leave_kernel+0x0/0x280
                                sp=e0000030148eb9b0 bsp=e0000030148e53c8
 [<a0000001001233c0>] __free_pages+0x60/0x140
                                sp=e0000030148ebb80 bsp=e0000030148e5388
 [<a00000010012b670>] slab_destroy+0x2f0/0x3e0
                                sp=e0000030148ebb80 bsp=e0000030148e5338
 [<a000000100130120>] reap_timer_fnc+0x480/0x680
                                sp=e0000030148ebb80 bsp=e0000030148e5268
 [<a0000001000e7ee0>] run_timer_softirq+0x380/0x5c0
                                sp=e0000030148ebb90 bsp=e0000030148e51e0

Jesse

  reply	other threads:[~2004-03-31 19:03 UTC|newest]

Thread overview: 14+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2004-03-18  4:14 2.6.5-rc1-mm2 Andrew Morton
2004-03-18 16:01 ` 2.6.5-rc1-mm2 John Cherry
2004-03-18 20:31 ` USB: gphoto2 hangs, device disconnection oddity (was Re: 2.6.5-rc1-mm2) Sean Neakums
2004-03-19  9:27 ` 2.6.5-rc1-mm2 Marc-Christian Petersen
2004-03-30 19:27 ` 2.6.5-rc1-mm2 Jesse Barnes
2004-03-30 19:36   ` 2.6.5-rc1-mm2 Andrew Morton
2004-03-30 19:44     ` 2.6.5-rc1-mm2 Jesse Barnes
2004-03-31 19:02       ` Jesse Barnes [this message]
2004-03-31 20:06         ` 2.6.5-rc1-mm2 Andrew Morton
2004-03-31 23:15           ` 2.6.5-rc1-mm2 Jesse Barnes
2004-03-31 23:56             ` 2.6.5-rc1-mm2 Andrew Morton
2004-03-31 23:58               ` 2.6.5-rc1-mm2 Jesse Barnes
2004-04-01  0:16                 ` 2.6.5-rc1-mm2 Jesse Barnes
2004-04-01 19:28               ` 2.6.5-rc1-mm2 Jesse Barnes

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=200403311102.58136.jbarnes@sgi.com \
    --to=jbarnes@sgi.com \
    --cc=akpm@osdl.org \
    --cc=linux-kernel@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.