From: "Randy.Dunlap" <rddunlap@osdl.org>
To: Marc Bevand <bevand_m@epita.fr>
Cc: linux-kernel@vger.kernel.org, ak@suse.de
Subject: Re: NMI watchdog detected lockup
Date: Mon, 18 Oct 2004 10:13:11 -0700 [thread overview]
Message-ID: <4173F9A7.2090504@osdl.org> (raw)
In-Reply-To: <ckv123$pcs$1@sea.gmane.org>
Marc Bevand wrote:
> On 2004-10-17, Randy.Dunlap <rddunlap@osdl.org> wrote:
> |
> | I'm seeing this often during a kernel build on AIC79xx.
> | I did one kernel build on SATA without seeing this.
> | This is on a dual-Opteron IBM Workstation A with
> | 2 GB RAM, SATA, & SCSI.
> | [...]
> | NMI Watchdog detected LOCKUP on CPU0, registers:
> | [...]
>
> You are not the first one to observe frequent watchdog timeout
> lockup on dual Opteron systems during intense I/O operations,
> see this thread:
>
> http://thread.gmane.org/gmane.linux.ide/1933
>
> Note: this does *not* seem to be SATA-related.
Hi,
Zwane suspected NMI spikes and advised me to disable nmi_watchdog
(nmi_watchdog=0). After doing that, a kernel build completes
successfully, although with many messages like these:
Uhhuh. NMI received for unknown reason 21.
Dazed and confused, but trying to continue
Do you have a strange power saving mode enabled?
Uhhuh. NMI received for unknown reason 31.
Dazed and confused, but trying to continue
Do you have a strange power saving mode enabled?
Dazed and confused, but trying to continue
Do you have a strange power saving mode enabled?
Uhhuh. NMI received for unknown reason 31.
Dazed and confused, but trying to continue
Do you have a strange power saving mode enabled?
Uhhuh. NMI received for unknown reason 31.
Uhhuh. NMI received for unknown reason 31.
Dazed and confused, but trying to continue
Do you have a strange power saving mode enabled?
Uhhuh. NMI received for unknown reason 21.
Dazed and confused, but trying to continue
Do you have a strange power saving mode enabled?
Uhhuh. NMI received for unknown reason 21.
Dazed and confused, but trying to continue
Do you have a strange power saving mode enabled?
Dazed and confused, but trying to continue
Do you have a strange power saving mode enabled?
Uhhuh. NMI received for unknown reason 21.
Dazed and confused, but trying to continue
Do you have a strange power saving mode enabled?
I've also seen reason == 20.
This is on 2.6.9-rc4.
Andi, any ideas?
I've had several hundred of these messages, with only 1 dazed &
confused that did not continue OK.
Adding show_registers(regs); in the NMI handler points to
default_idle():
Dazed and confused, but trying to continue
Do you have a strange power saving mode enabled?
CPU 0
Modules linked in: aic79xx usbserial aic7xxx ohci1394 ieee1394
Pid: 0, comm: swapper Not tainted 2.6.9-rc4
RIP: 0010:[<ffffffff8010f5f0>] <ffffffff8010f5f0>{default_idle+32}
RSP: 0018:ffffffff805e3fb8 EFLAGS: 00000246
RAX: 0000000000000000 RBX: 0000000000000000 RCX: 0000000000000018
RDX: ffffffff8010f5d0 RSI: ffffffff80472b00 RDI: 0000010001e11b20
RBP: 0000000000000000 R08: 00000000ffffffff R09: 0000000000000001
R10: 0000000000000080 R11: ffffffff80562ae0 R12: 0000000000000000
R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000000000
FS: 0000002a95b2e4c0(0000) GS:ffffffff805de800(0000)
knlGS:0000000000000000
CS: 0010 DS: 0018 ES: 0018 CR0: 000000008005003b
CR2: 0000002a955a6000 CR3: 0000000000101000 CR4: 00000000000006e0
Process swapper (pid: 0, threadinfo ffffffff805e2000, task
ffffffff80472b00)
Stack: ffffffff8010f9fd 0000000000000000 ffffffff805e56e5 0000000000000000
ffffffff8055fbe0 0000000000000800 ffffffff805e51e0
0000000000000404
0000000000000000
Call Trace:<ffffffff8010f9fd>{cpu_idle+29}
<ffffffff805e56e5>{start_kernel+421}
<ffffffff805e51e0>{_sinittext+480}
Code: c3 fb f3 c3 66 66 66 90 66 66 66 90 66 66 66 90 48 83 ec 38
--
~Randy
next prev parent reply other threads:[~2004-10-18 17:22 UTC|newest]
Thread overview: 9+ messages / expand[flat|nested] mbox.gz Atom feed top
2004-10-17 22:58 NMI watchdog detected lockup Randy.Dunlap
2004-10-18 0:00 ` Marc Bevand
2004-10-18 17:13 ` Randy.Dunlap [this message]
2004-10-18 18:00 ` Andi Kleen
2004-10-18 17:58 ` Randy.Dunlap
2004-10-18 18:16 ` Andi Kleen
2004-10-18 18:37 ` Phil Oester
2004-10-21 4:54 ` Randy.Dunlap
-- strict thread matches above, loose matches on Subject: below --
2006-12-28 9:10 NMI Watchdog detected LOCKUP Jesper Juhl
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=4173F9A7.2090504@osdl.org \
--to=rddunlap@osdl.org \
--cc=ak@suse.de \
--cc=bevand_m@epita.fr \
--cc=linux-kernel@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.