From: Ingo Molnar <mingo@elte.hu>
To: Gene Heskett <gene.heskett@verizon.net>
Cc: linux-kernel@vger.kernel.org, Matt Heler <lkml@lpbproductions.com>
Subject: Re: 2.6.9-rc2-mm4
Date: Tue, 28 Sep 2004 13:05:41 +0200 [thread overview]
Message-ID: <20040928110541.GA22436@elte.hu> (raw)
In-Reply-To: <200409280701.06932.gene.heskett@verizon.net>
* Gene Heskett <gene.heskett@verizon.net> wrote:
> That would I assume need a null modem cable, and what do I run on the
> firewall? Minicom? Or is there something better that can just grab
> and log without being interactive? Its a rh7.3 box with a 2.4.18 era
> kernel. I'd update that, but its not broken. :)
here's a mini-howto:
to set up serial logging:
-------------------------
install a null modem cable to one of the serial ports of the server,
connect the cable to another box, run a terminal program on that other
box (e.g. "minicom -m" - do Alt-L to switch on logging after starting it
up) and set up the server's kernel to do serial logging: enable
CONFIG_SERIAL_8250_CONSOLE and CONFIG_SERIAL_CORE_CONSOLE, recompile &
reinstall the kernel, add "console=ttyS0,38400 console=tty0" to your
/etc/grub.conf or /etc/lilo.conf kernel boot line, reboot the server
with the new kernel command line - and configure minicom to run with
that speed (Alt-S).
e.g. my /etc/grub.conf has:
title test-2.6 (test-2.6)
root (hd0,0)
kernel /boot/bzImage root=/dev/sda1 console=ttyS0,38400 console=tty0 nmi_watchdog=1 kernel_preempt=1
if everything is set up correctly then you should see kernel messages
showing up in the minicom session when you boot up.
When the messages do not show up then typical errors are mismatch
between the serial port (or speed) and the device names used - if it's
COM2 then use ttyS1, and dont forget to set up the serial speed option
of minicom, etc. You can test the serial connection by doing:
echo x > /dev/ttyS0
and that should show up in the minicom session on the other box.
to set up the NMI watchdog:
---------------------------
add nmi_watchdog=1 to your boot parameters and reboot - that should be
all to get it active. If all CPU's NMI count increases in
/proc/interrupts then it's working fine. If the counts do not increase
(or only one CPU increases it) then try nmi_watchdog=2 - this is another
type of NMI that might work better. (Very rarely there are boxes that
dont have reliable NMI counts with 1 and 2 either - but i dont think
your box is one of those.)
once the NMI watchdog is up and running it should catch all hard lockups
and print backtraces to the serial console - even if you are within X
while the lockup happens. You can test hard lockups by running the
attached 'lockupcli' userspace code as root - it turns off interrupts
and goes into an infinite loop => instant lockup. The NMI watchdog
should notice this condition after a couple of seconds and should abort
the task, printing a kernel trace as well. Your box should be back in
working order after that point.
now for the real lockup your box wont be 'fixed' by the NMI watchdog, it
will likely stay locked up, but you should get messages on the serial
console, giving us an idea where the kernel locked up and why. (Very
rarely it happens that not even the NMI watchdog prints anything for a
hard lockup - this is often the sign of hardware problems.)
Ingo
--- lockupcli.c
main ()
{
iopl(3);
for (;;) asm("cli");
}
next prev parent reply other threads:[~2004-09-28 11:04 UTC|newest]
Thread overview: 45+ messages / expand[flat|nested] mbox.gz Atom feed top
2004-09-27 1:10 2.6.9-rc2-mm4 Andrew Morton
2004-09-27 4:53 ` 2.6.9-rc2-mm4 Gene Heskett
2004-09-27 5:13 ` 2.6.9-rc2-mm4 Andrew Morton
2004-09-27 6:31 ` 2.6.9-rc2-mm4 Gene Heskett
2004-09-27 7:02 ` 2.6.9-rc2-mm4 Gene Heskett
[not found] ` <200409270706.21661.lkml@lpbproductions.com>
2004-09-27 15:31 ` 2.6.9-rc2-mm4 Gene Heskett
2004-09-27 15:52 ` 2.6.9-rc2-mm4 Matt Heler
2004-09-27 16:06 ` 2.6.9-rc2-mm4 Gene Heskett
2004-09-27 16:27 ` 2.6.9-rc2-mm4 Geert Uytterhoeven
2004-09-27 16:40 ` 2.6.9-rc2-mm4 Matt Heler
2004-09-27 20:17 ` 2.6.9-rc2-mm4 Ingo Molnar
2004-09-28 1:42 ` 2.6.9-rc2-mm4 Gene Heskett
2004-09-28 7:01 ` 2.6.9-rc2-mm4 Ingo Molnar
2004-09-28 10:26 ` 2.6.9-rc2-mm4 Gene Heskett
2004-09-28 10:33 ` 2.6.9-rc2-mm4 Ingo Molnar
2004-09-28 11:01 ` 2.6.9-rc2-mm4 Gene Heskett
2004-09-28 11:05 ` Ingo Molnar [this message]
2004-09-28 11:36 ` 2.6.9-rc2-mm4 Gene Heskett
2004-09-29 8:08 ` 2.6.9-rc2-mm4 Geert Uytterhoeven
2004-09-29 8:43 ` 2.6.9-rc2-mm4 Andreas Schwab
2004-09-30 16:49 ` 2.6.9-rc2-mm4 Bill Davidsen
2004-09-27 15:32 ` 2.6.9-rc2-mm4 Matt Heler
2004-09-27 20:19 ` 2.6.9-rc2-mm4 Ingo Molnar
2004-09-27 20:34 ` 2.6.9-rc2-mm4 Paul Fulghum
2004-09-27 20:45 ` 2.6.9-rc2-mm4 Ingo Molnar
2004-09-27 23:01 ` 2.6.9-rc2-mm4 Ingo Molnar
2004-09-28 2:12 ` 2.6.9-rc2-mm4 Gene Heskett
2004-09-28 13:07 ` 2.6.9-rc2-mm4 Paul Fulghum
2004-09-27 21:14 ` 2.6.9-rc2-mm4 Ingo Molnar
2004-09-27 22:00 ` 2.6.9-rc2-mm4 Paul Fulghum
2004-09-27 21:52 ` 2.6.9-rc2-mm4 Felipe Alfaro Solana
2004-09-28 1:45 ` 2.6.9-rc2-mm4 Gene Heskett
2004-09-27 23:04 ` 2.6.9-rc2-mm4 J.A. Magallon
2004-09-29 10:43 ` 2.6.9-rc2-mm4 Borislav Petkov
2004-09-30 23:26 ` 2.6.9-rc2-mm4 J.A. Magallon
2004-10-01 0:05 ` 2.6.9-rc2-mm4 Andrew Morton
2004-10-01 0:17 ` 2.6.9-rc2-mm4 J.A. Magallon
2004-10-01 0:22 ` 2.6.9-rc2-mm4 J.A. Magallon
2004-10-01 0:31 ` 2.6.9-rc2-mm4 Randy.Dunlap
2004-10-01 5:51 ` 2.6.9-rc2-mm4 Dmitry Torokhov
2004-10-01 5:30 ` 2.6.9-rc2-mm4 Dmitry Torokhov
2004-10-01 18:01 ` 2.6.9-rc2-mm4 Greg KH
2004-10-01 18:26 ` 2.6.9-rc2-mm4 Vojtech Pavlik
2004-10-01 23:41 ` 2.6.9-rc2-mm4 Greg KH
2004-10-02 5:31 ` 2.6.9-rc2-mm4 Vojtech Pavlik
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20040928110541.GA22436@elte.hu \
--to=mingo@elte.hu \
--cc=gene.heskett@verizon.net \
--cc=linux-kernel@vger.kernel.org \
--cc=lkml@lpbproductions.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox