From: Jean Delvare <khali@linux-fr.org>
To: Ryan Richter <ryan@tau.solarneutrino.net>
Cc: Erik Mouw <erik@harddisk-recovery.com>,
Nick Warne <nick@linicks.net>,
linux-kernel@vger.kernel.org
Subject: Re: Random reboots
Date: Tue, 14 Feb 2006 23:22:22 +0100 [thread overview]
Message-ID: <20060214232222.5d4384a8.khali@linux-fr.org> (raw)
In-Reply-To: <20060214132904.GI16566@tau.solarneutrino.net>
Hi Ryan,
> > We recently had such an issue with a dual AMD64 machine rebooting at
> > mke2fs. It turned out it was a faulty power supply. After we changed
> > the power supply, everything ran smooth again.
> >
> > You could start to test by powering your drives from an old AT-style
> > power supply leaving more "juice" for the main board and CPUs.
>
> It's possible, but I doubt it. More often than not, the reboot happens
> when the machine is completely idle - in fact I can't remember a single
> time when it wasn't idle. I just spent a couple months debugging a
> SCSI-tape crash, and I ran the backups a lot and had lots of RAID
> resyncs and it *never* rebooted during either of these events. Anyway
> it has quite a large 2+1 redundant power supply, and, like I said, we
> routinely had 3+ months of uptime with older kernels.
You seem to have hardware monitoring drivers loaded on the system, so
I'd suggest that you watch the returned values over time. If the
hardware is going wrong it might show there. Your system could be
overheating for some reason (stuck fan...)
The fact that older kernels were seemingly working better doesn't prove
much. You were running these kernels before, not now, and hardware
*does* age, contrary to what people seem to think. If you want to make
certain that older kernels were indeed working better for purely
software reasons, you should switch back to such an old kernel and see
if things actually improve or not.
A wild guess while I'm at it... Is the machine behind a KVM switch by
any chance? I have a fun (old) motherboard here which reboots when I
unplug the keyboard and plug it again. Never seen that before...
> During the years I've had this machine, I've experienced at least 10-15
> strange kernel bugs that only happened on this machine. Each and every
> time I was *convinced* that the hardware was at fault (and people on the
> mailing list suggested it) until either a kernel came out that fixed the
> problem or a kernel developer positively identified it as a kernel
> problem and eventually fixed it. This machine just seems to be a magnet
> for kernel bugs.
Note that the first case ("a kernel came out that fixed the problem")
doesn't mean that the hardware was not at fault. There are quite a few
quirks in the Linux kernel code which are just there to workaround
known hardware or BIOS bugs.
--
Jean Delvare
next prev parent reply other threads:[~2006-02-14 22:22 UTC|newest]
Thread overview: 21+ messages / expand[flat|nested] mbox.gz Atom feed top
2006-02-13 21:04 Random reboots Ryan Richter
2006-02-13 21:10 ` anders
2006-02-13 21:22 ` Ryan Richter
[not found] ` <7c3341450602131332x2fcd7d8co@mail.gmail.com>
2006-02-13 21:39 ` Ryan Richter
2006-02-13 21:49 ` Ryan Richter
2006-02-14 8:54 ` Erik Mouw
2006-02-14 13:29 ` Ryan Richter
2006-02-14 14:47 ` Nick Warne
2006-02-14 22:22 ` Jean Delvare [this message]
2006-02-15 14:28 ` Ryan Richter
2006-02-15 15:11 ` linux-os (Dick Johnson)
2006-02-15 15:13 ` Ryan Richter
2006-02-15 15:41 ` Jean Delvare
2006-02-15 16:00 ` Ryan Richter
2006-02-15 16:20 ` Jean Delvare
2006-02-15 16:30 ` Ryan Richter
2006-02-15 18:46 ` Ryan Richter
2006-02-27 20:35 ` Ryan Richter
-- strict thread matches above, loose matches on Subject: below --
2001-04-24 16:18 random reboots Nathan Walp
2001-04-26 1:36 ` Petr Vandrovec
2001-04-26 1:52 ` Nathan Walp
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20060214232222.5d4384a8.khali@linux-fr.org \
--to=khali@linux-fr.org \
--cc=erik@harddisk-recovery.com \
--cc=linux-kernel@vger.kernel.org \
--cc=nick@linicks.net \
--cc=ryan@tau.solarneutrino.net \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox