From: Rick Warner <rick@microway.com>
To: linux-kernel@vger.kernel.org
Subject: very strange issue with sata,<4G Ram, and ext3
Date: Thu, 28 Apr 2005 12:16:07 -0400 [thread overview]
Message-ID: <200504281216.08026.rick@microway.com> (raw)
Hello,
We are having a very strange issue on some 64bit systems. We have a 32 node
cluster of EM64T's (supermicro boards). We are using our node restore
software to propagate a linux install onto them. We do a pxe boot to a
kernel and initrd image. The initrd has some config info, a basic root
filesystem, and a restore script. The kernel is passed init=/restore (the
restore script itself). The script runs dhcp, gets an ip, then nfs mounts
the master node of the cluster. The backup image is stored on the master
node's nfs mount. The script then applies a backed up partition table and
then mkfs's the partitions, mounts them, untars a backup tar to the drive,
and then makes it bootable with grub.
On these systems, we are getting ext2 errors from the initrd during the
untarring. Soon after, we start getting seg faults on random things (looks
like stuff caused by the still running dhcp client), and then a continuous
stream of segfaults on the restore script itself (restore[1]).
The systems being restored are dual em64t's with 2G of ram and 200G sata
drives. If we up the memory to 4G, the restores complete without error. If
we reduce down to 512M, the segfaults start at the mkfs stage instead of the
untar stage. We've tried different sata drives and controllers without
change. Switching to ide drives works. Switching to reiserfs instead of
ext3 for the destination drives works too. We've tried enabling the scsi
debug stuff as well as the jbd debug stuff for ext3 without getting any more
info. We also enabled the kernel debug options too. We've also tried using
the deprecated ide based sata drivers instead of the scsi based ones without
success. We have tried restoring to Intel's Jarell EM64T systems as well as
an Arima HDAMA opteron with the same errors. We've also tried adding swap
space ASAP in the inird image.
This problem is really baffling us and we're not quite sure what to check
into next. Any ideas?
--
Richard Warner
Lead Systems Integrator
Microway, Inc
(508)732-5517
next reply other threads:[~2005-04-28 16:16 UTC|newest]
Thread overview: 8+ messages / expand[flat|nested] mbox.gz Atom feed top
2005-04-28 16:16 Rick Warner [this message]
2005-04-28 17:32 ` very strange issue with sata,<4G Ram, and ext3 Rick Warner
2005-04-28 22:48 ` Alan Cox
2005-04-29 14:45 ` Rick Warner
2005-05-04 19:29 ` Rick Warner
2005-05-05 15:00 ` Alan Cox
2005-05-05 21:37 ` Krzysztof Halasa
2005-05-06 13:39 ` Rick Warner
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=200504281216.08026.rick@microway.com \
--to=rick@microway.com \
--cc=linux-kernel@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox