From: Francois-Rene Rideau <fare@tunes.org>
To: reiserfs-list@namesys.com
Subject: Crash: the problem was DMA!
Date: Fri, 24 Jan 2003 23:28:23 +0100 [thread overview]
Message-ID: <20030124222823.GA15643@hell.mine.nu> (raw)
In-Reply-To: <20030122100342.GA14100@hell.mine.nu>
Dear reiserfs developers,
here's an update on the trouble I had lately (the second disk crash):
the culprit was IDE DMA!
Here's the story:
* I installed more RAM, and it failed, so I upgrade the BIOS to see it
(machine was dead after I upgraded the BIOS, until I was told to also
short the NVRAM to reset the BIOS parameters, but that's another story).
* After I rebooted, swsusp failed (oops, I had suspended the machine
instead of halting it), and I originally attributed my troubles to that.
The machine was thereupon blocked right after mounting its reiserfs rootfs.
* On another machine, I could reiserfsck the disk, mount it and backup
its contents. Pphew!
* Windows and debian's linux-2.4.18-bf2.4 were working fine on the machine,
and memtest86 reported no error, so the hardware looks OK.
* However, using any of my custom-compiled Linux kernels would result in
miserable kernel panics while accessing the big 100G reiserfs partition:
I ksymoops'ed a few panics, and they all as a NULL-pointer dereference
while accessing the big reiserfs partition: open() system call or
disk interrupt handler.
The panic came quick enough (while in init, in rcS or getty),
though not at a fully predictable place.
* I could put some support files in my tiny /boot, I can successfully boot
with root=/dev/hda1 init=/bin/sash -- even with the kernels that fail
on the big partition. Then, if I can try to reiserfsck the disk,
I get a panic or lack thereof depending on my using a custom kernel
or the debian-supplied one. I suppose the partition was small enough
that timing problems didn't have the opportunity to occur.
* I started suspecting DMA thanks to a reiserfsck message telling me
while using debian's kernel that lack of dma was detrimental to performance.
Indeed, compiling a custom kernel w/o DMA caused the system to boot,
while using hdparm to enable dma caused it to hang.
* The reiserfs partition seems to have survived quite well all those
reboots and crashes that happened while sorting out this mess.
Congratulations!
* All in all, it looks like my BIOS update hosed the way the IDE chipset
is configured, as far as having Linux use the DMA is concerned. Darn.
This message is to tell you what I spent quite some time to figure out,
so that if the problem occurs again, you (yes, you who are asked to solve
problems, but also YOU, who have a problem and are Googling in search of
a solution to your problem, and found this message in a mailing-list archive)
can find the answer readily available.
PS: YES, my old kern.log's from before the BIOS update do show
hda: 240121728 sectors (122942 MB) w/2048KiB Cache, CHS=14946/255/63, UDMA(66)
While the newer one lack the UDMA(66).
VP_IDE: VIA vt82c686a (rev 1b) IDE UDMA66 controller on pci00:04.1
In case anyone cares, that's an ASUS K7M motherboard.
Darn. I still have this performance bug - but at least, the computer works.
Advice welcome, though it's becoming off-topic (so private message might
be more suited).
[ François-René ÐVB Rideau | Reflection&Cybernethics | http://fare.tunes.org ]
[ TUNES project for a Free Reflective Computing System | http://tunes.org ]
Premature optimization is the root of all evil.
-- D.E. Knuth
next prev parent reply other threads:[~2003-01-24 22:28 UTC|newest]
Thread overview: 29+ messages / expand[flat|nested] mbox.gz Atom feed top
2003-01-17 22:15 recovery of crashed reiserfs disk? Francois-Rene Rideau
2003-01-18 7:09 ` Ookhoi
2003-01-18 8:51 ` Oleg Drokin
2003-01-18 9:01 ` Ookhoi
2003-01-18 11:57 ` Francois-Rene Rideau
2003-01-20 14:20 ` Francois-Rene Rideau
2003-01-21 21:33 ` Crash again! Francois-Rene Rideau
2003-01-22 5:57 ` Ookhoi
2003-01-22 10:03 ` Francois-Rene Rideau
2003-01-24 22:28 ` Francois-Rene Rideau [this message]
2003-01-25 16:15 ` Crash: the problem was DMA! Manuel Krause
2003-01-26 18:18 ` mkreiserfs -s 1024 makes unmountable partitions Francois-Rene Rideau
2003-01-27 4:49 ` Ookhoi
2003-01-27 5:23 ` Brian Tinsley
2003-01-27 6:59 ` Oleg Drokin
2003-01-27 7:20 ` Scott R. Every
2003-01-27 7:25 ` Oleg Drokin
2003-01-27 7:37 ` Scott R. Every
2003-01-27 7:42 ` Oleg Drokin
2003-01-29 11:37 ` Hans Reiser
2003-01-27 11:26 ` Francois-Rene Rideau
2003-01-27 11:33 ` Oleg Drokin
2003-01-27 11:42 ` Francois-Rene Rideau
2003-01-27 12:46 ` Francois-Rene Rideau
2003-01-27 14:20 ` Manuel Krause
2003-01-28 2:41 ` Oh no! Not again! Francois-Rene Rideau
2003-01-28 11:00 ` Vitaly Fertman
2003-01-29 19:31 ` Francois-Rene Rideau
2003-01-29 20:17 ` Vitaly Fertman
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20030124222823.GA15643@hell.mine.nu \
--to=fare@tunes.org \
--cc=reiserfs-list@namesys.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.