* Western Digital hard drive and DMA
@ 2002-09-08 20:46 Adam Jaskiewicz
2002-09-08 21:12 ` jbradford
` (2 more replies)
0 siblings, 3 replies; 13+ messages in thread
From: Adam Jaskiewicz @ 2002-09-08 20:46 UTC (permalink / raw)
To: linux-kernel
OK, I have heard that other people have been having this problem for a while
now, but I havent been able to find much about what causes it. I have a
Western Digital hard drive in my computer (60GB, 5400 RPM) I can use it just
fine with no DMA, but it runs much faster with DMA. However, when I use DMA,
all my data is slowly corrupted, and I begin having to re-install packages
all the time. After about a month, my system deteriorates to the point where
I have to reinstall slackware. I have no idea why this is happening, but I
know some people who have had the same experience under Linux with Western
Digital hard drives, but not with other brands. I am assuming this is a
problem with Western Digital's implimentation of DMA, but shouldnt it do
something to prevent errors?
Thanks.
--
Adam Jaskiewicz
adamjaskie@yahoo.com
http://middlearth.d2g.com:31415
talk: adam@middlearth.d2g.com
--
Never tell a lie unless it is absolutely convenient.
^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: Western Digital hard drive and DMA
2002-09-08 20:46 Adam Jaskiewicz
@ 2002-09-08 21:12 ` jbradford
2002-09-09 2:19 ` Horst von Brand
2002-09-08 22:19 ` Barry K. Nathan
2002-09-10 21:36 ` Maxwell Spangler
2 siblings, 1 reply; 13+ messages in thread
From: jbradford @ 2002-09-08 21:12 UTC (permalink / raw)
To: adamjaskie; +Cc: linux-kernel
> OK, I have heard that other people have been having this problem for a while
> now, but I havent been able to find much about what causes it. I have a
> Western Digital hard drive in my computer (60GB, 5400 RPM) I can use it just
> fine with no DMA, but it runs much faster with DMA. However, when I use DMA,
> all my data is slowly corrupted, and I begin having to re-install packages
> all the time. After about a month, my system deteriorates to the point where
> I have to reinstall slackware. I have no idea why this is happening, but I
> know some people who have had the same experience under Linux with Western
> Digital hard drives, but not with other brands. I am assuming this is a
> problem with Western Digital's implimentation of DMA, but shouldnt it do
> something to prevent errors?
What is the chipset of the interface it's on?
John.
^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: Western Digital hard drive and DMA
[not found] <Pine.LNX.4.33.0209081714500.18967-100000@coffee.psychology.mcmaster.ca>
@ 2002-09-08 21:21 ` Adam Jaskiewicz
2002-09-08 21:39 ` jbradford
2002-09-08 22:18 ` Mark Hahn
0 siblings, 2 replies; 13+ messages in thread
From: Adam Jaskiewicz @ 2002-09-08 21:21 UTC (permalink / raw)
To: Mark Hahn; +Cc: linux-kernel
> first, what controller is it plugged into, and which kernel are you
> running, and what are the ide-related boot messages?
Well, ATM its 2.4.17, but ive had the problem all through since 2.4.5, which
was the first kernel installed on this machine. The chipset is Intel 440BX.
These are the IDE boot messages:
Uniform Multi-Platform E-IDE driver Revision: 6.31
ide: Assuming 33MHz system bus speed for PIO modes; override with idebus=xx
PIIX4: IDE controller on PCI bus 00 dev 39
PIIX4: chipset revision 1
PIIX4: not 100% native mode: will probe irqs later
ide0: BM-DMA at 0x1420-0x1427, BIOS settings: hda:pio, hdb:DMA
ide1: BM-DMA at 0x1428-0x142f, BIOS settings: hdc:DMA, hdd:pio
hda: WDC WD600AB-00BVA0, ATA DISK drive
hdb: WDC AC313600D, ATA DISK drive
hdc: TOSHIBA DVD-ROM SD-M1212, ATAPI CD/DVD-ROM drive
hdd: PCRW804, ATAPI CD/DVD-ROM drive
ide0 at 0x1f0-0x1f7,0x3f6 on irq 14
ide1 at 0x170-0x177,0x376 on irq 15
hda: 117231408 sectors (60022 MB) w/2048KiB Cache, CHS=7297/255/63, UDMA(33)
hdb: 26712000 sectors (13677 MB) w/1966KiB Cache, CHS=1662/255/63, UDMA(33)
hdc: ATAPI 32X DVD-ROM drive, 256kB Cache, UDMA(33)
hdd is running though ide-scsi, as it is a cd-rw. hda and hdb both have dma
turned off later in the boot process by hdparm. Could it be that I wasnt
using those 80 conductor cables, and was getting crosstalk? I guess i could
buy some to test that theory out...
--
Adam Jaskiewicz
adamjaskie@yahoo.com
http://middlearth.d2g.com:31415
talk: adam@middlearth.d2g.com
--
... But we've only fondled the surface of that subject.
-- Virginia Masters
^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: Western Digital hard drive and DMA
2002-09-08 21:21 ` Western Digital hard drive and DMA Adam Jaskiewicz
@ 2002-09-08 21:39 ` jbradford
2002-09-08 22:18 ` Mark Hahn
1 sibling, 0 replies; 13+ messages in thread
From: jbradford @ 2002-09-08 21:39 UTC (permalink / raw)
To: adamjaskie; +Cc: linux-kernel
> turned off later in the boot process by hdparm. Could it be that I wasnt
> using those 80 conductor cables, and was getting crosstalk? I guess i could
> buy some to test that theory out...
Have a look at the number of UDMA CRC errors reported by
smartctl -a /dev/hda?
John
^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: Western Digital hard drive and DMA
2002-09-08 21:21 ` Western Digital hard drive and DMA Adam Jaskiewicz
2002-09-08 21:39 ` jbradford
@ 2002-09-08 22:18 ` Mark Hahn
2002-09-08 22:28 ` Adam Jaskiewicz
1 sibling, 1 reply; 13+ messages in thread
From: Mark Hahn @ 2002-09-08 22:18 UTC (permalink / raw)
To: Adam Jaskiewicz; +Cc: linux-kernel
> hdd is running though ide-scsi, as it is a cd-rw. hda and hdb both have dma
> turned off later in the boot process by hdparm. Could it be that I wasnt
> using those 80 conductor cables, and was getting crosstalk? I guess i could
> buy some to test that theory out...
if you have noisy cables and someone turns off udma,
yes, you could certainly see corruption. if you can
possibly ever use udma, it's a very good idea to do so;
only with it are transfers checksummed. 80-conductor
cables are always advantageous as well, though only
required over udma33. (remember that valid IDE cables
are always <= 18" long, with no stubs...)
^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: Western Digital hard drive and DMA
2002-09-08 20:46 Adam Jaskiewicz
2002-09-08 21:12 ` jbradford
@ 2002-09-08 22:19 ` Barry K. Nathan
2002-09-10 21:36 ` Maxwell Spangler
2 siblings, 0 replies; 13+ messages in thread
From: Barry K. Nathan @ 2002-09-08 22:19 UTC (permalink / raw)
To: Adam Jaskiewicz; +Cc: linux-kernel
On Sun, Sep 08, 2002 at 04:46:37PM -0400, Adam Jaskiewicz wrote:
> OK, I have heard that other people have been having this problem for a while
> now, but I havent been able to find much about what causes it. I have a
> Western Digital hard drive in my computer (60GB, 5400 RPM) I can use it just
[snip]
What brand of IDE controller does your computer have? WD drives often
don't get along with VIA IDE controllers. (I think very recent WD drives
might have fixed this, but I'm not sure.)
Also, I'm pretty sure that a 60GB WD drive is too new to be affected by
the DMA problems that their older drives had. I'd look at other factors
like the cables (they are 80-conductor, and 18" or shorter, right?), the
IDE controller (bad controllers can do this), or the power supply (that
was the cause for the case of data corruption that I most recently
investigated).
-Barry K. Nathan <barryn@pobox.com>
^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: Western Digital hard drive and DMA
2002-09-08 22:18 ` Mark Hahn
@ 2002-09-08 22:28 ` Adam Jaskiewicz
2002-09-09 21:06 ` Alan Cox
0 siblings, 1 reply; 13+ messages in thread
From: Adam Jaskiewicz @ 2002-09-08 22:28 UTC (permalink / raw)
To: Mark Hahn; +Cc: linux-kernel
> if you have noisy cables and someone turns off udma,
> yes, you could certainly see corruption. if you can
> possibly ever use udma, it's a very good idea to do so;
How do I enable UDMA as opposed to just DMA? I was having trouble with DMA,
but no trouble (other than VERY slow access) without DMA. I have an 80
conductor cable SOMEWHERE (probably in the bottom of a box in the basement
lol) but im almost certain the cables in there now arent more than 18 inches,
as its a fairly stock Dell system, and has the original ribbon cables.
--
Adam Jaskiewicz
adamjaskie@yahoo.com
http://middlearth.d2g.com:31415
talk: adam@middlearth.d2g.com
--
"I'd love to go out with you, but there are important world issues that
need worrying about."
^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: Western Digital hard drive and DMA
2002-09-08 21:12 ` jbradford
@ 2002-09-09 2:19 ` Horst von Brand
2002-09-09 21:08 ` Alan Cox
0 siblings, 1 reply; 13+ messages in thread
From: Horst von Brand @ 2002-09-09 2:19 UTC (permalink / raw)
To: jbradford; +Cc: adamjaskie, linux-kernel
jbradford@dial.pipex.com said:
> > OK, I have heard that other people have been having this problem for a while
> > now, but I havent been able to find much about what causes it. I have a
> > Western Digital hard drive in my computer (60GB, 5400 RPM) I can use it just
> > fine with no DMA, but it runs much faster with DMA. However, when I use DMA,
> > all my data is slowly corrupted, and I begin having to re-install packages
> > all the time. After about a month, my system deteriorates to the point where
> > I have to reinstall slackware. I have no idea why this is happening, but I
> > know some people who have had the same experience under Linux with Western
> > Digital hard drives, but not with other brands. I am assuming this is a
> > problem with Western Digital's implimentation of DMA, but shouldnt it do
> > something to prevent errors?
>
> What is the chipset of the interface it's on?
Use DMA for a week or so, and / is mangled beyond recognition (seems to
happen with read-only access too). Chipset is intel (sr440bx board, PIIX4E
IDE). Have heard of similar problems with DMA on WD drives, but got no
details.
--
Dr. Horst H. von Brand User #22616 counter.li.org
Departamento de Informatica Fono: +56 32 654431
Universidad Tecnica Federico Santa Maria +56 32 654239
Casilla 110-V, Valparaiso, Chile Fax: +56 32 797513
^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: Western Digital hard drive and DMA
2002-09-08 22:28 ` Adam Jaskiewicz
@ 2002-09-09 21:06 ` Alan Cox
0 siblings, 0 replies; 13+ messages in thread
From: Alan Cox @ 2002-09-09 21:06 UTC (permalink / raw)
To: adamjaskie; +Cc: Mark Hahn, linux-kernel
On Sun, 2002-09-08 at 23:28, Adam Jaskiewicz wrote:
> > if you have noisy cables and someone turns off udma,
> > yes, you could certainly see corruption. if you can
> > possibly ever use udma, it's a very good idea to do so;
>
> How do I enable UDMA as opposed to just DMA? I was having trouble with DMA,
> but no trouble (other than VERY slow access) without DMA. I have an 80
> conductor cable SOMEWHERE (probably in the bottom of a box in the basement
> lol) but im almost certain the cables in there now arent more than 18 inches,
> as its a fairly stock Dell system, and has the original ribbon cables.
UDMA is on so that doesn't explain what is happening at all
^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: Western Digital hard drive and DMA
2002-09-09 2:19 ` Horst von Brand
@ 2002-09-09 21:08 ` Alan Cox
2002-09-10 15:55 ` john slee
0 siblings, 1 reply; 13+ messages in thread
From: Alan Cox @ 2002-09-09 21:08 UTC (permalink / raw)
To: Horst von Brand; +Cc: jbradford, adamjaskie, linux-kernel
On Mon, 2002-09-09 at 03:19, Horst von Brand wrote:
> Use DMA for a week or so, and / is mangled beyond recognition (seems to
> happen with read-only access too). Chipset is intel (sr440bx board, PIIX4E
> IDE). Have heard of similar problems with DMA on WD drives, but got no
> details.
Old old (we are talking 340MB era here) WD had some DMA problems in a
few cases. We know about it and blacklist such drives. I'm aware of a
few "UDMA doesnt work" type incompatibilities with WD drives but not
with PIIX and always UDMA crc errors
^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: Western Digital hard drive and DMA
2002-09-09 21:08 ` Alan Cox
@ 2002-09-10 15:55 ` john slee
0 siblings, 0 replies; 13+ messages in thread
From: john slee @ 2002-09-10 15:55 UTC (permalink / raw)
To: Alan Cox; +Cc: linux-kernel
On Mon, Sep 09, 2002 at 10:08:14PM +0100, Alan Cox wrote:
> Old old (we are talking 340MB era here) WD had some DMA problems in a
> few cases. We know about it and blacklist such drives. I'm aware of a
shouldn't these drives be _way_ past their mtbf by now and into the "wow
its still working" stage. not that its an argument against
blacklisting... (far from it!)
j.
--
toyota power: http://indigoid.net/
^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: Western Digital hard drive and DMA
2002-09-08 20:46 Adam Jaskiewicz
2002-09-08 21:12 ` jbradford
2002-09-08 22:19 ` Barry K. Nathan
@ 2002-09-10 21:36 ` Maxwell Spangler
2002-09-12 1:52 ` Horst von Brand
2 siblings, 1 reply; 13+ messages in thread
From: Maxwell Spangler @ 2002-09-10 21:36 UTC (permalink / raw)
To: Adam Jaskiewicz; +Cc: linux-kernel
On Sun, 8 Sep 2002, Adam Jaskiewicz wrote:
> OK, I have heard that other people have been having this problem for a while
> now, but I havent been able to find much about what causes it. I have a
> Western Digital hard drive in my computer (60GB, 5400 RPM) I can use it just
> fine with no DMA, but it runs much faster with DMA. However, when I use DMA,
> all my data is slowly corrupted, and I begin having to re-install packages
> all the time. After about a month, my system deteriorates to the point where
> I have to reinstall slackware. I have no idea why this is happening, but I
> know some people who have had the same experience under Linux with Western
> Digital hard drives, but not with other brands. I am assuming this is a
> problem with Western Digital's implimentation of DMA, but shouldnt it do
> something to prevent errors?
This _could_ be your power supply.
I had problems with two IBM 22GXP drives attached to a Tyan dual slot1 board
based on the BX chipset. The system seemed fine other than drives would spin
down and spin up occasionally and the /var/log/messages output would indicate
DMA "drive not ready" errors.
I can't remember why I finally did it, but after replacing the power supply,
the system operated fine. The original was 3 years old, a generic to begin
with. (I've become a big fan of Antec power supplies and cases now that I've
upgraded to Athlon class CPUs.)
Andre's drivers (which your output shows you are using on 2.4.17) have been
very good on this type of equipment, IMHO. But software can't overcome
failing or poor quality hardware and the age of the BX chipset suggests this
is probably and older computer as well.
This is just a guess, but something you can try and investigate on your own
right now..
-- ----------------------------------------------------------------------------
Maxwell Spangler
Program Writer
Greenbelt, Maryland, U.S.A.
Washington D.C. Metropolitan Area
^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: Western Digital hard drive and DMA
2002-09-10 21:36 ` Maxwell Spangler
@ 2002-09-12 1:52 ` Horst von Brand
0 siblings, 0 replies; 13+ messages in thread
From: Horst von Brand @ 2002-09-12 1:52 UTC (permalink / raw)
To: Maxwell Spangler; +Cc: Adam Jaskiewicz, linux-kernel
Maxwell Spangler <maxwax@speakeasy.net> said:
[...]
> I can't remember why I finally did it, but after replacing the power
> supply, the system operated fine. The original was 3 years old, a
> generic to begin with. (I've become a big fan of Antec power supplies
> and cases now that I've upgraded to Athlon class CPUs.)
This is the second power supply here (original died on me), corruption with
the old and the new one. I very much doubt both are broken the same
way. Problems started with 2.4 or thereabouts.
WDC WD135AA, 15GiB; PIIX4E (intel sr440bx mobo, updated bios)
--
Dr. Horst H. von Brand User #22616 counter.li.org
Departamento de Informatica Fono: +56 32 654431
Universidad Tecnica Federico Santa Maria +56 32 654239
Casilla 110-V, Valparaiso, Chile Fax: +56 32 797513
^ permalink raw reply [flat|nested] 13+ messages in thread
end of thread, other threads:[~2002-09-12 1:47 UTC | newest]
Thread overview: 13+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
[not found] <Pine.LNX.4.33.0209081714500.18967-100000@coffee.psychology.mcmaster.ca>
2002-09-08 21:21 ` Western Digital hard drive and DMA Adam Jaskiewicz
2002-09-08 21:39 ` jbradford
2002-09-08 22:18 ` Mark Hahn
2002-09-08 22:28 ` Adam Jaskiewicz
2002-09-09 21:06 ` Alan Cox
2002-09-08 20:46 Adam Jaskiewicz
2002-09-08 21:12 ` jbradford
2002-09-09 2:19 ` Horst von Brand
2002-09-09 21:08 ` Alan Cox
2002-09-10 15:55 ` john slee
2002-09-08 22:19 ` Barry K. Nathan
2002-09-10 21:36 ` Maxwell Spangler
2002-09-12 1:52 ` Horst von Brand
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox