public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
* Western Digital hard drive and DMA
@ 2002-09-08 20:46 Adam Jaskiewicz
  2002-09-08 21:12 ` jbradford
                   ` (2 more replies)
  0 siblings, 3 replies; 13+ messages in thread
From: Adam Jaskiewicz @ 2002-09-08 20:46 UTC (permalink / raw)
  To: linux-kernel

OK, I have heard that other people have been having this problem for a while 
now, but I havent been able to find much about what causes it. I have a 
Western Digital hard drive in my computer (60GB, 5400 RPM) I can use it just 
fine with no DMA, but it runs much faster with DMA. However, when I use DMA, 
all my data is slowly corrupted, and I begin having to re-install packages 
all the time. After about a month, my system deteriorates to the point where 
I have to reinstall slackware. I have no idea why this is happening, but I 
know some people who have had the same experience under Linux with Western 
Digital hard drives, but not with other brands. I am assuming this is a 
problem with Western Digital's implimentation of DMA, but shouldnt it do 
something to prevent errors?

Thanks.

-- 
Adam Jaskiewicz
adamjaskie@yahoo.com
http://middlearth.d2g.com:31415
talk:  adam@middlearth.d2g.com
--
Never tell a lie unless it is absolutely convenient.

^ permalink raw reply	[flat|nested] 13+ messages in thread

* Re: Western Digital hard drive and DMA
  2002-09-08 20:46 Adam Jaskiewicz
@ 2002-09-08 21:12 ` jbradford
  2002-09-09  2:19   ` Horst von Brand
  2002-09-08 22:19 ` Barry K. Nathan
  2002-09-10 21:36 ` Maxwell Spangler
  2 siblings, 1 reply; 13+ messages in thread
From: jbradford @ 2002-09-08 21:12 UTC (permalink / raw)
  To: adamjaskie; +Cc: linux-kernel

> OK, I have heard that other people have been having this problem for a while 
> now, but I havent been able to find much about what causes it. I have a 
> Western Digital hard drive in my computer (60GB, 5400 RPM) I can use it just 
> fine with no DMA, but it runs much faster with DMA. However, when I use DMA, 
> all my data is slowly corrupted, and I begin having to re-install packages 
> all the time. After about a month, my system deteriorates to the point where 
> I have to reinstall slackware. I have no idea why this is happening, but I 
> know some people who have had the same experience under Linux with Western 
> Digital hard drives, but not with other brands. I am assuming this is a 
> problem with Western Digital's implimentation of DMA, but shouldnt it do 
> something to prevent errors?

What is the chipset of the interface it's on?

John.

^ permalink raw reply	[flat|nested] 13+ messages in thread

* Re: Western Digital hard drive and DMA
       [not found] <Pine.LNX.4.33.0209081714500.18967-100000@coffee.psychology.mcmaster.ca>
@ 2002-09-08 21:21 ` Adam Jaskiewicz
  2002-09-08 21:39   ` jbradford
  2002-09-08 22:18   ` Mark Hahn
  0 siblings, 2 replies; 13+ messages in thread
From: Adam Jaskiewicz @ 2002-09-08 21:21 UTC (permalink / raw)
  To: Mark Hahn; +Cc: linux-kernel

> first, what controller is it plugged into, and which kernel are you
> running, and what are the ide-related boot messages?

Well, ATM its 2.4.17, but ive had the problem all through since 2.4.5, which 
was the first kernel installed on this machine. The chipset is Intel 440BX. 

These are the IDE boot messages:

Uniform Multi-Platform E-IDE driver Revision: 6.31
ide: Assuming 33MHz system bus speed for PIO modes; override with idebus=xx
PIIX4: IDE controller on PCI bus 00 dev 39
PIIX4: chipset revision 1
PIIX4: not 100% native mode: will probe irqs later
    ide0: BM-DMA at 0x1420-0x1427, BIOS settings: hda:pio, hdb:DMA
    ide1: BM-DMA at 0x1428-0x142f, BIOS settings: hdc:DMA, hdd:pio
hda: WDC WD600AB-00BVA0, ATA DISK drive
hdb: WDC AC313600D, ATA DISK drive
hdc: TOSHIBA DVD-ROM SD-M1212, ATAPI CD/DVD-ROM drive
hdd: PCRW804, ATAPI CD/DVD-ROM drive
ide0 at 0x1f0-0x1f7,0x3f6 on irq 14
ide1 at 0x170-0x177,0x376 on irq 15
hda: 117231408 sectors (60022 MB) w/2048KiB Cache, CHS=7297/255/63, UDMA(33)
hdb: 26712000 sectors (13677 MB) w/1966KiB Cache, CHS=1662/255/63, UDMA(33)
hdc: ATAPI 32X DVD-ROM drive, 256kB Cache, UDMA(33)

hdd is running though ide-scsi, as it is a cd-rw. hda and hdb both have dma 
turned off later in the boot process by hdparm. Could it be that I wasnt 
using those 80 conductor cables, and was getting crosstalk? I guess i could 
buy some to test that theory out...

-- 
Adam Jaskiewicz
adamjaskie@yahoo.com
http://middlearth.d2g.com:31415
talk:  adam@middlearth.d2g.com
--
... But we've only fondled the surface of that subject.
		-- Virginia Masters

^ permalink raw reply	[flat|nested] 13+ messages in thread

* Re: Western Digital hard drive and DMA
  2002-09-08 21:21 ` Western Digital hard drive and DMA Adam Jaskiewicz
@ 2002-09-08 21:39   ` jbradford
  2002-09-08 22:18   ` Mark Hahn
  1 sibling, 0 replies; 13+ messages in thread
From: jbradford @ 2002-09-08 21:39 UTC (permalink / raw)
  To: adamjaskie; +Cc: linux-kernel

> turned off later in the boot process by hdparm. Could it be that I wasnt 
> using those 80 conductor cables, and was getting crosstalk? I guess i could 
> buy some to test that theory out...

Have a look at the number of UDMA CRC errors reported by

smartctl -a /dev/hda?

John

^ permalink raw reply	[flat|nested] 13+ messages in thread

* Re: Western Digital hard drive and DMA
  2002-09-08 21:21 ` Western Digital hard drive and DMA Adam Jaskiewicz
  2002-09-08 21:39   ` jbradford
@ 2002-09-08 22:18   ` Mark Hahn
  2002-09-08 22:28     ` Adam Jaskiewicz
  1 sibling, 1 reply; 13+ messages in thread
From: Mark Hahn @ 2002-09-08 22:18 UTC (permalink / raw)
  To: Adam Jaskiewicz; +Cc: linux-kernel

> hdd is running though ide-scsi, as it is a cd-rw. hda and hdb both have dma 
> turned off later in the boot process by hdparm. Could it be that I wasnt 
> using those 80 conductor cables, and was getting crosstalk? I guess i could 
> buy some to test that theory out...

if you have noisy cables and someone turns off udma,
yes, you could certainly see corruption.  if you can
possibly ever use udma, it's a very good idea to do so;
only with it are transfers checksummed.  80-conductor 
cables are always advantageous as well, though only 
required over udma33.  (remember that valid IDE cables 
are always <= 18" long, with no stubs...)


^ permalink raw reply	[flat|nested] 13+ messages in thread

* Re: Western Digital hard drive and DMA
  2002-09-08 20:46 Adam Jaskiewicz
  2002-09-08 21:12 ` jbradford
@ 2002-09-08 22:19 ` Barry K. Nathan
  2002-09-10 21:36 ` Maxwell Spangler
  2 siblings, 0 replies; 13+ messages in thread
From: Barry K. Nathan @ 2002-09-08 22:19 UTC (permalink / raw)
  To: Adam Jaskiewicz; +Cc: linux-kernel

On Sun, Sep 08, 2002 at 04:46:37PM -0400, Adam Jaskiewicz wrote:
> OK, I have heard that other people have been having this problem for a while
> now, but I havent been able to find much about what causes it. I have a
> Western Digital hard drive in my computer (60GB, 5400 RPM) I can use it just
[snip]

What brand of IDE controller does your computer have? WD drives often
don't get along with VIA IDE controllers. (I think very recent WD drives
might have fixed this, but I'm not sure.)

Also, I'm pretty sure that a 60GB WD drive is too new to be affected by
the DMA problems that their older drives had. I'd look at other factors
like the cables (they are 80-conductor, and 18" or shorter, right?), the
IDE controller (bad controllers can do this), or the power supply (that
was the cause for the case of data corruption that I most recently
investigated).

-Barry K. Nathan <barryn@pobox.com>

^ permalink raw reply	[flat|nested] 13+ messages in thread

* Re: Western Digital hard drive and DMA
  2002-09-08 22:18   ` Mark Hahn
@ 2002-09-08 22:28     ` Adam Jaskiewicz
  2002-09-09 21:06       ` Alan Cox
  0 siblings, 1 reply; 13+ messages in thread
From: Adam Jaskiewicz @ 2002-09-08 22:28 UTC (permalink / raw)
  To: Mark Hahn; +Cc: linux-kernel

> if you have noisy cables and someone turns off udma,
> yes, you could certainly see corruption.  if you can
> possibly ever use udma, it's a very good idea to do so;

How do I enable UDMA as opposed to just DMA? I was having trouble with DMA, 
but no trouble (other than VERY slow access) without DMA. I have an 80 
conductor cable SOMEWHERE (probably in the bottom of a box in the basement 
lol) but im almost certain the cables in there now arent more than 18 inches, 
as its a fairly stock Dell system, and has the original ribbon cables. 

-- 
Adam Jaskiewicz
adamjaskie@yahoo.com
http://middlearth.d2g.com:31415
talk:  adam@middlearth.d2g.com
--
"I'd love to go out with you, but there are important world issues that
need worrying about."

^ permalink raw reply	[flat|nested] 13+ messages in thread

* Re: Western Digital hard drive and DMA
  2002-09-08 21:12 ` jbradford
@ 2002-09-09  2:19   ` Horst von Brand
  2002-09-09 21:08     ` Alan Cox
  0 siblings, 1 reply; 13+ messages in thread
From: Horst von Brand @ 2002-09-09  2:19 UTC (permalink / raw)
  To: jbradford; +Cc: adamjaskie, linux-kernel

jbradford@dial.pipex.com said:
> > OK, I have heard that other people have been having this problem for a while 
> > now, but I havent been able to find much about what causes it. I have a 
> > Western Digital hard drive in my computer (60GB, 5400 RPM) I can use it just 
> > fine with no DMA, but it runs much faster with DMA. However, when I use DMA, 
> > all my data is slowly corrupted, and I begin having to re-install packages 
> > all the time. After about a month, my system deteriorates to the point where 
> > I have to reinstall slackware. I have no idea why this is happening, but I 
> > know some people who have had the same experience under Linux with Western 
> > Digital hard drives, but not with other brands. I am assuming this is a 
> > problem with Western Digital's implimentation of DMA, but shouldnt it do 
> > something to prevent errors?
> 
> What is the chipset of the interface it's on?

Use DMA for a week or so, and / is mangled beyond recognition (seems to
happen with read-only access too). Chipset is intel (sr440bx board, PIIX4E
IDE). Have heard of similar problems with DMA on WD drives, but got no
details.
-- 
Dr. Horst H. von Brand                   User #22616 counter.li.org
Departamento de Informatica                     Fono: +56 32 654431
Universidad Tecnica Federico Santa Maria              +56 32 654239
Casilla 110-V, Valparaiso, Chile                Fax:  +56 32 797513

^ permalink raw reply	[flat|nested] 13+ messages in thread

* Re: Western Digital hard drive and DMA
  2002-09-08 22:28     ` Adam Jaskiewicz
@ 2002-09-09 21:06       ` Alan Cox
  0 siblings, 0 replies; 13+ messages in thread
From: Alan Cox @ 2002-09-09 21:06 UTC (permalink / raw)
  To: adamjaskie; +Cc: Mark Hahn, linux-kernel

On Sun, 2002-09-08 at 23:28, Adam Jaskiewicz wrote:
> > if you have noisy cables and someone turns off udma,
> > yes, you could certainly see corruption.  if you can
> > possibly ever use udma, it's a very good idea to do so;
> 
> How do I enable UDMA as opposed to just DMA? I was having trouble with DMA, 
> but no trouble (other than VERY slow access) without DMA. I have an 80 
> conductor cable SOMEWHERE (probably in the bottom of a box in the basement 
> lol) but im almost certain the cables in there now arent more than 18 inches, 
> as its a fairly stock Dell system, and has the original ribbon cables. 

UDMA is on so that doesn't explain what is happening at all


^ permalink raw reply	[flat|nested] 13+ messages in thread

* Re: Western Digital hard drive and DMA
  2002-09-09  2:19   ` Horst von Brand
@ 2002-09-09 21:08     ` Alan Cox
  2002-09-10 15:55       ` john slee
  0 siblings, 1 reply; 13+ messages in thread
From: Alan Cox @ 2002-09-09 21:08 UTC (permalink / raw)
  To: Horst von Brand; +Cc: jbradford, adamjaskie, linux-kernel

On Mon, 2002-09-09 at 03:19, Horst von Brand wrote:
> Use DMA for a week or so, and / is mangled beyond recognition (seems to
> happen with read-only access too). Chipset is intel (sr440bx board, PIIX4E
> IDE). Have heard of similar problems with DMA on WD drives, but got no
> details.

Old old (we are talking 340MB era here) WD had some DMA problems in a
few cases. We know about it and blacklist such drives. I'm aware of a
few "UDMA doesnt work" type incompatibilities with WD drives but not
with PIIX and always UDMA crc errors


^ permalink raw reply	[flat|nested] 13+ messages in thread

* Re: Western Digital hard drive and DMA
  2002-09-09 21:08     ` Alan Cox
@ 2002-09-10 15:55       ` john slee
  0 siblings, 0 replies; 13+ messages in thread
From: john slee @ 2002-09-10 15:55 UTC (permalink / raw)
  To: Alan Cox; +Cc: linux-kernel

On Mon, Sep 09, 2002 at 10:08:14PM +0100, Alan Cox wrote:
> Old old (we are talking 340MB era here) WD had some DMA problems in a
> few cases. We know about it and blacklist such drives. I'm aware of a

shouldn't these drives be _way_ past their mtbf by now and into the "wow
its still working" stage.  not that its an argument against
blacklisting... (far from it!)

j.

-- 
toyota power: http://indigoid.net/

^ permalink raw reply	[flat|nested] 13+ messages in thread

* Re: Western Digital hard drive and DMA
  2002-09-08 20:46 Adam Jaskiewicz
  2002-09-08 21:12 ` jbradford
  2002-09-08 22:19 ` Barry K. Nathan
@ 2002-09-10 21:36 ` Maxwell Spangler
  2002-09-12  1:52   ` Horst von Brand
  2 siblings, 1 reply; 13+ messages in thread
From: Maxwell Spangler @ 2002-09-10 21:36 UTC (permalink / raw)
  To: Adam Jaskiewicz; +Cc: linux-kernel

On Sun, 8 Sep 2002, Adam Jaskiewicz wrote:

> OK, I have heard that other people have been having this problem for a while 
> now, but I havent been able to find much about what causes it. I have a 
> Western Digital hard drive in my computer (60GB, 5400 RPM) I can use it just 
> fine with no DMA, but it runs much faster with DMA. However, when I use DMA, 
> all my data is slowly corrupted, and I begin having to re-install packages 
> all the time. After about a month, my system deteriorates to the point where 
> I have to reinstall slackware. I have no idea why this is happening, but I 
> know some people who have had the same experience under Linux with Western 
> Digital hard drives, but not with other brands. I am assuming this is a 
> problem with Western Digital's implimentation of DMA, but shouldnt it do 
> something to prevent errors?

This _could_ be your power supply.

I had problems with two IBM 22GXP drives attached to a Tyan dual slot1 board 
based on the BX chipset.  The system seemed fine other than drives would spin 
down and spin up occasionally and the /var/log/messages output would indicate 
DMA "drive not ready" errors.

I can't remember why I finally did it, but after replacing the power supply, 
the system operated fine.  The original was 3 years old, a generic to begin 
with.  (I've become a big fan of Antec power supplies and cases now that I've 
upgraded to Athlon class CPUs.)

Andre's drivers (which your output shows you are using on 2.4.17) have been 
very good on this type of equipment, IMHO.  But software can't overcome 
failing or poor quality hardware and the age of the BX chipset suggests this 
is probably and older computer as well.

This is just a guess, but something you can try and investigate on your own 
right now..

-- ----------------------------------------------------------------------------
Maxwell Spangler                                                
Program Writer                                              
Greenbelt, Maryland, U.S.A.                         
Washington D.C. Metropolitan Area 


^ permalink raw reply	[flat|nested] 13+ messages in thread

* Re: Western Digital hard drive and DMA
  2002-09-10 21:36 ` Maxwell Spangler
@ 2002-09-12  1:52   ` Horst von Brand
  0 siblings, 0 replies; 13+ messages in thread
From: Horst von Brand @ 2002-09-12  1:52 UTC (permalink / raw)
  To: Maxwell Spangler; +Cc: Adam Jaskiewicz, linux-kernel

Maxwell Spangler <maxwax@speakeasy.net> said:

[...]

> I can't remember why I finally did it, but after replacing the power
> supply, the system operated fine.  The original was 3 years old, a
> generic to begin with.  (I've become a big fan of Antec power supplies
> and cases now that I've upgraded to Athlon class CPUs.)

This is the second power supply here (original died on me), corruption with
the old and the new one. I very much doubt both are broken the same
way. Problems started with 2.4 or thereabouts.

WDC WD135AA, 15GiB; PIIX4E (intel sr440bx mobo, updated bios)
-- 
Dr. Horst H. von Brand                   User #22616 counter.li.org
Departamento de Informatica                     Fono: +56 32 654431
Universidad Tecnica Federico Santa Maria              +56 32 654239
Casilla 110-V, Valparaiso, Chile                Fax:  +56 32 797513

^ permalink raw reply	[flat|nested] 13+ messages in thread

end of thread, other threads:[~2002-09-12  1:47 UTC | newest]

Thread overview: 13+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
     [not found] <Pine.LNX.4.33.0209081714500.18967-100000@coffee.psychology.mcmaster.ca>
2002-09-08 21:21 ` Western Digital hard drive and DMA Adam Jaskiewicz
2002-09-08 21:39   ` jbradford
2002-09-08 22:18   ` Mark Hahn
2002-09-08 22:28     ` Adam Jaskiewicz
2002-09-09 21:06       ` Alan Cox
2002-09-08 20:46 Adam Jaskiewicz
2002-09-08 21:12 ` jbradford
2002-09-09  2:19   ` Horst von Brand
2002-09-09 21:08     ` Alan Cox
2002-09-10 15:55       ` john slee
2002-09-08 22:19 ` Barry K. Nathan
2002-09-10 21:36 ` Maxwell Spangler
2002-09-12  1:52   ` Horst von Brand

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox