public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
* Random host bus errors with pata_it821x, filesystem corruption
@ 2008-12-04 20:01 Ondrej Zary
  2008-12-04 21:37 ` Alan Cox
  0 siblings, 1 reply; 2+ messages in thread
From: Ondrej Zary @ 2008-12-04 20:01 UTC (permalink / raw)
  To: Alan Cox; +Cc: linux-ide, linux-kernel

Hello,
I'm using IT8212F-based card with two Samsung HD400LD drives in cards' RAID1 mode.
It works fine most of the time, but sometimes the machine stalls for a while with HDD LED on.
The log contains errors like this then:

(syslog)
Nov 29 11:25:38 pentium kernel: ata3.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x0
Nov 29 11:25:38 pentium kernel: ata3.00: BMDMA stat 0x26
Nov 29 11:25:38 pentium kernel: ata3.00: cmd ca/00:08:ca:74:d0/00:00:00:00:00/e1 tag 0 dma 4096 out
Nov 29 11:25:38 pentium kernel:          res 51/00:01:01:00:00/00:00:00:00:00/00 Emask 0x21 (host bus error)
Nov 29 11:25:38 pentium kernel: ata3.00: status: { DRDY ERR }
Nov 29 16:09:40 pentium kernel: ata3.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x0
Nov 29 16:09:41 pentium kernel: ata3.00: BMDMA stat 0x26
Nov 29 16:09:41 pentium kernel: ata3.00: cmd ca/00:08:c2:f5:a7/00:00:00:00:00/e2 tag 0 dma 4096 out
Nov 29 16:09:41 pentium kernel:          res 51/04:01:01:00:00/00:00:00:00:00/00 Emask 0x21 (host bus error)
Nov 29 16:09:41 pentium kernel: ata3.00: status: { DRDY ERR }
Nov 29 16:09:41 pentium kernel: ata3.00: error: { ABRT }

(messages)
Nov 29 11:25:39 pentium kernel: ata3.00: RAID1 volume.
Nov 29 11:25:39 pentium kernel: ata3.00: configured for DMA
Nov 29 11:25:39 pentium kernel: ata3: EH complete
Nov 29 11:25:39 pentium kernel: sd 2:0:0:0: [sdb] 781422766 512-byte hardware sectors (400088 MB)
Nov 29 11:25:39 pentium kernel: sd 2:0:0:0: [sdb] Write Protect is off
Nov 29 11:25:39 pentium kernel: sd 2:0:0:0: [sdb] Write cache: disabled, read cache: enabled, doesn't support DPO or FUA
Nov 29 16:09:41 pentium kernel: ata3.00: RAID1 volume.
Nov 29 16:09:41 pentium kernel: ata3.00: configured for DMA
Nov 29 16:09:41 pentium kernel: ata3: EH complete
Nov 29 16:09:41 pentium kernel: sd 2:0:0:0: [sdb] 781422766 512-byte hardware sectors (400088 MB)
Nov 29 16:09:41 pentium kernel: sd 2:0:0:0: [sdb] Write Protect is off
Nov 29 16:09:41 pentium kernel: sd 2:0:0:0: [sdb] Write cache: disabled, read cache: enabled, doesn't support DPO or FUA


or this:

(syslog)
Dec  3 21:30:41 pentium kernel: ata3.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x0
Dec  3 21:30:42 pentium kernel: ata3.00: BMDMA stat 0x26
Dec  3 21:30:42 pentium kernel: ata3.00: cmd ca/00:08:fa:0f:68/00:00:00:00:00/e1 tag 0 dma 4096 out
Dec  3 21:30:42 pentium kernel:          res 51/10:01:01:00:00/00:00:00:00:00/00 Emask 0xa1 (host bus error)
Dec  3 21:30:42 pentium kernel: ata3.00: status: { DRDY ERR }
Dec  3 21:30:42 pentium kernel: ata3.00: error: { IDNF }
Dec  3 21:30:42 pentium kernel: Descriptor sense data with sense descriptors (in hex):
Dec  3 21:30:42 pentium kernel: end_request: I/O error, dev sdb, sector 23597050
Dec  3 21:30:42 pentium kernel: Buffer I/O error on device sdb2, logical block 1901390
Dec  3 21:30:42 pentium kernel: lost page write due to I/O error on sdb2
Dec  3 21:30:49 pentium kernel: ata3.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x0
Dec  3 21:30:49 pentium kernel: ata3.00: BMDMA stat 0x26
Dec  3 21:30:49 pentium kernel: ata3.00: cmd ca/00:08:aa:41:d0/00:00:00:00:00/e1 tag 0 dma 4096 out
Dec  3 21:30:49 pentium kernel:          res 51/04:01:01:00:00/00:00:00:00:00/00 Emask 0x21 (host bus error)
Dec  3 21:30:49 pentium kernel: ata3.00: status: { DRDY ERR }
Dec  3 21:30:49 pentium kernel: ata3.00: error: { ABRT }
Dec  3 21:40:41 pentium kernel: ata3.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x6 frozen
Dec  3 21:40:41 pentium kernel: ata3.00: cmd ca/00:18:c2:2f:1e/00:00:00:00:00/e1 tag 0 dma 12288 out
Dec  3 21:40:41 pentium kernel:          res 40/00:01:01:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
Dec  3 21:40:41 pentium kernel: ata3.00: status: { DRDY }

(messages)
Dec  3 21:30:42 pentium kernel: ata3.00: RAID1 volume.
Dec  3 21:30:42 pentium kernel: ata3.00: configured for DMA
Dec  3 21:30:42 pentium kernel: sd 2:0:0:0: [sdb] Result: hostbyte=0x00 driverbyte=0x08
Dec  3 21:30:42 pentium kernel: sd 2:0:0:0: [sdb] Sense Key : 0xb [current] [descriptor]
Dec  3 21:30:42 pentium kernel:         72 0b 14 00 00 00 00 0c 00 0a 80 00 00 00 00 00
Dec  3 21:30:42 pentium kernel:         00 00 00 01
Dec  3 21:30:42 pentium kernel: sd 2:0:0:0: [sdb] ASC=0x14 ASCQ=0x0
Dec  3 21:30:42 pentium kernel: ata3: EH complete
Dec  3 21:30:42 pentium kernel: sd 2:0:0:0: [sdb] 781422766 512-byte hardware sectors (400088 MB)
Dec  3 21:30:42 pentium kernel: sd 2:0:0:0: [sdb] Write Protect is off
Dec  3 21:30:42 pentium kernel: sd 2:0:0:0: [sdb] Write cache: disabled, read cache: enabled, doesn't support DPO or FUA
Dec  3 21:30:50 pentium kernel: ata3.00: RAID1 volume.
Dec  3 21:30:50 pentium kernel: ata3.00: configured for DMA
Dec  3 21:30:50 pentium kernel: ata3: EH complete
Dec  3 21:30:50 pentium kernel: sd 2:0:0:0: [sdb] 781422766 512-byte hardware sectors (400088 MB)
Dec  3 21:30:50 pentium kernel: sd 2:0:0:0: [sdb] Write Protect is off
Dec  3 21:30:50 pentium kernel: sd 2:0:0:0: [sdb] Write cache: disabled, read cache: enabled, doesn't support DPO or FUA
Dec  3 21:40:41 pentium kernel: ata3: soft resetting link
Dec  3 21:40:42 pentium kernel: ata3.00: RAID1 volume.
Dec  3 21:40:42 pentium kernel: ata3.00: configured for DMA
Dec  3 21:40:42 pentium kernel: ata3: EH complete
Dec  3 21:40:42 pentium kernel: sd 2:0:0:0: [sdb] 781422766 512-byte hardware sectors (400088 MB)
Dec  3 21:40:42 pentium kernel: sd 2:0:0:0: [sdb] Write Protect is off
Dec  3 21:40:42 pentium kernel: sd 2:0:0:0: [sdb] Write cache: disabled, read cache: enabled, doesn't support DPO or FUA



It even resulted in filesystem corruption once:

(syslog)
Dec  2 17:37:55 pentium kernel: ata3.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x6 frozen
Dec  2 17:37:56 pentium kernel: ata3.00: cmd ca/00:08:1a:f6:13/00:00:00:00:00/e1 tag 0 dma 4096 out
Dec  2 17:37:56 pentium kernel:          res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
Dec  2 17:37:56 pentium kernel: ata3.00: status: { DRDY }
Dec  2 17:39:32 pentium kernel: EXT3-fs error (device sdb2): htree_dirblock_to_tree: bad entry in directory #945280: rec_len %% 4 != 0 - 
offset=12288, inode=862741792, rec_len=29535, name_len=116

(messages)
Dec  2 17:37:56 pentium kernel: ata3: soft resetting link
Dec  2 17:37:56 pentium kernel: ata3.00: RAID1 volume.
Dec  2 17:37:56 pentium kernel: ata3.00: configured for DMA
Dec  2 17:37:56 pentium kernel: ata3: EH complete
Dec  2 17:37:56 pentium kernel: sd 2:0:0:0: [sdb] 781422766 512-byte hardware sectors (400088 MB)
Dec  2 17:37:56 pentium kernel: sd 2:0:0:0: [sdb] Write Protect is off
Dec  2 17:37:56 pentium kernel: sd 2:0:0:0: [sdb] Write cache: disabled, read cache: enabled, doesn't support DPO or FUA

Next day, fsck started on boot and found some errors.

I wonder what might cause these errors? I switched the controller into non-RAID mode and read SMART - both drives
are 100% OK (no pending or reallocated sectors, no errors logged).

-- 
Ondrej Zary

^ permalink raw reply	[flat|nested] 2+ messages in thread

* Re: Random host bus errors with pata_it821x, filesystem corruption
  2008-12-04 20:01 Random host bus errors with pata_it821x, filesystem corruption Ondrej Zary
@ 2008-12-04 21:37 ` Alan Cox
  0 siblings, 0 replies; 2+ messages in thread
From: Alan Cox @ 2008-12-04 21:37 UTC (permalink / raw)
  To: Ondrej Zary; +Cc: linux-ide, linux-kernel

> Dec  3 21:30:42 pentium kernel: ata3.00: status: { DRDY ERR }
> Dec  3 21:30:42 pentium kernel: ata3.00: error: { IDNF }

The drive firmware is reporting an unreadable block (or invalid block
number)

> I wonder what might cause these errors? I switched the controller into non-RAID mode and read SMART - both drives
> are 100% OK (no pending or reallocated sectors, no errors logged).

I would keep the controller in that mode anyway to be honest - the RAID
mode firmware is a bit variable and some versions blow up for reasons
I've never figured.

Alan

^ permalink raw reply	[flat|nested] 2+ messages in thread

end of thread, other threads:[~2008-12-04 21:38 UTC | newest]

Thread overview: 2+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2008-12-04 20:01 Random host bus errors with pata_it821x, filesystem corruption Ondrej Zary
2008-12-04 21:37 ` Alan Cox

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox