* [linux-lvm] LVM snapshots causing SATA disk failures
@ 2005-05-31 16:33 Joerg Sommrey
2005-09-06 16:46 ` [linux-lvm] " Joerg Sommrey
0 siblings, 1 reply; 2+ messages in thread
From: Joerg Sommrey @ 2005-05-31 16:33 UTC (permalink / raw)
To: linux-lvm
Hello,
the subject sounds strange - I know. However, all the tests I did lead
me to this conclusion.
The setup consist of:
- two identical SATA disk /dev/sdb and /dev/sdc connected to a Promise
SATA150 TX4 controller
- both disks are identically partitioned into /dev/sd[bc][1-4]
- each pair of corresponding partitions are members of a MD-RAID0 or
MD-RAID1 device /dev/md[0-3]
- each MD-RAID device is the only physical volume of a LVM volume group
- there are several logical volumes within these volume groups
containing reiser filesystems.
So far everything works fine.
Now I create a snapshot of one of the logical volumes in a RAID-1 vg
(and mount it) and do some "heavy" memory-mapped i/o on either the
original or the snapshot filesystem (e.g. starting mutt on a large maildir
folder). This causes the system to fail in a absolutely repeatable way.
Syslog gets flooded with errors and the RAID-1 continues operation on one
partition. Need to reboot (reset!) at this point. I am able to
raidhotadd the failed partition without problems after reboot.
This problem appeared between kernel 2.6.10 and 2.6.11 and persists
until recent 2.6.12-rc5.
Does anybody have an idea what's going on here?
TIA
-jo
syslog:
May 31 17:08:13 bear kernel: ata1: status=0x51 { DriveReady SeekComplete Error }May 31 17:08:13 bear kernel: ata1: called with no error (51)!
May 31 17:08:13 bear kernel: ata1: status=0x51 { DriveReady SeekComplete Error }May 31 17:08:13 bear kernel: ata1: called with no error (51)!
May 31 17:08:13 bear kernel: ata1: status=0x51 { DriveReady SeekComplete Error }May 31 17:08:13 bear kernel: ata1: called with no error (51)!
May 31 17:08:13 bear kernel: ata1: status=0x51 { DriveReady SeekComplete Error }May 31 17:08:13 bear kernel: ata1: called with no error (51)!
May 31 17:08:13 bear kernel: ata1: status=0x51 { DriveReady SeekComplete Error }May 31 17:08:13 bear kernel: ata1: called with no error (51)!
May 31 17:08:13 bear kernel: SCSI error : <1 0 0 0> return code = 0x8000002
May 31 17:08:13 bear kernel: sdb: Current: sense key: Medium Error
May 31 17:08:13 bear kernel: Additional sense: Unrecovered read error - auto reallocate failed
May 31 17:08:13 bear kernel: end_request: I/O error, dev sdb, sector 80684237
May 31 17:08:13 bear kernel: ata1: status=0x51 { DriveReady SeekComplete Error }May 31 17:08:13 bear kernel: ata1: called with no error (51)!
May 31 17:08:13 bear kernel: SCSI error : <1 0 0 0> return code = 0x8000002
May 31 17:08:13 bear kernel: sdb: Current: sense key: Medium Error
May 31 17:08:13 bear kernel: Additional sense: Unrecovered read error - auto reallocate failed
May 31 17:08:13 bear kernel: end_request: I/O error, dev sdb, sector 80684245
May 31 17:08:13 bear kernel: raid1: Disk failure on sdb3, disabling device.
May 31 17:08:13 bear kernel: ^IOperation continuing on 1 devices
May 31 17:08:13 bear kernel: raid1: sdb3: rescheduling sector 278912
May 31 17:08:13 bear kernel: ata1: status=0x51 { DriveReady SeekComplete Error }May 31 17:08:13 bear kernel: ata1: called with no error (51)!
May 31 17:08:13 bear kernel: SCSI error : <1 0 0 0> return code = 0x8000002
May 31 17:08:13 bear kernel: sdb: Current: sense key: Medium Error
May 31 17:08:13 bear kernel: Additional sense: Unrecovered read error - auto reallocate failed
May 31 17:08:13 bear kernel: end_request: I/O error, dev sdb, sector 80684253
May 31 17:08:13 bear kernel: raid1: sdb3: rescheduling sector 278928
May 31 17:08:13 bear kernel: RAID1 conf printout:
May 31 17:08:13 bear kernel: --- wd:1 rd:2
May 31 17:08:13 bear kernel: disk 0, wo:1, o:0, dev:sdb3
May 31 17:08:13 bear kernel: disk 1, wo:0, o:1, dev:sdc3
May 31 17:08:13 bear kernel: RAID1 conf printout:
May 31 17:08:13 bear kernel: --- wd:1 rd:2
May 31 17:08:13 bear kernel: disk 1, wo:0, o:1, dev:sdc3
May 31 17:08:13 bear kernel: raid1: sdc3: redirecting sector 278912 to another mirror
May 31 17:08:13 bear kernel: raid1: sdc3: redirecting sector 278928 to another mirror
[etc. until reboot]
^ permalink raw reply [flat|nested] 2+ messages in thread
* [linux-lvm] Re: LVM snapshots causing SATA disk failures
2005-05-31 16:33 [linux-lvm] LVM snapshots causing SATA disk failures Joerg Sommrey
@ 2005-09-06 16:46 ` Joerg Sommrey
0 siblings, 0 replies; 2+ messages in thread
From: Joerg Sommrey @ 2005-09-06 16:46 UTC (permalink / raw)
To: linux-lvm
Update:
Seems to work now with 2.6.13.
-jo
^ permalink raw reply [flat|nested] 2+ messages in thread
end of thread, other threads:[~2005-09-06 16:47 UTC | newest]
Thread overview: 2+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2005-05-31 16:33 [linux-lvm] LVM snapshots causing SATA disk failures Joerg Sommrey
2005-09-06 16:46 ` [linux-lvm] " Joerg Sommrey
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).