* sector size mismatch a problem for RAID 1?
@ 2017-02-03 0:17 Boylan, Ross
2017-02-03 6:18 ` Boylan, Ross
0 siblings, 1 reply; 3+ messages in thread
From: Boylan, Ross @ 2017-02-03 0:17 UTC (permalink / raw)
To: linux-raid@vger.kernel.org
I got a replacement for a failing disk, but the 2 drives seems to have slightly different formatting:
For the failing one
GNU Parted 2.3
Using /dev/sdb
Welcome to GNU Parted! Type 'help' to view a list of commands.
(parted) p
Model: WDC WD40 01FFSX-68JNUN0 (scsi)
Disk /dev/sdb: 4001GB
Sector size (logical/physical): 512B/4096B <<<------
Partition Table: gpt
And the replacement (sdk)
Model: WDC WD40 01FFSX-68JNUN0 (scsi)
Disk /dev/sdk: 7814037168s
Sector size (logical/physical): 512B/512B <<<<-----
Partition Table: gpt
So the first is 4k physical, and the second is 512b physical (I assume that's really a fib, but it's what the drive says).
Could the mismatch in the physical sizes cause trouble?
I ask because I did this:
mdadm --grow /dev/md/media4 --add /dev/sdk2 --raid-devices=2
which resulted in
the Feb 2 15:26:46 tempserver kernel: [ 8583.545160] md: bind<sdk2>
Feb 2 15:26:46 tempserver kernel: [ 8583.957047] RAID1 conf printout:
Feb 2 15:26:46 tempserver kernel: [ 8583.957051] --- wd:1 rd:2
Feb 2 15:26:46 tempserver kernel: [ 8583.957052] disk 0, wo:0, o:1, dev:sdb2
Feb 2 15:26:46 tempserver kernel: [ 8583.957054] disk 1, wo:1, o:1, dev:sdk2
Feb 2 15:26:46 tempserver kernel: [ 8583.957108] md: recovery of RAID array md126
Feb 2 15:26:46 tempserver kernel: [ 8583.957111] md: minimum _guaranteed_ speed: 1000 KB/sec/disk.
Feb 2 15:26:46 tempserver kernel: [ 8583.957112] md: using maximum available idle IO bandwidth (but not more than 200000 KB/sec) for recovery.
Feb 2 15:26:46 tempserver kernel: [ 8583.957115] md: using 128k window, over a total of 1953376064k.
Feb 2 15:26:50 tempserver kernel: [ 8587.312261] usb 2-3: USB disconnect, device number 4
Feb 2 15:26:50 tempserver kernel: [ 8587.315908] scsi 8:0:0:3: rejecting I/O to offline device
Feb 2 15:26:50 tempserver kernel: [ 8587.315912] scsi 8:0:0:3: [sdk] killing request
Feb 2 15:26:50 tempserver kernel: [ 8587.315934] scsi 8:0:0:3: [sdk] Unhandled error code
Feb 2 15:26:50 tempserver kernel: [ 8587.315936] scsi 8:0:0:3: [sdk]
Feb 2 15:26:50 tempserver kernel: [ 8587.315939] Result: hostbyte=DID_NO_CONNECT driverbyte=DRIVER_OK
Feb 2 15:26:50 tempserver kernel: [ 8587.315941] scsi 8:0:0:3: [sdk] CDB:
Feb 2 15:26:50 tempserver kernel: [ 8587.315943] Write(16): 8a 00 00 00 00 00 00 0c 3e 80 00 00 00 80 00 00
Feb 2 15:26:50 tempserver kernel: [ 8587.315951] end_request: I/O error, dev sdk, sector 802432
Feb 2 15:26:50 tempserver kernel: [ 8587.315963] md/raid1:md126: Disk failure on sdk2, disabling device.
Feb 2 15:26:50 tempserver kernel: [ 8587.315963] md/raid1:md126: Operation continuing on 1 devices.result of which from kern.log was
I then tried
dd if=/dev/disk/by-id/ata-WDC_WD4001FFSX-68JNUN0_WD-WMC130FACU91 of=/dev/null skip=802500 count=1000
which produced no error. The input is from the device previously known as sdk. So I can at least read from the sector that was associated with the error message above.
BTW, the array I'm trying to mirror is on a part of the disk that I think is still good.
^ permalink raw reply [flat|nested] 3+ messages in thread
* RE: sector size mismatch a problem for RAID 1?
2017-02-03 0:17 sector size mismatch a problem for RAID 1? Boylan, Ross
@ 2017-02-03 6:18 ` Boylan, Ross
2017-02-03 11:20 ` Wols Lists
0 siblings, 1 reply; 3+ messages in thread
From: Boylan, Ross @ 2017-02-03 6:18 UTC (permalink / raw)
To: linux-raid@vger.kernel.org
When I moved the new disk to an internal drive bay (after reading lvm-raid wiki's statements that RAID shouldn't be used over a USB link) the problems went away--at least so far.
I wonder if this is the source of my original problem: I was using a single disk RAID1 where the single disk had a USB connection. However, I did get a failure off that drive doing a dd off the raw device.
Ross
________________________________________
From: linux-raid-owner@vger.kernel.org [linux-raid-owner@vger.kernel.org] on behalf of Boylan, Ross [Ross.Boylan@ucsf.edu]
Sent: Thursday, February 02, 2017 4:17 PM
To: linux-raid@vger.kernel.org
Subject: sector size mismatch a problem for RAID 1?
I got a replacement for a failing disk, but the 2 drives seems to have slightly different formatting:
For the failing one
GNU Parted 2.3
Using /dev/sdb
Welcome to GNU Parted! Type 'help' to view a list of commands.
(parted) p
Model: WDC WD40 01FFSX-68JNUN0 (scsi)
Disk /dev/sdb: 4001GB
Sector size (logical/physical): 512B/4096B <<<------
Partition Table: gpt
And the replacement (sdk)
Model: WDC WD40 01FFSX-68JNUN0 (scsi)
Disk /dev/sdk: 7814037168s
Sector size (logical/physical): 512B/512B <<<<-----
Partition Table: gpt
So the first is 4k physical, and the second is 512b physical (I assume that's really a fib, but it's what the drive says).
Could the mismatch in the physical sizes cause trouble?
I ask because I did this:
mdadm --grow /dev/md/media4 --add /dev/sdk2 --raid-devices=2
which resulted in
the Feb 2 15:26:46 tempserver kernel: [ 8583.545160] md: bind<sdk2>
Feb 2 15:26:46 tempserver kernel: [ 8583.957047] RAID1 conf printout:
Feb 2 15:26:46 tempserver kernel: [ 8583.957051] --- wd:1 rd:2
Feb 2 15:26:46 tempserver kernel: [ 8583.957052] disk 0, wo:0, o:1, dev:sdb2
Feb 2 15:26:46 tempserver kernel: [ 8583.957054] disk 1, wo:1, o:1, dev:sdk2
Feb 2 15:26:46 tempserver kernel: [ 8583.957108] md: recovery of RAID array md126
Feb 2 15:26:46 tempserver kernel: [ 8583.957111] md: minimum _guaranteed_ speed: 1000 KB/sec/disk.
Feb 2 15:26:46 tempserver kernel: [ 8583.957112] md: using maximum available idle IO bandwidth (but not more than 200000 KB/sec) for recovery.
Feb 2 15:26:46 tempserver kernel: [ 8583.957115] md: using 128k window, over a total of 1953376064k.
Feb 2 15:26:50 tempserver kernel: [ 8587.312261] usb 2-3: USB disconnect, device number 4
Feb 2 15:26:50 tempserver kernel: [ 8587.315908] scsi 8:0:0:3: rejecting I/O to offline device
Feb 2 15:26:50 tempserver kernel: [ 8587.315912] scsi 8:0:0:3: [sdk] killing request
Feb 2 15:26:50 tempserver kernel: [ 8587.315934] scsi 8:0:0:3: [sdk] Unhandled error code
Feb 2 15:26:50 tempserver kernel: [ 8587.315936] scsi 8:0:0:3: [sdk]
Feb 2 15:26:50 tempserver kernel: [ 8587.315939] Result: hostbyte=DID_NO_CONNECT driverbyte=DRIVER_OK
Feb 2 15:26:50 tempserver kernel: [ 8587.315941] scsi 8:0:0:3: [sdk] CDB:
Feb 2 15:26:50 tempserver kernel: [ 8587.315943] Write(16): 8a 00 00 00 00 00 00 0c 3e 80 00 00 00 80 00 00
Feb 2 15:26:50 tempserver kernel: [ 8587.315951] end_request: I/O error, dev sdk, sector 802432
Feb 2 15:26:50 tempserver kernel: [ 8587.315963] md/raid1:md126: Disk failure on sdk2, disabling device.
Feb 2 15:26:50 tempserver kernel: [ 8587.315963] md/raid1:md126: Operation continuing on 1 devices.result of which from kern.log was
I then tried
dd if=/dev/disk/by-id/ata-WDC_WD4001FFSX-68JNUN0_WD-WMC130FACU91 of=/dev/null skip=802500 count=1000
which produced no error. The input is from the device previously known as sdk. So I can at least read from the sector that was associated with the error message above.
BTW, the array I'm trying to mirror is on a part of the disk that I think is still good.
--
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: sector size mismatch a problem for RAID 1?
2017-02-03 6:18 ` Boylan, Ross
@ 2017-02-03 11:20 ` Wols Lists
0 siblings, 0 replies; 3+ messages in thread
From: Wols Lists @ 2017-02-03 11:20 UTC (permalink / raw)
To: Boylan, Ross, linux-raid@vger.kernel.org
On 03/02/17 06:18, Boylan, Ross wrote:
> When I moved the new disk to an internal drive bay (after reading lvm-raid wiki's statements that RAID shouldn't be used over a USB link) the problems went away--at least so far.
>
As I understand it, the problem is that the USB interface goes to sleep.
So when you try to write to it, it may not wake up quick enough, causing
havoc ...
> I wonder if this is the source of my original problem: I was using a single disk RAID1 where the single disk had a USB connection. However, I did get a failure off that drive doing a dd off the raw device.
>
Quite likely. But I'm puzzled as to why the dd would fail, as this would
keep the USB interface active. Maybe others who know more will chime in
and explain.
>
> Ross
Cheers,
Wol
^ permalink raw reply [flat|nested] 3+ messages in thread
end of thread, other threads:[~2017-02-03 11:20 UTC | newest]
Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2017-02-03 0:17 sector size mismatch a problem for RAID 1? Boylan, Ross
2017-02-03 6:18 ` Boylan, Ross
2017-02-03 11:20 ` Wols Lists
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.