* how to recover from "medium format corrupted"?
@ 2009-12-14 10:19 Christian Kujau
2009-12-14 14:32 ` Douglas Gilbert
0 siblings, 1 reply; 4+ messages in thread
From: Christian Kujau @ 2009-12-14 10:19 UTC (permalink / raw)
To: linux-scsi
Hi there,
I noticed that I cannot access one of the two 73GB SCSI disks any more
(connected to a 53C1030 LSI/Fusion MPT controller); during bootup the
message "READ CAPACITY failed" appears. Searching the net I've seen two
very interesting posts [0] about this and I think this is exactly what
happened to my disk:
I was booting the same system into Solaris 10 earlier
and wanted to erase the partition table via "dd if=/dev/zero ..." but as
this wasn't working (the table was still in place) I "played around" with
format[1] and may have issued the "FORMAT" command - but as it estimated
to run ~100 minutes I interrupted the command and booted into Linux
(2.6.32) only to see that sdb now has a different geometry:
# hdparm -g /dev/sd[ab]
/dev/sda: geometry = 8924/255/63, sectors = 143374744, start = 0
/dev/sdb: geometry = 0/64/32, sectors = 0, start = 0
Unfortunately, sg_format (as suggested by Douglas in the posts) were not
able to recover the disk.
I wonder if the disk is still OK (it was working earlier, SCSI FORMAT
commands won't destroy the disk, right?) and if anyone knows some magic
SCSI commands to issue so that the disk returns to a normal state again.
I've the lsiutil[2] tool on my hands, just in case some more details are
needed.
Any ideas?
Thanks,
Christian.
[0] http://www.mail-archive.com/linux-scsi@vger.kernel.org/msg09553.html
http://www.mail-archive.com/linux-scsi@vger.kernel.org/msg07629.html
[1] http://www.sun.com/bigadmin/content/submitted/format_utility.jsp
[2] http://www.lsi.com/DistributionSystem/AssetDocument/LSIUtil_1.62.zip
v40z1# dmesg | grep sdb
sd 2:0:1:0: [sdb] READ CAPACITY failed
sd 2:0:1:0: [sdb] Result: hostbyte=0x00 driverbyte=0x08
sd 2:0:1:0: [sdb] Sense Key : 0x3 [current]
sd 2:0:1:0: [sdb] ASC=0x31 ASCQ=0x0
sd 2:0:1:0: [sdb] Write Protect is off
sd 2:0:1:0: [sdb] Mode Sense: ab 00 10 08
sd 2:0:1:0: [sdb] Write cache: enabled, read cache: enabled, supports DPO and FUA
sd 2:0:1:0: [sdb] READ CAPACITY failed
sd 2:0:1:0: [sdb] Result: hostbyte=0x00 driverbyte=0x08
sd 2:0:1:0: [sdb] Sense Key : 0x3 [current]
sd 2:0:1:0: [sdb] ASC=0x31 ASCQ=0x0
sd 2:0:1:0: [sdb] Attached SCSI disk
v40z1# sg_format --format -vvv /dev/sdb
open /dev/sdb with flags=0x802
inquiry cdb: 12 00 00 00 24 00
duration=0 ms
SEAGATE ST373307LC 0007 peripheral_type: disk [0x0]
PROTECT=0
mode sense (10) cdb: 5a 00 01 00 00 00 00 00 fc 00
duration=0 ms
mode sense (10): requested 252 bytes but got 28 bytes
mode sense (10): response
00 1a 00 10 00 00 00 08 08 8b b9 98 00 00 02 00
81 0a c0 0b ff 00 00 00 05 00 ff ff
Mode Sense (block descriptor) data, prior to changes:
Number of blocks=143374744 [0x88bb998]
Block size=512 [0x200]
A FORMAT will commence in 10 seconds
ALL data on /dev/sdb will be DESTROYED
Press control-C to abort
A FORMAT will commence in 5 seconds
ALL data on /dev/sdb will be DESTROYED
Press control-C to abort
format cdb: 04 18 00 00 00 00
format parameter list:
00 02 00 00
duration=280 ms
Format has started
test unit ready cdb: 00 00 00 00 00 00
sense buffer empty
FORMAT Complete
v40z1# echo $?
0
--
BOFH excuse #449:
greenpeace free'd the mallocs
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: how to recover from "medium format corrupted"?
2009-12-14 10:19 how to recover from "medium format corrupted"? Christian Kujau
@ 2009-12-14 14:32 ` Douglas Gilbert
2009-12-14 21:24 ` Christian Kujau
2009-12-16 5:18 ` Christian Kujau
0 siblings, 2 replies; 4+ messages in thread
From: Douglas Gilbert @ 2009-12-14 14:32 UTC (permalink / raw)
To: Christian Kujau; +Cc: linux-scsi
Christian Kujau wrote:
> Hi there,
>
> I noticed that I cannot access one of the two 73GB SCSI disks any more
> (connected to a 53C1030 LSI/Fusion MPT controller); during bootup the
> message "READ CAPACITY failed" appears. Searching the net I've seen two
> very interesting posts [0] about this and I think this is exactly what
> happened to my disk:
>
> I was booting the same system into Solaris 10 earlier
> and wanted to erase the partition table via "dd if=/dev/zero ..." but as
> this wasn't working (the table was still in place) I "played around" with
> format[1] and may have issued the "FORMAT" command - but as it estimated
> to run ~100 minutes I interrupted the command and booted into Linux
> (2.6.32) only to see that sdb now has a different geometry:
>
> # hdparm -g /dev/sd[ab]
> /dev/sda: geometry = 8924/255/63, sectors = 143374744, start = 0
> /dev/sdb: geometry = 0/64/32, sectors = 0, start = 0
>
> Unfortunately, sg_format (as suggested by Douglas in the posts) were not
> able to recover the disk.
Christian,
Could you try the '--wait' option on the sg_format? The disk
does not seem to respond to the TEST UNIT READY poll. If
it is a modern disk it might respond to a REQUEST SENSE poll
but sg_format doesn't support that yet.
The '--wait' option will leave sg_format hanging until the
FORMAT UNIT finishes or 4 hours expires.
MEDIUM FORMAT CORRUPTED [asc/ascq=0x31,0x0] is caused by an
interrupted FORMAT UNIT.
Doug Gilbert
> I wonder if the disk is still OK (it was working earlier, SCSI FORMAT
> commands won't destroy the disk, right?) and if anyone knows some magic
> SCSI commands to issue so that the disk returns to a normal state again.
>
> I've the lsiutil[2] tool on my hands, just in case some more details are
> needed.
>
> Any ideas?
>
> Thanks,
> Christian.
>
> [0] http://www.mail-archive.com/linux-scsi@vger.kernel.org/msg09553.html
> http://www.mail-archive.com/linux-scsi@vger.kernel.org/msg07629.html
> [1] http://www.sun.com/bigadmin/content/submitted/format_utility.jsp
> [2] http://www.lsi.com/DistributionSystem/AssetDocument/LSIUtil_1.62.zip
>
>
> v40z1# dmesg | grep sdb
> sd 2:0:1:0: [sdb] READ CAPACITY failed
> sd 2:0:1:0: [sdb] Result: hostbyte=0x00 driverbyte=0x08
> sd 2:0:1:0: [sdb] Sense Key : 0x3 [current]
> sd 2:0:1:0: [sdb] ASC=0x31 ASCQ=0x0
> sd 2:0:1:0: [sdb] Write Protect is off
> sd 2:0:1:0: [sdb] Mode Sense: ab 00 10 08
> sd 2:0:1:0: [sdb] Write cache: enabled, read cache: enabled, supports DPO and FUA
> sd 2:0:1:0: [sdb] READ CAPACITY failed
> sd 2:0:1:0: [sdb] Result: hostbyte=0x00 driverbyte=0x08
> sd 2:0:1:0: [sdb] Sense Key : 0x3 [current]
> sd 2:0:1:0: [sdb] ASC=0x31 ASCQ=0x0
> sd 2:0:1:0: [sdb] Attached SCSI disk
>
> v40z1# sg_format --format -vvv /dev/sdb
> open /dev/sdb with flags=0x802
> inquiry cdb: 12 00 00 00 24 00
> duration=0 ms
> SEAGATE ST373307LC 0007 peripheral_type: disk [0x0]
> PROTECT=0
> mode sense (10) cdb: 5a 00 01 00 00 00 00 00 fc 00
> duration=0 ms
> mode sense (10): requested 252 bytes but got 28 bytes
> mode sense (10): response
> 00 1a 00 10 00 00 00 08 08 8b b9 98 00 00 02 00
> 81 0a c0 0b ff 00 00 00 05 00 ff ff
> Mode Sense (block descriptor) data, prior to changes:
> Number of blocks=143374744 [0x88bb998]
> Block size=512 [0x200]
>
> A FORMAT will commence in 10 seconds
> ALL data on /dev/sdb will be DESTROYED
> Press control-C to abort
> A FORMAT will commence in 5 seconds
> ALL data on /dev/sdb will be DESTROYED
> Press control-C to abort
> format cdb: 04 18 00 00 00 00
> format parameter list:
> 00 02 00 00
> duration=280 ms
>
> Format has started
> test unit ready cdb: 00 00 00 00 00 00
> sense buffer empty
> FORMAT Complete
> v40z1# echo $?
> 0
>
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: how to recover from "medium format corrupted"?
2009-12-14 14:32 ` Douglas Gilbert
@ 2009-12-14 21:24 ` Christian Kujau
2009-12-16 5:18 ` Christian Kujau
1 sibling, 0 replies; 4+ messages in thread
From: Christian Kujau @ 2009-12-14 21:24 UTC (permalink / raw)
To: Douglas Gilbert; +Cc: linux-scsi
On Mon, 14 Dec 2009 at 15:32, Douglas Gilbert wrote:
> Could you try the '--wait' option on the sg_format?
After sending out my mail to linux-scsi (as a last resort, I thought) I
decided to boot into Solaris again. Although "format" was not too happy
accessing the disk (and was even dumping core quite often), it was able to
get hold of the disk, somehow. I issued to FORMAT command again but this
time let it run and it completed to 97% - when it exited with a segfault.
But that must've been enough, as I'm now able to access the disk again,
under Linux too.
> The disk
> does not seem to respond to the TEST UNIT READY poll. If
> it is a modern disk it might respond to a REQUEST SENSE poll
> but sg_format doesn't support that yet.
It's a ST373307LC (Cheetah 10K.6), I think they were made 2003 or
something. I'll see if I can use lsiutil to send those commands directly
from the HBA.
> The '--wait' option will leave sg_format hanging until the
> FORMAT UNIT finishes or 4 hours expires.
I'm tempted to try that again - the box is a testsystem anyway and I'm
curious if I can reprodue this "media format corrupted" state to see if
the --wait option does anything.
Thanks for the prompt reply!
Christian.
--
BOFH excuse #287:
Telecommunications is downshifting.
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: how to recover from "medium format corrupted"?
2009-12-14 14:32 ` Douglas Gilbert
2009-12-14 21:24 ` Christian Kujau
@ 2009-12-16 5:18 ` Christian Kujau
1 sibling, 0 replies; 4+ messages in thread
From: Christian Kujau @ 2009-12-16 5:18 UTC (permalink / raw)
To: Douglas Gilbert; +Cc: linux-scsi
On Mon, 14 Dec 2009 at 15:32, Douglas Gilbert wrote:
> Could you try the '--wait' option on the sg_format?
OK, I was curious enough to try this again: interrupt a FORMAT command,
now the disk responds with ASC=0x31 ASCQ=0x0 again. However, the --wait
option (sg_format version: 1.17 20090324) doesn't do anything:
v40z1# sg_format --format --wait -vvv /dev/sdb
open /dev/sdb with flags=0x802
inquiry cdb: 12 00 00 00 24 00
duration=0 ms
SEAGATE ST373307LC 0007 peripheral_type: disk [0x0]
PROTECT=0
mode sense (10) cdb: 5a 00 01 00 00 00 00 00 fc 00
duration=0 ms
sense buffer empty
requested 252 bytes but got 0 bytes
MODE SENSE (10) command failed
I can leave the disk in this state if you want to play around some more
and will bring back the disk in a usable state in a few days then.
Thanks,
Christian.
--
BOFH excuse #403:
Sysadmin didn't hear pager go off due to loud music from bar-room speakers.
^ permalink raw reply [flat|nested] 4+ messages in thread
end of thread, other threads:[~2009-12-16 5:18 UTC | newest]
Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2009-12-14 10:19 how to recover from "medium format corrupted"? Christian Kujau
2009-12-14 14:32 ` Douglas Gilbert
2009-12-14 21:24 ` Christian Kujau
2009-12-16 5:18 ` Christian Kujau
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox