* Problem with large devices >2TB
@ 2006-05-10 11:04 Jim Klimov
2006-05-14 20:10 ` H. Peter Anvin
0 siblings, 1 reply; 3+ messages in thread
From: Jim Klimov @ 2006-05-10 11:04 UTC (permalink / raw)
To: linux-raid
Hello linux-raid,
We have a problem with large devices (over 2TB).
While this is not exactly a software RAID issue, I saw quite a few
references to hardware RAID in this list and hope that my question
isn't an extreme offtopic.
I have seen this sort of problem in numerous mailing lists, and now
during an upgrade I encountered it myself. And I have not yet found
any well-working solutions, only hanging questions...
The server I maintain now has 2 3Ware 9550SX-12 cards with 10*320
and 9*400gb RAID5 arrays (addressable space is thus sized 2880gb
and 3200gb). These arrays are partitioned in 9*320gb and 8*400gb
parts (so we can defragment via a hotspare drive sometimes).
The Linux kernel is 2.6.16.12 with CONFIG_LBD=y.
The only tool that agreed to partition these arrays is a recent
GNU parted (1.7.0rc5), both fdisk and cfdisk failed to see it.
Fdisk from util-linux-2.12r suggested:
[root@storage bofh]# fdisk /dev/sdb
You must set cylinders.
You can do this from the extra functions menu.
Command (m for help): p
Disk /dev/sdb: 0 MB, 0 bytes
255 heads, 63 sectors/track, 0 cylinders
Units = cylinders of 16065 * 512 = 8225280 bytes
Blockdev also can not address the device:
[root@storage bofh]# blockdev --getsize /dev/sdc
BLKGETSIZE: File too large
[root@storage bofh]# blockdev -V
blockdev from util-linux-2.12r
Since the new parted worked ok (older one didn't), we were happy
until we tried a reboot. During the device initialization and after
it the system only recognises the 6 or 7 partitions which start
before the 2000Gb limit:
[ 5.168239] 3ware Storage Controller device driver for Linux v1.26.02.001.
[ 5.175294] 3ware 9000 Storage Controller device driver for Linux v2.26.02.007.
[ 5.438278] 3w-9xxx: scsi0: AEN: WARNING (0x04:0x0008): Unclean shutdown detected:unit=0.
[ 5.548104] 3w-9xxx: scsi0: AEN: WARNING (0x04:0x0008): Unclean shutdown detected:unit=1.
[ 5.657930] scsi0 : 3ware 9000 Storage Controller
[ 5.662833] 3w-9xxx: scsi0: Found a 3ware 9000 Storage Controller at 0xf6200000, IRQ: 201.
[ 5.976432] 3w-9xxx: scsi0: Firmware FE9X 3.02.00.004, BIOS BE9X 3.01.00.024, Ports: 12.
[ 5.985100] Vendor: AMCC Model: 9550SX-12 DISK Rev: 3.02
[ 5.993543] Type: Direct-Access ANSI SCSI revision: 03
[ 6.002020] Vendor: AMCC Model: 9550SX-12 DISK Rev: 3.02
[ 6.010487] Type: Direct-Access ANSI SCSI revision: 03
[ 6.278964] 3w-9xxx: scsi1: AEN: WARNING (0x04:0x0008): Unclean shutdown detected:unit=1.
[ 6.388788] scsi1 : 3ware 9000 Storage Controller
[ 6.393684] 3w-9xxx: scsi1: Found a 3ware 9000 Storage Controller at 0xf6340000, IRQ: 177.
[ 6.707289] 3w-9xxx: scsi1: Firmware FE9X 3.02.00.004, BIOS BE9X 3.01.00.024, Ports: 12.
[ 6.715910] Vendor: AMCC Model: 9550SX-12 DISK Rev: 3.02
[ 6.724362] Type: Direct-Access ANSI SCSI revision: 03
[ 6.732850] Vendor: AMCC Model: 9550SX-12 DISK Rev: 3.02
[ 6.741298] Type: Direct-Access ANSI SCSI revision: 03
[ 6.755016] SCSI device sda: 273395712 512-byte hdwr sectors (139979 MB)
[ 6.761971] sda: Write Protect is off
[ 6.765712] sda: Mode Sense: 23 00 00 00
[ 6.766029] SCSI device sda: drive cache: none
[ 6.770834] SCSI device sda: 273395712 512-byte hdwr sectors (139979 MB)
[ 6.777764] sda: Write Protect is off
[ 6.781507] sda: Mode Sense: 23 00 00 00
[ 6.781824] SCSI device sda: drive cache: none
[ 6.786343] sda: sda1
[ 6.789283] sd 0:0:0:0: Attached scsi disk sda
[ 6.793993] sdb : very big device. try to use READ CAPACITY(16).
[ 6.800164] SCSI device sdb: 5624801280 512-byte hdwr sectors (2879898 MB)
[ 6.807363] sdb: Write Protect is off
[ 6.811098] sdb: Mode Sense: 23 00 00 00
[ 6.811589] SCSI device sdb: drive cache: write back, no read (daft)
[ 6.818288] sdb : very big device. try to use READ CAPACITY(16).
[ 6.824454] SCSI device sdb: 5624801280 512-byte hdwr sectors (2879898 MB)
[ 6.831640] sdb: Write Protect is off
[ 6.835397] sdb: Mode Sense: 23 00 00 00
[ 6.835903] SCSI device sdb: drive cache: write back, no read (daft)
[ 6.842334] sdb: sdb1 sdb2 sdb3 sdb4 < sdb5 sdb6 sdb7 sdb8 >
<< here we lost sdb9 and sdb10
[ 6.936783] sd 0:0:1:0: Attached scsi disk sdb
[ 6.941515] sdc : very big device. try to use READ CAPACITY(16).
[ 6.947680] SCSI device sdc: 6249824256 512-byte hdwr sectors (3199910 MB)
[ 6.954870] sdc: Write Protect is off
[ 6.958611] sdc: Mode Sense: 23 00 00 00
[ 6.959085] SCSI device sdc: drive cache: write back, no read (daft)
[ 6.965784] sdc : very big device. try to use READ CAPACITY(16).
[ 6.947680] SCSI device sdc: 6249824256 512-byte hdwr sectors (3199910 MB)
[ 6.954870] sdc: Write Protect is off
[ 6.958611] sdc: Mode Sense: 23 00 00 00
[ 6.959085] SCSI device sdc: drive cache: write back, no read (daft)
[ 6.965784] sdc : very big device. try to use READ CAPACITY(16).
[ 6.971953] SCSI device sdc: 6249824256 512-byte hdwr sectors (3199910 MB)
[ 6.979131] sdc: Write Protect is off
[ 6.982869] sdc: Mode Sense: 23 00 00 00
[ 6.983341] SCSI device sdc: drive cache: write back, no read (daft)
[ 6.989769] sdc: sdc1 sdc2 sdc3 sdc4 < sdc5 sdc6 sdc7 >
<< here we lost sdc8 and sdc9
[ 7.078365] sd 1:0:0:0: Attached scsi disk sdc
[ 7.083087] SCSI device sdd: 144486400 512-byte hdwr sectors (73977 MB)
[ 7.089944] sdd: Write Protect is off
[ 7.093693] sdd: Mode Sense: 23 00 00 00
[ 7.094008] SCSI device sdd: drive cache: write back, no read (daft)
[ 7.100701] SCSI device sdd: 144486400 512-byte hdwr sectors (73977 MB)
[ 7.107546] sdd: Write Protect is off
[ 7.111292] sdd: Mode Sense: 23 00 00 00
[ 7.111608] SCSI device sdd: drive cache: write back, no read (daft)
[ 7.118032] sdd: sdd1
[ 7.120944] sd 1:0:1:0: Attached scsi disk sdd
The "last" few partitions are invisible to mount, mkfs, fsck and
other tools until a trick is done: we run parted, remove a partition
and remake it again with same start-end parameters. This probably
causes some re-read of the partition table and our data is
accessible again (until a reboot). Nothing appears in dmesg, though.
While this workaround works, it is unpleasant enough to put it in
some startup scripts and forget about it in a few years :)
Other workarounds seem to include remaking the arrays (so they are
under 2Tb in size) or remaking the last partition so it's larger and
starts before the 2Tb limit. So far we don't want to resort to these
solutions and are willing to experiment with whatever ideas or
patches the List People can suggest :)
While googling around I saw references that standard SCSI devices
can't address over 2Tb due to a 32-bit protocol, and several
proposed extensions are not standardized and buggy. Perhaps this
problem starts in the sd SCSI layer driver?
Can anybody on the list suggest where the problem can be rooted and
perhaps forward me and my message to the more appropriate list?
--
Best regards,
Jim Klimov mailto:klimov@2ka.mipt.ru
^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: Problem with large devices >2TB
2006-05-10 11:04 Problem with large devices >2TB Jim Klimov
@ 2006-05-14 20:10 ` H. Peter Anvin
2006-05-15 13:41 ` Re[2]: " Jim Klimov
0 siblings, 1 reply; 3+ messages in thread
From: H. Peter Anvin @ 2006-05-14 20:10 UTC (permalink / raw)
To: linux-raid
Followup to: <0376848.20060510150440@2ka.mipt.ru>
By author: Jim Klimov <klimov@2ka.mipt.ru>
In newsgroup: linux.dev.raid
>
> Since the new parted worked ok (older one didn't), we were happy
> until we tried a reboot. During the device initialization and after
> it the system only recognises the 6 or 7 partitions which start
> before the 2000Gb limit:
>
For a DOS partition table, there is no such thing as a partition
starting beyond 2 TB. You need to use a GPT or other more
sophisticated partition table.
-hpa
^ permalink raw reply [flat|nested] 3+ messages in thread
* Re[2]: Problem with large devices >2TB
2006-05-14 20:10 ` H. Peter Anvin
@ 2006-05-15 13:41 ` Jim Klimov
0 siblings, 0 replies; 3+ messages in thread
From: Jim Klimov @ 2006-05-15 13:41 UTC (permalink / raw)
To: linux-raid
Hello H.,
Monday, May 15, 2006, 12:10:06 AM, you wrote:
HPA> Followup to: <0376848.20060510150440@2ka.mipt.ru>
HPA> By author: Jim Klimov <klimov@2ka.mipt.ru>
HPA> In newsgroup: linux.dev.raid
>>
>> Since the new parted worked ok (older one didn't), we were happy
>> until we tried a reboot. During the device initialization and after
>> it the system only recognises the 6 or 7 partitions which start
>> before the 2000Gb limit:
>>
HPA> For a DOS partition table, there is no such thing as a partition
HPA> starting beyond 2 TB. You need to use a GPT or other more
HPA> sophisticated partition table.
Thank you for the suggestion, we'll research it. Haven't tried any
other partitioning schemes before :)
A couple of short yes-no questions, if I may:
1) Are the GPT tables a feature of Itanium platforms, or can they be
used on usual 32-bit (Xeon) servers with Linux inside?
2) Is it possible to convert the partitioning schemes "on-the-fly",
or should we backup one array, repartition it and format the new
partitions, and restore the data? I believe this is a safer way
in any case, but several terabytes are not so easy and quick to
back up :)
And a longer question:
3) Are there any other features I should know of before relying on
GPT? Device naming? Addressing? Driver updates? Boot-up issues?
I'll use Google to find more FAQs now... ;)
--
Best regards,
Jim Klimov mailto:klimov@2ka.mipt.ru
^ permalink raw reply [flat|nested] 3+ messages in thread
end of thread, other threads:[~2006-05-15 13:41 UTC | newest]
Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2006-05-10 11:04 Problem with large devices >2TB Jim Klimov
2006-05-14 20:10 ` H. Peter Anvin
2006-05-15 13:41 ` Re[2]: " Jim Klimov
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).