All of lore.kernel.org
 help / color / mirror / Atom feed
* multipath and StorageWorks HSG80
@ 2006-04-20 19:43 Steve Feehan
  2006-04-20 20:02 ` Bernd Zeimetz
  2006-04-20 20:07 ` Christophe Varoqui
  0 siblings, 2 replies; 12+ messages in thread
From: Steve Feehan @ 2006-04-20 19:43 UTC (permalink / raw)
  To: dm-devel

Hello. I'm trying to figure out multipath on a StorageWorks HSG80
contoller. I'm using Ubuntu "Dapper" with the following software
versions:

  kernel:           2.6.15-20-server
  multipath-tools:  0.4.7-1ubuntu4

From what I've been reading[1], I gather that I need to use the
dm-hp-sw kernel module. So I applied the patches[2,3,4,5] to
the kernel and setup the /etc/multipath.conf as follows:

devices {

        device {
                vendor                  "DEC"
                product                 "HSG80"
                path_grouping_policy    group_by_serial
                path_checker            hp_sw
                features                "1 queue_if_no_path"
                hardware_handler        "1 hp_sw"
        }
}

multipaths {
        multipath {
                wwid    60001fe10015bf500009947159810015
                alias   red
        }
}

From what I can tell, it is basically working:

sfeehan@dogwood:~$ sudo multipath -ll
red (360001fe10015bf500009947159810015)
[size=203 GB][features=1 queue_if_no_path][hwhandler=1 hp_sw]
\_ round-robin 0 [prio=2][active]
 \_ 0:0:0:1 sda 8:0   [active][ready]
 \_ 0:0:1:1 sdb 8:16  [active][ready]
\_ round-robin 0 [prio=2][enabled]
 \_ 0:0:2:1 sdc 8:32  [active][ghost]
 \_ 0:0:3:1 sdd 8:48  [active][ghost]


And now the problem: I see quite a few errors in dmesg:


[42949392.230000] sd 0:0:2:1: Device not ready.
[42949392.230000] end_request: I/O error, dev sdc, sector 426583424
[42949392.230000] printk: 515 messages suppressed.
[42949392.230000] Buffer I/O error on device sdc, logical block
213291712
[42949392.230000] sd 0:0:2:1: Device not ready.
[42949392.230000] end_request: I/O error, dev sdc, sector 426583424
[42949392.230000] sd 0:0:3:1: Device not ready.
[42949392.230000] end_request: I/O error, dev sdd, sector 426583424
[42949392.230000] sd 0:0:3:1: Device not ready.
[42949392.230000] end_request: I/O error, dev sdd, sector 426583424


And when I put LVM on top of the multipath device I get lots
of errors:

sfeehan@dogwood:~$ sudo pvcreate /dev/mapper/red
  Physical volume "/dev/mapper/red" successfully created
sfeehan@dogwood:~$ sudo vgcreate vg0 /dev/mapper/red
  Found duplicate PV qDhK4gPTGtzXAm7jPQf0uf07a0FQs2qL: using /dev/sdb1
not /dev/sda1
  /dev/sdc: read failed after 0 of 2048 at 0: Input/output error
  /dev/sdd: read failed after 0 of 2048 at 0: Input/output error
  Found duplicate PV qDhK4gPTGtzXAm7jPQf0uf07a0FQs2qL: using /dev/sda1
not /dev/sdb1
  Found duplicate PV qDhK4gPTGtzXAm7jPQf0uf07a0FQs2qL: using /dev/sdb1
not /dev/sda1
  /dev/sdc: read failed after 0 of 1024 at 0: Input/output error
  /dev/sdc: read failed after 0 of 2048 at 0: Input/output error
  /dev/sdd: read failed after 0 of 1024 at 0: Input/output error
  /dev/sdd: read failed after 0 of 2048 at 0: Input/output error
  Volume group "vg0" successfully created
sfeehan@dogwood:~$
sfeehan@dogwood:~$
sfeehan@dogwood:~$ sudo lvcreate -L 10G -n test vg0
  Found duplicate PV qDhK4gPTGtzXAm7jPQf0uf07a0FQs2qL: using /dev/sdb1
not /dev/sda1
  /dev/sdc: read failed after 0 of 2048 at 0: Input/output error
  /dev/sdd: read failed after 0 of 2048 at 0: Input/output error
  Logical volume "test" created
sfeehan@dogwood:~$


So... what are the errors about? Are they significant? What
if anything can I do to resolve them?

Thanks.

Steve


[1] [dm-devel] StorageWorks multipath support
    http://www.redhat.com/archives/dm-devel/2005-June/msg00105.html

[2] [dm-devel] [PATCH RFT/RFC 1/4] export block layer helpers
    http://www.redhat.com/archives/dm-devel/2005-September/msg00097.html

[3] [dm-devel] [PATCH RFT/RFC 2/4] add dm_scsi helpers
    http://www.redhat.com/archives/dm-devel/2005-September/msg00098.html

[4] [dm-devel] [PATCH RFT/RFC 3/4] convert dm emc
    http://www.redhat.com/archives/dm-devel/2005-September/msg00099.html

[5] [dm-devel] [PATCH RFT/RFC 4/4] add dm-hp-sw hw handler
    http://www.redhat.com/archives/dm-devel/2005-September/msg00100.html

-- 
Steve Feehan

^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: multipath and StorageWorks HSG80
  2006-04-20 19:43 multipath and StorageWorks HSG80 Steve Feehan
@ 2006-04-20 20:02 ` Bernd Zeimetz
  2006-04-20 20:47   ` Steve Feehan
  2006-04-20 20:07 ` Christophe Varoqui
  1 sibling, 1 reply; 12+ messages in thread
From: Bernd Zeimetz @ 2006-04-20 20:02 UTC (permalink / raw)
  To: device-mapper development

Steve Feehan wrote:
> Hello. I'm trying to figure out multipath on a StorageWorks HSG80
> contoller. I'm using Ubuntu "Dapper" with the following software
>   
there was a thread about the HSG80s few days ago.
http://www.redhat.com/archives/dm-devel/2006-April/msg00025.html
probably that answers all your questions.


Actually I'd really like to have them supported well, chances are good
that we'll get some of them soon :\


Best regards,


Bernd

^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: multipath and StorageWorks HSG80
  2006-04-20 19:43 multipath and StorageWorks HSG80 Steve Feehan
  2006-04-20 20:02 ` Bernd Zeimetz
@ 2006-04-20 20:07 ` Christophe Varoqui
  2006-04-20 20:44   ` Steve Feehan
  1 sibling, 1 reply; 12+ messages in thread
From: Christophe Varoqui @ 2006-04-20 20:07 UTC (permalink / raw)
  To: device-mapper development

Steve Feehan a écrit :
> Hello. I'm trying to figure out multipath on a StorageWorks HSG80
> contoller. I'm using Ubuntu "Dapper" with the following software
> versions:
>
>   kernel:           2.6.15-20-server
>   multipath-tools:  0.4.7-1ubuntu4
>
> >From what I've been reading[1], I gather that I need to use the
> dm-hp-sw kernel module. So I applied the patches[2,3,4,5] to
> the kernel and setup the /etc/multipath.conf as follows:
>
> devices {
>
>         device {
>                 vendor                  "DEC"
>                 product                 "HSG80"
>                 path_grouping_policy    group_by_serial
>                 path_checker            hp_sw
>                 features                "1 queue_if_no_path"
>                 hardware_handler        "1 hp_sw"
>         }
> }
>
> multipaths {
>         multipath {
>                 wwid    60001fe10015bf500009947159810015
>                 alias   red
>         }
> }
>
> >From what I can tell, it is basically working:
>
> sfeehan@dogwood:~$ sudo multipath -ll
> red (360001fe10015bf500009947159810015)
> [size=203 GB][features=1 queue_if_no_path][hwhandler=1 hp_sw]
> \_ round-robin 0 [prio=2][active]
>  \_ 0:0:0:1 sda 8:0   [active][ready]
>  \_ 0:0:1:1 sdb 8:16  [active][ready]
> \_ round-robin 0 [prio=2][enabled]
>  \_ 0:0:2:1 sdc 8:32  [active][ghost]
>  \_ 0:0:3:1 sdd 8:48  [active][ghost]
>
>
> And now the problem: I see quite a few errors in dmesg:
>
>
> [42949392.230000] sd 0:0:2:1: Device not ready.
> [42949392.230000] end_request: I/O error, dev sdc, sector 426583424
> [42949392.230000] printk: 515 messages suppressed.
> [42949392.230000] Buffer I/O error on device sdc, logical block
> 213291712
> [42949392.230000] sd 0:0:2:1: Device not ready.
> [42949392.230000] end_request: I/O error, dev sdc, sector 426583424
> [42949392.230000] sd 0:0:3:1: Device not ready.
> [42949392.230000] end_request: I/O error, dev sdd, sector 426583424
> [42949392.230000] sd 0:0:3:1: Device not ready.
> [42949392.230000] end_request: I/O error, dev sdd, sector 426583424
>
>
> And when I put LVM on top of the multipath device I get lots
> of errors:
>
> sfeehan@dogwood:~$ sudo pvcreate /dev/mapper/red
>   Physical volume "/dev/mapper/red" successfully created
> sfeehan@dogwood:~$ sudo vgcreate vg0 /dev/mapper/red
>   Found duplicate PV qDhK4gPTGtzXAm7jPQf0uf07a0FQs2qL: using /dev/sdb1
> not /dev/sda1
>   /dev/sdc: read failed after 0 of 2048 at 0: Input/output error
>   /dev/sdd: read failed after 0 of 2048 at 0: Input/output error
>   Found duplicate PV qDhK4gPTGtzXAm7jPQf0uf07a0FQs2qL: using /dev/sda1
> not /dev/sdb1
>   Found duplicate PV qDhK4gPTGtzXAm7jPQf0uf07a0FQs2qL: using /dev/sdb1
> not /dev/sda1
>   /dev/sdc: read failed after 0 of 1024 at 0: Input/output error
>   /dev/sdc: read failed after 0 of 2048 at 0: Input/output error
>   /dev/sdd: read failed after 0 of 1024 at 0: Input/output error
>   /dev/sdd: read failed after 0 of 2048 at 0: Input/output error
>   Volume group "vg0" successfully created
> sfeehan@dogwood:~$
> sfeehan@dogwood:~$
> sfeehan@dogwood:~$ sudo lvcreate -L 10G -n test vg0
>   Found duplicate PV qDhK4gPTGtzXAm7jPQf0uf07a0FQs2qL: using /dev/sdb1
> not /dev/sda1
>   /dev/sdc: read failed after 0 of 2048 at 0: Input/output error
>   /dev/sdd: read failed after 0 of 2048 at 0: Input/output error
>   Logical volume "test" created
> sfeehan@dogwood:~$
>
>
> So... what are the errors about? Are they significant? What
> if anything can I do to resolve them?
>
> Thanks.
>
> Steve
>
>
> [1] [dm-devel] StorageWorks multipath support
>     http://www.redhat.com/archives/dm-devel/2005-June/msg00105.html
>
> [2] [dm-devel] [PATCH RFT/RFC 1/4] export block layer helpers
>     http://www.redhat.com/archives/dm-devel/2005-September/msg00097.html
>
> [3] [dm-devel] [PATCH RFT/RFC 2/4] add dm_scsi helpers
>     http://www.redhat.com/archives/dm-devel/2005-September/msg00098.html
>
> [4] [dm-devel] [PATCH RFT/RFC 3/4] convert dm emc
>     http://www.redhat.com/archives/dm-devel/2005-September/msg00099.html
>
> [5] [dm-devel] [PATCH RFT/RFC 4/4] add dm-hp-sw hw handler
>     http://www.redhat.com/archives/dm-devel/2005-September/msg00100.html
>
>   
Overall, that's very good howto material, if you care to feed the wiki.

The IO errors are due to lvm tools trying to scan the individual paths, 
including those not usable, where they should only care about the 
multipath devices.

You should blacklist the individual path in the lvm config.


Regards,
cvaroqui

^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: multipath and StorageWorks HSG80
  2006-04-20 20:07 ` Christophe Varoqui
@ 2006-04-20 20:44   ` Steve Feehan
  2006-04-20 21:10     ` Eddie Williams
  0 siblings, 1 reply; 12+ messages in thread
From: Steve Feehan @ 2006-04-20 20:44 UTC (permalink / raw)
  To: device-mapper development

On Thu, Apr 20, 2006 at 10:07:50PM +0200, Christophe Varoqui wrote:

> Overall, that's very good howto material, if you care to feed the wiki.

OK, I will see what I can do.

> The IO errors are due to lvm tools trying to scan the individual paths, 
> including those not usable, where they should only care about the 
> multipath devices.
>
> You should blacklist the individual path in the lvm config.

Yes, I put the following in /etc/lvm/lvm.conf:

filter = [ "r|/dev/cdrom|", "r|/dev/sd|" ]

And that makes the errors go away for the LVM case. But I still 
get the kernel errors during bootup.

It's too soon to say for sure, but so far I don't seem to be
getting any errors once the system has booted. Perhaps it's just
during the early stages of booting and not an issue after that?

If anyone would like to see the full dmesg output for a boot
I'd be glad to send it.

And I haven't even begun to test actually failing over the
controllers. I imagine I'll have more problems/questions shortly. :)

Thanks.

-- 
Steve Feehan

^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: multipath and StorageWorks HSG80
  2006-04-20 20:02 ` Bernd Zeimetz
@ 2006-04-20 20:47   ` Steve Feehan
  2006-04-20 20:53     ` Christophe Varoqui
  0 siblings, 1 reply; 12+ messages in thread
From: Steve Feehan @ 2006-04-20 20:47 UTC (permalink / raw)
  To: device-mapper development

On Thu, Apr 20, 2006 at 10:02:07PM +0200, Bernd Zeimetz wrote:
> Steve Feehan wrote:
> > Hello. I'm trying to figure out multipath on a StorageWorks HSG80
> > contoller. I'm using Ubuntu "Dapper" with the following software
> >   
> there was a thread about the HSG80s few days ago.
> http://www.redhat.com/archives/dm-devel/2006-April/msg00025.html
> probably that answers all your questions.

Yes, I saw that. The particular problem that I saw there
was the misreporting of unit size. The Ubuntu init script
takes care of this and all of my units report the same 
size.

> Actually I'd really like to have them supported well, chances are good
> that we'll get some of them soon :\
> 
> 
> Best regards,
> 
> 
> Bernd
> 
> --
> dm-devel mailing list
> dm-devel@redhat.com
> https://www.redhat.com/mailman/listinfo/dm-devel

-- 
Steve Feehan

^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: multipath and StorageWorks HSG80
  2006-04-20 20:47   ` Steve Feehan
@ 2006-04-20 20:53     ` Christophe Varoqui
  2006-04-20 21:05       ` Steve Feehan
  0 siblings, 1 reply; 12+ messages in thread
From: Christophe Varoqui @ 2006-04-20 20:53 UTC (permalink / raw)
  To: device-mapper development

Steve Feehan a écrit :
> On Thu, Apr 20, 2006 at 10:02:07PM +0200, Bernd Zeimetz wrote:
>   
>> Steve Feehan wrote:
>>     
>>> Hello. I'm trying to figure out multipath on a StorageWorks HSG80
>>> contoller. I'm using Ubuntu "Dapper" with the following software
>>>   
>>>       
>> there was a thread about the HSG80s few days ago.
>> http://www.redhat.com/archives/dm-devel/2006-April/msg00025.html
>> probably that answers all your questions.
>>     
>
> Yes, I saw that. The particular problem that I saw there
> was the misreporting of unit size. The Ubuntu init script
> takes care of this and all of my units report the same 
> size.
>
>   
That's certainly where the IO errors are triggered at bootup.
Can you send the code snippet ?

Regards,
cvaroqui

^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: multipath and StorageWorks HSG80
  2006-04-20 20:53     ` Christophe Varoqui
@ 2006-04-20 21:05       ` Steve Feehan
  2006-04-20 21:22         ` Christophe Varoqui
  0 siblings, 1 reply; 12+ messages in thread
From: Steve Feehan @ 2006-04-20 21:05 UTC (permalink / raw)
  To: device-mapper development

On Thu, Apr 20, 2006 at 10:53:00PM +0200, Christophe Varoqui wrote:
> Steve Feehan a ?crit :
> >On Thu, Apr 20, 2006 at 10:02:07PM +0200, Bernd Zeimetz wrote:
> >  
> >>Steve Feehan wrote:
> >>    
> >>>Hello. I'm trying to figure out multipath on a StorageWorks HSG80
> >>>contoller. I'm using Ubuntu "Dapper" with the following software
> >>>  
> >>>      
> >>there was a thread about the HSG80s few days ago.
> >>http://www.redhat.com/archives/dm-devel/2006-April/msg00025.html
> >>probably that answers all your questions.
> >>    
> >
> >Yes, I saw that. The particular problem that I saw there
> >was the misreporting of unit size. The Ubuntu init script
> >takes care of this and all of my units report the same 
> >size.
> >
> >  
> That's certainly where the IO errors are triggered at bootup.
> Can you send the code snippet ?

You mean from the init script? Here it is:

hsg80_init() {
        dummy_capa=2097152

        for i in $(grep -rl 2097152 /sys/block/sd*/size|awk -F/ '{print
$4}')
        do
                echo
                echo "##################################"
                echo "FIXING HSG80 device: $i"
                echo
                sg_start -start /dev/$i
                sleep 1
                echo 1>/sys/block/$i/device/rescan
        done
}

I put the echo's in so that I could see if and for which devices
it was being called. And yes, it's called for the two "ghost"
devices (sdc and sdd).


But if you look at boot output, the errors begin well before
the multipath-tools init script runs. 

Here is a trimmed excerpt:


[42949387.600000] qla2200 0000:03:01.0: Found an ISP2200, irq 169,
iobase 0xf8826000
[42949387.700000] qla2200 0000:03:01.0: Configuring PCI space...
[42949387.780000] qla2200 0000:03:01.0: Configure NVRAM parameters...
[42949387.940000] qla2200 0000:03:01.0: Verifying loaded RISC code...
[42949388.200000] qla2200 0000:03:01.0: LIP reset occured (0).
[42949388.270000] qla2200 0000:03:01.0: Waiting for LIP to complete...
[42949390.910000] qla2200 0000:03:01.0: LOOP UP detected (1 Gbps).
[42949390.990000] qla2200 0000:03:01.0: Topology - (F_Port), Host Loop
address 0xffff
[42949391.170000] scsi0 : qla2xxx
[42949391.210000] qla2200 0000:03:01.0:
[42949391.210000]  QLogic Fibre Channel HBA Driver: 8.01.03-k
[42949391.210000]   QLogic QLA22xx -
[42949391.210000]   ISP2200: PCI (66 MHz) @ 0000:03:01.0 hdma-, host#=0,
fw=2.02.08 TP
[42949391.480000] scsi: unknown device type 12
[42949391.540000]   Vendor: DEC       Model: HSG80CCL          Rev: V87F
[42949391.620000]   Type:   RAID                               ANSI SCSI
revision: 02
[42949391.730000]   Vendor: DEC       Model: HSG80             Rev: V87F
[42949391.810000]   Type:   Direct-Access                      ANSI SCSI
revision: 02
[42949391.910000] usbcore: registered new driver usbfs
[42949391.910000] scsi: unknown device type 12
[42949391.910000]   Vendor: DEC       Model: HSG80CCL          Rev: V87F
[42949391.910000]   Type:   RAID                               ANSI SCSI
revision: 02
[42949391.920000]   Vendor: DEC       Model: HSG80             Rev: V87F
[42949391.920000]   Type:   Direct-Access                      ANSI SCSI
revision: 02
[42949392.400000] usbcore: registered new driver hub
[42949392.470000] scsi: unknown device type 12
[42949392.520000]   Vendor: DEC       Model: HSG80CCL          Rev: V87F
[42949392.610000]   Type:   RAID                               ANSI SCSI
revision: 02
[42949392.710000] ACPI: PCI Interrupt 0000:01:00.0[D] -> GSI 19 (level,
low) -> IRQ 177
[42949394.820000] SCSI device sda: 426583554 512-byte hdwr sectors
(218411 MB)
[42949394.910000] SCSI device sda: drive cache: write through
[42949394.980000]  sda: unknown partition table
[42949395.050000] sd 0:0:0:1: Attached scsi disk sda
[42949395.120000] SCSI device sdb: 426583554 512-byte hdwr sectors
(218411 MB)
[42949395.210000] SCSI device sdb: drive cache: write through
[42949395.290000] SCSI device sdb: 426583554 512-byte hdwr sectors
(218411 MB)
[42949395.380000] SCSI device sdb: drive cache: write through
[42949395.450000]  sdb: unknown partition table
[42949395.510000] sd 0:0:1:1: Attached scsi disk sdb
[42949395.580000] sdc : READ CAPACITY failed.
[42949395.580000] sdc : status=1, message=00, host=0, driver=08
[42949395.710000] sd: Current: sense key: Not Ready
[42949395.770000]     Additional sense: Logical unit not ready,
initializing cmd. required
[42949395.870000] sdc: asking for cache data failed
[42949395.930000] sdc: assuming drive cache: write through
[42949396.000000] sdc : READ CAPACITY failed.
[42949396.000000] sdc : status=1, message=00, host=0, driver=08
[42949396.130000] sd: Current: sense key: Not Ready
[42949396.190000]     Additional sense: Logical unit not ready,
initializing cmd. required
[42949396.300000] sdc: asking for cache data failed
[42949396.360000] sdc: assuming drive cache: write through
[42949396.430000]  sdc:<6>sd 0:0:2:1: Device not ready.
[42949396.490000] end_request: I/O error, dev sdc, sector 0
[42949396.560000] Buffer I/O error on device sdc, logical block 0
[42949396.640000] sd 0:0:2:1: Device not ready.
[42949396.700000] end_request: I/O error, dev sdc, sector 0
[42949396.770000] Buffer I/O error on device sdc, logical block 0
[42949396.840000]  unable to read partition table
[42949396.900000] sd 0:0:2:1: Attached scsi disk sdc
[42949396.970000] sdd : READ CAPACITY failed.
[42949396.970000] sdd : status=1, message=00, host=0, driver=08
[42949397.100000] sd 0:0:2:1: Device not ready.
[42949397.100000] end_request: I/O error, dev sdc, sector 2097024
[42949397.100000] Buffer I/O error on device sdc, logical block 262128
[42949397.310000] sd 0:0:2:1: Device not ready.


.... and then hundreds of lines of such errors ....

Begin: Running /scripts/local-bottom ...
Done.
Done.
Begin: Running /scripts/init-bottom ...
Done.
 * version 2.86 booting
Starting multipath
##################################
FIXING HSG80 device: sdc

[42949413.900000] SCSI device sdc: 426583554 512-byte hdwr sectors
(218411 MB)
[42949413.990000] SCSI device sdc: drive cache: write through

##################################
FIXING HSG80 device: sdd

[42949415.090000] SCSI device sdd: 426583554 512-byte hdwr sectors
(218411 MB)
[42949415.180000] SCSI device sdd: drive cache: write through
[42949415.440000] device-mapper: hp_sw version 0.4 loaded
[42949415.520000] device-mapper: dm-round-robin version 1.0.0 loaded
 done.
 * Starting basic networking...
 * [ ok ]
 * Starting kernel event manager...
 * [ ok ]
 * Loading hardware drivers... [42949416.810000] device-mapper: hp_sw:
 * queueing START_STOP command on 8:16
[42949417.030000]  0:0:0:0: Attached scsi generic sg0 type 12
[42949417.100000] sd 0:0:0:1: Attached scsi generic sg1 type 0
[42949417.170000]  0:0:1:0: Attached scsi generic sg2 type 12
[42949417.240000] sd 0:0:1:1: Attached scsi generic sg3 type 0
[42949417.320000] sd 0:0:3:1: Device not ready.
[42949417.320000] end_request: I/O error, dev sdd, sector 426583424
[42949417.320000] printk: 40 messages suppressed.


.... and the errors continue for the rest of the boot process.
Once the file system is mounted, no more errors. At least not
for the short period that the system has been up.

-- 
Steve Feehan

^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: multipath and StorageWorks HSG80
  2006-04-20 20:44   ` Steve Feehan
@ 2006-04-20 21:10     ` Eddie Williams
  2006-04-21 13:40       ` Steve Feehan
  0 siblings, 1 reply; 12+ messages in thread
From: Eddie Williams @ 2006-04-20 21:10 UTC (permalink / raw)
  To: device-mapper development

On Thu, 2006-04-20 at 16:44 -0400, Steve Feehan wrote:
> On Thu, Apr 20, 2006 at 10:07:50PM +0200, Christophe Varoqui wrote:

> And that makes the errors go away for the LVM case. But I still 
> get the kernel errors during bootup.
> 

Did you rebuild your RAM disk?  If not it will scan LVM using the conf
file from the last build.

^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: multipath and StorageWorks HSG80
  2006-04-20 21:05       ` Steve Feehan
@ 2006-04-20 21:22         ` Christophe Varoqui
  0 siblings, 0 replies; 12+ messages in thread
From: Christophe Varoqui @ 2006-04-20 21:22 UTC (permalink / raw)
  To: device-mapper development


>> That's certainly where the IO errors are triggered at bootup.
>> Can you send the code snippet ?
>>     
>
> You mean from the init script? Here it is:
>
> hsg80_init() {
>         dummy_capa=2097152
>
>         for i in $(grep -rl 2097152 /sys/block/sd*/size|awk -F/ '{print
> $4}')
>         do
>                 echo
>                 echo "##################################"
>                 echo "FIXING HSG80 device: $i"
>                 echo
>                 sg_start -start /dev/$i
>                 sleep 1
>                 echo 1>/sys/block/$i/device/rescan
>         done
> }
>
> I put the echo's in so that I could see if and for which devices
> it was being called. And yes, it's called for the two "ghost"
> devices (sdc and sdd).
>
>
> But if you look at boot output, the errors begin well before
> the multipath-tools init script runs. 
>
>   
Right, this script can't be the culprit.
> Here is a trimmed excerpt:
>
>
> (218411 MB)
> [42949395.380000] SCSI device sdb: drive cache: write through
> [42949395.450000]  sdb: unknown partition table
> [42949395.510000] sd 0:0:1:1: Attached scsi disk sdb
> [42949395.580000] sdc : READ CAPACITY failed.
> [42949395.580000] sdc : status=1, message=00, host=0, driver=08
> [42949395.710000] sd: Current: sense key: Not Ready
> [42949395.770000]     Additional sense: Logical unit not ready,
> initializing cmd. required
> [42949395.870000] sdc: asking for cache data failed
> [42949395.930000] sdc: assuming drive cache: write through
> [42949396.000000] sdc : READ CAPACITY failed.
> [42949396.000000] sdc : status=1, message=00, host=0, driver=08
> [42949396.130000] sd: Current: sense key: Not Ready
> [42949396.190000]     Additional sense: Logical unit not ready,
> initializing cmd. required
> [42949396.300000] sdc: asking for cache data failed
> [42949396.360000] sdc: assuming drive cache: write through
> [42949396.430000]  sdc:<6>sd 0:0:2:1: Device not ready.
> [42949396.490000] end_request: I/O error, dev sdc, sector 0
>   
The scsi driver probing seems to fail with io errors on ghost paths.
If so, you can consider it harmless.

Regards,
cvaroqui

^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: multipath and StorageWorks HSG80
  2006-04-20 21:10     ` Eddie Williams
@ 2006-04-21 13:40       ` Steve Feehan
  2006-04-21 14:09         ` Philip R. Auld
  0 siblings, 1 reply; 12+ messages in thread
From: Steve Feehan @ 2006-04-21 13:40 UTC (permalink / raw)
  To: device-mapper development

On Thu, Apr 20, 2006 at 05:10:44PM -0400, Eddie Williams wrote:
> On Thu, 2006-04-20 at 16:44 -0400, Steve Feehan wrote:
> > On Thu, Apr 20, 2006 at 10:07:50PM +0200, Christophe Varoqui wrote:
> 
> > And that makes the errors go away for the LVM case. But I still 
> > get the kernel errors during bootup.
> > 
> 
> Did you rebuild your RAM disk?  If not it will scan LVM using the conf
> file from the last build.

No, I hadn't considered that. But looking at the content of the
initramfs there is not a copy of lvm.conf in there.

But I have since rebuilt the initramfs and it did not make a
difference.

I have not customized the default Ubuntu initramfs (other than
building it for the patched kernel that I needed to build to get
the dm-hp-sw module). Should I be doing anything special in the
initramfs? Note that I'm not trying to put the / file system on
a multipath device... I'm not nearly that ambitious. :)

Also, I noticed this when the device-mapper modules are loaded
from the initramfs:


Begin: Loading e[42949384.440000] SCSI subsystem initialized
ssential drivers[42949384.500000] device-mapper: 4.4.0-ioctl
(2005-01-12) initialised: dm-devel@redhat.com
... ...
[42949384.620000] device-mapper: dm-multipath version 1.0.4 loaded
[42949384.710000] dm_hp_sw: Unknown parameter `dm-hp-sw.c'
Done.


But looking at the boot text, there was something wrong with the
loading of dm-hp-sw from the initramfs. It appears that the
dm-hp-sw module was not successfully loaded until much later in
the boot sequence.

I tracked this down and fixed the problem. But still, even with
dm-hp-sw loading before the SCSI devices are probed, I get a ton
of SCSI errors trying to access the "ghost" devices.

-- 
Steve Feehan

^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: multipath and StorageWorks HSG80
  2006-04-21 13:40       ` Steve Feehan
@ 2006-04-21 14:09         ` Philip R. Auld
  2006-04-21 14:39           ` Steve Feehan
  0 siblings, 1 reply; 12+ messages in thread
From: Philip R. Auld @ 2006-04-21 14:09 UTC (permalink / raw)
  To: device-mapper development

Rumor has it that on Fri, Apr 21, 2006 at 09:40:55AM -0400 Steve Feehan said:
> 
> I tracked this down and fixed the problem. But still, even with
> dm-hp-sw loading before the SCSI devices are probed, I get a ton
> of SCSI errors trying to access the "ghost" devices.

Those should be expected. The SCSI layers are trying to configure the
devices (read capacity, partition tables etc). The dm-hp-sw module
is not in that path so it can't do anything about that. They should be
harmless. 
 
This is an architectural artifact of doing the multipathing above the
SCSI layer.


Cheers,

Phil

> 
> -- 
> Steve Feehan
> 
> --
> dm-devel mailing list
> dm-devel@redhat.com
> https://www.redhat.com/mailman/listinfo/dm-devel

-- 
Philip R. Auld, Ph.D.  	        	       Egenera, Inc.    
Software Architect                            165 Forest St.
(508) 858-2628                            Marlboro, MA 01752

^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: multipath and StorageWorks HSG80
  2006-04-21 14:09         ` Philip R. Auld
@ 2006-04-21 14:39           ` Steve Feehan
  0 siblings, 0 replies; 12+ messages in thread
From: Steve Feehan @ 2006-04-21 14:39 UTC (permalink / raw)
  To: device-mapper development

On Fri, Apr 21, 2006 at 10:09:42AM -0400, Philip R. Auld wrote:
> Rumor has it that on Fri, Apr 21, 2006 at 09:40:55AM -0400 Steve Feehan said:
> > 
> > I tracked this down and fixed the problem. But still, even with
> > dm-hp-sw loading before the SCSI devices are probed, I get a ton
> > of SCSI errors trying to access the "ghost" devices.
> 
> Those should be expected. The SCSI layers are trying to configure the
> devices (read capacity, partition tables etc). The dm-hp-sw module
> is not in that path so it can't do anything about that. They should be
> harmless. 
>  
> This is an architectural artifact of doing the multipathing above the
> SCSI layer.

OK. Then as Christophe Varoqui also suggested, I will just /try/
to not worry too much. ;)

Thanks.

-- 
Steve Feehan

^ permalink raw reply	[flat|nested] 12+ messages in thread

end of thread, other threads:[~2006-04-21 14:39 UTC | newest]

Thread overview: 12+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2006-04-20 19:43 multipath and StorageWorks HSG80 Steve Feehan
2006-04-20 20:02 ` Bernd Zeimetz
2006-04-20 20:47   ` Steve Feehan
2006-04-20 20:53     ` Christophe Varoqui
2006-04-20 21:05       ` Steve Feehan
2006-04-20 21:22         ` Christophe Varoqui
2006-04-20 20:07 ` Christophe Varoqui
2006-04-20 20:44   ` Steve Feehan
2006-04-20 21:10     ` Eddie Williams
2006-04-21 13:40       ` Steve Feehan
2006-04-21 14:09         ` Philip R. Auld
2006-04-21 14:39           ` Steve Feehan

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.