All of lore.kernel.org
 help / color / mirror / Atom feed
* Question regarding failback option in multipath.conf
@ 2009-09-30 16:41 Eli Klein
  2009-09-30 18:51 ` Eli Klein
  2009-09-30 19:07 ` John Brier
  0 siblings, 2 replies; 11+ messages in thread
From: Eli Klein @ 2009-09-30 16:41 UTC (permalink / raw)
  To: dm-devel


[-- Attachment #1.1: Type: text/plain, Size: 2925 bytes --]

I'm running into an issue where the failback option in multipath.conf  
is being ignored.  I've tried to set this to a value > 0 to cause dm  
to pause before reinstating a recovered path, but it reinstates the  
path immediately.  The same is true when setting the value to  
"manual", the path is reinstated immediately.  The delay is most  
useful when a ethernet switch has failed or rebooted.  During the boot  
of the switch, the interface comes up and drops again multiple times.   
I'd love to be able to introduce a delay before DM marks the path as  
reinstated after a recovery.  This would save me the hassle of running  
into I/O errors (and occasional filesystem RO) as the path bounces  
while the switch is booting.

Thanks in advance for any help or suggestions!

-Eli

I've included my multipath.conf as well as output from multipath -ll  
below:

multipath.conf:

blacklist {
         devnode "sda$"
#        devnode "*"
}

## By default, devices with vendor = "IBM" and product = "S/390.*" are
## blacklisted. To enable mulitpathing on these devies, uncomment the
## following lines.
#blacklist_exceptions {
#	device {
#		vendor	"IBM"
#		product	"S/390.*"
#	}
#}

## Use user friendly names, instead of using WWIDs as names.
defaults {
	user_friendly_names yes
}
##
## Here is an example of how to configure some standard options.
##
#
defaults {
	udev_dir		/dev
	polling_interval 	10
	selector		"round-robin 0"
	path_grouping_policy	multibus
	getuid_callout		"/sbin/scsi_id -g -u -s /block/%n"
	prio_callout		/bin/true
	path_checker		readsector0
	rr_min_io		10
	rr_weight		priorities
	failback		120
	features		"1 queue_if_no_path"
	no_path_retry		1
	user_friendly_name	yes
}

multipath -ll:

mpath2 (36000eb3945b8703a00000000000000f4) dm-3 LEFTHAND,iSCSIDisk
[size=400G][features=1 queue_if_no_path][hwhandler=0][rw]
\_ round-robin 0 [prio=0][active]
  \_ 192:0:0:0 sdl 8:176 [active][ready]
  \_ 191:0:0:0 sdf 8:80  [active][ready]
mpath152 (36000eb3945b8703a0000000000001431) dm-2 LEFTHAND,iSCSIDisk
[size=300G][features=1 queue_if_no_path][hwhandler=0][rw]
\_ round-robin 0 [prio=0][active]
  \_ 186:0:0:0 sdh 8:112 [active][ready]
  \_ 185:0:0:0 sde 8:64  [active][ready]
mpath149 (36000eb3945b8703a0000000000001429) dm-5 LEFTHAND,iSCSIDisk
[size=300G][features=1 queue_if_no_path][hwhandler=0][rw]
\_ round-robin 0 [prio=0][active]
  \_ 189:0:0:0 sdc 8:32  [active][ready]
  \_ 190:0:0:0 sdi 8:128 [active][ready]
mpath151 (36000eb3945b8703a000000000000142d) dm-6 LEFTHAND,iSCSIDisk
[size=300G][features=1 queue_if_no_path][hwhandler=0][rw]
\_ round-robin 0 [prio=0][active]
  \_ 188:0:0:0 sdj 8:144 [active][ready]
  \_ 187:0:0:0 sdg 8:96  [active][ready]
mpath150 (36000eb3945b8703a0000000000001435) dm-4 LEFTHAND,iSCSIDisk
[size=200G][features=1 queue_if_no_path][hwhandler=0][rw]
\_ round-robin 0 [prio=0][active]
  \_ 184:0:0:0 sdk 8:160 [active][ready]
  \_ 183:0:0:0 sdd 8:48  [active][ready]

[-- Attachment #1.2: Type: text/html, Size: 14680 bytes --]

[-- Attachment #2: Type: text/plain, Size: 0 bytes --]



^ permalink raw reply	[flat|nested] 11+ messages in thread

* Re: Question regarding failback option in multipath.conf
  2009-09-30 16:41 Question regarding failback option in multipath.conf Eli Klein
@ 2009-09-30 18:51 ` Eli Klein
  2009-09-30 19:07 ` John Brier
  1 sibling, 0 replies; 11+ messages in thread
From: Eli Klein @ 2009-09-30 18:51 UTC (permalink / raw)
  To: dm-devel


[-- Attachment #1.1: Type: text/plain, Size: 3350 bytes --]

Sorry, left off a few key details:

I'm running the following:

redhat 5.3 x86_64
device-mapper-1.02.28-2.el5
device-mapper-multipath-0.4.7-23.el5_3.4

Let me know if there are other details that would be helpful.

-Eli

On Sep 30, 2009, at 10:41 AM, Eli Klein wrote:

> I'm running into an issue where the failback option in  
> multipath.conf is being ignored.  I've tried to set this to a value  
> > 0 to cause dm to pause before reinstating a recovered path, but it  
> reinstates the path immediately.  The same is true when setting the  
> value to "manual", the path is reinstated immediately.  The delay is  
> most useful when a ethernet switch has failed or rebooted.  During  
> the boot of the switch, the interface comes up and drops again  
> multiple times.  I'd love to be able to introduce a delay before DM  
> marks the path as reinstated after a recovery.  This would save me  
> the hassle of running into I/O errors (and occasional filesystem RO)  
> as the path bounces while the switch is booting.
>
> Thanks in advance for any help or suggestions!
>
> -Eli
>
> I've included my multipath.conf as well as output from multipath -ll  
> below:
>
> multipath.conf:
>
> blacklist {
>         devnode "sda$"
> #        devnode "*"
> }
>
> ## By default, devices with vendor = "IBM" and product = "S/390.*" are
> ## blacklisted. To enable mulitpathing on these devies, uncomment the
> ## following lines.
> #blacklist_exceptions {
> #	device {
> #		vendor	"IBM"
> #		product	"S/390.*"
> #	}
> #}
>
> ## Use user friendly names, instead of using WWIDs as names.
> defaults {
> 	user_friendly_names yes
> }
> ##
> ## Here is an example of how to configure some standard options.
> ##
> #
> defaults {
> 	udev_dir		/dev
> 	polling_interval 	10
> 	selector		"round-robin 0"
> 	path_grouping_policy	multibus
> 	getuid_callout		"/sbin/scsi_id -g -u -s /block/%n"
> 	prio_callout		/bin/true
> 	path_checker		readsector0
> 	rr_min_io		10
> 	rr_weight		priorities
> 	failback		120
> 	features		"1 queue_if_no_path"
> 	no_path_retry		1
> 	user_friendly_name	yes
> }
>
> multipath -ll:
>
> mpath2 (36000eb3945b8703a00000000000000f4) dm-3 LEFTHAND,iSCSIDisk
> [size=400G][features=1 queue_if_no_path][hwhandler=0][rw]
> \_ round-robin 0 [prio=0][active]
>  \_ 192:0:0:0 sdl 8:176 [active][ready]
>  \_ 191:0:0:0 sdf 8:80  [active][ready]
> mpath152 (36000eb3945b8703a0000000000001431) dm-2 LEFTHAND,iSCSIDisk
> [size=300G][features=1 queue_if_no_path][hwhandler=0][rw]
> \_ round-robin 0 [prio=0][active]
>  \_ 186:0:0:0 sdh 8:112 [active][ready]
>  \_ 185:0:0:0 sde 8:64  [active][ready]
> mpath149 (36000eb3945b8703a0000000000001429) dm-5 LEFTHAND,iSCSIDisk
> [size=300G][features=1 queue_if_no_path][hwhandler=0][rw]
> \_ round-robin 0 [prio=0][active]
>  \_ 189:0:0:0 sdc 8:32  [active][ready]
>  \_ 190:0:0:0 sdi 8:128 [active][ready]
> mpath151 (36000eb3945b8703a000000000000142d) dm-6 LEFTHAND,iSCSIDisk
> [size=300G][features=1 queue_if_no_path][hwhandler=0][rw]
> \_ round-robin 0 [prio=0][active]
>  \_ 188:0:0:0 sdj 8:144 [active][ready]
>  \_ 187:0:0:0 sdg 8:96  [active][ready]
> mpath150 (36000eb3945b8703a0000000000001435) dm-4 LEFTHAND,iSCSIDisk
> [size=200G][features=1 queue_if_no_path][hwhandler=0][rw]
> \_ round-robin 0 [prio=0][active]
>  \_ 184:0:0:0 sdk 8:160 [active][ready]
>  \_ 183:0:0:0 sdd 8:48  [active][ready]


[-- Attachment #1.2: Type: text/html, Size: 15290 bytes --]

[-- Attachment #2: Type: text/plain, Size: 0 bytes --]



^ permalink raw reply	[flat|nested] 11+ messages in thread

* Re: Question regarding failback option in multipath.conf
  2009-09-30 16:41 Question regarding failback option in multipath.conf Eli Klein
  2009-09-30 18:51 ` Eli Klein
@ 2009-09-30 19:07 ` John Brier
  2009-09-30 19:16   ` Eli Klein
  1 sibling, 1 reply; 11+ messages in thread
From: John Brier @ 2009-09-30 19:07 UTC (permalink / raw)
  To: device-mapper development

On 09/30/2009 12:41 PM, Eli Klein wrote:
> I'm running into an issue where the failback option in multipath.conf is
> being ignored. I've tried to set this to a value > 0 to cause dm to
> pause before reinstating a recovered path, but it reinstates the path
> immediately. The same is true when setting the value to "manual", the
> path is reinstated immediately. The delay is most useful when a ethernet
> switch has failed or rebooted. During the boot of the switch, the
> interface comes up and drops again multiple times. I'd love to be able
> to introduce a delay before DM marks the path as reinstated after a
> recovery. This would save me the hassle of running into I/O errors (and
> occasional filesystem RO) as the path bounces while the switch is booting.
>
> Thanks in advance for any help or suggestions!
>
> -Eli
>
> I've included my multipath.conf as well as output from multipath -ll below:
>
> multipath.conf:
>
> blacklist {
> devnode "sda$"
> # devnode "*"
> }
>
> ## By default, devices with vendor = "IBM" and product = "S/390.*" are
> ## blacklisted. To enable mulitpathing on these devies, uncomment the
> ## following lines.
> #blacklist_exceptions {
> # device {
> # vendor "IBM"
> # product "S/390.*"
> # }
> #}
>
> ## Use user friendly names, instead of using WWIDs as names.
> defaults {
> user_friendly_names yes
> }
> ##
> ## Here is an example of how to configure some standard options.
> ##
> #
> defaults {
> udev_dir /dev
> polling_interval 10
> selector "round-robin 0"
> path_grouping_policy multibus
> getuid_callout "/sbin/scsi_id -g -u -s /block/%n"
> prio_callout /bin/true
> path_checker readsector0
> rr_min_io 10
> rr_weight priorities
> failback 120
> features "1 queue_if_no_path"
> no_path_retry 1
> user_friendly_name yes
> }

It looks like you have two defaults {} sections. Is that allowed? 
Maybe multipath is only picking up the first one? I'd remove the first 
one/comment out and put user_friendly_names yes in the second one.

Also I think multipath -v3 -d will show you what settings would be 
applied so you don't have to test the behavior by cycling your 
ethernet switch.

For example on one of my systems multipath -v3 shows
mpath1: pgfailback = -2 (controller setting)


After you have it configured right run multipath -v3 without the -d 
(dry run)

Can someone confirm if pgfailback from multipath -v3 output indicates 
the current setting of failback that multipath is using?

John Brier

^ permalink raw reply	[flat|nested] 11+ messages in thread

* Re: Question regarding failback option in multipath.conf
  2009-09-30 19:07 ` John Brier
@ 2009-09-30 19:16   ` Eli Klein
  2009-09-30 19:24     ` John Brier
  0 siblings, 1 reply; 11+ messages in thread
From: Eli Klein @ 2009-09-30 19:16 UTC (permalink / raw)
  To: device-mapper development


[-- Attachment #1.1: Type: text/plain, Size: 3338 bytes --]

On Sep 30, 2009, at 1:07 PM, John Brier wrote:

> On 09/30/2009 12:41 PM, Eli Klein wrote:
>> I'm running into an issue where the failback option in  
>> multipath.conf is
>> being ignored. I've tried to set this to a value > 0 to cause dm to
>> pause before reinstating a recovered path, but it reinstates the path
>> immediately. The same is true when setting the value to "manual", the
>> path is reinstated immediately. The delay is most useful when a  
>> ethernet
>> switch has failed or rebooted. During the boot of the switch, the
>> interface comes up and drops again multiple times. I'd love to be  
>> able
>> to introduce a delay before DM marks the path as reinstated after a
>> recovery. This would save me the hassle of running into I/O errors  
>> (and
>> occasional filesystem RO) as the path bounces while the switch is  
>> booting.
>>
>> Thanks in advance for any help or suggestions!
>>
>> -Eli
>>
>> I've included my multipath.conf as well as output from multipath - 
>> ll below:
>>
>> multipath.conf:
>>
>> blacklist {
>> devnode "sda$"
>> # devnode "*"
>> }
>>
>> ## By default, devices with vendor = "IBM" and product = "S/390.*"  
>> are
>> ## blacklisted. To enable mulitpathing on these devies, uncomment the
>> ## following lines.
>> #blacklist_exceptions {
>> # device {
>> # vendor "IBM"
>> # product "S/390.*"
>> # }
>> #}
>>
>> ## Use user friendly names, instead of using WWIDs as names.
>> defaults {
>> user_friendly_names yes
>> }
>> ##
>> ## Here is an example of how to configure some standard options.
>> ##
>> #
>> defaults {
>> udev_dir /dev
>> polling_interval 10
>> selector "round-robin 0"
>> path_grouping_policy multibus
>> getuid_callout "/sbin/scsi_id -g -u -s /block/%n"
>> prio_callout /bin/true
>> path_checker readsector0
>> rr_min_io 10
>> rr_weight priorities
>> failback 120
>> features "1 queue_if_no_path"
>> no_path_retry 1
>> user_friendly_name yes
>> }
>
> It looks like you have two defaults {} sections. Is that allowed?  
> Maybe multipath is only picking up the first one? I'd remove the  
> first one/comment out and put user_friendly_names yes in the second  
> one.
>
> Also I think multipath -v3 -d will show you what settings would be  
> applied so you don't have to test the behavior by cycling your  
> ethernet switch.
>
> For example on one of my systems multipath -v3 shows
> mpath1: pgfailback = -2 (controller setting)
>
>
> After you have it configured right run multipath -v3 without the -d  
> (dry run)
>
> Can someone confirm if pgfailback from multipath -v3 output  
> indicates the current setting of failback that multipath is using?
>
> John Brier

Here's the abbreviated output from multipath -v3:

mpath1: pgfailback = 120 (config file default)
mpath1: pgpolicy = multibus (config file default)
mpath1: selector = round-robin 0 (internal default)
mpath1: features = 1 queue_if_no_path (internal default)
mpath1: hwhandler = 0 (internal default)
mpath1: rr_weight = 2 (config file default)
mpath1: minio = 10 (config file default)
mpath1: no_path_retry = 1 (config file default)
pg_timeout = NONE (internal default)
mpath1: set ACT_NOTHING (map unchanged)

It's definitely picking up the settings correctly from multipath.conf,  
but the paths are still reinstated immediately once they're  
reconnected (even if it's just for a split second).

-Eli

[-- Attachment #1.2: Type: text/html, Size: 5949 bytes --]

[-- Attachment #2: Type: text/plain, Size: 0 bytes --]



^ permalink raw reply	[flat|nested] 11+ messages in thread

* Re: Question regarding failback option in multipath.conf
  2009-09-30 19:16   ` Eli Klein
@ 2009-09-30 19:24     ` John Brier
  2009-09-30 19:36       ` Eli Klein
  0 siblings, 1 reply; 11+ messages in thread
From: John Brier @ 2009-09-30 19:24 UTC (permalink / raw)
  To: device-mapper development

On 09/30/2009 03:16 PM, Eli Klein wrote:
>
> Here's the abbreviated output from multipath -v3:
>
> mpath1: pgfailback = 120 (config file default)
> mpath1: pgpolicy = multibus (config file default)
> mpath1: selector = round-robin 0 (internal default)
> mpath1: features = 1 queue_if_no_path (internal default)
> mpath1: hwhandler = 0 (internal default)
> mpath1: rr_weight = 2 (config file default)
> mpath1: minio = 10 (config file default)
> mpath1: no_path_retry = 1 (config file default)
> pg_timeout = NONE (internal default)
> mpath1: set ACT_NOTHING (map unchanged)
>
> It's definitely picking up the settings correctly from multipath.conf,
> but the paths are still reinstated immediately once they're reconnected
> (even if it's just for a split second).
>

What does /var/log/messages look like when this happens? Since you 
changed failback to 120 did you restart multipathd or restart the box?

John Brier

^ permalink raw reply	[flat|nested] 11+ messages in thread

* Re: Question regarding failback option in multipath.conf
  2009-09-30 19:24     ` John Brier
@ 2009-09-30 19:36       ` Eli Klein
  2009-10-01 21:16         ` Eli Klein
  0 siblings, 1 reply; 11+ messages in thread
From: Eli Klein @ 2009-09-30 19:36 UTC (permalink / raw)
  To: device-mapper development


[-- Attachment #1.1: Type: text/plain, Size: 6212 bytes --]

On Sep 30, 2009, at 1:24 PM, John Brier wrote:

> On 09/30/2009 03:16 PM, Eli Klein wrote:
>>
>> Here's the abbreviated output from multipath -v3:
>>
>> mpath1: pgfailback = 120 (config file default)
>> mpath1: pgpolicy = multibus (config file default)
>> mpath1: selector = round-robin 0 (internal default)
>> mpath1: features = 1 queue_if_no_path (internal default)
>> mpath1: hwhandler = 0 (internal default)
>> mpath1: rr_weight = 2 (config file default)
>> mpath1: minio = 10 (config file default)
>> mpath1: no_path_retry = 1 (config file default)
>> pg_timeout = NONE (internal default)
>> mpath1: set ACT_NOTHING (map unchanged)
>>
>> It's definitely picking up the settings correctly from  
>> multipath.conf,
>> but the paths are still reinstated immediately once they're  
>> reconnected
>> (even if it's just for a split second).
>>
>
> What does /var/log/messages look like when this happens? Since you  
> changed failback to 120 did you restart multipathd or restart the box?
>
> John Brier
>
> --
> dm-devel mailing list
> dm-devel@redhat.com
> https://www.redhat.com/mailman/listinfo/dm-devel

I've tried rebooting, restarting multipath (and iscsid), flushing  
multipath (-F), removing all the iscsi targets completely and then  
reinitializing the targets.  Nothing changes the behavior.  Here's  
output from /var/log/messages.. note the time between when the link  
becomes active (13:31:08) to when the path is reinstated (13:31:15):

Sep 30 13:30:28 testbox kernel:  connection7:0: ping timeout of 5 secs  
expired, recv timeout 5, last rx 4470061909, last ping 4470066909, now  
4470071909
Sep 30 13:30:28 testbox kernel:  connection7:0: detected conn error  
(1011)
Sep 30 13:30:28 testbox multipathd: sdf: readsector0 checker reports  
path is down
Sep 30 13:30:28 testbox multipathd: checker failed path 8:80 in map  
mpath0
Sep 30 13:30:28 testbox multipathd: mpath0: remaining active paths: 1
Sep 30 13:30:28 testbox kernel: device-mapper: multipath: Failing path  
8:80.
Sep 30 13:30:28 testbox iscsid: Kernel reported iSCSI connection 7:0  
error (1011) state (3)
Sep 30 13:30:29 testbox kernel:  connection9:0: ping timeout of 5 secs  
expired, recv timeout 5, last rx 4470063232, last ping 4470068232, now  
4470073232
Sep 30 13:30:29 testbox kernel:  connection9:0: detected conn error  
(1011)
Sep 30 13:30:29 testbox multipathd: sdg: readsector0 checker reports  
path is down
Sep 30 13:30:29 testbox multipathd: checker failed path 8:96 in map  
mpath1
Sep 30 13:30:29 testbox multipathd: mpath1: remaining active paths: 1
Sep 30 13:30:29 testbox kernel: device-mapper: multipath: Failing path  
8:96.
Sep 30 13:30:30 testbox kernel:  connection11:0: ping timeout of 5  
secs expired, recv timeout 5, last rx 4470064373, last ping  
4470069373, now 4470074373
Sep 30 13:30:30 testbox kernel:  connection11:0: detected conn error  
(1011)
Sep 30 13:30:30 testbox iscsid: Kernel reported iSCSI connection 9:0  
error (1011) state (3)
Sep 30 13:30:30 testbox multipathd: sdh: readsector0 checker reports  
path is down
Sep 30 13:30:30 testbox multipathd: checker failed path 8:112 in map  
mpath2
Sep 30 13:30:30 testbox kernel: device-mapper: multipath: Failing path  
8:112.
Sep 30 13:30:30 testbox multipathd: mpath2: remaining active paths: 1
Sep 30 13:30:30 testbox multipathd: dm-4: add map (uevent)
Sep 30 13:30:30 testbox multipathd: dm-4: devmap already registered
Sep 30 13:30:30 testbox multipathd: dm-2: add map (uevent)
Sep 30 13:30:30 testbox multipathd: dm-2: devmap already registered
Sep 30 13:30:30 testbox multipathd: dm-3: add map (uevent)
Sep 30 13:30:30 testbox multipathd: dm-3: devmap already registered
Sep 30 13:30:31 testbox iscsid: Kernel reported iSCSI connection 11:0  
error (1011) state (3)
Sep 30 13:30:43 testbox kernel:  session7: session recovery timed out  
after 15 secs
Sep 30 13:30:43 testbox multipathd: sdf: readsector0 checker reports  
path is down
Sep 30 13:30:44 testbox kernel:  session9: session recovery timed out  
after 15 secs
Sep 30 13:30:44 testbox multipathd: sdg: readsector0 checker reports  
path is down
Sep 30 13:30:45 testbox kernel:  session11: session recovery timed out  
after 15 secs
Sep 30 13:30:45 testbox kernel: sd 13:0:0:0: SCSI error: return code =  
0x000f0000
Sep 30 13:30:45 testbox kernel: end_request: I/O error, dev sdh,  
sector 407978223
Sep 30 13:30:45 testbox multipathd: sdh: readsector0 checker reports  
path is down
Sep 30 13:30:55 testbox multipathd: sdf: readsector0 checker reports  
path is down
Sep 30 13:31:05 testbox multipathd: sdg: readsector0 checker reports  
path is down
Sep 30 13:31:05 testbox multipathd: sdh: readsector0 checker reports  
path is down
Sep 30 13:31:05 testbox multipathd: sdf: readsector0 checker reports  
path is down
Sep 30 13:31:05 testbox multipathd: sdg: readsector0 checker reports  
path is down
Sep 30 13:31:05 testbox multipathd: sdh: readsector0 checker reports  
path is down
Sep 30 13:31:08 testbox iscsid: connection9:0 is operational after  
recovery (3 attempts)
Sep 30 13:31:08 testbox iscsid: connection11:0 is operational after  
recovery (3 attempts)
Sep 30 13:31:11 testbox iscsid: connection7:0 is operational after  
recovery (3 attempts)
Sep 30 13:31:15 testbox multipathd: sdf: readsector0 checker reports  
path is up
Sep 30 13:31:15 testbox multipathd: 8:80: reinstated
Sep 30 13:31:15 testbox multipathd: mpath0: remaining active paths: 2
Sep 30 13:31:15 testbox multipathd: sdg: readsector0 checker reports  
path is up
Sep 30 13:31:15 testbox multipathd: 8:96: reinstated
Sep 30 13:31:15 testbox multipathd: mpath1: remaining active paths: 2
Sep 30 13:31:15 testbox multipathd: sdh: readsector0 checker reports  
path is up
Sep 30 13:31:15 testbox multipathd: 8:112: reinstated
Sep 30 13:31:15 testbox multipathd: mpath2: remaining active paths: 2
Sep 30 13:31:15 testbox multipathd: dm-4: add map (uevent)
Sep 30 13:31:15 testbox multipathd: dm-4: devmap already registered
Sep 30 13:31:15 testbox multipathd: dm-2: add map (uevent)
Sep 30 13:31:15 testbox multipathd: dm-2: devmap already registered
Sep 30 13:31:15 testbox multipathd: dm-3: add map (uevent)
Sep 30 13:31:15 testbox multipathd: dm-3: devmap already registered


[-- Attachment #1.2: Type: text/html, Size: 15787 bytes --]

[-- Attachment #2: Type: text/plain, Size: 0 bytes --]



^ permalink raw reply	[flat|nested] 11+ messages in thread

* Re: Question regarding failback option in multipath.conf
  2009-09-30 19:36       ` Eli Klein
@ 2009-10-01 21:16         ` Eli Klein
  2009-10-02  0:39           ` John A. Sullivan III
  0 siblings, 1 reply; 11+ messages in thread
From: Eli Klein @ 2009-10-01 21:16 UTC (permalink / raw)
  To: device-mapper development


[-- Attachment #1.1: Type: text/plain, Size: 6483 bytes --]

Any ideas on this? Not sure if this e-mail made it out to the list.
Thanks!

-Eli

On Wed, Sep 30, 2009 at 1:36 PM, Eli Klein <elijah@aclue.com> wrote:

> On Sep 30, 2009, at 1:24 PM, John Brier wrote:
>
> On 09/30/2009 03:16 PM, Eli Klein wrote:
>
>
> Here's the abbreviated output from multipath -v3:
>
>
> mpath1: pgfailback = 120 (config file default)
>
> mpath1: pgpolicy = multibus (config file default)
>
> mpath1: selector = round-robin 0 (internal default)
>
> mpath1: features = 1 queue_if_no_path (internal default)
>
> mpath1: hwhandler = 0 (internal default)
>
> mpath1: rr_weight = 2 (config file default)
>
> mpath1: minio = 10 (config file default)
>
> mpath1: no_path_retry = 1 (config file default)
>
> pg_timeout = NONE (internal default)
>
> mpath1: set ACT_NOTHING (map unchanged)
>
>
> It's definitely picking up the settings correctly from multipath.conf,
>
> but the paths are still reinstated immediately once they're reconnected
>
> (even if it's just for a split second).
>
>
>
> What does /var/log/messages look like when this happens? Since you changed
> failback to 120 did you restart multipathd or restart the box?
>
> John Brier
>
> --
> dm-devel mailing list
> dm-devel@redhat.com
> https://www.redhat.com/mailman/listinfo/dm-devel
>
>
> I've tried rebooting, restarting multipath (and iscsid), flushing multipath
> (-F), removing all the iscsi targets completely and then reinitializing the
> targets.  Nothing changes the behavior.  Here's output from
> /var/log/messages.. note the time between when the link becomes active
> (13:31:08) to when the path is reinstated (13:31:15):
>
> Sep 30 13:30:28 testbox kernel:  connection7:0: ping timeout of 5 secs
> expired, recv timeout 5, last rx 4470061909, last ping 4470066909, now
> 4470071909
> Sep 30 13:30:28 testbox kernel:  connection7:0: detected conn error (1011)
> Sep 30 13:30:28 testbox multipathd: sdf: readsector0 checker reports path
> is down
> Sep 30 13:30:28 testbox multipathd: checker failed path 8:80 in map mpath0
> Sep 30 13:30:28 testbox multipathd: mpath0: remaining active paths: 1
> Sep 30 13:30:28 testbox kernel: device-mapper: multipath: Failing path
> 8:80.
> Sep 30 13:30:28 testbox iscsid: Kernel reported iSCSI connection 7:0 error
> (1011) state (3)
> Sep 30 13:30:29 testbox kernel:  connection9:0: ping timeout of 5 secs
> expired, recv timeout 5, last rx 4470063232, last ping 4470068232, now
> 4470073232
> Sep 30 13:30:29 testbox kernel:  connection9:0: detected conn error (1011)
> Sep 30 13:30:29 testbox multipathd: sdg: readsector0 checker reports path
> is down
> Sep 30 13:30:29 testbox multipathd: checker failed path 8:96 in map mpath1
> Sep 30 13:30:29 testbox multipathd: mpath1: remaining active paths: 1
> Sep 30 13:30:29 testbox kernel: device-mapper: multipath: Failing path
> 8:96.
> Sep 30 13:30:30 testbox kernel:  connection11:0: ping timeout of 5 secs
> expired, recv timeout 5, last rx 4470064373, last ping 4470069373, now
> 4470074373
> Sep 30 13:30:30 testbox kernel:  connection11:0: detected conn error (1011)
> Sep 30 13:30:30 testbox iscsid: Kernel reported iSCSI connection 9:0 error
> (1011) state (3)
> Sep 30 13:30:30 testbox multipathd: sdh: readsector0 checker reports path
> is down
> Sep 30 13:30:30 testbox multipathd: checker failed path 8:112 in map mpath2
> Sep 30 13:30:30 testbox kernel: device-mapper: multipath: Failing path
> 8:112.
> Sep 30 13:30:30 testbox multipathd: mpath2: remaining active paths: 1
> Sep 30 13:30:30 testbox multipathd: dm-4: add map (uevent)
> Sep 30 13:30:30 testbox multipathd: dm-4: devmap already registered
> Sep 30 13:30:30 testbox multipathd: dm-2: add map (uevent)
> Sep 30 13:30:30 testbox multipathd: dm-2: devmap already registered
> Sep 30 13:30:30 testbox multipathd: dm-3: add map (uevent)
> Sep 30 13:30:30 testbox multipathd: dm-3: devmap already registered
> Sep 30 13:30:31 testbox iscsid: Kernel reported iSCSI connection 11:0 error
> (1011) state (3)
> Sep 30 13:30:43 testbox kernel:  session7: session recovery timed out after
> 15 secs
> Sep 30 13:30:43 testbox multipathd: sdf: readsector0 checker reports path
> is down
> Sep 30 13:30:44 testbox kernel:  session9: session recovery timed out after
> 15 secs
> Sep 30 13:30:44 testbox multipathd: sdg: readsector0 checker reports path
> is down
> Sep 30 13:30:45 testbox kernel:  session11: session recovery timed out
> after 15 secs
> Sep 30 13:30:45 testbox kernel: sd 13:0:0:0: SCSI error: return code =
> 0x000f0000
> Sep 30 13:30:45 testbox kernel: end_request: I/O error, dev sdh, sector
> 407978223
> Sep 30 13:30:45 testbox multipathd: sdh: readsector0 checker reports path
> is down
> Sep 30 13:30:55 testbox multipathd: sdf: readsector0 checker reports path
> is down
> Sep 30 13:31:05 testbox multipathd: sdg: readsector0 checker reports path
> is down
> Sep 30 13:31:05 testbox multipathd: sdh: readsector0 checker reports path
> is down
> Sep 30 13:31:05 testbox multipathd: sdf: readsector0 checker reports path
> is down
> Sep 30 13:31:05 testbox multipathd: sdg: readsector0 checker reports path
> is down
> Sep 30 13:31:05 testbox multipathd: sdh: readsector0 checker reports path
> is down
> Sep 30 13:31:08 testbox iscsid: connection9:0 is operational after recovery
> (3 attempts)
> Sep 30 13:31:08 testbox iscsid: connection11:0 is operational after
> recovery (3 attempts)
> Sep 30 13:31:11 testbox iscsid: connection7:0 is operational after recovery
> (3 attempts)
> Sep 30 13:31:15 testbox multipathd: sdf: readsector0 checker reports path
> is up
> Sep 30 13:31:15 testbox multipathd: 8:80: reinstated
> Sep 30 13:31:15 testbox multipathd: mpath0: remaining active paths: 2
> Sep 30 13:31:15 testbox multipathd: sdg: readsector0 checker reports path
> is up
> Sep 30 13:31:15 testbox multipathd: 8:96: reinstated
> Sep 30 13:31:15 testbox multipathd: mpath1: remaining active paths: 2
> Sep 30 13:31:15 testbox multipathd: sdh: readsector0 checker reports path
> is up
> Sep 30 13:31:15 testbox multipathd: 8:112: reinstated
> Sep 30 13:31:15 testbox multipathd: mpath2: remaining active paths: 2
> Sep 30 13:31:15 testbox multipathd: dm-4: add map (uevent)
> Sep 30 13:31:15 testbox multipathd: dm-4: devmap already registered
> Sep 30 13:31:15 testbox multipathd: dm-2: add map (uevent)
> Sep 30 13:31:15 testbox multipathd: dm-2: devmap already registered
> Sep 30 13:31:15 testbox multipathd: dm-3: add map (uevent)
> Sep 30 13:31:15 testbox multipathd: dm-3: devmap already registered
>
>

[-- Attachment #1.2: Type: text/html, Size: 13668 bytes --]

[-- Attachment #2: Type: text/plain, Size: 0 bytes --]



^ permalink raw reply	[flat|nested] 11+ messages in thread

* Re: Question regarding failback option in multipath.conf
  2009-10-01 21:16         ` Eli Klein
@ 2009-10-02  0:39           ` John A. Sullivan III
  2009-10-02  1:01             ` Eli Klein
  0 siblings, 1 reply; 11+ messages in thread
From: John A. Sullivan III @ 2009-10-02  0:39 UTC (permalink / raw)
  To: device-mapper development

I don't have the expertise to help very much but did you address the
previous comment about duplicate default blocks? Good luck - John

On Thu, 2009-10-01 at 15:16 -0600, Eli Klein wrote:
> Any ideas on this? Not sure if this e-mail made it out to the list.
> 
> 
> Thanks!
> 
> 
> -Eli
> 
> On Wed, Sep 30, 2009 at 1:36 PM, Eli Klein <elijah@aclue.com> wrote:
>         
>         On Sep 30, 2009, at 1:24 PM, John Brier wrote:
>         
>         > On 09/30/2009 03:16 PM, Eli Klein wrote:
>         > > 
>         > > Here's the abbreviated output from multipath -v3:
>         > > 
>         > > mpath1: pgfailback = 120 (config file default)
>         > > mpath1: pgpolicy = multibus (config file default)
>         > > mpath1: selector = round-robin 0 (internal default)
>         > > mpath1: features = 1 queue_if_no_path (internal default)
>         > > mpath1: hwhandler = 0 (internal default)
>         > > mpath1: rr_weight = 2 (config file default)
>         > > mpath1: minio = 10 (config file default)
>         > > mpath1: no_path_retry = 1 (config file default)
>         > > pg_timeout = NONE (internal default)
>         > > mpath1: set ACT_NOTHING (map unchanged)
>         > > 
>         > > It's definitely picking up the settings correctly from
>         > > multipath.conf,
>         > > but the paths are still reinstated immediately once
>         > > they're reconnected
>         > > (even if it's just for a split second).
>         > > 
>         > 
>         > What does /var/log/messages look like when this happens?
>         > Since you changed failback to 120 did you restart multipathd
>         > or restart the box?
>         > 
>         > John Brier
>         > 
>         > --
>         > dm-devel mailing list
>         > dm-devel@redhat.com
>         > https://www.redhat.com/mailman/listinfo/dm-devel
>         > 
>         
>         
>         I've tried rebooting, restarting multipath (and iscsid),
>         flushing multipath (-F), removing all the iscsi targets
>         completely and then reinitializing the targets.  Nothing
>         changes the behavior.  Here's output from /var/log/messages..
>         note the time between when the link becomes active (13:31:08)
>         to when the path is reinstated (13:31:15):
>         
>         
>         Sep 30 13:30:28 testbox kernel:  connection7:0: ping timeout
>         of 5 secs expired, recv timeout 5, last rx 4470061909, last
>         ping 4470066909, now 4470071909
>         Sep 30 13:30:28 testbox kernel:  connection7:0: detected conn
>         error (1011)
>         Sep 30 13:30:28 testbox multipathd: sdf: readsector0 checker
>         reports path is down
>         Sep 30 13:30:28 testbox multipathd: checker failed path 8:80
>         in map mpath0
>         Sep 30 13:30:28 testbox multipathd: mpath0: remaining active
>         paths: 1
>         Sep 30 13:30:28 testbox kernel: device-mapper: multipath:
>         Failing path 8:80.
>         Sep 30 13:30:28 testbox iscsid: Kernel reported iSCSI
>         connection 7:0 error (1011) state (3)
>         Sep 30 13:30:29 testbox kernel:  connection9:0: ping timeout
>         of 5 secs expired, recv timeout 5, last rx 4470063232, last
>         ping 4470068232, now 4470073232
>         Sep 30 13:30:29 testbox kernel:  connection9:0: detected conn
>         error (1011)
>         Sep 30 13:30:29 testbox multipathd: sdg: readsector0 checker
>         reports path is down
>         Sep 30 13:30:29 testbox multipathd: checker failed path 8:96
>         in map mpath1
>         Sep 30 13:30:29 testbox multipathd: mpath1: remaining active
>         paths: 1
>         Sep 30 13:30:29 testbox kernel: device-mapper: multipath:
>         Failing path 8:96.
>         Sep 30 13:30:30 testbox kernel:  connection11:0: ping timeout
>         of 5 secs expired, recv timeout 5, last rx 4470064373, last
>         ping 4470069373, now 4470074373
>         Sep 30 13:30:30 testbox kernel:  connection11:0: detected conn
>         error (1011)
>         Sep 30 13:30:30 testbox iscsid: Kernel reported iSCSI
>         connection 9:0 error (1011) state (3)
>         Sep 30 13:30:30 testbox multipathd: sdh: readsector0 checker
>         reports path is down
>         Sep 30 13:30:30 testbox multipathd: checker failed path 8:112
>         in map mpath2
>         Sep 30 13:30:30 testbox kernel: device-mapper: multipath:
>         Failing path 8:112.
>         Sep 30 13:30:30 testbox multipathd: mpath2: remaining active
>         paths: 1
>         Sep 30 13:30:30 testbox multipathd: dm-4: add map (uevent)
>         Sep 30 13:30:30 testbox multipathd: dm-4: devmap already
>         registered
>         Sep 30 13:30:30 testbox multipathd: dm-2: add map (uevent)
>         Sep 30 13:30:30 testbox multipathd: dm-2: devmap already
>         registered
>         Sep 30 13:30:30 testbox multipathd: dm-3: add map (uevent)
>         Sep 30 13:30:30 testbox multipathd: dm-3: devmap already
>         registered
>         Sep 30 13:30:31 testbox iscsid: Kernel reported iSCSI
>         connection 11:0 error (1011) state (3)
>         Sep 30 13:30:43 testbox kernel:  session7: session recovery
>         timed out after 15 secs
>         Sep 30 13:30:43 testbox multipathd: sdf: readsector0 checker
>         reports path is down
>         Sep 30 13:30:44 testbox kernel:  session9: session recovery
>         timed out after 15 secs
>         Sep 30 13:30:44 testbox multipathd: sdg: readsector0 checker
>         reports path is down
>         Sep 30 13:30:45 testbox kernel:  session11: session recovery
>         timed out after 15 secs
>         Sep 30 13:30:45 testbox kernel: sd 13:0:0:0: SCSI error:
>         return code = 0x000f0000
>         Sep 30 13:30:45 testbox kernel: end_request: I/O error, dev
>         sdh, sector 407978223
>         Sep 30 13:30:45 testbox multipathd: sdh: readsector0 checker
>         reports path is down
>         Sep 30 13:30:55 testbox multipathd: sdf: readsector0 checker
>         reports path is down
>         Sep 30 13:31:05 testbox multipathd: sdg: readsector0 checker
>         reports path is down
>         Sep 30 13:31:05 testbox multipathd: sdh: readsector0 checker
>         reports path is down
>         Sep 30 13:31:05 testbox multipathd: sdf: readsector0 checker
>         reports path is down
>         Sep 30 13:31:05 testbox multipathd: sdg: readsector0 checker
>         reports path is down
>         Sep 30 13:31:05 testbox multipathd: sdh: readsector0 checker
>         reports path is down
>         Sep 30 13:31:08 testbox iscsid: connection9:0 is operational
>         after recovery (3 attempts)
>         Sep 30 13:31:08 testbox iscsid: connection11:0 is operational
>         after recovery (3 attempts)
>         Sep 30 13:31:11 testbox iscsid: connection7:0 is operational
>         after recovery (3 attempts)
>         Sep 30 13:31:15 testbox multipathd: sdf: readsector0 checker
>         reports path is up
>         Sep 30 13:31:15 testbox multipathd: 8:80: reinstated
>         Sep 30 13:31:15 testbox multipathd: mpath0: remaining active
>         paths: 2
>         Sep 30 13:31:15 testbox multipathd: sdg: readsector0 checker
>         reports path is up
>         Sep 30 13:31:15 testbox multipathd: 8:96: reinstated
>         Sep 30 13:31:15 testbox multipathd: mpath1: remaining active
>         paths: 2
>         Sep 30 13:31:15 testbox multipathd: sdh: readsector0 checker
>         reports path is up
>         Sep 30 13:31:15 testbox multipathd: 8:112: reinstated
>         Sep 30 13:31:15 testbox multipathd: mpath2: remaining active
>         paths: 2
>         Sep 30 13:31:15 testbox multipathd: dm-4: add map (uevent)
>         Sep 30 13:31:15 testbox multipathd: dm-4: devmap already
>         registered
>         Sep 30 13:31:15 testbox multipathd: dm-2: add map (uevent)
>         Sep 30 13:31:15 testbox multipathd: dm-2: devmap already
>         registered
>         Sep 30 13:31:15 testbox multipathd: dm-3: add map (uevent)
>         Sep 30 13:31:15 testbox multipathd: dm-3: devmap already
>         registered
>         
>         
> 
> 
> --
> dm-devel mailing list
> dm-devel@redhat.com
> https://www.redhat.com/mailman/listinfo/dm-devel
-- 
John A. Sullivan III
Open Source Development Corporation
+1 207-985-7880
jsullivan@opensourcedevel.com

http://www.spiritualoutreach.com
Making Christianity intelligible to secular society

^ permalink raw reply	[flat|nested] 11+ messages in thread

* Re: Question regarding failback option in multipath.conf
  2009-10-02  0:39           ` John A. Sullivan III
@ 2009-10-02  1:01             ` Eli Klein
  2009-10-02 14:30               ` John Brier
  0 siblings, 1 reply; 11+ messages in thread
From: Eli Klein @ 2009-10-02  1:01 UTC (permalink / raw)
  To: device-mapper development


[-- Attachment #1.1: Type: text/plain, Size: 9107 bytes --]

Sorry, I forgot to mention that I did.. it didn't help.
In addition, from the output of multipath -v3, multipathd is picking up all
of the correct settings, even with the two separate default sections.

-Eli

On Thu, Oct 1, 2009 at 6:39 PM, John A. Sullivan III <
jsullivan@opensourcedevel.com> wrote:

> I don't have the expertise to help very much but did you address the
> previous comment about duplicate default blocks? Good luck - John
>
> On Thu, 2009-10-01 at 15:16 -0600, Eli Klein wrote:
> > Any ideas on this? Not sure if this e-mail made it out to the list.
> >
> >
> > Thanks!
> >
> >
> > -Eli
> >
> > On Wed, Sep 30, 2009 at 1:36 PM, Eli Klein <elijah@aclue.com> wrote:
> >
> >         On Sep 30, 2009, at 1:24 PM, John Brier wrote:
> >
> >         > On 09/30/2009 03:16 PM, Eli Klein wrote:
> >         > >
> >         > > Here's the abbreviated output from multipath -v3:
> >         > >
> >         > > mpath1: pgfailback = 120 (config file default)
> >         > > mpath1: pgpolicy = multibus (config file default)
> >         > > mpath1: selector = round-robin 0 (internal default)
> >         > > mpath1: features = 1 queue_if_no_path (internal default)
> >         > > mpath1: hwhandler = 0 (internal default)
> >         > > mpath1: rr_weight = 2 (config file default)
> >         > > mpath1: minio = 10 (config file default)
> >         > > mpath1: no_path_retry = 1 (config file default)
> >         > > pg_timeout = NONE (internal default)
> >         > > mpath1: set ACT_NOTHING (map unchanged)
> >         > >
> >         > > It's definitely picking up the settings correctly from
> >         > > multipath.conf,
> >         > > but the paths are still reinstated immediately once
> >         > > they're reconnected
> >         > > (even if it's just for a split second).
> >         > >
> >         >
> >         > What does /var/log/messages look like when this happens?
> >         > Since you changed failback to 120 did you restart multipathd
> >         > or restart the box?
> >         >
> >         > John Brier
> >         >
> >         > --
> >         > dm-devel mailing list
> >         > dm-devel@redhat.com
> >         > https://www.redhat.com/mailman/listinfo/dm-devel
> >         >
> >
> >
> >         I've tried rebooting, restarting multipath (and iscsid),
> >         flushing multipath (-F), removing all the iscsi targets
> >         completely and then reinitializing the targets.  Nothing
> >         changes the behavior.  Here's output from /var/log/messages..
> >         note the time between when the link becomes active (13:31:08)
> >         to when the path is reinstated (13:31:15):
> >
> >
> >         Sep 30 13:30:28 testbox kernel:  connection7:0: ping timeout
> >         of 5 secs expired, recv timeout 5, last rx 4470061909, last
> >         ping 4470066909, now 4470071909
> >         Sep 30 13:30:28 testbox kernel:  connection7:0: detected conn
> >         error (1011)
> >         Sep 30 13:30:28 testbox multipathd: sdf: readsector0 checker
> >         reports path is down
> >         Sep 30 13:30:28 testbox multipathd: checker failed path 8:80
> >         in map mpath0
> >         Sep 30 13:30:28 testbox multipathd: mpath0: remaining active
> >         paths: 1
> >         Sep 30 13:30:28 testbox kernel: device-mapper: multipath:
> >         Failing path 8:80.
> >         Sep 30 13:30:28 testbox iscsid: Kernel reported iSCSI
> >         connection 7:0 error (1011) state (3)
> >         Sep 30 13:30:29 testbox kernel:  connection9:0: ping timeout
> >         of 5 secs expired, recv timeout 5, last rx 4470063232, last
> >         ping 4470068232, now 4470073232
> >         Sep 30 13:30:29 testbox kernel:  connection9:0: detected conn
> >         error (1011)
> >         Sep 30 13:30:29 testbox multipathd: sdg: readsector0 checker
> >         reports path is down
> >         Sep 30 13:30:29 testbox multipathd: checker failed path 8:96
> >         in map mpath1
> >         Sep 30 13:30:29 testbox multipathd: mpath1: remaining active
> >         paths: 1
> >         Sep 30 13:30:29 testbox kernel: device-mapper: multipath:
> >         Failing path 8:96.
> >         Sep 30 13:30:30 testbox kernel:  connection11:0: ping timeout
> >         of 5 secs expired, recv timeout 5, last rx 4470064373, last
> >         ping 4470069373, now 4470074373
> >         Sep 30 13:30:30 testbox kernel:  connection11:0: detected conn
> >         error (1011)
> >         Sep 30 13:30:30 testbox iscsid: Kernel reported iSCSI
> >         connection 9:0 error (1011) state (3)
> >         Sep 30 13:30:30 testbox multipathd: sdh: readsector0 checker
> >         reports path is down
> >         Sep 30 13:30:30 testbox multipathd: checker failed path 8:112
> >         in map mpath2
> >         Sep 30 13:30:30 testbox kernel: device-mapper: multipath:
> >         Failing path 8:112.
> >         Sep 30 13:30:30 testbox multipathd: mpath2: remaining active
> >         paths: 1
> >         Sep 30 13:30:30 testbox multipathd: dm-4: add map (uevent)
> >         Sep 30 13:30:30 testbox multipathd: dm-4: devmap already
> >         registered
> >         Sep 30 13:30:30 testbox multipathd: dm-2: add map (uevent)
> >         Sep 30 13:30:30 testbox multipathd: dm-2: devmap already
> >         registered
> >         Sep 30 13:30:30 testbox multipathd: dm-3: add map (uevent)
> >         Sep 30 13:30:30 testbox multipathd: dm-3: devmap already
> >         registered
> >         Sep 30 13:30:31 testbox iscsid: Kernel reported iSCSI
> >         connection 11:0 error (1011) state (3)
> >         Sep 30 13:30:43 testbox kernel:  session7: session recovery
> >         timed out after 15 secs
> >         Sep 30 13:30:43 testbox multipathd: sdf: readsector0 checker
> >         reports path is down
> >         Sep 30 13:30:44 testbox kernel:  session9: session recovery
> >         timed out after 15 secs
> >         Sep 30 13:30:44 testbox multipathd: sdg: readsector0 checker
> >         reports path is down
> >         Sep 30 13:30:45 testbox kernel:  session11: session recovery
> >         timed out after 15 secs
> >         Sep 30 13:30:45 testbox kernel: sd 13:0:0:0: SCSI error:
> >         return code = 0x000f0000
> >         Sep 30 13:30:45 testbox kernel: end_request: I/O error, dev
> >         sdh, sector 407978223
> >         Sep 30 13:30:45 testbox multipathd: sdh: readsector0 checker
> >         reports path is down
> >         Sep 30 13:30:55 testbox multipathd: sdf: readsector0 checker
> >         reports path is down
> >         Sep 30 13:31:05 testbox multipathd: sdg: readsector0 checker
> >         reports path is down
> >         Sep 30 13:31:05 testbox multipathd: sdh: readsector0 checker
> >         reports path is down
> >         Sep 30 13:31:05 testbox multipathd: sdf: readsector0 checker
> >         reports path is down
> >         Sep 30 13:31:05 testbox multipathd: sdg: readsector0 checker
> >         reports path is down
> >         Sep 30 13:31:05 testbox multipathd: sdh: readsector0 checker
> >         reports path is down
> >         Sep 30 13:31:08 testbox iscsid: connection9:0 is operational
> >         after recovery (3 attempts)
> >         Sep 30 13:31:08 testbox iscsid: connection11:0 is operational
> >         after recovery (3 attempts)
> >         Sep 30 13:31:11 testbox iscsid: connection7:0 is operational
> >         after recovery (3 attempts)
> >         Sep 30 13:31:15 testbox multipathd: sdf: readsector0 checker
> >         reports path is up
> >         Sep 30 13:31:15 testbox multipathd: 8:80: reinstated
> >         Sep 30 13:31:15 testbox multipathd: mpath0: remaining active
> >         paths: 2
> >         Sep 30 13:31:15 testbox multipathd: sdg: readsector0 checker
> >         reports path is up
> >         Sep 30 13:31:15 testbox multipathd: 8:96: reinstated
> >         Sep 30 13:31:15 testbox multipathd: mpath1: remaining active
> >         paths: 2
> >         Sep 30 13:31:15 testbox multipathd: sdh: readsector0 checker
> >         reports path is up
> >         Sep 30 13:31:15 testbox multipathd: 8:112: reinstated
> >         Sep 30 13:31:15 testbox multipathd: mpath2: remaining active
> >         paths: 2
> >         Sep 30 13:31:15 testbox multipathd: dm-4: add map (uevent)
> >         Sep 30 13:31:15 testbox multipathd: dm-4: devmap already
> >         registered
> >         Sep 30 13:31:15 testbox multipathd: dm-2: add map (uevent)
> >         Sep 30 13:31:15 testbox multipathd: dm-2: devmap already
> >         registered
> >         Sep 30 13:31:15 testbox multipathd: dm-3: add map (uevent)
> >         Sep 30 13:31:15 testbox multipathd: dm-3: devmap already
> >         registered
> >
> >
> >
> >
> > --
> > dm-devel mailing list
> > dm-devel@redhat.com
> > https://www.redhat.com/mailman/listinfo/dm-devel
> --
> John A. Sullivan III
> Open Source Development Corporation
> +1 207-985-7880
> jsullivan@opensourcedevel.com
>
> http://www.spiritualoutreach.com
> Making Christianity intelligible to secular society
>
> --
> dm-devel mailing list
> dm-devel@redhat.com
> https://www.redhat.com/mailman/listinfo/dm-devel
>

[-- Attachment #1.2: Type: text/html, Size: 11339 bytes --]

[-- Attachment #2: Type: text/plain, Size: 0 bytes --]



^ permalink raw reply	[flat|nested] 11+ messages in thread

* Re: Question regarding failback option in multipath.conf
  2009-10-02  1:01             ` Eli Klein
@ 2009-10-02 14:30               ` John Brier
  2009-10-02 14:40                 ` Eli Klein
  0 siblings, 1 reply; 11+ messages in thread
From: John Brier @ 2009-10-02 14:30 UTC (permalink / raw)
  To: device-mapper development

On 10/01/2009 09:01 PM, Eli Klein wrote:
> Sorry, I forgot to mention that I did.. it didn't help.
> In addition, from the output of multipath -v3, multipathd is picking up all
> of the correct settings, even with the two separate default sections.
>

It seems like you set it all up right but failback isn't working like 
it *seems* it should. (I have never needed it) If you're interested in 
a hack..

Since you don't seem to care how long it takes to failover, you could 
do the *opposite* of what this kbase recommends:

How can I improve the failover time of a faulty path when using 
device-mapper-multipath over iSCSI?

http://kbase.redhat.com/faq/docs/DOC-2877


Aside from that I would suggest you open a support case with Red Hat 
if the box is covered under a support contract.

John Brier

^ permalink raw reply	[flat|nested] 11+ messages in thread

* Re: Question regarding failback option in multipath.conf
  2009-10-02 14:30               ` John Brier
@ 2009-10-02 14:40                 ` Eli Klein
  0 siblings, 0 replies; 11+ messages in thread
From: Eli Klein @ 2009-10-02 14:40 UTC (permalink / raw)
  To: device-mapper development


[-- Attachment #1.1: Type: text/plain, Size: 1645 bytes --]

On Fri, Oct 2, 2009 at 8:30 AM, John Brier <jbrier@redhat.com> wrote:

> On 10/01/2009 09:01 PM, Eli Klein wrote:
>
>> Sorry, I forgot to mention that I did.. it didn't help.
>> In addition, from the output of multipath -v3, multipathd is picking up
>> all
>> of the correct settings, even with the two separate default sections.
>>
>>
> It seems like you set it all up right but failback isn't working like it
> *seems* it should. (I have never needed it) If you're interested in a hack..
>
> Since you don't seem to care how long it takes to failover, you could do
> the *opposite* of what this kbase recommends:
>
> How can I improve the failover time of a faulty path when using
> device-mapper-multipath over iSCSI?
>
> http://kbase.redhat.com/faq/docs/DOC-2877
>
>
> Aside from that I would suggest you open a support case with Red Hat if the
> box is covered under a support contract.
>
>
> John Brier
>
> --
> dm-devel mailing list
> dm-devel@redhat.com
> https://www.redhat.com/mailman/listinfo/dm-devel
>

I've seen that doc before (it was helpful!).  Unfortunately, I *do* care
about how long it takes for the initial failover.  I have this configured to
take 15 seconds at the moment.  One thing that's interesting is that even
when I had "node.session.timeo.replacement_timeout" set to 120 seconds, the
failback was still immediate. The initial failover would take at minimum 120
seconds.

All of my problems seem to be cause by what appears to be a failback
parameter that doesn't actually have any effect.  I'll have to open up a
support case to get to the bottom of the failback issues.

Thank you very much for your help!

-Eli

[-- Attachment #1.2: Type: text/html, Size: 3054 bytes --]

[-- Attachment #2: Type: text/plain, Size: 0 bytes --]



^ permalink raw reply	[flat|nested] 11+ messages in thread

end of thread, other threads:[~2009-10-02 14:40 UTC | newest]

Thread overview: 11+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2009-09-30 16:41 Question regarding failback option in multipath.conf Eli Klein
2009-09-30 18:51 ` Eli Klein
2009-09-30 19:07 ` John Brier
2009-09-30 19:16   ` Eli Klein
2009-09-30 19:24     ` John Brier
2009-09-30 19:36       ` Eli Klein
2009-10-01 21:16         ` Eli Klein
2009-10-02  0:39           ` John A. Sullivan III
2009-10-02  1:01             ` Eli Klein
2009-10-02 14:30               ` John Brier
2009-10-02 14:40                 ` Eli Klein

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.