All of lore.kernel.org
 help / color / mirror / Atom feed
* FW: How to avoid lots of read ios to passive paths of active-pass ive storage devices?
@ 2005-09-02 14:25 goggin, edward
  2005-09-02 15:34 ` Steve Lord
  2005-09-02 21:17 ` christophe varoqui
  0 siblings, 2 replies; 3+ messages in thread
From: goggin, edward @ 2005-09-02 14:25 UTC (permalink / raw)
  To: 'dm-devel@redhat.com'

Reposting since it didn't get much response initially and the issue came up
again
in yesterday's multipath conference call.

> ______________________________________________ 
> From: 	goggin, edward  
> Sent:	Friday, July 08, 2005 12:21 AM
> To:	'dm-devel@redhat.com'
> Subject:	How to avoid lots of read ios to passive paths of
> active-passive storage devices?
> 
> How sould device scanning utilities like blkid, pvscan,
> vgscan, and lvscan (others?) be made to avoid issuing
> a significant number of read ios to passive paths of
> active-passive storage devices?
> 
> Possibly they could be changed to use device specific
> command interfaces like SCSI pass through ioctls to
> read the native device paths in order to avoid sending
> lots of ios down the passive paths to active-passive
> storage devices?
> 
> This way they could parse the sense key, ASC, and ASCQ
> returned via the SG_IO interface, see values like
> "Not Ready, Manual Intervention Required" and avoid
> sending subsequent reads to the same path.
> 
> Or should they be changed to scan the multipath devices
> if they exist instead of the path specific devices?
> 
> While this isn't an issue now, it could become one later
> when/if linux hosts are configured with hundreds/thousands
> of passive paths.
> 

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: FW: How to avoid lots of read ios to passive paths of active-pass ive storage devices?
  2005-09-02 14:25 FW: How to avoid lots of read ios to passive paths of active-pass ive storage devices? goggin, edward
@ 2005-09-02 15:34 ` Steve Lord
  2005-09-02 21:17 ` christophe varoqui
  1 sibling, 0 replies; 3+ messages in thread
From: Steve Lord @ 2005-09-02 15:34 UTC (permalink / raw)
  To: device-mapper development

goggin, edward wrote:
> Reposting since it didn't get much response initially and the issue came up
> again
> in yesterday's multipath conference call.
> 

>>
>>While this isn't an issue now, it could become one later
>>when/if linux hosts are configured with hundreds/thousands
>>of passive paths.
>>
> 

There are already issues out there, these were not directly with
multipath, but should serve as examples.

An SGI Altix box with 8 HBA ports connected via a fabric to 4 Engenio dual
controller raids (2 ports on each controller). From what I recall there were 4
LUNs assigned to each controller on each raid. I think there were 1024 paths
in the complete configuration. This was actually a very small version of
the planned production system which has multiple hosts and several thousand
LUNs.

Path ping pong during partition table scanning took several hours
to resolve itself (we gave up waiting and went home for the day).

The issue was made worth by attempts at parallelism and retries in
the logic. Multiple device reads were issued in parallel
via udev to all the different paths to devices, these reads
did retries on failures. Since a trespass (or automatic volume
transfer, depending on your terminology), causes a failure on
the active path on this raid, end result was it takes a lot of
I/O failures before one actually works.

Once all this completed, various volume manager components then
came along and tried to look for their metadata at the other
end of the LUNs. The same chaos ensues.

Engenio has actually added code to their raid firmware which
lets you turn off automatic transfers within the first few
blocks of the disk. This deals with partition scanning for the
most part. There is no code to deal with metadata scanning
at the end of luns, just don't do it.

There are Linux SANs in production where the reboot of a
single node in a fabric causes all the active nodes to suffer
major performance problems as paths get moved out from under
them.

In the RDAC mode of operation instead of the path ping pong
issue, you still end up with slow I/O failures on the
standby paths. Nowhere near as bad, but still painful
once you scale things up.

Steve

p.s. is anyone working on multipath modules for Engenio devices?

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: FW: How to avoid lots of read ios to passive paths of active-pass ive storage devices?
  2005-09-02 14:25 FW: How to avoid lots of read ios to passive paths of active-pass ive storage devices? goggin, edward
  2005-09-02 15:34 ` Steve Lord
@ 2005-09-02 21:17 ` christophe varoqui
  1 sibling, 0 replies; 3+ messages in thread
From: christophe varoqui @ 2005-09-02 21:17 UTC (permalink / raw)
  To: device-mapper development

What if we claim exclusively these passive paths from multipathd ?

Regards,

On ven, 2005-09-02 at 10:25 -0400, goggin, edward wrote:
> Reposting since it didn't get much response initially and the issue came up
> again
> in yesterday's multipath conference call.
> 
> > ______________________________________________ 
> > From: 	goggin, edward  
> > Sent:	Friday, July 08, 2005 12:21 AM
> > To:	'dm-devel@redhat.com'
> > Subject:	How to avoid lots of read ios to passive paths of
> > active-passive storage devices?
> > 
> > How sould device scanning utilities like blkid, pvscan,
> > vgscan, and lvscan (others?) be made to avoid issuing
> > a significant number of read ios to passive paths of
> > active-passive storage devices?
> > 
> > Possibly they could be changed to use device specific
> > command interfaces like SCSI pass through ioctls to
> > read the native device paths in order to avoid sending
> > lots of ios down the passive paths to active-passive
> > storage devices?
> > 
> > This way they could parse the sense key, ASC, and ASCQ
> > returned via the SG_IO interface, see values like
> > "Not Ready, Manual Intervention Required" and avoid
> > sending subsequent reads to the same path.
> > 
> > Or should they be changed to scan the multipath devices
> > if they exist instead of the path specific devices?
> > 
> > While this isn't an issue now, it could become one later
> > when/if linux hosts are configured with hundreds/thousands
> > of passive paths.
> > 
> 
> --
> dm-devel mailing list
> dm-devel@redhat.com
> https://www.redhat.com/mailman/listinfo/dm-devel
-- 
christophe varoqui <christophe.varoqui@free.fr>

^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2005-09-02 21:17 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2005-09-02 14:25 FW: How to avoid lots of read ios to passive paths of active-pass ive storage devices? goggin, edward
2005-09-02 15:34 ` Steve Lord
2005-09-02 21:17 ` christophe varoqui

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.