All of lore.kernel.org
 help / color / mirror / Atom feed
From: Hannes Reinecke <hare@suse.de>
To: SCSI Mailing List <linux-scsi@vger.kernel.org>
Cc: Andrew Vasquez <andrew.vasquez@qlogic.com>,
	Chad Dupuis <chad.dupuis@qlogic.com>,
	James Smart <james.smart@emulex.com>,
	James Bottomley <James.Bottomley@HansenPartnership.com>
Subject: Error handling on FC devices
Date: Mon, 19 Nov 2012 13:41:51 +0100	[thread overview]
Message-ID: <50AA290F.8000105@suse.de> (raw)

Hi all,

just when we thought we'd finally nailed the error handling on FC ...
A customer of ours recently hit this really nasty issue:
He had a 'drain' on the SAN, in the sense that the link was still 
intact, but no commands were coming back from the link.

This caused the FC HBA / driver to not detect a link down, and so 
the failing command was pushed onto the error handler.
Which of course resorted back to HBA reset, but by that time the 
cluster already had kicked out the machine.
And as all machines in the cluster were connected to the same switch 
this happened to all machines, resulting on a nice cluster shutdown. 
And a really unhappy customer.

Looking closer multipathing actually managed to detect and switch 
paths as desired, but as the initial failing command was pushed onto 
the error handler all applications had to wait for this command to 
finish before proceeding.

So the following questions:
- Why did the FC HBA not detect a 'link-down' scenario?
   (Incidentally, this happens with QLogic _and_ Emulex :-)
   I know this is not a typical link-down, but from my naive
   assumption the HBA should detect that commands are not
   making progress, and at least after RA TOV was expired
   it should try to reset the link.
- Can we speed up error handling for these cases?
   Currently we're waiting for eh to complete before returning
   the affected commands with a final state.
   However, after we've done a LUN reset there shouldn't be
   any command state left and we should be able to terminate
   outstanding commands directly, without having to wait for
   eh to finally complete. James?

Thanks.

Cheers,

Hannes
-- 
Dr. Hannes Reinecke		      zSeries & Storage
hare@suse.de			      +49 911 74053 688
SUSE LINUX Products GmbH, Maxfeldstr. 5, 90409 Nürnberg
GF: J. Hawn, J. Guild, F. Imendörffer, HRB 16746 (AG Nürnberg)
--
To unsubscribe from this list: send the line "unsubscribe linux-scsi" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

             reply	other threads:[~2012-11-19 12:41 UTC|newest]

Thread overview: 12+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2012-11-19 12:41 Hannes Reinecke [this message]
2012-11-26 22:32 ` Error handling on FC devices James Smart
2012-11-27 20:03   ` Ewan Milne
2012-11-27 20:29     ` Elliott, Robert (Server Storage)
2012-11-28  7:09       ` Hannes Reinecke
2012-11-29 16:02         ` James Smart
2012-11-30 11:44           ` Hannes Reinecke
2012-11-30 16:54             ` Mike Christie
2012-12-03  7:15               ` Hannes Reinecke
2012-12-03 17:19                 ` Jeremy Linton
2012-12-03 22:52                 ` Elliott, Robert (Server Storage)
2012-12-04 15:56                   ` Kipp Aldrich

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=50AA290F.8000105@suse.de \
    --to=hare@suse.de \
    --cc=James.Bottomley@HansenPartnership.com \
    --cc=andrew.vasquez@qlogic.com \
    --cc=chad.dupuis@qlogic.com \
    --cc=james.smart@emulex.com \
    --cc=linux-scsi@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.