blk_requeue_request BUG

public inbox for linux-scsi@vger.kernel.org
 help / color / mirror / Atom feed

* blk_requeue_request BUG_ON
@ 2015-05-13 22:54 Brian King
  2015-05-14  0:07 ` Rajat Jain
  2015-05-14  0:34 ` James Bottomley
  0 siblings, 2 replies; 3+ messages in thread
From: Brian King @ 2015-05-13 22:54 UTC (permalink / raw)
  To: linux-scsi; +Cc: Hannes Reinecke, lnx1138

I've been chasing a BUG_ON in blk_requeue_request which seems to occur
in scenarios where we are seeing lots of SCSI aborts. As I've been digging
through the completion paths and abort paths, I've noticed if the following
sequence occurs, we are likely to hit this issue:

1. scsi_cmd times out, async abort issued
2. LLDD aborts command, LLDD calls scsi_done for the aborted command from interrupt handler
   when aborted command comes back
3. If result of the aborted command is something like DID_ERROR and we allow retries,
   then in scsi_done processing, we'll call scsi_queue_insert which then calls blk_requeue_request
4. Returning from the LLDD's eh_abort handler, scsi_error sees the abort was successful,
   and then calls scsi_queue_insert for the aborted command, which also calls blk_requeue_request
   where we hit the BUG_ON because the command has been queued again.

This is occurring for the non blk_rq_tagged case, for reference, so blk_requeue_request
doesn't call blk_queue_end_tag which might cause this to not be hit...

Should a LLDD NOT call scsi_done for commands it aborts? We've seen the issue above on
both ibmvfc and mpt2sas, but I know there are other LLDDs that call scsi_done in this case,
but just do it before eh_abort returns. Or is it expected the LLDD will only ever return DID_ABORT
on the aborted command, which looks like it might prevent this issue as well, however, that seems
racy and then we'd also need to add some memory barriers around the checking / setting of scmd->eh_eflags
I would think.

Or am I missing something and headed down the wrong path?

Thanks,

Brian

-- 
Brian King
Power Linux I/O
IBM Linux Technology Center

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: blk_requeue_request BUG_ON
  2015-05-13 22:54 blk_requeue_request BUG_ON Brian King
@ 2015-05-14  0:07 ` Rajat Jain
  2015-05-14  0:34 ` James Bottomley
  1 sibling, 0 replies; 3+ messages in thread
From: Rajat Jain @ 2015-05-14  0:07 UTC (permalink / raw)
  To: Brian King; +Cc: linux-scsi, Hannes Reinecke, lnx1138

Hello,

On Wed, May 13, 2015 at 3:54 PM, Brian King <brking@linux.vnet.ibm.com> wrote:
> I've been chasing a BUG_ON in blk_requeue_request which seems to occur
> in scenarios where we are seeing lots of SCSI aborts. As I've been digging
> through the completion paths and abort paths, I've noticed if the following
> sequence occurs, we are likely to hit this issue:
>
> 1. scsi_cmd times out, async abort issued
> 2. LLDD aborts command, LLDD calls scsi_done for the aborted command from interrupt handler
>    when aborted command comes back
> 3. If result of the aborted command is something like DID_ERROR and we allow retries,
>    then in scsi_done processing, we'll call scsi_queue_insert which then calls blk_requeue_request
> 4. Returning from the LLDD's eh_abort handler, scsi_error sees the abort was successful,
>    and then calls scsi_queue_insert for the aborted command, which also calls blk_requeue_request
>    where we hit the BUG_ON because the command has been queued again.
>
> This is occurring for the non blk_rq_tagged case, for reference, so blk_requeue_request
> doesn't call blk_queue_end_tag which might cause this to not be hit...
>
> Should a LLDD NOT call scsi_done for commands it aborts?

I think the right statement would be that the LLDD should not call
scsi_done() AFTER the eh_abort_handler() has returned SUCCESS, because
once SUCCESS is returned from the eh_abort_handler, the expectation is
that the LLDD and the HW has completely forgotten about that command
(and hence would not call scsi_done()). Also, from the time the
eh_abort_handler returns, the mid level is free to do what it pleases
with that scsi_cmnd (requeue it or free it or finish it) - thus LLDD
cannot be holding a pointer to scsi_cmnd anymore as it could be
pointing elsewhere than what LLD thinks.

Thanks,

Rajat

> We've seen the issue above on
> both ibmvfc and mpt2sas, but I know there are other LLDDs that call scsi_done in this case,
> but just do it before eh_abort returns. Or is it expected the LLDD will only ever return DID_ABORT
> on the aborted command, which looks like it might prevent this issue as well, however, that seems
> racy and then we'd also need to add some memory barriers around the checking / setting of scmd->eh_eflags
> I would think.
>
> Or am I missing something and headed down the wrong path?
>
> Thanks,
>
> Brian
>
> --
> Brian King
> Power Linux I/O
> IBM Linux Technology Center
>
>
> --
> To unsubscribe from this list: send the line "unsubscribe linux-scsi" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: blk_requeue_request BUG_ON
  2015-05-13 22:54 blk_requeue_request BUG_ON Brian King
  2015-05-14  0:07 ` Rajat Jain
@ 2015-05-14  0:34 ` James Bottomley
  1 sibling, 0 replies; 3+ messages in thread
From: James Bottomley @ 2015-05-14  0:34 UTC (permalink / raw)
  To: Brian King; +Cc: linux-scsi, Hannes Reinecke, lnx1138

On Wed, 2015-05-13 at 17:54 -0500, Brian King wrote:
> I've been chasing a BUG_ON in blk_requeue_request which seems to occur
> in scenarios where we are seeing lots of SCSI aborts. As I've been digging
> through the completion paths and abort paths, I've noticed if the following
> sequence occurs, we are likely to hit this issue:
> 
> 1. scsi_cmd times out, async abort issued
> 2. LLDD aborts command, LLDD calls scsi_done for the aborted command from interrupt handler
>    when aborted command comes back
> 3. If result of the aborted command is something like DID_ERROR and we allow retries,
>    then in scsi_done processing, we'll call scsi_queue_insert which then calls blk_requeue_request
> 4. Returning from the LLDD's eh_abort handler, scsi_error sees the abort was successful,
>    and then calls scsi_queue_insert for the aborted command, which also calls blk_requeue_request
>    where we hit the BUG_ON because the command has been queued again.
> 
> This is occurring for the non blk_rq_tagged case, for reference, so blk_requeue_request
> doesn't call blk_queue_end_tag which might cause this to not be hit...
> 
> Should a LLDD NOT call scsi_done for commands it aborts? We've seen the issue above on
> both ibmvfc and mpt2sas, but I know there are other LLDDs that call scsi_done in this case,
> but just do it before eh_abort returns. Or is it expected the LLDD will only ever return DID_ABORT
> on the aborted command, which looks like it might prevent this issue as well, however, that seems
> racy and then we'd also need to add some memory barriers around the checking / setting of scmd->eh_eflags
> I would think.
> 
> Or am I missing something and headed down the wrong path?

This is happening because the new abort handler isn't seen by the block
layer as complete (meaning it ignores duplicate completion).

The principle we've always operated under is that LLDs may if they wish
complete commands under error handling.  However, all we do is throw
away the completion because we don't care about the status (we believe
the command needs to be retried as a result of the eh).

There are two options that I can see for restoring the behaviour.  One
is to tell the block layer that it needs to treat this command as
completed (so export blk_mark_rq_complete).  The other would be to add a
check to scsi_done() for SCSI_EH_ABORT_SCHEDULED and simply return if it
is set.  I lean towards the former but don't like exporting the private
interface.

I think the real solution would be to move the actual queueing interface
for our running aborts up to block, so it could be aware of what's going
on (so block handles the internals without having to export any
interfaces).  Plus it makes the running abort available to anything else
which might want to use it.

James

^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2015-05-14  0:34 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2015-05-13 22:54 blk_requeue_request BUG_ON Brian King
2015-05-14  0:07 ` Rajat Jain
2015-05-14  0:34 ` James Bottomley

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox