From: Jeff Garzik <jgarzik@pobox.com>
To: Tejun Heo <htejun@gmail.com>
Cc: linux-ide@vger.kernel.org, albertcc@tw.ibm.com
Subject: Re: [PATCH 08/12] libata: fix handling of race between timeout and completion
Date: Sun, 22 Jan 2006 04:41:50 -0500 [thread overview]
Message-ID: <43D3535E.80405@pobox.com> (raw)
In-Reply-To: <11379167113869-git-send-email-htejun@gmail.com>
Tejun Heo wrote:
> If a qc completes after SCSI timer expires but before libata EH kicks
> in, the qc gets completed but the scsicmd still gets passed to libata
> EH resulting in ->eng_timeout invocation with NULL qc. Currently none
> of ->eng_timeout callbacks handles this properly. This patch makes
> ata_scsi_error() bypass ->eng_timeout and handle this rare case.
>
> Signed-off-by: Tejun Heo <htejun@gmail.com>
>
> ---
>
> drivers/scsi/libata-scsi.c | 42 +++++++++++++++++++++++++++++++++++++++---
> 1 files changed, 39 insertions(+), 3 deletions(-)
>
> ba51311c2cc3177f9c5d33aee11be1488d783c7c
> diff --git a/drivers/scsi/libata-scsi.c b/drivers/scsi/libata-scsi.c
> index ab6b533..accb63a 100644
> --- a/drivers/scsi/libata-scsi.c
> +++ b/drivers/scsi/libata-scsi.c
> @@ -731,12 +731,48 @@ int ata_scsi_slave_config(struct scsi_de
>
> int ata_scsi_error(struct Scsi_Host *host)
> {
> - struct ata_port *ap;
> + struct ata_port *ap = (struct ata_port *) &host->hostdata[0];
> + struct ata_queued_cmd *qc;
> + unsigned long flags;
>
> DPRINTK("ENTER\n");
>
> - ap = (struct ata_port *) &host->hostdata[0];
> - ap->ops->eng_timeout(ap);
> + spin_lock_irqsave(&ap->host_set->lock, flags);
> + qc = ata_qc_from_tag(ap, ap->active_tag);
> + spin_unlock_irqrestore(&ap->host_set->lock, flags);
> +
> + if (qc) {
> + ap->ops->eng_timeout(ap);
> + } else {
> + struct scsi_cmnd *scmd;
> + unsigned char *sb;
> +
> + /* The scmd had timed out but the corresponding qc
> + * completed successfully inbetween timer expiration
> + * and here. Retry if possible.
> + *
> + * It is better to enter eng_timeout and perform EH
> + * before retrying the command, but this case should
> + * be _very_ rare and eng_timeout isn't ready for
> + * NULL-qc case.
> + */
> + scmd = list_entry(host->eh_cmd_q.next,
> + struct scsi_cmnd, eh_entry);
> + sb = scmd->sense_buffer;
> +
> + /* Timeout, fake parity for now */
> + scmd->result = (DRIVER_SENSE << 24) | SAM_STAT_CHECK_CONDITION;
> + sb[0] = 0x70;
> + sb[7] = 0x0a;
> + sb[2] = ABORTED_COMMAND;
> + sb[12] = 0x47;
> + sb[13] = 0x00;
> +
> + printk(KERN_WARNING "ata%u: interrupt and timer raced for "
> + "scsicmd %p\n", ap->id, scmd);
> +
> + scsi_eh_finish_cmd(scmd, &ap->eh_done_q);
Honestly, I'm not sure how to best solve this. I suppose this is ok for
now.
Jeff
next prev parent reply other threads:[~2006-01-22 9:41 UTC|newest]
Thread overview: 28+ messages / expand[flat|nested] mbox.gz Atom feed top
2006-01-22 7:58 [PATCHSET] libata: various fixes related to EH Tejun Heo
2006-01-22 7:58 ` [PATCH 01/12] libata: fold __ata_qc_complete() into ata_qc_free() Tejun Heo
2006-01-22 7:58 ` [PATCH 02/12] libata: make the owner of a qc responsible for freeing it Tejun Heo
2006-01-22 9:37 ` Jeff Garzik
2006-01-22 10:16 ` Tejun Heo
2006-01-22 7:58 ` [PATCH 03/12] libata: fix ata_qc_issue() error handling Tejun Heo
2006-01-22 9:25 ` Jeff Garzik
2006-01-22 7:58 ` [PATCH 06/12] SCSI: export scsi_eh_finish_cmd() and scsi_eh_flush_done_q() Tejun Heo
2006-01-22 9:36 ` Jeff Garzik
2006-01-22 7:58 ` [PATCH 04/12] libata: add detailed AC_ERR_* flags Tejun Heo
2006-01-22 9:30 ` Jeff Garzik
2006-01-22 9:46 ` Tejun Heo
2006-01-22 9:50 ` Tejun Heo
2006-01-22 7:58 ` [PATCH 05/12] libata: return AC_ERR_* from issue functions Tejun Heo
2006-01-22 9:36 ` Jeff Garzik
2006-01-22 7:58 ` [PATCH 07/12] libata: implement and apply ata_eh_qc_complete/retry() Tejun Heo
2006-01-22 7:58 ` [PATCH 09/12] libata: kill NULL qc handling from ->eng_timeout callbacks Tejun Heo
2006-01-22 7:58 ` [PATCH 08/12] libata: fix handling of race between timeout and completion Tejun Heo
2006-01-22 9:41 ` Jeff Garzik [this message]
2006-01-22 7:58 ` [PATCH 12/12] libata: EH / pio tasks synchronization Tejun Heo
2006-01-22 9:58 ` Jeff Garzik
2006-01-22 10:27 ` Tejun Heo
2006-01-22 7:58 ` [PATCH 10/12] libata: implement ATA_FLAG_IN_EH port flag Tejun Heo
2006-01-22 9:49 ` Jeff Garzik
2006-01-22 7:58 ` [PATCH 11/12] libata: ignore normal qc completion during EH Tejun Heo
2006-01-22 9:53 ` Jeff Garzik
2006-01-22 11:09 ` Tejun Heo
2006-01-22 9:10 ` [PATCHSET] libata: various fixes related to EH Jeff Garzik
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=43D3535E.80405@pobox.com \
--to=jgarzik@pobox.com \
--cc=albertcc@tw.ibm.com \
--cc=htejun@gmail.com \
--cc=linux-ide@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).