From: Niklas Cassel <cassel@kernel.org>
To: Damien Le Moal <dlemoal@kernel.org>
Cc: linux-ide@vger.kernel.org
Subject: Re: [PATCH 1/2] ata: libata-eh: correctly handle deferred qc timeouts
Date: Fri, 20 Feb 2026 17:37:12 +0100 [thread overview]
Message-ID: <aZiNsC8fguChYpn2@ryzen> (raw)
In-Reply-To: <20260220050053.390135-2-dlemoal@kernel.org>
On Fri, Feb 20, 2026 at 02:00:52PM +0900, Damien Le Moal wrote:
> A differed qc may timeout while waiting for the device queue to drain
Nit: s/differed/deferred/
> to be submitted. In such case, since the qc is not active,
> ata_scsi_cmd_error_handler() ends up calling scsi_eh_finish_cmd(),
> which frees the qc. But as the port deferred_qc field still references
> this finished/freed qc, the deferred qc work may eventually attempt to
> call ata_qc_issue() against this invalid qc, leading to errors such as
> reported by UBSAN (syzbot run):
>
> UBSAN: shift-out-of-bounds in drivers/ata/libata-core.c:5166:24
> shift exponent 4210818301 is too large for 64-bit type 'long long unsigned int'
> ...
> Call Trace:
> <TASK>
> __dump_stack lib/dump_stack.c:94 [inline]
> dump_stack_lvl+0x100/0x190 lib/dump_stack.c:120
> ubsan_epilogue+0xa/0x30 lib/ubsan.c:233
> __ubsan_handle_shift_out_of_bounds+0x279/0x2a0 lib/ubsan.c:494
> ata_qc_issue.cold+0x38/0x9f drivers/ata/libata-core.c:5166
> ata_scsi_deferred_qc_work+0x154/0x1f0 drivers/ata/libata-scsi.c:1679
> process_one_work+0x9d7/0x1920 kernel/workqueue.c:3275
> process_scheduled_works kernel/workqueue.c:3358 [inline]
> worker_thread+0x5da/0xe40 kernel/workqueue.c:3439
> kthread+0x370/0x450 kernel/kthread.c:467
> ret_from_fork+0x754/0xd80 arch/x86/kernel/process.c:158
> ret_from_fork_asm+0x1a/0x30 arch/x86/entry/entry_64.S:245
> </TASK>
>
> Fix this by checking if the qc of a timed out command is a deferred one,
> and in such case, clear the port deferred_qc field and finish the scsi
> command with DID_TIME_OUT.
>
> Reported-by: syzbot+1f77b8ca15336fff21ff@syzkaller.appspotmail.com
> Fixes: 0ea84089dbf6 ("ata: libata-scsi: avoid Non-NCQ command starvation")
> Signed-off-by: Damien Le Moal <dlemoal@kernel.org>
> ---
> drivers/ata/libata-eh.c | 20 +++++++++++++++++---
> 1 file changed, 17 insertions(+), 3 deletions(-)
>
> diff --git a/drivers/ata/libata-eh.c b/drivers/ata/libata-eh.c
> index 72a22b6c9682..f86085f9b476 100644
> --- a/drivers/ata/libata-eh.c
> +++ b/drivers/ata/libata-eh.c
> @@ -640,12 +640,26 @@ void ata_scsi_cmd_error_handler(struct Scsi_Host *host, struct ata_port *ap,
> set_host_byte(scmd, DID_OK);
>
> ata_qc_for_each_raw(ap, qc, i) {
> - if (qc->flags & ATA_QCFLAG_ACTIVE &&
> - qc->scsicmd == scmd)
> + if (qc->scsicmd != scmd)
> + continue;
> + if ((qc->flags & ATA_QCFLAG_ACTIVE) ||
> + qc == ap->deferred_qc)
> break;
> }
>
> - if (i < ATA_MAX_QUEUE) {
> + if (qc == ap->deferred_qc) {
> + /*
> + * This is a deferred command that timedout while
s/timedout/timed out/
> + * waiting for the command queue to drain. Since the qc
> + * is not active yet, simply signal the timeout by
How do we know for sure that the QC is not active yet?
The answer appears to be that we always clear ap->deferred_qc before
issuing the deferred QC, thus ap->deferred_qc will never have flag
ATA_QCFLAG_ACTIVE set.
Perhaps we could somehow make this clearer in the comment.
Perhaps even a WARN_ON(qc->flags & ATA_QCFLAG_ACTIVE) ?
Otherwise, this patch looks good to me.
> + * finishing the SCSI command and clear the deferred qc
> + * to prevent the deferred qc work from issuing this
> + * qc.
> + */
> + ap->deferred_qc = NULL;
> + set_host_byte(scmd, DID_TIME_OUT);
> + scsi_eh_finish_cmd(scmd, &ap->eh_done_q);
> + } else if (i < ATA_MAX_QUEUE) {
> /* the scmd has an associated qc */
> if (!(qc->flags & ATA_QCFLAG_EH)) {
> /* which hasn't failed yet, timeout */
> --
> 2.53.0
>
next prev parent reply other threads:[~2026-02-20 16:37 UTC|newest]
Thread overview: 6+ messages / expand[flat|nested] mbox.gz Atom feed top
2026-02-20 5:00 [PATCH 0/2] ATA port deferred qc fixes Damien Le Moal
2026-02-20 5:00 ` [PATCH 1/2] ata: libata-eh: correctly handle deferred qc timeouts Damien Le Moal
2026-02-20 16:37 ` Niklas Cassel [this message]
2026-02-20 5:00 ` [PATCH 2/2] ata: libata-core: fix cancellation of a port deferred qc work Damien Le Moal
2026-02-20 17:33 ` Igor Pylypiv
2026-02-20 21:48 ` Damien Le Moal
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=aZiNsC8fguChYpn2@ryzen \
--to=cassel@kernel.org \
--cc=dlemoal@kernel.org \
--cc=linux-ide@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox