Linux ATA/IDE development
 help / color / mirror / Atom feed
From: Damien Le Moal <dlemoal@kernel.org>
To: Niklas Cassel <cassel@kernel.org>, Tommy Kelly <linux@tkel.ly>,
	"Martin K. Petersen" <martin.petersen@oracle.com>,
	John Garry <john.g.garry@oracle.com>
Cc: linux-ide@vger.kernel.org
Subject: Re: [PATCH v2 1/2] ata: libata-scsi: do not use the deferred QC feature on PMPs with CBS
Date: Tue, 12 May 2026 10:57:08 +0900	[thread overview]
Message-ID: <5aeae334-3bcb-42c6-a779-f07589d49748@kernel.org> (raw)
In-Reply-To: <20260508193240.176735-5-cassel@kernel.org>

On 5/9/26 04:32, Niklas Cassel wrote:
> When using Port Multipliers (PMPs) with Command-Based Switching (CBS), you
> can only issue commands to one link at a time. For PMPs with CBS, there is
> already code to handle commands being sent to different links in
> sata_pmp_qc_defer_cmd_switch() using ap->excl_link.
> 
> A user on the list reported that commit 0ea84089dbf6 ("ata: libata-scsi:
> avoid Non-NCQ command starvation") broke PMPs with CBS. The commit
> introduced code that stores a deferred qc in ap->deferred_qc, to later be
> issued via a workqueue. It turns out that this change is incompatible with
> the existing ap->excl_link handling used by PMPs with CBS.
> 
> Thus, modify sata_pmp_qc_defer_cmd_switch() code to return
> ATA_DEFER_PORT_PMP_CBS and ATA_DEFER_LINK_PMP_CBS, and make sure that the
> deferred QC handling via workqueue is not used for these return values.
> 
> This way, PMPs with CBS will work once again. Note that the starvation
> referenced in commit 0ea84089dbf6 ("ata: libata-scsi: avoid Non-NCQ
> command starvation") can only happen on libsas ports, and libsas does not
> support Port Multipliers, thus there is no harm of reverting back to the
> previous way of deferring commands for PMPs with CBS.
> 
> Non-libsas ports connected to anything but a PMP with CBS (e.g. a normal
> drive or a PMP with FBS) will continue using the deferred workqueue, since
> it does result in lower completion latencies for non-NCQ commands, even
> though the workqueue is not strictly needed to avoid starvation for
> non-libsas ports.
> 
> If we want to modify the scope of the workqueue issuing to also handle
> PMPs with CBS, then we should ensure that we can save both NCQ and non-NCQ
> commands in ap->deferred_qc, while also removing the existing PMP CBS
> handling using ap->excl_link, such that we don't duplicate features.
> 
> While at it, also add a comment explaining how the ap->excl_link mechanism
> works.
> 
> Fixes: 0ea84089dbf6 ("ata: libata-scsi: avoid Non-NCQ command starvation")
> Signed-off-by: Niklas Cassel <cassel@kernel.org>

Looks good to me. See some nits below.

> ---
>  drivers/ata/libata-pmp.c  | 24 ++++++++++++++--
>  drivers/ata/libata-scsi.c | 58 ++++++++++++++++++++++++---------------
>  include/linux/libata.h    |  2 ++
>  3 files changed, 60 insertions(+), 24 deletions(-)
> 
> diff --git a/drivers/ata/libata-pmp.c b/drivers/ata/libata-pmp.c
> index e3adc008fed1..d847bdff6d0a 100644
> --- a/drivers/ata/libata-pmp.c
> +++ b/drivers/ata/libata-pmp.c
> @@ -113,14 +113,34 @@ int sata_pmp_qc_defer_cmd_switch(struct ata_queued_cmd *qc)
>  
>  	if (ap->excl_link == NULL || ap->excl_link == link) {
>  		if (ap->nr_active_links == 0 || ata_link_active(link)) {
> +			int ret;
> +
>  			qc->flags |= ATA_QCFLAG_CLEAR_EXCL;
> -			return ata_std_qc_defer(qc);
> +			ret = ata_std_qc_defer(qc);
> +			switch (ret) {
> +			case 0:
> +				return ret;
> +			case ATA_DEFER_LINK:
> +				return ATA_DEFER_LINK_PMP_CBS;
> +			case ATA_DEFER_PORT:
> +				return ATA_DEFER_PORT_PMP_CBS;
> +			default:
> +				WARN_ON_ONCE(1);
> +				return ATA_DEFER_PORT_PMP_CBS;
> +			}
>  		}
>  
> +		/*
> +		 * Note: ap->excl_link contains the link that is next in line,
> +		 * i.e. implicit round robin. If there is only one link
> +		 * dispatching, ap->excl_link will be left unclaimed, allowing
> +		 * other links to set ap->excl_link, ensuring that the currently
> +		 * active link cannot queue any more.
> +		 */
>  		ap->excl_link = link;
>  	}
>  
> -	return ATA_DEFER_PORT;
> +	return ATA_DEFER_PORT_PMP_CBS;
>  }
>  
>  /**
> diff --git a/drivers/ata/libata-scsi.c b/drivers/ata/libata-scsi.c
> index f44612e269a4..6f273c5d0cd3 100644
> --- a/drivers/ata/libata-scsi.c
> +++ b/drivers/ata/libata-scsi.c
> @@ -1767,7 +1767,7 @@ static int ata_scsi_qc_issue(struct ata_port *ap, struct ata_queued_cmd *qc)
>  	int ret;
>  
>  	if (!ap->ops->qc_defer)
> -		goto issue;
> +		goto issue_qc;
>  
>  	/*
>  	 * If we already have a deferred qc, then rely on the SCSI layer to
> @@ -1783,38 +1783,52 @@ static int ata_scsi_qc_issue(struct ata_port *ap, struct ata_queued_cmd *qc)
>  	ret = ap->ops->qc_defer(qc);
>  	switch (ret) {
>  	case 0:
> -		break;
> +		goto issue_qc;

Please keep the break here (see below).

>  	case ATA_DEFER_LINK:
>  		ret = SCSI_MLQUEUE_DEVICE_BUSY;
> -		break;
> +		goto store_qc;
>  	case ATA_DEFER_PORT:
>  		ret = SCSI_MLQUEUE_HOST_BUSY;
> -		break;
> +		goto store_qc;
> +	case ATA_DEFER_LINK_PMP_CBS:
> +		/*
> +		 * PMP in CBS mode has independent handling using ap->excl_link
> +		 * that is incompatible with ap->deferred_qc workqueue handling.
> +		 */
> +		ret = SCSI_MLQUEUE_DEVICE_BUSY;
> +		goto free_qc;
> +	case ATA_DEFER_PORT_PMP_CBS:
> +		/*
> +		 * PMP in CBS mode has independent handling using ap->excl_link
> +		 * that is incompatible with ap->deferred_qc workqueue handling.
> +		 */
> +		ret = SCSI_MLQUEUE_HOST_BUSY;
> +		goto free_qc;

The repeated comment is not great... What about writing this like this ?

+	case ATA_DEFER_LINK_PMP_CBS:
+	case ATA_DEFER_PORT_PMP_CBS:
+		/*
+		 * PMP in CBS mode has independent handling using ap->excl_link
+		 * that is incompatible with ap->deferred_qc workqueue handling.
+		 */
+		if (ret == ATA_DEFER_LINK_PMP_CBS)
+			ret = SCSI_MLQUEUE_DEVICE_BUSY;
+		else
+			ret = SCSI_MLQUEUE_HOST_BUSY;
+		goto free_qc;

>  	default:
>  		WARN_ON_ONCE(1);
>  		ret = SCSI_MLQUEUE_HOST_BUSY;
> -		break;
> +		goto free_qc;
>  	}
>  
> -	if (ret) {
> -		/*
> -		 * We must defer this qc: if this is not an NCQ command, keep
> -		 * this qc as a deferred one and report to the SCSI layer that
> -		 * we issued it so that it is not requeued. The deferred qc will
> -		 * be issued with the port deferred_qc_work once all on-going
> -		 * commands complete.
> -		 */
> -		if (!ata_is_ncq(qc->tf.protocol)) {
> -			ap->deferred_qc = qc;
> -			return 0;
> -		}
> -
> -		/* Force a requeue of the command to defer its execution. */
> -		ata_qc_free(qc);
> -		return ret;
> +store_qc:

defer_qc would be a better name since we do not necessarily "store" it

> +	/*
> +	 * We must defer this qc: if this is not an NCQ command, keep
> +	 * this qc as a deferred one and report to the SCSI layer that
> +	 * we issued it so that it is not requeued. The deferred qc will
> +	 * be issued with the port deferred_qc_work once all on-going
> +	 * commands complete.
> +	 */
> +	if (!ata_is_ncq(qc->tf.protocol)) {
> +		ap->deferred_qc = qc;
> +		return 0;
>  	}
>  
> -issue:
> +free_qc:
> +	/* Force a requeue of the command to defer its execution. */
> +	ata_qc_free(qc);
> +	return ret;
> +
> +issue_qc:
>  	ata_qc_issue(qc);
>  
>  	return 0;

Since this is the normal case most of the time, let's keep it at the top, right
after the switch/case so that we can simply break in that switch.

> diff --git a/include/linux/libata.h b/include/linux/libata.h
> index 5c085ef4eda7..511cdf1a6650 100644
> --- a/include/linux/libata.h
> +++ b/include/linux/libata.h
> @@ -371,6 +371,8 @@ enum {
>  	/* return values for ->qc_defer */
>  	ATA_DEFER_LINK		= 1,
>  	ATA_DEFER_PORT		= 2,
> +	ATA_DEFER_LINK_PMP_CBS	= 3,
> +	ATA_DEFER_PORT_PMP_CBS	= 4,
>  
>  	/* desc_len for ata_eh_info and context */
>  	ATA_EH_DESC_LEN		= 80,


-- 
Damien Le Moal
Western Digital Research

  reply	other threads:[~2026-05-12  1:57 UTC|newest]

Thread overview: 4+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2026-05-08 19:32 [PATCH v2 0/2] ata: fix deferred QC handling for port multipliers Niklas Cassel
2026-05-08 19:32 ` [PATCH v2 1/2] ata: libata-scsi: do not use the deferred QC feature on PMPs with CBS Niklas Cassel
2026-05-12  1:57   ` Damien Le Moal [this message]
2026-05-08 19:32 ` [PATCH v2 2/2] ata: libata-scsi: do not needlessly defer commands when using PMP with FBS Niklas Cassel

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=5aeae334-3bcb-42c6-a779-f07589d49748@kernel.org \
    --to=dlemoal@kernel.org \
    --cc=cassel@kernel.org \
    --cc=john.g.garry@oracle.com \
    --cc=linux-ide@vger.kernel.org \
    --cc=linux@tkel.ly \
    --cc=martin.petersen@oracle.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox