From: Nathan Lynch <nathan.lynch@amd.com>
To: <sashiko-reviews@lists.linux.dev>
Cc: <dmaengine@vger.kernel.org>, <linux-pci@vger.kernel.org>,
<vkoul@kernel.org>, <Frank.Li@kernel.org>
Subject: Re: [PATCH v2 04/23] dmaengine: sdxi: Feature discovery and initial configuration
Date: Fri, 5 Jun 2026 16:25:10 -0500 [thread overview]
Message-ID: <87v7bwd6ix.fsf@amd.com> (raw)
In-Reply-To: <20260513003352.8EDB2C2BCB0@smtp.kernel.org>
sashiko-bot@kernel.org writes:
> Thank you for your contribution! Sashiko AI review found 2 potential issue(s) to consider:
> - [High] Hardware state machine mismanagement and TOCTOU race during device initialization.
> - [Low] Use of 64-bit MMIO accessors without proper fallback causes build failures on 32-bit architectures.
> --
>
> commit 20635a3fda61c4d58fcd763198caf0840128ff21
> Author: Nathan Lynch <nathan.lynch@amd.com>
>
> dmaengine: sdxi: Feature discovery and initial configuration
>
> Discover via the capability registers the doorbell region stride, the
> maximum supported context ID, the operation groups implemented, and
> limits on buffer and control structure sizes. The driver has the
> option of writing more conservative limits to the ctl2 register, but
> it uses those supplied by the implementation for now.
>
>> diff --git a/drivers/dma/sdxi/device.c b/drivers/dma/sdxi/device.c
>> index b718ce04afa07..f9a9944ad8929 100644
>> --- a/drivers/dma/sdxi/device.c
>> +++ b/drivers/dma/sdxi/device.c
> [ ... ]
>> @@ -5,14 +5,180 @@
>> +/* Get the device to the GSV_STOP state. */
>> +static int sdxi_dev_stop(struct sdxi_dev *sdxi)
>> +{
>> + enum sdxi_fn_gsv status = sdxi_dev_gsv(sdxi);
>> + int ret;
>> +
>> + dev_dbg(sdxi->dev, "attempting stop, current state: %s\n",
>> + gsv_str(status));
>> +
>> + switch (status) {
>> + case SDXI_GSV_INIT:
>> + case SDXI_GSV_ACTIVE:
>> + sdxi_write_fn_gsr(sdxi, SDXI_GSRV_STOP_SF);
>> + break;
>> + case SDXI_GSV_STOPG_SF:
>> + sdxi_write_fn_gsr(sdxi, SDXI_GSRV_STOP_HD);
>> + break;
>> + case SDXI_GSV_STOPG_HD:
>> + case SDXI_GSV_ERROR:
>> + /*
>> + * If hard-stopping, there's nothing to do but wait.
>> + * If in error state, the reset is issued below.
>> + */
>> + break;
>> + default:
>> + /* Unrecognized state; try a reset. */
>> + sdxi_write_fn_gsr(sdxi, SDXI_GSRV_RESET);
>> + break;
>> + }
>
> If the device is already in the SDXI_GSV_STOP state, does the switch fall
> through to the default case and unintentionally issue a reset?
>
> The subsequent read_poll_timeout() check would then immediately return success
> because the state evaluates as STOP before the background reset actually begins.
> Could this allow the reset to proceed in the background while the driver
> continues with initialization, potentially wiping the configuration?
No, this is ok as written; when the function is in GSV_STOP, it ignores
any value written to fn_gsr except for SDXI_GSRV_ACTIVE.
>> +/*
>> + * See SDXI 1.0 4.1.8 Activation of the SDXI Function by Software.
>> + */
>> +static int sdxi_fn_activate(struct sdxi_dev *sdxi)
>> +{
>> + u64 version, cap0, cap1, ctl2;
>> + int err;
>> +
>> + /*
>> + * Clear any existing configuration from MMIO_CTL0 and ensure
>> + * the function is in GSV_STOP state.
>> + */
>> + sdxi_write64(sdxi, SDXI_MMIO_CTL0, 0);
>> + err = sdxi_dev_stop(sdxi);
>
> Does writing 0 to SDXI_MMIO_CTL0 implicitly write SDXI_GSRV_RESET (value 0)
> to the FN_GSR command field?
>
> If so, sdxi_dev_stop() might read a stale device state while the hardware is
> processing the reset, and overwrite the command field with a conflicting command
> like SDXI_GSRV_STOP_SF. Can this lead to a hardware state machine
> conflict?
I don't think there's a state machine conflict (see Figure 4-1: SDXI
Function States and State Transitions in the spec). But we probably
shouldn't change the fn_pasid_vl or fn_err_intr_en fields in the
register unless the function is already in stopped state.
Another reason to be more careful here is that we shouldn't clear
MMIO_CTL0's fn_grp_id field, which is software-assignable for function
grouping, and is read-only for VFs.
>> @@ -30,9 +34,24 @@ struct sdxi_dev {
>> +static inline u64 sdxi_read64(const struct sdxi_dev *sdxi, enum sdxi_reg reg)
>> +{
>> + return ioread64(sdxi->ctrl_regs + reg);
>> +}
>> +
>> +static inline void sdxi_write64(struct sdxi_dev *sdxi, enum sdxi_reg reg, u64 val)
>> +{
>> + iowrite64(val, sdxi->ctrl_regs + reg);
>> +}
>
> Can this cause build failures on 32-bit architectures?
>
> The functions use ioread64() and iowrite64(), which typically rely on readq()
> and writeq() that are not defined on 32-bit platforms.
>
> Although linux/io-64-nonatomic-lo-hi.h is included at the top of the file,
> the code does not use the macros actually provided by that header
> (e.g., ioread64_lo_hi()).
I decided to just make the driver depend on 64BIT and use readq/writeq.
We can take up 32-bit build coverage later.
next prev parent reply other threads:[~2026-06-05 21:25 UTC|newest]
Thread overview: 58+ messages / expand[flat|nested] mbox.gz Atom feed top
2026-05-11 19:16 [PATCH v2 00/23] dmaengine: Smart Data Accelerator Interface (SDXI) basic support Nathan Lynch via B4 Relay
2026-05-11 19:16 ` [PATCH v2 01/23] PCI: Add SNIA SDXI accelerator sub-class Nathan Lynch via B4 Relay
2026-05-11 20:48 ` Frank Li
2026-05-12 23:50 ` sashiko-bot
2026-05-26 23:44 ` Nathan Lynch
2026-05-11 19:16 ` [PATCH v2 02/23] MAINTAINERS: Add entry for SDXI driver Nathan Lynch via B4 Relay
2026-05-11 19:16 ` [PATCH v2 03/23] dmaengine: sdxi: Add PCI initialization Nathan Lynch via B4 Relay
2026-05-11 21:22 ` Frank Li
2026-05-13 0:05 ` sashiko-bot
2026-05-26 23:28 ` Nathan Lynch
2026-05-11 19:16 ` [PATCH v2 04/23] dmaengine: sdxi: Feature discovery and initial configuration Nathan Lynch via B4 Relay
2026-05-11 21:30 ` Frank Li
2026-05-13 0:33 ` sashiko-bot
2026-06-05 21:25 ` Nathan Lynch [this message]
2026-05-11 19:16 ` [PATCH v2 05/23] dmaengine: sdxi: Configure context tables Nathan Lynch via B4 Relay
2026-05-13 1:12 ` sashiko-bot
2026-06-05 22:19 ` Nathan Lynch
2026-05-11 19:16 ` [PATCH v2 06/23] dmaengine: sdxi: Allocate DMA pools Nathan Lynch via B4 Relay
2026-05-13 1:30 ` sashiko-bot
2026-05-27 0:05 ` Nathan Lynch
2026-05-11 19:16 ` [PATCH v2 07/23] dmaengine: sdxi: Allocate administrative context Nathan Lynch via B4 Relay
2026-05-13 2:20 ` sashiko-bot
2026-05-27 0:07 ` Nathan Lynch
2026-05-11 19:16 ` [PATCH v2 08/23] dmaengine: sdxi: Install " Nathan Lynch via B4 Relay
2026-05-13 3:17 ` sashiko-bot
2026-06-05 23:26 ` Nathan Lynch
2026-05-11 19:16 ` [PATCH v2 09/23] dmaengine: sdxi: Start functions on probe, stop on remove Nathan Lynch via B4 Relay
2026-05-13 3:35 ` sashiko-bot
2026-05-11 19:16 ` [PATCH v2 10/23] dmaengine: sdxi: Complete administrative context jump start Nathan Lynch via B4 Relay
2026-05-13 3:54 ` sashiko-bot
2026-05-11 19:16 ` [PATCH v2 11/23] dmaengine: sdxi: Add client context alloc and release APIs Nathan Lynch via B4 Relay
2026-05-13 4:46 ` sashiko-bot
2026-05-11 19:16 ` [PATCH v2 12/23] dmaengine: sdxi: Add descriptor ring management Nathan Lynch via B4 Relay
2026-05-13 5:21 ` sashiko-bot
2026-05-11 19:16 ` [PATCH v2 13/23] dmaengine: sdxi: Add unit tests for descriptor ring reservations Nathan Lynch via B4 Relay
2026-05-13 5:48 ` sashiko-bot
2026-05-11 19:16 ` [PATCH v2 14/23] dmaengine: sdxi: Attach descriptor ring state to contexts Nathan Lynch via B4 Relay
2026-05-13 19:31 ` sashiko-bot
2026-05-11 19:16 ` [PATCH v2 15/23] dmaengine: sdxi: Per-context access key (AKey) table entry allocator Nathan Lynch via B4 Relay
2026-05-13 19:54 ` sashiko-bot
2026-05-11 19:16 ` [PATCH v2 16/23] dmaengine: sdxi: Generic descriptor manipulation helpers Nathan Lynch via B4 Relay
2026-05-13 20:21 ` sashiko-bot
2026-05-11 19:16 ` [PATCH v2 17/23] dmaengine: sdxi: Add completion status block API Nathan Lynch via B4 Relay
2026-05-13 20:38 ` sashiko-bot
2026-05-11 19:16 ` [PATCH v2 18/23] dmaengine: sdxi: Encode context start, stop, and sync descriptors Nathan Lynch via B4 Relay
2026-05-11 19:16 ` [PATCH v2 19/23] dmaengine: sdxi: Provide context start and stop APIs Nathan Lynch via B4 Relay
2026-05-13 21:18 ` sashiko-bot
2026-05-11 19:16 ` [PATCH v2 20/23] dmaengine: sdxi: Encode nop, copy, and interrupt descriptors Nathan Lynch via B4 Relay
2026-05-13 21:33 ` sashiko-bot
2026-05-11 19:16 ` [PATCH v2 21/23] dmaengine: sdxi: Add unit tests for descriptor encoding Nathan Lynch via B4 Relay
2026-05-13 21:55 ` sashiko-bot
2026-05-11 19:16 ` [PATCH v2 22/23] dmaengine: sdxi: MSI/MSI-X vector allocation and mapping Nathan Lynch via B4 Relay
2026-05-13 22:17 ` sashiko-bot
2026-05-11 19:16 ` [PATCH v2 23/23] dmaengine: sdxi: Add DMA engine provider Nathan Lynch via B4 Relay
2026-05-11 20:47 ` Frank Li
2026-05-11 22:28 ` Lynch, Nathan
2026-05-13 20:01 ` Frank Li
2026-05-13 22:57 ` sashiko-bot
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=87v7bwd6ix.fsf@amd.com \
--to=nathan.lynch@amd.com \
--cc=Frank.Li@kernel.org \
--cc=dmaengine@vger.kernel.org \
--cc=linux-pci@vger.kernel.org \
--cc=sashiko-reviews@lists.linux.dev \
--cc=vkoul@kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox