public inbox for rust-for-linux@vger.kernel.org
 help / color / mirror / Atom feed
* [PATCH] gpu: nova-core: falcon: align firmware DMA object size to required block alignment
@ 2026-04-04 12:14 Alexandre Courbot
  2026-04-04 13:35 ` Gary Guo
  2026-04-04 22:24 ` John Hubbard
  0 siblings, 2 replies; 4+ messages in thread
From: Alexandre Courbot @ 2026-04-04 12:14 UTC (permalink / raw)
  To: Danilo Krummrich, Alice Ryhl, David Airlie, Simona Vetter,
	Gary Guo
  Cc: John Hubbard, Alistair Popple, Joel Fernandes, Timur Tabi,
	Zhi Wang, Eliot Courtney, dri-devel, rust-for-linux, linux-kernel,
	Alexandre Courbot

Commit a88831502c8f ("gpu: nova-core: falcon: use dma::Coherent")
dropped the nova-local `DmaObject` device memory type for the
kernel-global `Coherent` one.

This switch had a side-effect: `DmaOject` always aligned the requested
size to `PAGE_SIZE`, and also reported that adjusted size when queried.
`Coherent`, on the other hand, does page-align allocation sizes but
only allows CPU access on the exact size provided by the caller.

This change runs into a limitation of falcon DMA copies, namely that DMA
accesses are done on blocks of exactly 256 bytes. If the provided data
does not have a length that is a multiple of 256, `dma_wr` returns
an error.

It was expected that all firmwares would present the proper adjusted
size, but this is not the case at least on my GA107:

    NovaCore 0000:08:00.0: DMA transfer goes beyond range of DMA object
    NovaCore 0000:08:00.0: Failed to load FWSEC firmware: EINVAL
    NovaCore 0000:08:00.0: probe with driver NovaCore failed with error -22

Fix this by aligning the `Coherent`'s size to `MEM_BLOCK_ALIGNMENT`
(i.e. 256) when allocating it and filling it with zeroes, before
copying the firmware on top of it.

Fixes: a88831502c8f ("gpu: nova-core: falcon: use dma::Coherent")
Signed-off-by: Alexandre Courbot <acourbot@nvidia.com>
---
 drivers/gpu/nova-core/falcon.rs | 21 +++++++++++++++++++--
 1 file changed, 19 insertions(+), 2 deletions(-)

diff --git a/drivers/gpu/nova-core/falcon.rs b/drivers/gpu/nova-core/falcon.rs
index e0315fda576b..eb0cf1f7329a 100644
--- a/drivers/gpu/nova-core/falcon.rs
+++ b/drivers/gpu/nova-core/falcon.rs
@@ -11,6 +11,7 @@
     },
     dma::{
         Coherent,
+        CoherentBox,
         DmaAddress,
         DmaMask, //
     },
@@ -613,8 +614,24 @@ fn dma_load<F: FalconFirmware<Target = E> + FalconDmaLoadable>(
         bar: &Bar0,
         fw: &F,
     ) -> Result {
-        // Create DMA object with firmware content as the source of the DMA engine.
-        let dma_obj = Coherent::from_slice(dev, fw.as_slice(), GFP_KERNEL)?;
+        // DMA object with firmware content as the source of the DMA engine.
+        let dma_obj = {
+            let fw_slice = fw.as_slice();
+
+            // DMA copies are done in chunks of `MEM_BLOCK_ALIGNMENT`, so align the length
+            // accordingly and fill with `0`.
+            let mut dma_obj = CoherentBox::zeroed_slice(
+                dev,
+                fw_slice.len().next_multiple_of(MEM_BLOCK_ALIGNMENT),
+                GFP_KERNEL,
+            )?;
+
+            // PANIC: `dma_obj` has been created with a length equal to or larger than
+            // `fw_slice.len()`, so the range `0..fw_slice.len()` is valid.
+            dma_obj[0..fw_slice.len()].copy_from_slice(fw_slice);
+
+            dma_obj.into()
+        };
 
         self.dma_reset(bar);
         bar.update(regs::NV_PFALCON_FBIF_TRANSCFG::of::<E>().at(0), |v| {

---
base-commit: 7c50d748b4a635bc39802ea3f6b120e66b1b9067
change-id: 20260404-falcon-dma-roundup-0edd764a9840

Best regards,
--  
Alexandre Courbot <acourbot@nvidia.com>


^ permalink raw reply related	[flat|nested] 4+ messages in thread

* Re: [PATCH] gpu: nova-core: falcon: align firmware DMA object size to required block alignment
  2026-04-04 12:14 [PATCH] gpu: nova-core: falcon: align firmware DMA object size to required block alignment Alexandre Courbot
@ 2026-04-04 13:35 ` Gary Guo
  2026-04-04 22:24 ` John Hubbard
  1 sibling, 0 replies; 4+ messages in thread
From: Gary Guo @ 2026-04-04 13:35 UTC (permalink / raw)
  To: Alexandre Courbot, Danilo Krummrich, Alice Ryhl, David Airlie,
	Simona Vetter, Gary Guo
  Cc: John Hubbard, Alistair Popple, Joel Fernandes, Timur Tabi,
	Zhi Wang, Eliot Courtney, dri-devel, rust-for-linux, linux-kernel

On Sat Apr 4, 2026 at 1:14 PM BST, Alexandre Courbot wrote:
> Commit a88831502c8f ("gpu: nova-core: falcon: use dma::Coherent")
> dropped the nova-local `DmaObject` device memory type for the
> kernel-global `Coherent` one.
>
> This switch had a side-effect: `DmaOject` always aligned the requested
> size to `PAGE_SIZE`, and also reported that adjusted size when queried.
> `Coherent`, on the other hand, does page-align allocation sizes but
> only allows CPU access on the exact size provided by the caller.
>
> This change runs into a limitation of falcon DMA copies, namely that DMA
> accesses are done on blocks of exactly 256 bytes. If the provided data
> does not have a length that is a multiple of 256, `dma_wr` returns
> an error.
>
> It was expected that all firmwares would present the proper adjusted
> size, but this is not the case at least on my GA107:
>
>     NovaCore 0000:08:00.0: DMA transfer goes beyond range of DMA object
>     NovaCore 0000:08:00.0: Failed to load FWSEC firmware: EINVAL
>     NovaCore 0000:08:00.0: probe with driver NovaCore failed with error -22
>
> Fix this by aligning the `Coherent`'s size to `MEM_BLOCK_ALIGNMENT`
> (i.e. 256) when allocating it and filling it with zeroes, before
> copying the firmware on top of it.
>
> Fixes: a88831502c8f ("gpu: nova-core: falcon: use dma::Coherent")
> Signed-off-by: Alexandre Courbot <acourbot@nvidia.com>

Reviewed-by: Gary Guo <gary@garyguo.net>

> ---
>  drivers/gpu/nova-core/falcon.rs | 21 +++++++++++++++++++--
>  1 file changed, 19 insertions(+), 2 deletions(-)
>
> diff --git a/drivers/gpu/nova-core/falcon.rs b/drivers/gpu/nova-core/falcon.rs
> index e0315fda576b..eb0cf1f7329a 100644
> --- a/drivers/gpu/nova-core/falcon.rs
> +++ b/drivers/gpu/nova-core/falcon.rs
> @@ -11,6 +11,7 @@
>      },
>      dma::{
>          Coherent,
> +        CoherentBox,
>          DmaAddress,
>          DmaMask, //
>      },
> @@ -613,8 +614,24 @@ fn dma_load<F: FalconFirmware<Target = E> + FalconDmaLoadable>(
>          bar: &Bar0,
>          fw: &F,
>      ) -> Result {
> -        // Create DMA object with firmware content as the source of the DMA engine.
> -        let dma_obj = Coherent::from_slice(dev, fw.as_slice(), GFP_KERNEL)?;
> +        // DMA object with firmware content as the source of the DMA engine.
> +        let dma_obj = {
> +            let fw_slice = fw.as_slice();
> +
> +            // DMA copies are done in chunks of `MEM_BLOCK_ALIGNMENT`, so align the length
> +            // accordingly and fill with `0`.
> +            let mut dma_obj = CoherentBox::zeroed_slice(
> +                dev,
> +                fw_slice.len().next_multiple_of(MEM_BLOCK_ALIGNMENT),
> +                GFP_KERNEL,
> +            )?;
> +
> +            // PANIC: `dma_obj` has been created with a length equal to or larger than
> +            // `fw_slice.len()`, so the range `0..fw_slice.len()` is valid.
> +            dma_obj[0..fw_slice.len()].copy_from_slice(fw_slice);

NIT: could be `dma_obj[..fw_slice.len()]`.

Best,
Gary

> +
> +            dma_obj.into()
> +        };
>  
>          self.dma_reset(bar);
>          bar.update(regs::NV_PFALCON_FBIF_TRANSCFG::of::<E>().at(0), |v| {
>
> ---
> base-commit: 7c50d748b4a635bc39802ea3f6b120e66b1b9067
> change-id: 20260404-falcon-dma-roundup-0edd764a9840
>
> Best regards,
> --  
> Alexandre Courbot <acourbot@nvidia.com>


^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: [PATCH] gpu: nova-core: falcon: align firmware DMA object size to required block alignment
  2026-04-04 12:14 [PATCH] gpu: nova-core: falcon: align firmware DMA object size to required block alignment Alexandre Courbot
  2026-04-04 13:35 ` Gary Guo
@ 2026-04-04 22:24 ` John Hubbard
  2026-04-05  2:12   ` Alexandre Courbot
  1 sibling, 1 reply; 4+ messages in thread
From: John Hubbard @ 2026-04-04 22:24 UTC (permalink / raw)
  To: Alexandre Courbot, Danilo Krummrich, Alice Ryhl, David Airlie,
	Simona Vetter, Gary Guo
  Cc: Alistair Popple, Joel Fernandes, Timur Tabi, Zhi Wang,
	Eliot Courtney, dri-devel, rust-for-linux, linux-kernel

On 4/4/26 5:14 AM, Alexandre Courbot wrote:
> Commit a88831502c8f ("gpu: nova-core: falcon: use dma::Coherent")
> dropped the nova-local `DmaObject` device memory type for the
> kernel-global `Coherent` one.

Hi Alex,

The diffs look good, so:

Reviewed-by: John Hubbard <jhubbard@nvidia.com>


A small typo in 3 places changes the meaning enough to throw
things off:

The subject line should s/align/pad/. Alignment normally means
adjusting the starting address, whereas padding is what you are
doing here.

Same in the comments below:

> 
> This switch had a side-effect: `DmaOject` always aligned the requested
> size to `PAGE_SIZE`, and also reported that adjusted size when queried.
> `Coherent`, on the other hand, does page-align allocation sizes but
> only allows CPU access on the exact size provided by the caller.
> 
> This change runs into a limitation of falcon DMA copies, namely that DMA
> accesses are done on blocks of exactly 256 bytes. If the provided data
> does not have a length that is a multiple of 256, `dma_wr` returns
> an error.
> 
> It was expected that all firmwares would present the proper adjusted
> size, but this is not the case at least on my GA107:
> 
>     NovaCore 0000:08:00.0: DMA transfer goes beyond range of DMA object
>     NovaCore 0000:08:00.0: Failed to load FWSEC firmware: EINVAL
>     NovaCore 0000:08:00.0: probe with driver NovaCore failed with error -22
> 
> Fix this by aligning the `Coherent`'s size to `MEM_BLOCK_ALIGNMENT`

"Fix this by padding"

> (i.e. 256) when allocating it and filling it with zeroes, before
> copying the firmware on top of it.
> 
> Fixes: a88831502c8f ("gpu: nova-core: falcon: use dma::Coherent")
> Signed-off-by: Alexandre Courbot <acourbot@nvidia.com>
> ---
>  drivers/gpu/nova-core/falcon.rs | 21 +++++++++++++++++++--
>  1 file changed, 19 insertions(+), 2 deletions(-)
> 
> diff --git a/drivers/gpu/nova-core/falcon.rs b/drivers/gpu/nova-core/falcon.rs
> index e0315fda576b..eb0cf1f7329a 100644
> --- a/drivers/gpu/nova-core/falcon.rs
> +++ b/drivers/gpu/nova-core/falcon.rs
> @@ -11,6 +11,7 @@
>      },
>      dma::{
>          Coherent,
> +        CoherentBox,
>          DmaAddress,
>          DmaMask, //
>      },
> @@ -613,8 +614,24 @@ fn dma_load<F: FalconFirmware<Target = E> + FalconDmaLoadable>(
>          bar: &Bar0,
>          fw: &F,
>      ) -> Result {
> -        // Create DMA object with firmware content as the source of the DMA engine.
> -        let dma_obj = Coherent::from_slice(dev, fw.as_slice(), GFP_KERNEL)?;
> +        // DMA object with firmware content as the source of the DMA engine.
> +        let dma_obj = {
> +            let fw_slice = fw.as_slice();
> +
> +            // DMA copies are done in chunks of `MEM_BLOCK_ALIGNMENT`, so align the length

"pad the length"

> +            // accordingly and fill with `0`.
> +            let mut dma_obj = CoherentBox::zeroed_slice(
> +                dev,
> +                fw_slice.len().next_multiple_of(MEM_BLOCK_ALIGNMENT),
> +                GFP_KERNEL,
> +            )?;
> +
> +            // PANIC: `dma_obj` has been created with a length equal to or larger than
> +            // `fw_slice.len()`, so the range `0..fw_slice.len()` is valid.
> +            dma_obj[0..fw_slice.len()].copy_from_slice(fw_slice);
> +
> +            dma_obj.into()
> +        };
>  
>          self.dma_reset(bar);
>          bar.update(regs::NV_PFALCON_FBIF_TRANSCFG::of::<E>().at(0), |v| {
> 
> ---
> base-commit: 7c50d748b4a635bc39802ea3f6b120e66b1b9067
> change-id: 20260404-falcon-dma-roundup-0edd764a9840
> 
> Best regards,
> --  
> Alexandre Courbot <acourbot@nvidia.com>
> 

thanks,
-- 
John Hubbard


^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: [PATCH] gpu: nova-core: falcon: align firmware DMA object size to required block alignment
  2026-04-04 22:24 ` John Hubbard
@ 2026-04-05  2:12   ` Alexandre Courbot
  0 siblings, 0 replies; 4+ messages in thread
From: Alexandre Courbot @ 2026-04-05  2:12 UTC (permalink / raw)
  To: John Hubbard
  Cc: Danilo Krummrich, Alice Ryhl, David Airlie, Simona Vetter,
	Gary Guo, Alistair Popple, Joel Fernandes, Timur Tabi, Zhi Wang,
	Eliot Courtney, dri-devel, rust-for-linux, linux-kernel

On Sun Apr 5, 2026 at 7:24 AM JST, John Hubbard wrote:
> On 4/4/26 5:14 AM, Alexandre Courbot wrote:
>> Commit a88831502c8f ("gpu: nova-core: falcon: use dma::Coherent")
>> dropped the nova-local `DmaObject` device memory type for the
>> kernel-global `Coherent` one.
>
> Hi Alex,
>
> The diffs look good, so:
>
> Reviewed-by: John Hubbard <jhubbard@nvidia.com>
>
>
> A small typo in 3 places changes the meaning enough to throw
> things off:
>
> The subject line should s/align/pad/. Alignment normally means
> adjusting the starting address, whereas padding is what you are
> doing here.

Oooh indeed that's a bad one. Thanks for catching this - I'll send a v2
to make applying easier.

^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2026-04-05  2:12 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2026-04-04 12:14 [PATCH] gpu: nova-core: falcon: align firmware DMA object size to required block alignment Alexandre Courbot
2026-04-04 13:35 ` Gary Guo
2026-04-04 22:24 ` John Hubbard
2026-04-05  2:12   ` Alexandre Courbot

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox