* [PATCH v10 0/3] Add dma coherent allocator abstraction
@ 2025-01-21 19:14 Abdiel Janulgue
2025-01-21 19:14 ` [PATCH v10 1/3] rust: error: Add EOVERFLOW Abdiel Janulgue
` (2 more replies)
0 siblings, 3 replies; 10+ messages in thread
From: Abdiel Janulgue @ 2025-01-21 19:14 UTC (permalink / raw)
To: rust-for-linux, daniel.almeida, dakr, robin.murphy, aliceryhl
Cc: Miguel Ojeda, Alex Gaynor, Boqun Feng, Gary Guo,
Björn Roy Baron, Benno Lossin, Andreas Hindborg,
Trevor Gross, Valentin Obst, open list, Christoph Hellwig,
Marek Szyprowski, airlied, open list:DMA MAPPING HELPERS,
Abdiel Janulgue
Changes since v9:
- Use ARef<Device> in the constructor arguments, docs clarification avoid
manually dropping the refcount for the device in into_parts(), use
add() instead of wrapping_add() in the pointer arithmetic for performance
(Alice Ryhl).
- Link to v9: https://lore.kernel.org/all/20250121084756.1051758-1-abdiel.janulgue@gmail.com/
Changes since v8:
- Add MAINTAINERS entry
- Fix build issues due to switch from core::ffi to crate:ffi in bindgen.
- Ensure the wrapped attribute is non-pub in struct Attrs, declare it
#[repr(transparent)] as well (Daniel Sedlak)
Changes since v7:
- Remove cpu_buf() and cpu_buf_mut() as exporting a r/w interface via
a slice is undefined behaviour due to slice's requirement that the
underlying pointer should not be modified (Alice Ryhl, Robin Murphy).
- Reintroduce r/w helpers instead which includes proper safety
invariants (Daniel Almeida).
Changes since v6:
- Include the dma_attrs in the constructor, use alloc::Flags as inpiration
Changes since v5:
- Remove unnecessary lifetime annotation when returning the CPU buffer.
Changes since v4:
- Documentation and example fixes, use Markdown formatting (Miguel Ojeda).
- Discard read()/write() helpers to remove bound on Copy and fix overhead
(Daniel Almeida).
- Improve error-handling in the constructor block (Andreas Hindborg).
Changes since v3:
- Reject ZST types by checking the type size in the constructor in
addition to requiring FromBytes/AsBytes traits for the type (Alice Ryhl).
Changes since v2:
- Fixed missing header for generating the bindings.
Changes since v1:
- Fix missing info in commit log where EOVERFLOW is used.
- Restrict the dma coherent allocator to numeric types for now for valid
behaviour (Daniel Almeida).
- Build slice dynamically.
Abdiel Janulgue (3):
rust: error: Add EOVERFLOW
rust: add dma coherent allocator abstraction.
MAINTAINERS: add entry for Rust dma mapping helpers device driver API
MAINTAINERS | 10 ++
rust/bindings/bindings_helper.h | 1 +
rust/kernel/dma.rs | 281 ++++++++++++++++++++++++++++++++
rust/kernel/error.rs | 1 +
rust/kernel/lib.rs | 1 +
5 files changed, 294 insertions(+)
create mode 100644 rust/kernel/dma.rs
base-commit: ceff0757f5dafb5be5205988171809c877b1d3e3
--
2.43.0
^ permalink raw reply [flat|nested] 10+ messages in thread* [PATCH v10 1/3] rust: error: Add EOVERFLOW 2025-01-21 19:14 [PATCH v10 0/3] Add dma coherent allocator abstraction Abdiel Janulgue @ 2025-01-21 19:14 ` Abdiel Janulgue 2025-01-21 19:14 ` [PATCH v10 2/3] rust: add dma coherent allocator abstraction Abdiel Janulgue 2025-01-21 19:14 ` [PATCH v10 3/3] MAINTAINERS: add entry for Rust dma mapping helpers device driver API Abdiel Janulgue 2 siblings, 0 replies; 10+ messages in thread From: Abdiel Janulgue @ 2025-01-21 19:14 UTC (permalink / raw) To: rust-for-linux, daniel.almeida, dakr, robin.murphy, aliceryhl Cc: Miguel Ojeda, Alex Gaynor, Boqun Feng, Gary Guo, Björn Roy Baron, Benno Lossin, Andreas Hindborg, Trevor Gross, Valentin Obst, open list, Christoph Hellwig, Marek Szyprowski, airlied, open list:DMA MAPPING HELPERS, Abdiel Janulgue Trivial addition for missing EOVERFLOW error. This is used by a subsequent patch that might require returning EOVERFLOW as a result of `checked_mul`. Signed-off-by: Abdiel Janulgue <abdiel.janulgue@gmail.com> Reviewed-by: Alice Ryhl <aliceryhl@google.com> --- rust/kernel/error.rs | 1 + 1 file changed, 1 insertion(+) diff --git a/rust/kernel/error.rs b/rust/kernel/error.rs index f6ecf09cb65f..1e510181432c 100644 --- a/rust/kernel/error.rs +++ b/rust/kernel/error.rs @@ -64,6 +64,7 @@ macro_rules! declare_err { declare_err!(EPIPE, "Broken pipe."); declare_err!(EDOM, "Math argument out of domain of func."); declare_err!(ERANGE, "Math result not representable."); + declare_err!(EOVERFLOW, "Value too large for defined data type."); declare_err!(ERESTARTSYS, "Restart the system call."); declare_err!(ERESTARTNOINTR, "System call was interrupted by a signal and will be restarted."); declare_err!(ERESTARTNOHAND, "Restart if no handler."); -- 2.43.0 ^ permalink raw reply related [flat|nested] 10+ messages in thread
* [PATCH v10 2/3] rust: add dma coherent allocator abstraction. 2025-01-21 19:14 [PATCH v10 0/3] Add dma coherent allocator abstraction Abdiel Janulgue 2025-01-21 19:14 ` [PATCH v10 1/3] rust: error: Add EOVERFLOW Abdiel Janulgue @ 2025-01-21 19:14 ` Abdiel Janulgue 2025-01-22 9:13 ` Alice Ryhl 2025-01-21 19:14 ` [PATCH v10 3/3] MAINTAINERS: add entry for Rust dma mapping helpers device driver API Abdiel Janulgue 2 siblings, 1 reply; 10+ messages in thread From: Abdiel Janulgue @ 2025-01-21 19:14 UTC (permalink / raw) To: rust-for-linux, daniel.almeida, dakr, robin.murphy, aliceryhl Cc: Miguel Ojeda, Alex Gaynor, Boqun Feng, Gary Guo, Björn Roy Baron, Benno Lossin, Andreas Hindborg, Trevor Gross, Valentin Obst, open list, Christoph Hellwig, Marek Szyprowski, airlied, open list:DMA MAPPING HELPERS, Abdiel Janulgue Add a simple dma coherent allocator rust abstraction. Based on Andreas Hindborg's dma abstractions from the rnvme driver, which was also based on earlier work by Wedson Almeida Filho. Signed-off-by: Abdiel Janulgue <abdiel.janulgue@gmail.com> --- rust/bindings/bindings_helper.h | 1 + rust/kernel/dma.rs | 281 ++++++++++++++++++++++++++++++++ rust/kernel/lib.rs | 1 + 3 files changed, 283 insertions(+) create mode 100644 rust/kernel/dma.rs diff --git a/rust/bindings/bindings_helper.h b/rust/bindings/bindings_helper.h index 5c4dfe22f41a..49bf713b9bb6 100644 --- a/rust/bindings/bindings_helper.h +++ b/rust/bindings/bindings_helper.h @@ -11,6 +11,7 @@ #include <linux/blk_types.h> #include <linux/blkdev.h> #include <linux/cred.h> +#include <linux/dma-mapping.h> #include <linux/errname.h> #include <linux/ethtool.h> #include <linux/file.h> diff --git a/rust/kernel/dma.rs b/rust/kernel/dma.rs new file mode 100644 index 000000000000..ebae7270190e --- /dev/null +++ b/rust/kernel/dma.rs @@ -0,0 +1,281 @@ +// SPDX-License-Identifier: GPL-2.0 + +//! Direct memory access (DMA). +//! +//! C header: [`include/linux/dma-mapping.h`](srctree/include/linux/dma-mapping.h) + +use crate::{ + bindings, build_assert, + device::Device, + error::code::*, + error::Result, + transmute::{AsBytes, FromBytes}, + types::ARef, +}; + +/// Possible attributes associated with a DMA mapping. +/// +/// They can be combined with the operators `|`, `&`, and `!`. +/// +/// Values can be used from the [`attrs`] module. +#[derive(Clone, Copy, PartialEq)] +#[repr(transparent)] +pub struct Attrs(u32); + +impl Attrs { + /// Get the raw representation of this attribute. + pub(crate) fn as_raw(self) -> crate::ffi::c_ulong { + self.0 as _ + } + + /// Check whether `flags` is contained in `self`. + pub fn contains(self, flags: Attrs) -> bool { + (self & flags) == flags + } +} + +impl core::ops::BitOr for Attrs { + type Output = Self; + fn bitor(self, rhs: Self) -> Self::Output { + Self(self.0 | rhs.0) + } +} + +impl core::ops::BitAnd for Attrs { + type Output = Self; + fn bitand(self, rhs: Self) -> Self::Output { + Self(self.0 & rhs.0) + } +} + +impl core::ops::Not for Attrs { + type Output = Self; + fn not(self) -> Self::Output { + Self(!self.0) + } +} + +/// DMA mapping attrributes. +pub mod attrs { + use super::Attrs; + + /// Specifies that reads and writes to the mapping may be weakly ordered, that is that reads + /// and writes may pass each other. + pub const DMA_ATTR_WEAK_ORDERING: Attrs = Attrs(bindings::DMA_ATTR_WEAK_ORDERING); + + /// Specifies that writes to the mapping may be buffered to improve performance. + pub const DMA_ATTR_WRITE_COMBINE: Attrs = Attrs(bindings::DMA_ATTR_WRITE_COMBINE); + + /// Lets the platform to avoid creating a kernel virtual mapping for the allocated buffer. + pub const DMA_ATTR_NO_KERNEL_MAPPING: Attrs = Attrs(bindings::DMA_ATTR_NO_KERNEL_MAPPING); + + /// Allows platform code to skip synchronization of the CPU cache for the given buffer assuming + /// that it has been already transferred to 'device' domain. + pub const DMA_ATTR_SKIP_CPU_SYNC: Attrs = Attrs(bindings::DMA_ATTR_SKIP_CPU_SYNC); + + /// Forces contiguous allocation of the buffer in physical memory. + pub const DMA_ATTR_FORCE_CONTIGUOUS: Attrs = Attrs(bindings::DMA_ATTR_FORCE_CONTIGUOUS); + + /// This is a hint to the DMA-mapping subsystem that it's probably not worth the time to try + /// to allocate memory to in a way that gives better TLB efficiency. + pub const DMA_ATTR_ALLOC_SINGLE_PAGES: Attrs = Attrs(bindings::DMA_ATTR_ALLOC_SINGLE_PAGES); + + /// This tells the DMA-mapping subsystem to suppress allocation failure reports (similarly to + /// __GFP_NOWARN). + pub const DMA_ATTR_NO_WARN: Attrs = Attrs(bindings::DMA_ATTR_NO_WARN); + + /// Used to indicate that the buffer is fully accessible at an elevated privilege level (and + /// ideally inaccessible or at least read-only at lesser-privileged levels). + pub const DMA_ATTR_PRIVILEGED: Attrs = Attrs(bindings::DMA_ATTR_PRIVILEGED); +} + +/// An abstraction of the `dma_alloc_coherent` API. +/// +/// This is an abstraction around the `dma_alloc_coherent` API which is used to allocate and map +/// large consistent DMA regions. +/// +/// A [`CoherentAllocation`] instance contains a pointer to the allocated region (in the +/// processor's virtual address space) and the device address which can be given to the device +/// as the DMA address base of the region. The region is released once [`CoherentAllocation`] +/// is dropped. +/// +/// # Invariants +/// +/// For the lifetime of an instance of [`CoherentAllocation`], the cpu address is a valid pointer +/// to an allocated region of consistent memory and we hold a reference to the device. +pub struct CoherentAllocation<T: AsBytes + FromBytes> { + dev: ARef<Device>, + dma_handle: bindings::dma_addr_t, + count: usize, + cpu_addr: *mut T, + dma_attrs: Attrs, +} + +impl<T: AsBytes + FromBytes> CoherentAllocation<T> { + /// Allocates a region of `size_of::<T> * count` of consistent memory. + /// + /// # Examples + /// + /// ``` + /// use kernel::device::Device; + /// use kernel::dma::{attrs::*, CoherentAllocation}; + /// + /// # fn test(dev: &Device) -> Result { + /// let c: CoherentAllocation<u64> = CoherentAllocation::alloc_attrs(dev.into(), 4, GFP_KERNEL, + /// DMA_ATTR_NO_WARN)?; + /// # Ok::<(), Error>(()) } + /// ``` + pub fn alloc_attrs( + dev: ARef<Device>, + count: usize, + gfp_flags: kernel::alloc::Flags, + dma_attrs: Attrs, + ) -> Result<CoherentAllocation<T>> { + build_assert!( + core::mem::size_of::<T>() > 0, + "It doesn't make sense for the allocated type to be a ZST" + ); + + let size = count + .checked_mul(core::mem::size_of::<T>()) + .ok_or(EOVERFLOW)?; + let mut dma_handle = 0; + // SAFETY: device pointer is guaranteed as valid by invariant on `Device`. + // We ensure that we catch the failure on this function and throw an ENOMEM + let ret = unsafe { + bindings::dma_alloc_attrs( + dev.as_raw(), + size, + &mut dma_handle, + gfp_flags.as_raw(), + dma_attrs.as_raw(), + ) + }; + if ret.is_null() { + return Err(ENOMEM); + } + // INVARIANT: We just successfully allocated a coherent region which is accessible for + // `count` elements, hence the cpu address is valid. We also hold a refcounted reference + // to the device. + Ok(Self { + dev, + dma_handle, + count, + cpu_addr: ret as *mut T, + dma_attrs, + }) + } + + /// Performs the same functionality as `alloc_attrs`, except the `dma_attrs` is 0 by default. + pub fn alloc_coherent( + dev: ARef<Device>, + count: usize, + gfp_flags: kernel::alloc::Flags, + ) -> Result<CoherentAllocation<T>> { + CoherentAllocation::alloc_attrs(dev, count, gfp_flags, Attrs(0)) + } + + /// Returns the device, base address, dma handle, attributes and the size of the + /// allocated region. + /// + /// The caller takes ownership of the returned resources, i.e., will have the responsibility + /// in calling `bindings::dma_free_attrs`. The allocated region is valid as long as + /// the returned device exists. + pub fn into_parts( + self, + ) -> ( + ARef<Device>, + *mut T, + bindings::dma_addr_t, + crate::ffi::c_ulong, + usize, + ) { + let size = self.count * core::mem::size_of::<T>(); + let ret = ( + self.dev.clone(), + self.cpu_addr, + self.dma_handle, + self.dma_attrs.as_raw(), + size, + ); + core::mem::forget(self); + ret + } + + /// Returns the base address to the allocated region in the CPU's virtual address space. + pub fn start_ptr(&self) -> *const T { + self.cpu_addr + } + + /// Returns the base address to the allocated region in the CPU's virtual address space as + /// a mutable pointer. + pub fn start_ptr_mut(&mut self) -> *mut T { + self.cpu_addr + } + + /// Returns a DMA handle which may given to the device as the DMA address base of + /// the region. + pub fn dma_handle(&self) -> bindings::dma_addr_t { + self.dma_handle + } + + /// Reads data from the region starting from `offset` as a slice. + /// `offset` and `count` are in units of `T`, not the number of bytes. + /// + /// Due to the safety requirements of slice, the data returned should be regarded by the + /// caller as a snapshot of the region when this function is called, as the region could + /// be modified by the device at anytime. For ringbuffer type of r/w access or use-cases + /// where the pointer to the live data is needed, `start_ptr()` or `start_ptr_mut()` + /// could be used instead. + /// + /// # Safety + /// + /// Callers must ensure that no hardware operations that involve the buffer are currently + /// taking place while the returned slice is live. + pub unsafe fn read(&self, offset: usize, count: usize) -> Result<&[T]> { + if offset + count >= self.count { + return Err(EINVAL); + } + // SAFETY: + // - The pointer is valid due to type invariant on `CoherentAllocation`, + // we've just checked that the range and index is within bounds. The immutability of the + // of data is also guaranteed by the safety requirements of the function. + // - `offset` can't overflow since it is smaller than `self.count` and we've checked + // that `self.count` won't overflow early in the constructor. + Ok(unsafe { core::slice::from_raw_parts(self.cpu_addr.add(offset), count) }) + } + + /// Writes data to the region starting from `offset`. `offset` is in units of `T`, not the + /// number of bytes. + pub fn write(&self, src: &[T], offset: usize) -> Result { + if offset + src.len() >= self.count { + return Err(EINVAL); + } + // SAFETY: + // - The pointer is valid due to type invariant on `CoherentAllocation` + // and we've just checked that the range and index is within bounds. + // - `offset` can't overflow since it is smaller than `self.count` and we've checked + // that `self.count` won't overflow early in the constructor. + unsafe { + core::ptr::copy_nonoverlapping(src.as_ptr(), self.cpu_addr.add(offset), src.len()) + }; + Ok(()) + } +} + +impl<T: AsBytes + FromBytes> Drop for CoherentAllocation<T> { + fn drop(&mut self) { + let size = self.count * core::mem::size_of::<T>(); + // SAFETY: the device, cpu address, and the dma handle is valid due to the + // type invariants on `CoherentAllocation`. + unsafe { + bindings::dma_free_attrs( + self.dev.as_raw(), + size, + self.cpu_addr as _, + self.dma_handle, + self.dma_attrs.as_raw(), + ) + } + } +} diff --git a/rust/kernel/lib.rs b/rust/kernel/lib.rs index 545d1170ee63..36ac88fd91e7 100644 --- a/rust/kernel/lib.rs +++ b/rust/kernel/lib.rs @@ -37,6 +37,7 @@ pub mod build_assert; pub mod cred; pub mod device; +pub mod dma; pub mod error; #[cfg(CONFIG_RUST_FW_LOADER_ABSTRACTIONS)] pub mod firmware; -- 2.43.0 ^ permalink raw reply related [flat|nested] 10+ messages in thread
* Re: [PATCH v10 2/3] rust: add dma coherent allocator abstraction. 2025-01-21 19:14 ` [PATCH v10 2/3] rust: add dma coherent allocator abstraction Abdiel Janulgue @ 2025-01-22 9:13 ` Alice Ryhl 2025-01-22 10:14 ` Abdiel Janulgue 0 siblings, 1 reply; 10+ messages in thread From: Alice Ryhl @ 2025-01-22 9:13 UTC (permalink / raw) To: Abdiel Janulgue Cc: rust-for-linux, daniel.almeida, dakr, robin.murphy, Miguel Ojeda, Alex Gaynor, Boqun Feng, Gary Guo, Björn Roy Baron, Benno Lossin, Andreas Hindborg, Trevor Gross, Valentin Obst, open list, Christoph Hellwig, Marek Szyprowski, airlied, open list:DMA MAPPING HELPERS On Tue, Jan 21, 2025 at 8:14 PM Abdiel Janulgue <abdiel.janulgue@gmail.com> wrote: > > Add a simple dma coherent allocator rust abstraction. Based on > Andreas Hindborg's dma abstractions from the rnvme driver, which > was also based on earlier work by Wedson Almeida Filho. > > Signed-off-by: Abdiel Janulgue <abdiel.janulgue@gmail.com> > --- > rust/bindings/bindings_helper.h | 1 + > rust/kernel/dma.rs | 281 ++++++++++++++++++++++++++++++++ > rust/kernel/lib.rs | 1 + > 3 files changed, 283 insertions(+) > create mode 100644 rust/kernel/dma.rs > > diff --git a/rust/bindings/bindings_helper.h b/rust/bindings/bindings_helper.h > index 5c4dfe22f41a..49bf713b9bb6 100644 > --- a/rust/bindings/bindings_helper.h > +++ b/rust/bindings/bindings_helper.h > @@ -11,6 +11,7 @@ > #include <linux/blk_types.h> > #include <linux/blkdev.h> > #include <linux/cred.h> > +#include <linux/dma-mapping.h> > #include <linux/errname.h> > #include <linux/ethtool.h> > #include <linux/file.h> > diff --git a/rust/kernel/dma.rs b/rust/kernel/dma.rs > new file mode 100644 > index 000000000000..ebae7270190e > --- /dev/null > +++ b/rust/kernel/dma.rs > @@ -0,0 +1,281 @@ > +// SPDX-License-Identifier: GPL-2.0 > + > +//! Direct memory access (DMA). > +//! > +//! C header: [`include/linux/dma-mapping.h`](srctree/include/linux/dma-mapping.h) > + > +use crate::{ > + bindings, build_assert, > + device::Device, > + error::code::*, > + error::Result, > + transmute::{AsBytes, FromBytes}, > + types::ARef, > +}; > + > +/// Possible attributes associated with a DMA mapping. > +/// > +/// They can be combined with the operators `|`, `&`, and `!`. > +/// > +/// Values can be used from the [`attrs`] module. > +#[derive(Clone, Copy, PartialEq)] > +#[repr(transparent)] > +pub struct Attrs(u32); > + > +impl Attrs { > + /// Get the raw representation of this attribute. > + pub(crate) fn as_raw(self) -> crate::ffi::c_ulong { > + self.0 as _ > + } > + > + /// Check whether `flags` is contained in `self`. > + pub fn contains(self, flags: Attrs) -> bool { > + (self & flags) == flags > + } > +} > + > +impl core::ops::BitOr for Attrs { > + type Output = Self; > + fn bitor(self, rhs: Self) -> Self::Output { > + Self(self.0 | rhs.0) > + } > +} > + > +impl core::ops::BitAnd for Attrs { > + type Output = Self; > + fn bitand(self, rhs: Self) -> Self::Output { > + Self(self.0 & rhs.0) > + } > +} > + > +impl core::ops::Not for Attrs { > + type Output = Self; > + fn not(self) -> Self::Output { > + Self(!self.0) > + } > +} > + > +/// DMA mapping attrributes. > +pub mod attrs { > + use super::Attrs; > + > + /// Specifies that reads and writes to the mapping may be weakly ordered, that is that reads > + /// and writes may pass each other. > + pub const DMA_ATTR_WEAK_ORDERING: Attrs = Attrs(bindings::DMA_ATTR_WEAK_ORDERING); > + > + /// Specifies that writes to the mapping may be buffered to improve performance. > + pub const DMA_ATTR_WRITE_COMBINE: Attrs = Attrs(bindings::DMA_ATTR_WRITE_COMBINE); > + > + /// Lets the platform to avoid creating a kernel virtual mapping for the allocated buffer. > + pub const DMA_ATTR_NO_KERNEL_MAPPING: Attrs = Attrs(bindings::DMA_ATTR_NO_KERNEL_MAPPING); > + > + /// Allows platform code to skip synchronization of the CPU cache for the given buffer assuming > + /// that it has been already transferred to 'device' domain. > + pub const DMA_ATTR_SKIP_CPU_SYNC: Attrs = Attrs(bindings::DMA_ATTR_SKIP_CPU_SYNC); > + > + /// Forces contiguous allocation of the buffer in physical memory. > + pub const DMA_ATTR_FORCE_CONTIGUOUS: Attrs = Attrs(bindings::DMA_ATTR_FORCE_CONTIGUOUS); > + > + /// This is a hint to the DMA-mapping subsystem that it's probably not worth the time to try > + /// to allocate memory to in a way that gives better TLB efficiency. > + pub const DMA_ATTR_ALLOC_SINGLE_PAGES: Attrs = Attrs(bindings::DMA_ATTR_ALLOC_SINGLE_PAGES); > + > + /// This tells the DMA-mapping subsystem to suppress allocation failure reports (similarly to > + /// __GFP_NOWARN). > + pub const DMA_ATTR_NO_WARN: Attrs = Attrs(bindings::DMA_ATTR_NO_WARN); > + > + /// Used to indicate that the buffer is fully accessible at an elevated privilege level (and > + /// ideally inaccessible or at least read-only at lesser-privileged levels). > + pub const DMA_ATTR_PRIVILEGED: Attrs = Attrs(bindings::DMA_ATTR_PRIVILEGED); > +} > + > +/// An abstraction of the `dma_alloc_coherent` API. > +/// > +/// This is an abstraction around the `dma_alloc_coherent` API which is used to allocate and map > +/// large consistent DMA regions. > +/// > +/// A [`CoherentAllocation`] instance contains a pointer to the allocated region (in the > +/// processor's virtual address space) and the device address which can be given to the device > +/// as the DMA address base of the region. The region is released once [`CoherentAllocation`] > +/// is dropped. > +/// > +/// # Invariants > +/// > +/// For the lifetime of an instance of [`CoherentAllocation`], the cpu address is a valid pointer > +/// to an allocated region of consistent memory and we hold a reference to the device. > +pub struct CoherentAllocation<T: AsBytes + FromBytes> { > + dev: ARef<Device>, > + dma_handle: bindings::dma_addr_t, > + count: usize, > + cpu_addr: *mut T, > + dma_attrs: Attrs, > +} > + > +impl<T: AsBytes + FromBytes> CoherentAllocation<T> { > + /// Allocates a region of `size_of::<T> * count` of consistent memory. > + /// > + /// # Examples > + /// > + /// ``` > + /// use kernel::device::Device; > + /// use kernel::dma::{attrs::*, CoherentAllocation}; > + /// > + /// # fn test(dev: &Device) -> Result { > + /// let c: CoherentAllocation<u64> = CoherentAllocation::alloc_attrs(dev.into(), 4, GFP_KERNEL, > + /// DMA_ATTR_NO_WARN)?; > + /// # Ok::<(), Error>(()) } > + /// ``` > + pub fn alloc_attrs( > + dev: ARef<Device>, > + count: usize, > + gfp_flags: kernel::alloc::Flags, > + dma_attrs: Attrs, > + ) -> Result<CoherentAllocation<T>> { > + build_assert!( > + core::mem::size_of::<T>() > 0, > + "It doesn't make sense for the allocated type to be a ZST" > + ); > + > + let size = count > + .checked_mul(core::mem::size_of::<T>()) > + .ok_or(EOVERFLOW)?; > + let mut dma_handle = 0; > + // SAFETY: device pointer is guaranteed as valid by invariant on `Device`. > + // We ensure that we catch the failure on this function and throw an ENOMEM > + let ret = unsafe { > + bindings::dma_alloc_attrs( > + dev.as_raw(), > + size, > + &mut dma_handle, > + gfp_flags.as_raw(), > + dma_attrs.as_raw(), > + ) > + }; > + if ret.is_null() { > + return Err(ENOMEM); > + } > + // INVARIANT: We just successfully allocated a coherent region which is accessible for > + // `count` elements, hence the cpu address is valid. We also hold a refcounted reference > + // to the device. > + Ok(Self { > + dev, > + dma_handle, > + count, > + cpu_addr: ret as *mut T, > + dma_attrs, > + }) > + } > + > + /// Performs the same functionality as `alloc_attrs`, except the `dma_attrs` is 0 by default. > + pub fn alloc_coherent( > + dev: ARef<Device>, > + count: usize, > + gfp_flags: kernel::alloc::Flags, > + ) -> Result<CoherentAllocation<T>> { > + CoherentAllocation::alloc_attrs(dev, count, gfp_flags, Attrs(0)) > + } > + > + /// Returns the device, base address, dma handle, attributes and the size of the > + /// allocated region. > + /// > + /// The caller takes ownership of the returned resources, i.e., will have the responsibility > + /// in calling `bindings::dma_free_attrs`. The allocated region is valid as long as > + /// the returned device exists. > + pub fn into_parts( > + self, > + ) -> ( > + ARef<Device>, > + *mut T, > + bindings::dma_addr_t, > + crate::ffi::c_ulong, > + usize, > + ) { > + let size = self.count * core::mem::size_of::<T>(); > + let ret = ( > + self.dev.clone(), Calling clone here increments the refcount, but you don't want to do that since it leaks a count. otherwise lgtm Alice ^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: [PATCH v10 2/3] rust: add dma coherent allocator abstraction. 2025-01-22 9:13 ` Alice Ryhl @ 2025-01-22 10:14 ` Abdiel Janulgue 2025-01-22 10:15 ` Alice Ryhl 0 siblings, 1 reply; 10+ messages in thread From: Abdiel Janulgue @ 2025-01-22 10:14 UTC (permalink / raw) To: Alice Ryhl Cc: rust-for-linux, daniel.almeida, dakr, robin.murphy, Miguel Ojeda, Alex Gaynor, Boqun Feng, Gary Guo, Björn Roy Baron, Benno Lossin, Andreas Hindborg, Trevor Gross, Valentin Obst, open list, Christoph Hellwig, Marek Szyprowski, airlied, open list:DMA MAPPING HELPERS On 22/01/2025 11:13, Alice Ryhl wrote: > On Tue, Jan 21, 2025 at 8:14 PM Abdiel Janulgue > <abdiel.janulgue@gmail.com> wrote: >> >> Add a simple dma coherent allocator rust abstraction. Based on >> Andreas Hindborg's dma abstractions from the rnvme driver, which >> was also based on earlier work by Wedson Almeida Filho. >> >> Signed-off-by: Abdiel Janulgue <abdiel.janulgue@gmail.com> >> --- >> rust/bindings/bindings_helper.h | 1 + >> rust/kernel/dma.rs | 281 ++++++++++++++++++++++++++++++++ >> rust/kernel/lib.rs | 1 + >> 3 files changed, 283 insertions(+) >> create mode 100644 rust/kernel/dma.rs >> >> diff --git a/rust/bindings/bindings_helper.h b/rust/bindings/bindings_helper.h >> index 5c4dfe22f41a..49bf713b9bb6 100644 >> --- a/rust/bindings/bindings_helper.h >> +++ b/rust/bindings/bindings_helper.h >> @@ -11,6 +11,7 @@ >> #include <linux/blk_types.h> >> #include <linux/blkdev.h> >> #include <linux/cred.h> >> +#include <linux/dma-mapping.h> >> #include <linux/errname.h> >> #include <linux/ethtool.h> >> #include <linux/file.h> >> diff --git a/rust/kernel/dma.rs b/rust/kernel/dma.rs >> new file mode 100644 >> index 000000000000..ebae7270190e >> --- /dev/null >> +++ b/rust/kernel/dma.rs >> @@ -0,0 +1,281 @@ >> +// SPDX-License-Identifier: GPL-2.0 >> + >> +//! Direct memory access (DMA). >> +//! >> +//! C header: [`include/linux/dma-mapping.h`](srctree/include/linux/dma-mapping.h) >> + >> +use crate::{ >> + bindings, build_assert, >> + device::Device, >> + error::code::*, >> + error::Result, >> + transmute::{AsBytes, FromBytes}, >> + types::ARef, >> +}; >> + >> +/// Possible attributes associated with a DMA mapping. >> +/// >> +/// They can be combined with the operators `|`, `&`, and `!`. >> +/// >> +/// Values can be used from the [`attrs`] module. >> +#[derive(Clone, Copy, PartialEq)] >> +#[repr(transparent)] >> +pub struct Attrs(u32); >> + >> +impl Attrs { >> + /// Get the raw representation of this attribute. >> + pub(crate) fn as_raw(self) -> crate::ffi::c_ulong { >> + self.0 as _ >> + } >> + >> + /// Check whether `flags` is contained in `self`. >> + pub fn contains(self, flags: Attrs) -> bool { >> + (self & flags) == flags >> + } >> +} >> + >> +impl core::ops::BitOr for Attrs { >> + type Output = Self; >> + fn bitor(self, rhs: Self) -> Self::Output { >> + Self(self.0 | rhs.0) >> + } >> +} >> + >> +impl core::ops::BitAnd for Attrs { >> + type Output = Self; >> + fn bitand(self, rhs: Self) -> Self::Output { >> + Self(self.0 & rhs.0) >> + } >> +} >> + >> +impl core::ops::Not for Attrs { >> + type Output = Self; >> + fn not(self) -> Self::Output { >> + Self(!self.0) >> + } >> +} >> + >> +/// DMA mapping attrributes. >> +pub mod attrs { >> + use super::Attrs; >> + >> + /// Specifies that reads and writes to the mapping may be weakly ordered, that is that reads >> + /// and writes may pass each other. >> + pub const DMA_ATTR_WEAK_ORDERING: Attrs = Attrs(bindings::DMA_ATTR_WEAK_ORDERING); >> + >> + /// Specifies that writes to the mapping may be buffered to improve performance. >> + pub const DMA_ATTR_WRITE_COMBINE: Attrs = Attrs(bindings::DMA_ATTR_WRITE_COMBINE); >> + >> + /// Lets the platform to avoid creating a kernel virtual mapping for the allocated buffer. >> + pub const DMA_ATTR_NO_KERNEL_MAPPING: Attrs = Attrs(bindings::DMA_ATTR_NO_KERNEL_MAPPING); >> + >> + /// Allows platform code to skip synchronization of the CPU cache for the given buffer assuming >> + /// that it has been already transferred to 'device' domain. >> + pub const DMA_ATTR_SKIP_CPU_SYNC: Attrs = Attrs(bindings::DMA_ATTR_SKIP_CPU_SYNC); >> + >> + /// Forces contiguous allocation of the buffer in physical memory. >> + pub const DMA_ATTR_FORCE_CONTIGUOUS: Attrs = Attrs(bindings::DMA_ATTR_FORCE_CONTIGUOUS); >> + >> + /// This is a hint to the DMA-mapping subsystem that it's probably not worth the time to try >> + /// to allocate memory to in a way that gives better TLB efficiency. >> + pub const DMA_ATTR_ALLOC_SINGLE_PAGES: Attrs = Attrs(bindings::DMA_ATTR_ALLOC_SINGLE_PAGES); >> + >> + /// This tells the DMA-mapping subsystem to suppress allocation failure reports (similarly to >> + /// __GFP_NOWARN). >> + pub const DMA_ATTR_NO_WARN: Attrs = Attrs(bindings::DMA_ATTR_NO_WARN); >> + >> + /// Used to indicate that the buffer is fully accessible at an elevated privilege level (and >> + /// ideally inaccessible or at least read-only at lesser-privileged levels). >> + pub const DMA_ATTR_PRIVILEGED: Attrs = Attrs(bindings::DMA_ATTR_PRIVILEGED); >> +} >> + >> +/// An abstraction of the `dma_alloc_coherent` API. >> +/// >> +/// This is an abstraction around the `dma_alloc_coherent` API which is used to allocate and map >> +/// large consistent DMA regions. >> +/// >> +/// A [`CoherentAllocation`] instance contains a pointer to the allocated region (in the >> +/// processor's virtual address space) and the device address which can be given to the device >> +/// as the DMA address base of the region. The region is released once [`CoherentAllocation`] >> +/// is dropped. >> +/// >> +/// # Invariants >> +/// >> +/// For the lifetime of an instance of [`CoherentAllocation`], the cpu address is a valid pointer >> +/// to an allocated region of consistent memory and we hold a reference to the device. >> +pub struct CoherentAllocation<T: AsBytes + FromBytes> { >> + dev: ARef<Device>, >> + dma_handle: bindings::dma_addr_t, >> + count: usize, >> + cpu_addr: *mut T, >> + dma_attrs: Attrs, >> +} >> + >> +impl<T: AsBytes + FromBytes> CoherentAllocation<T> { >> + /// Allocates a region of `size_of::<T> * count` of consistent memory. >> + /// >> + /// # Examples >> + /// >> + /// ``` >> + /// use kernel::device::Device; >> + /// use kernel::dma::{attrs::*, CoherentAllocation}; >> + /// >> + /// # fn test(dev: &Device) -> Result { >> + /// let c: CoherentAllocation<u64> = CoherentAllocation::alloc_attrs(dev.into(), 4, GFP_KERNEL, >> + /// DMA_ATTR_NO_WARN)?; >> + /// # Ok::<(), Error>(()) } >> + /// ``` >> + pub fn alloc_attrs( >> + dev: ARef<Device>, >> + count: usize, >> + gfp_flags: kernel::alloc::Flags, >> + dma_attrs: Attrs, >> + ) -> Result<CoherentAllocation<T>> { >> + build_assert!( >> + core::mem::size_of::<T>() > 0, >> + "It doesn't make sense for the allocated type to be a ZST" >> + ); >> + >> + let size = count >> + .checked_mul(core::mem::size_of::<T>()) >> + .ok_or(EOVERFLOW)?; >> + let mut dma_handle = 0; >> + // SAFETY: device pointer is guaranteed as valid by invariant on `Device`. >> + // We ensure that we catch the failure on this function and throw an ENOMEM >> + let ret = unsafe { >> + bindings::dma_alloc_attrs( >> + dev.as_raw(), >> + size, >> + &mut dma_handle, >> + gfp_flags.as_raw(), >> + dma_attrs.as_raw(), >> + ) >> + }; >> + if ret.is_null() { >> + return Err(ENOMEM); >> + } >> + // INVARIANT: We just successfully allocated a coherent region which is accessible for >> + // `count` elements, hence the cpu address is valid. We also hold a refcounted reference >> + // to the device. >> + Ok(Self { >> + dev, >> + dma_handle, >> + count, >> + cpu_addr: ret as *mut T, >> + dma_attrs, >> + }) >> + } >> + >> + /// Performs the same functionality as `alloc_attrs`, except the `dma_attrs` is 0 by default. >> + pub fn alloc_coherent( >> + dev: ARef<Device>, >> + count: usize, >> + gfp_flags: kernel::alloc::Flags, >> + ) -> Result<CoherentAllocation<T>> { >> + CoherentAllocation::alloc_attrs(dev, count, gfp_flags, Attrs(0)) >> + } >> + >> + /// Returns the device, base address, dma handle, attributes and the size of the >> + /// allocated region. >> + /// >> + /// The caller takes ownership of the returned resources, i.e., will have the responsibility >> + /// in calling `bindings::dma_free_attrs`. The allocated region is valid as long as >> + /// the returned device exists. >> + pub fn into_parts( >> + self, >> + ) -> ( >> + ARef<Device>, >> + *mut T, >> + bindings::dma_addr_t, >> + crate::ffi::c_ulong, >> + usize, >> + ) { >> + let size = self.count * core::mem::size_of::<T>(); >> + let ret = ( >> + self.dev.clone(), > > Calling clone here increments the refcount, but you don't want to do > that since it leaks a count. > You mean return the raw *mut bindings::device instead of ARef<Device>. Otherwise the compiler complains not being able to move out ARef<Device> because it doesn't implement Copy. Thanks, Abdiel ^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: [PATCH v10 2/3] rust: add dma coherent allocator abstraction. 2025-01-22 10:14 ` Abdiel Janulgue @ 2025-01-22 10:15 ` Alice Ryhl 0 siblings, 0 replies; 10+ messages in thread From: Alice Ryhl @ 2025-01-22 10:15 UTC (permalink / raw) To: Abdiel Janulgue Cc: rust-for-linux, daniel.almeida, dakr, robin.murphy, Miguel Ojeda, Alex Gaynor, Boqun Feng, Gary Guo, Björn Roy Baron, Benno Lossin, Andreas Hindborg, Trevor Gross, Valentin Obst, open list, Christoph Hellwig, Marek Szyprowski, airlied, open list:DMA MAPPING HELPERS On Wed, Jan 22, 2025 at 11:14 AM Abdiel Janulgue <abdiel.janulgue@gmail.com> wrote: > >> + /// Returns the device, base address, dma handle, attributes and the size of the > >> + /// allocated region. > >> + /// > >> + /// The caller takes ownership of the returned resources, i.e., will have the responsibility > >> + /// in calling `bindings::dma_free_attrs`. The allocated region is valid as long as > >> + /// the returned device exists. > >> + pub fn into_parts( > >> + self, > >> + ) -> ( > >> + ARef<Device>, > >> + *mut T, > >> + bindings::dma_addr_t, > >> + crate::ffi::c_ulong, > >> + usize, > >> + ) { > >> + let size = self.count * core::mem::size_of::<T>(); > >> + let ret = ( > >> + self.dev.clone(), > > > > Calling clone here increments the refcount, but you don't want to do > > that since it leaks a count. > > > > You mean return the raw *mut bindings::device instead of ARef<Device>. > Otherwise the compiler complains not being able to move out ARef<Device> > because it doesn't implement Copy. What I mean is that you should return an ARef<Device> without incrementing the refcount. I acknowledge that this requires unsafe, but it's the correct semantics and the unsafe is a consequence of using mem::forget. You can write `core::ptr::read(&self.dev)`. Alice ^ permalink raw reply [flat|nested] 10+ messages in thread
* [PATCH v10 3/3] MAINTAINERS: add entry for Rust dma mapping helpers device driver API 2025-01-21 19:14 [PATCH v10 0/3] Add dma coherent allocator abstraction Abdiel Janulgue 2025-01-21 19:14 ` [PATCH v10 1/3] rust: error: Add EOVERFLOW Abdiel Janulgue 2025-01-21 19:14 ` [PATCH v10 2/3] rust: add dma coherent allocator abstraction Abdiel Janulgue @ 2025-01-21 19:14 ` Abdiel Janulgue 2025-01-23 13:18 ` Danilo Krummrich 2025-01-23 14:16 ` Miguel Ojeda 2 siblings, 2 replies; 10+ messages in thread From: Abdiel Janulgue @ 2025-01-21 19:14 UTC (permalink / raw) To: rust-for-linux, daniel.almeida, dakr, robin.murphy, aliceryhl Cc: Miguel Ojeda, Alex Gaynor, Boqun Feng, Gary Guo, Björn Roy Baron, Benno Lossin, Andreas Hindborg, Trevor Gross, Valentin Obst, open list, Christoph Hellwig, Marek Szyprowski, airlied, open list:DMA MAPPING HELPERS, Abdiel Janulgue Add an entry for the Rust dma mapping helpers abstractions. Signed-off-by: Abdiel Janulgue <abdiel.janulgue@gmail.com> --- MAINTAINERS | 10 ++++++++++ 1 file changed, 10 insertions(+) diff --git a/MAINTAINERS b/MAINTAINERS index baf0eeb9a355..4808c9880b3e 100644 --- a/MAINTAINERS +++ b/MAINTAINERS @@ -6822,6 +6822,16 @@ F: include/linux/dma-mapping.h F: include/linux/swiotlb.h F: kernel/dma/ +DMA MAPPING HELPERS DEVICE DRIVER API [RUST] +M: Abdiel Janulgue <abdiel.janulgue@gmail.com> +M: Danilo Krummrich <dakr@kernel.org> +R: Daniel Almeida <daniel.almeida@collabora.com> +L: rust-for-linux@vger.kernel.org +S: Supported +W: https://rust-for-linux.com +T: git https://github.com/Rust-for-Linux/linux.git rust-next +F: rust/kernel/dma.rs + DMA-BUF HEAPS FRAMEWORK M: Sumit Semwal <sumit.semwal@linaro.org> R: Benjamin Gaignard <benjamin.gaignard@collabora.com> -- 2.43.0 ^ permalink raw reply related [flat|nested] 10+ messages in thread
* Re: [PATCH v10 3/3] MAINTAINERS: add entry for Rust dma mapping helpers device driver API 2025-01-21 19:14 ` [PATCH v10 3/3] MAINTAINERS: add entry for Rust dma mapping helpers device driver API Abdiel Janulgue @ 2025-01-23 13:18 ` Danilo Krummrich 2025-01-23 14:16 ` Miguel Ojeda 1 sibling, 0 replies; 10+ messages in thread From: Danilo Krummrich @ 2025-01-23 13:18 UTC (permalink / raw) To: Abdiel Janulgue Cc: rust-for-linux, daniel.almeida, robin.murphy, aliceryhl, Miguel Ojeda, Alex Gaynor, Boqun Feng, Gary Guo, Björn Roy Baron, Benno Lossin, Andreas Hindborg, Trevor Gross, Valentin Obst, open list, Christoph Hellwig, Marek Szyprowski, airlied, open list:DMA MAPPING HELPERS On Tue, Jan 21, 2025 at 09:14:04PM +0200, Abdiel Janulgue wrote: > Add an entry for the Rust dma mapping helpers abstractions. > > Signed-off-by: Abdiel Janulgue <abdiel.janulgue@gmail.com> Acked-by: Danilo Krummrich <dakr@kernel.org> ^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: [PATCH v10 3/3] MAINTAINERS: add entry for Rust dma mapping helpers device driver API 2025-01-21 19:14 ` [PATCH v10 3/3] MAINTAINERS: add entry for Rust dma mapping helpers device driver API Abdiel Janulgue 2025-01-23 13:18 ` Danilo Krummrich @ 2025-01-23 14:16 ` Miguel Ojeda 2025-01-23 14:19 ` Miguel Ojeda 1 sibling, 1 reply; 10+ messages in thread From: Miguel Ojeda @ 2025-01-23 14:16 UTC (permalink / raw) To: Abdiel Janulgue Cc: rust-for-linux, daniel.almeida, dakr, robin.murphy, aliceryhl, Miguel Ojeda, Alex Gaynor, Boqun Feng, Gary Guo, Björn Roy Baron, Benno Lossin, Andreas Hindborg, Trevor Gross, Valentin Obst, open list, Christoph Hellwig, Marek Szyprowski, airlied, open list:DMA MAPPING HELPERS On Tue, Jan 21, 2025 at 8:14 PM Abdiel Janulgue <abdiel.janulgue@gmail.com> wrote: > > Add an entry for the Rust dma mapping helpers abstractions. Acked-by: Miguel Ojeda <ojeda@kernel.org> Thanks! Cheers, Miguel ^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: [PATCH v10 3/3] MAINTAINERS: add entry for Rust dma mapping helpers device driver API 2025-01-23 14:16 ` Miguel Ojeda @ 2025-01-23 14:19 ` Miguel Ojeda 0 siblings, 0 replies; 10+ messages in thread From: Miguel Ojeda @ 2025-01-23 14:19 UTC (permalink / raw) To: Abdiel Janulgue Cc: rust-for-linux, daniel.almeida, dakr, robin.murphy, aliceryhl, Miguel Ojeda, Alex Gaynor, Boqun Feng, Gary Guo, Björn Roy Baron, Benno Lossin, Andreas Hindborg, Trevor Gross, Valentin Obst, open list, Christoph Hellwig, Marek Szyprowski, airlied, open list:DMA MAPPING HELPERS On Thu, Jan 23, 2025 at 3:16 PM Miguel Ojeda <miguel.ojeda.sandonis@gmail.com> wrote: > > Acked-by: Miguel Ojeda <ojeda@kernel.org> To clarify: in case DMA MAPPING HELPERS decided to pick this initial one. Cheers, Miguel ^ permalink raw reply [flat|nested] 10+ messages in thread
end of thread, other threads:[~2025-01-23 14:19 UTC | newest] Thread overview: 10+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2025-01-21 19:14 [PATCH v10 0/3] Add dma coherent allocator abstraction Abdiel Janulgue 2025-01-21 19:14 ` [PATCH v10 1/3] rust: error: Add EOVERFLOW Abdiel Janulgue 2025-01-21 19:14 ` [PATCH v10 2/3] rust: add dma coherent allocator abstraction Abdiel Janulgue 2025-01-22 9:13 ` Alice Ryhl 2025-01-22 10:14 ` Abdiel Janulgue 2025-01-22 10:15 ` Alice Ryhl 2025-01-21 19:14 ` [PATCH v10 3/3] MAINTAINERS: add entry for Rust dma mapping helpers device driver API Abdiel Janulgue 2025-01-23 13:18 ` Danilo Krummrich 2025-01-23 14:16 ` Miguel Ojeda 2025-01-23 14:19 ` Miguel Ojeda
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox