* RFC: extend IOMMU attributes @ 2016-02-18 16:16 Stuart Yoder [not found] ` <HE1PR04MB1641F22E38710B68F4ED891F8DAF0-6LN7OEpIatU5tNmRkpaxD89NdZoXdze2vxpqHgZTriW3zl9H0oFU5g@public.gmane.org> 0 siblings, 1 reply; 5+ messages in thread From: Stuart Yoder @ 2016-02-18 16:16 UTC (permalink / raw) To: will.deacon@arm.com, joro@8bytes.org Cc: robin.murphy@arm.com, Peter Newton, iommu@lists.linux-foundation.org, Varun Sethi, Nipun Gupta, Bharat Bhushan, linux-arm-kernel@lists.infradead.org Hi Joerg and Will, We are implementing support for some specialized NXP SoC network devices and have the desire to extend the mapping attributes beyond those currently in iommu.h. (I see there is a recent proposal to add an IOMMU_MMIO flag) What we have right now in Linux is a least-common-denominator set of attributes, while for the ARM SMMU there is a much richer set of attributes that seem useful to support. Specifically, we have one SoC device we're dealing with right now that is the target of DMA that functionally requires a "cacheable, non-shareable" attribute in its SMMU mapping. In addition, there are many other attributes such as r/w allocate hints, transient hints, write-back/write-thru, etc in the SMMU. We wanted to see what your thinking is with respect to the direction the Linux IOMMU layer will head over the longer term with respect to attributes. Is there anything problematic with continuing to grow the attributes in iommu.h?...e.g.: #define IOMMU_READ (1 << 0) #define IOMMU_WRITE (1 << 1) -#define IOMMU_CACHE (1 << 2) /* DMA cache coherency */ +#define IOMMU_CACHE_COHERENT (1 << 2) /* cacheable and coherent */ #define IOMMU_NOEXEC (1 << 3) #define IOMMU_MMIO (1 << 4) /* e.g. things like MSI doorbells */ +#define IOMMU_CACHEABLE (1 << 5) /* cacheable, not coherent */ +#define IOMMU_CACHE_ALLOCATE (1 << 6) /* hint to allocate in the cache */ Also, are we willing to let somewhat architecture specific flags onto that list? For, example the ARM 'transient' hint. Another potential direction would be to provide some approach where IOMMU PTE attributes could be somehow derived from a corresponding MMU PTE in an arch-specific manner and the IOMMU API itself would not have to explicitly define all possible attributes. Thoughts? Thanks, Stuart ^ permalink raw reply [flat|nested] 5+ messages in thread
[parent not found: <HE1PR04MB1641F22E38710B68F4ED891F8DAF0-6LN7OEpIatU5tNmRkpaxD89NdZoXdze2vxpqHgZTriW3zl9H0oFU5g@public.gmane.org>]
* Re: RFC: extend IOMMU attributes [not found] ` <HE1PR04MB1641F22E38710B68F4ED891F8DAF0-6LN7OEpIatU5tNmRkpaxD89NdZoXdze2vxpqHgZTriW3zl9H0oFU5g@public.gmane.org> @ 2016-02-18 16:21 ` Will Deacon [not found] ` <20160218162145.GB16883-5wv7dgnIgG8@public.gmane.org> 2016-02-25 14:38 ` joro-zLv9SwRftAIdnm+yROfE0A 1 sibling, 1 reply; 5+ messages in thread From: Will Deacon @ 2016-02-18 16:21 UTC (permalink / raw) To: Stuart Yoder Cc: Peter Newton, iommu-cunTk1MwBs9QetFLy7KEm3xJsTq8ys+cHZ5vskTnxNA@public.gmane.org, Varun Sethi, Bharat Bhushan, linux-arm-kernel-IAPFreCvJWM7uuMidbF8XUB+6BGkLq7r@public.gmane.org Hi Stuart, On Thu, Feb 18, 2016 at 04:16:26PM +0000, Stuart Yoder wrote: > We are implementing support for some specialized NXP SoC network > devices and have the desire to extend the mapping attributes beyond > those currently in iommu.h. (I see there is a recent proposal to > add an IOMMU_MMIO flag) > > What we have right now in Linux is a least-common-denominator set of > attributes, while for the ARM SMMU there is a much richer set of > attributes that seem useful to support. Specifically, we have one > SoC device we're dealing with right now that is the target of DMA > that functionally requires a "cacheable, non-shareable" attribute > in its SMMU mapping. > > In addition, there are many other attributes such as r/w allocate > hints, transient hints, write-back/write-thru, etc in the SMMU. > > We wanted to see what your thinking is with respect to the > direction the Linux IOMMU layer will head over the longer term with > respect to attributes. > > Is there anything problematic with continuing to grow the > attributes in iommu.h?...e.g.: > > #define IOMMU_READ (1 << 0) > #define IOMMU_WRITE (1 << 1) > -#define IOMMU_CACHE (1 << 2) /* DMA cache coherency */ > +#define IOMMU_CACHE_COHERENT (1 << 2) /* cacheable and coherent */ > #define IOMMU_NOEXEC (1 << 3) > #define IOMMU_MMIO (1 << 4) /* e.g. things like MSI doorbells */ > +#define IOMMU_CACHEABLE (1 << 5) /* cacheable, not coherent */ What does that even mean? > +#define IOMMU_CACHE_ALLOCATE (1 << 6) /* hint to allocate in the cache */ > > Also, are we willing to let somewhat architecture specific flags > onto that list? For, example the ARM 'transient' hint. If we're going to support fine-grained attribute control, I think it needs to be done in a page-table specific manner. That is, io-pgtable-arm could provide those attribute controls which feature in the ARMv8 architecture... ...but that brings me onto my next question: Who on Earth is actually going to provide these attributes to the IOMMU API? There seems to be a missing piece. Will ^ permalink raw reply [flat|nested] 5+ messages in thread
[parent not found: <20160218162145.GB16883-5wv7dgnIgG8@public.gmane.org>]
* RE: RFC: extend IOMMU attributes [not found] ` <20160218162145.GB16883-5wv7dgnIgG8@public.gmane.org> @ 2016-02-18 19:33 ` Stuart Yoder 0 siblings, 0 replies; 5+ messages in thread From: Stuart Yoder @ 2016-02-18 19:33 UTC (permalink / raw) To: Will Deacon Cc: Peter Newton, iommu-cunTk1MwBs9QetFLy7KEm3xJsTq8ys+cHZ5vskTnxNA@public.gmane.org, Varun Sethi, Bharat Bhushan, linux-arm-kernel-IAPFreCvJWM7uuMidbF8XUB+6BGkLq7r@public.gmane.org > -----Original Message----- > From: Will Deacon [mailto:will.deacon-5wv7dgnIgG8@public.gmane.org] > Sent: Thursday, February 18, 2016 10:22 AM > To: Stuart Yoder <stuart.yoder-3arQi8VN3Tc@public.gmane.org> > Cc: joro-zLv9SwRftAIdnm+yROfE0A@public.gmane.org; robin.murphy-5wv7dgnIgG8@public.gmane.org; iommu-cunTk1MwBs9QetFLy7KEm3xJsTq8ys+cHZ5vskTnxNA@public.gmane.org; linux-arm- > kernel-IAPFreCvJWM7uuMidbF8XUB+6BGkLq7r@public.gmane.org; Varun Sethi <V.Sethi-3arQi8VN3Tc@public.gmane.org>; Bharat Bhushan > <bharat.bhushan-3arQi8VN3Tc@public.gmane.org>; Nipun Gupta <nipun.gupta-3arQi8VN3Tc@public.gmane.org>; Peter Newton > <peter.newton-3arQi8VN3Tc@public.gmane.org> > Subject: Re: RFC: extend IOMMU attributes > > Hi Stuart, > > On Thu, Feb 18, 2016 at 04:16:26PM +0000, Stuart Yoder wrote: > > We are implementing support for some specialized NXP SoC network > > devices and have the desire to extend the mapping attributes beyond > > those currently in iommu.h. (I see there is a recent proposal to > > add an IOMMU_MMIO flag) > > > > What we have right now in Linux is a least-common-denominator set of > > attributes, while for the ARM SMMU there is a much richer set of > > attributes that seem useful to support. Specifically, we have one > > SoC device we're dealing with right now that is the target of DMA > > that functionally requires a "cacheable, non-shareable" attribute > > in its SMMU mapping. > > > > In addition, there are many other attributes such as r/w allocate > > hints, transient hints, write-back/write-thru, etc in the SMMU. > > > > We wanted to see what your thinking is with respect to the > > direction the Linux IOMMU layer will head over the longer term with > > respect to attributes. > > > > Is there anything problematic with continuing to grow the > > attributes in iommu.h?...e.g.: > > > > #define IOMMU_READ (1 << 0) > > #define IOMMU_WRITE (1 << 1) > > -#define IOMMU_CACHE (1 << 2) /* DMA cache coherency */ > > +#define IOMMU_CACHE_COHERENT (1 << 2) /* cacheable and coherent */ > > #define IOMMU_NOEXEC (1 << 3) > > #define IOMMU_MMIO (1 << 4) /* e.g. things like MSI doorbells */ > > +#define IOMMU_CACHEABLE (1 << 5) /* cacheable, not coherent */ > > What does that even mean? It means allocate in the cache, but don't propagate coherency. In our case we have a device using an A57 cluster ACP port to do stashing. cluster +------------------+ | A57 A57 | +----+ | ......ACP..<............|SMMU|<...device | +--------.--+ | non-coherent +----+ | | L2 v | | write w/allocate | +--------.--+ | +-----------.------+ . +-----------v------+ | CCN-504 | +-------------.----+ . +------v------+ | device | | (descriptor | | rings) | +-------------+ The device in question is doing a cacheable write targeting some device descriptor rings. The purpose of this transaction is to stash the descriptor ring data in the L2 cache so the driver on the A57s can get at it with low latency. It has to be non-coherent because of stuff related to how things are hooked up to CCN-504 (can take more details offline if you want). Because it is non coherent, the driver obviously has to be affined to the cluster. It's a lot of hoops to jump through, but the performance benefit is worth it. > > +#define IOMMU_CACHE_ALLOCATE (1 << 6) /* hint to allocate in the cache */ > > > > Also, are we willing to let somewhat architecture specific flags > > onto that list? For, example the ARM 'transient' hint. > > If we're going to support fine-grained attribute control, I think it needs > to be done in a page-table specific manner. That is, io-pgtable-arm could > provide those attribute controls which feature in the ARMv8 architecture... So there would be some new API for users like vfio? > ...but that brings me onto my next question: Who on Earth is actually > going to provide these attributes to the IOMMU API? > > There seems to be a missing piece. Currently it would be vfio, because the driver is in user space. There are some details to be worked out regarding how vfio would determine and provide the attributes. This is a device on the fsl-mc bus (see drivers/staging/fsl-mc/README.txt) and the vfio-fsl-mc bus layer should be able to determine what attributes to use. Once the kernel dma map API is hooked up to the SMMU, I think there could be similar requirements down the road for kernel drivers. If a driver is intelligent enough to use some of the fine grained attributes, how can that be supported? Thanks, Stuart ^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: RFC: extend IOMMU attributes [not found] ` <HE1PR04MB1641F22E38710B68F4ED891F8DAF0-6LN7OEpIatU5tNmRkpaxD89NdZoXdze2vxpqHgZTriW3zl9H0oFU5g@public.gmane.org> 2016-02-18 16:21 ` Will Deacon @ 2016-02-25 14:38 ` joro-zLv9SwRftAIdnm+yROfE0A [not found] ` <20160225143855.GD16675-zLv9SwRftAIdnm+yROfE0A@public.gmane.org> 1 sibling, 1 reply; 5+ messages in thread From: joro-zLv9SwRftAIdnm+yROfE0A @ 2016-02-25 14:38 UTC (permalink / raw) To: Stuart Yoder Cc: Bharat Bhushan, will.deacon-5wv7dgnIgG8@public.gmane.org, Peter Newton, iommu-cunTk1MwBs9QetFLy7KEm3xJsTq8ys+cHZ5vskTnxNA@public.gmane.org, Varun Sethi, linux-arm-kernel-IAPFreCvJWM7uuMidbF8XUB+6BGkLq7r@public.gmane.org On Thu, Feb 18, 2016 at 04:16:26PM +0000, Stuart Yoder wrote: > #define IOMMU_READ (1 << 0) > #define IOMMU_WRITE (1 << 1) > -#define IOMMU_CACHE (1 << 2) /* DMA cache coherency */ > +#define IOMMU_CACHE_COHERENT (1 << 2) /* cacheable and coherent */ > #define IOMMU_NOEXEC (1 << 3) > #define IOMMU_MMIO (1 << 4) /* e.g. things like MSI doorbells */ > +#define IOMMU_CACHEABLE (1 << 5) /* cacheable, not coherent */ > +#define IOMMU_CACHE_ALLOCATE (1 << 6) /* hint to allocate in the cache */ Hmm, this might be a valid use-case to introduce the map_attr()/unmap_attr() calls to the IOMMU-API. I have been resistant to these functions until now because the proposed use-cases were not convincing, but for hardware-specific map-attributes like this one it could make sense. Joerg ^ permalink raw reply [flat|nested] 5+ messages in thread
[parent not found: <20160225143855.GD16675-zLv9SwRftAIdnm+yROfE0A@public.gmane.org>]
* Re: RFC: extend IOMMU attributes [not found] ` <20160225143855.GD16675-zLv9SwRftAIdnm+yROfE0A@public.gmane.org> @ 2016-02-25 15:00 ` Will Deacon 0 siblings, 0 replies; 5+ messages in thread From: Will Deacon @ 2016-02-25 15:00 UTC (permalink / raw) To: joro-zLv9SwRftAIdnm+yROfE0A@public.gmane.org Cc: Stuart Yoder, iommu-cunTk1MwBs9QetFLy7KEm3xJsTq8ys+cHZ5vskTnxNA@public.gmane.org, Varun Sethi, Peter Newton, Bharat Bhushan, linux-arm-kernel-IAPFreCvJWM7uuMidbF8XUB+6BGkLq7r@public.gmane.org On Thu, Feb 25, 2016 at 03:38:55PM +0100, joro-zLv9SwRftAIdnm+yROfE0A@public.gmane.org wrote: > On Thu, Feb 18, 2016 at 04:16:26PM +0000, Stuart Yoder wrote: > > #define IOMMU_READ (1 << 0) > > #define IOMMU_WRITE (1 << 1) > > -#define IOMMU_CACHE (1 << 2) /* DMA cache coherency */ > > +#define IOMMU_CACHE_COHERENT (1 << 2) /* cacheable and coherent */ > > #define IOMMU_NOEXEC (1 << 3) > > #define IOMMU_MMIO (1 << 4) /* e.g. things like MSI doorbells */ > > +#define IOMMU_CACHEABLE (1 << 5) /* cacheable, not coherent */ > > +#define IOMMU_CACHE_ALLOCATE (1 << 6) /* hint to allocate in the cache */ > > Hmm, this might be a valid use-case to introduce the > map_attr()/unmap_attr() calls to the IOMMU-API. I have been resistant to > these functions until now because the proposed use-cases were not > convincing, but for hardware-specific map-attributes like this one it > could make sense. So this particular use-case is completely insane, but something like read/write allocation hints could make sense for arm-smmu systems. Will ^ permalink raw reply [flat|nested] 5+ messages in thread
end of thread, other threads:[~2016-02-25 15:00 UTC | newest] Thread overview: 5+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2016-02-18 16:16 RFC: extend IOMMU attributes Stuart Yoder [not found] ` <HE1PR04MB1641F22E38710B68F4ED891F8DAF0-6LN7OEpIatU5tNmRkpaxD89NdZoXdze2vxpqHgZTriW3zl9H0oFU5g@public.gmane.org> 2016-02-18 16:21 ` Will Deacon [not found] ` <20160218162145.GB16883-5wv7dgnIgG8@public.gmane.org> 2016-02-18 19:33 ` Stuart Yoder 2016-02-25 14:38 ` joro-zLv9SwRftAIdnm+yROfE0A [not found] ` <20160225143855.GD16675-zLv9SwRftAIdnm+yROfE0A@public.gmane.org> 2016-02-25 15:00 ` Will Deacon
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).