From: david.vrabel@citrix.com (David Vrabel)
To: linux-arm-kernel@lists.infradead.org
Subject: [PATCH v2 2/2] xen/arm: introduce GNTTABOP_cache_flush
Date: Fri, 3 Oct 2014 17:36:26 +0100 [thread overview]
Message-ID: <542ED08A.1090507@citrix.com> (raw)
In-Reply-To: <1412354047.12695.38.camel@citrix.com>
On 03/10/14 17:34, Ian Campbell wrote:
> On Fri, 2014-10-03 at 17:20 +0100, Stefano Stabellini wrote:
>> On Fri, 3 Oct 2014, David Vrabel wrote:
>>> On 03/10/14 15:53, Stefano Stabellini wrote:
>>>> Introduce support for new hypercall GNTTABOP_cache_flush.
>>>> Use it to perform cache flashing on pages used for dma when necessary.
>>> [..]
>>>> /* functions called by SWIOTLB */
>>>> @@ -22,16 +25,31 @@ static void dma_cache_maint(dma_addr_t handle, unsigned long offset,
>>>> size_t len = left;
>>>> void *vaddr;
>>>>
>>>> + if (len + offset > PAGE_SIZE)
>>>> + len = PAGE_SIZE - offset;
>>>> +
>>>> if (!pfn_valid(pfn))
>>>> {
>>>> - /* TODO: cache flush */
>>>> + struct gnttab_cache_flush cflush;
>>>> +
>>>> + cflush.op = 0;
>>>> + cflush.a.dev_bus_addr = pfn << PAGE_SHIFT;
>>>> + cflush.offset = offset;
>>>> + cflush.size = len;
>>>> +
>>>> + if (op == dmac_unmap_area && dir != DMA_TO_DEVICE)
>>>> + cflush.op = GNTTAB_CACHE_INVAL;
>>>> + if (op == dmac_map_area) {
>>>> + cflush.op = GNTTAB_CACHE_CLEAN;
>>>> + if (dir == DMA_FROM_DEVICE)
>>>> + cflush.op |= GNTTAB_CACHE_INVAL;
>>>> + }
>>>
>>> Are all these cache operations needed? You do a clean on map regardless
>>> of the direction and INVAL on map seems unnecessary.
>
> Isn't the inval on map so that the processor doesn't decide to
> evict/clean the cache line all over your newly DMA'd data?
Ah, yes that makes sense.
>>> I would have thought it would be:
>>>
>>> map && (TO_DEVICE || BOTH)
>>> op = CLEAN
>>>
>>> unmap && (FROM_DEVICE || BOTH)
>>> op = INVAL
>>
>> I was trying to do the same thing Linux is already doing on native to
>> stay on the safe side.
>>
>> See arch/arm/mm/cache-v7.S:v7_dma_map_area and
>> arch/arm/mm/cache-v7.S:v7_dma_unmap_area.
>>
>> Unless I misread the assembly they should match.
>
> I think you have, beq doesn't set lr, so the called function will return
> to its "grandparent". i.e. the caller of v7_dma_map_area in this case
> (which will have used bl), so:
> ENTRY(v7_dma_map_area)
> add r1, r1, r0
> teq r2, #DMA_FROM_DEVICE
> beq v7_dma_inv_range
> b v7_dma_clean_range
> ENDPROC(v7_dma_map_area)
>
> Is actually
> if (dir == from device)
> inv
> else
> clean
>
> which makes much more sense I think.
This is how I read the assembler too.
David
WARNING: multiple messages have this Message-ID (diff)
From: David Vrabel <david.vrabel@citrix.com>
To: Ian Campbell <Ian.Campbell@citrix.com>,
Stefano Stabellini <stefano.stabellini@eu.citrix.com>
Cc: <xen-devel@lists.xensource.com>, <konrad.wilk@oracle.com>,
<linux-kernel@vger.kernel.org>,
<linux-arm-kernel@lists.infradead.org>
Subject: Re: [PATCH v2 2/2] xen/arm: introduce GNTTABOP_cache_flush
Date: Fri, 3 Oct 2014 17:36:26 +0100 [thread overview]
Message-ID: <542ED08A.1090507@citrix.com> (raw)
In-Reply-To: <1412354047.12695.38.camel@citrix.com>
On 03/10/14 17:34, Ian Campbell wrote:
> On Fri, 2014-10-03 at 17:20 +0100, Stefano Stabellini wrote:
>> On Fri, 3 Oct 2014, David Vrabel wrote:
>>> On 03/10/14 15:53, Stefano Stabellini wrote:
>>>> Introduce support for new hypercall GNTTABOP_cache_flush.
>>>> Use it to perform cache flashing on pages used for dma when necessary.
>>> [..]
>>>> /* functions called by SWIOTLB */
>>>> @@ -22,16 +25,31 @@ static void dma_cache_maint(dma_addr_t handle, unsigned long offset,
>>>> size_t len = left;
>>>> void *vaddr;
>>>>
>>>> + if (len + offset > PAGE_SIZE)
>>>> + len = PAGE_SIZE - offset;
>>>> +
>>>> if (!pfn_valid(pfn))
>>>> {
>>>> - /* TODO: cache flush */
>>>> + struct gnttab_cache_flush cflush;
>>>> +
>>>> + cflush.op = 0;
>>>> + cflush.a.dev_bus_addr = pfn << PAGE_SHIFT;
>>>> + cflush.offset = offset;
>>>> + cflush.size = len;
>>>> +
>>>> + if (op == dmac_unmap_area && dir != DMA_TO_DEVICE)
>>>> + cflush.op = GNTTAB_CACHE_INVAL;
>>>> + if (op == dmac_map_area) {
>>>> + cflush.op = GNTTAB_CACHE_CLEAN;
>>>> + if (dir == DMA_FROM_DEVICE)
>>>> + cflush.op |= GNTTAB_CACHE_INVAL;
>>>> + }
>>>
>>> Are all these cache operations needed? You do a clean on map regardless
>>> of the direction and INVAL on map seems unnecessary.
>
> Isn't the inval on map so that the processor doesn't decide to
> evict/clean the cache line all over your newly DMA'd data?
Ah, yes that makes sense.
>>> I would have thought it would be:
>>>
>>> map && (TO_DEVICE || BOTH)
>>> op = CLEAN
>>>
>>> unmap && (FROM_DEVICE || BOTH)
>>> op = INVAL
>>
>> I was trying to do the same thing Linux is already doing on native to
>> stay on the safe side.
>>
>> See arch/arm/mm/cache-v7.S:v7_dma_map_area and
>> arch/arm/mm/cache-v7.S:v7_dma_unmap_area.
>>
>> Unless I misread the assembly they should match.
>
> I think you have, beq doesn't set lr, so the called function will return
> to its "grandparent". i.e. the caller of v7_dma_map_area in this case
> (which will have used bl), so:
> ENTRY(v7_dma_map_area)
> add r1, r1, r0
> teq r2, #DMA_FROM_DEVICE
> beq v7_dma_inv_range
> b v7_dma_clean_range
> ENDPROC(v7_dma_map_area)
>
> Is actually
> if (dir == from device)
> inv
> else
> clean
>
> which makes much more sense I think.
This is how I read the assembler too.
David
WARNING: multiple messages have this Message-ID (diff)
From: David Vrabel <david.vrabel@citrix.com>
To: Ian Campbell <Ian.Campbell@citrix.com>,
Stefano Stabellini <stefano.stabellini@eu.citrix.com>
Cc: xen-devel@lists.xensource.com, konrad.wilk@oracle.com,
linux-kernel@vger.kernel.org,
linux-arm-kernel@lists.infradead.org
Subject: Re: [PATCH v2 2/2] xen/arm: introduce GNTTABOP_cache_flush
Date: Fri, 3 Oct 2014 17:36:26 +0100 [thread overview]
Message-ID: <542ED08A.1090507@citrix.com> (raw)
In-Reply-To: <1412354047.12695.38.camel@citrix.com>
On 03/10/14 17:34, Ian Campbell wrote:
> On Fri, 2014-10-03 at 17:20 +0100, Stefano Stabellini wrote:
>> On Fri, 3 Oct 2014, David Vrabel wrote:
>>> On 03/10/14 15:53, Stefano Stabellini wrote:
>>>> Introduce support for new hypercall GNTTABOP_cache_flush.
>>>> Use it to perform cache flashing on pages used for dma when necessary.
>>> [..]
>>>> /* functions called by SWIOTLB */
>>>> @@ -22,16 +25,31 @@ static void dma_cache_maint(dma_addr_t handle, unsigned long offset,
>>>> size_t len = left;
>>>> void *vaddr;
>>>>
>>>> + if (len + offset > PAGE_SIZE)
>>>> + len = PAGE_SIZE - offset;
>>>> +
>>>> if (!pfn_valid(pfn))
>>>> {
>>>> - /* TODO: cache flush */
>>>> + struct gnttab_cache_flush cflush;
>>>> +
>>>> + cflush.op = 0;
>>>> + cflush.a.dev_bus_addr = pfn << PAGE_SHIFT;
>>>> + cflush.offset = offset;
>>>> + cflush.size = len;
>>>> +
>>>> + if (op == dmac_unmap_area && dir != DMA_TO_DEVICE)
>>>> + cflush.op = GNTTAB_CACHE_INVAL;
>>>> + if (op == dmac_map_area) {
>>>> + cflush.op = GNTTAB_CACHE_CLEAN;
>>>> + if (dir == DMA_FROM_DEVICE)
>>>> + cflush.op |= GNTTAB_CACHE_INVAL;
>>>> + }
>>>
>>> Are all these cache operations needed? You do a clean on map regardless
>>> of the direction and INVAL on map seems unnecessary.
>
> Isn't the inval on map so that the processor doesn't decide to
> evict/clean the cache line all over your newly DMA'd data?
Ah, yes that makes sense.
>>> I would have thought it would be:
>>>
>>> map && (TO_DEVICE || BOTH)
>>> op = CLEAN
>>>
>>> unmap && (FROM_DEVICE || BOTH)
>>> op = INVAL
>>
>> I was trying to do the same thing Linux is already doing on native to
>> stay on the safe side.
>>
>> See arch/arm/mm/cache-v7.S:v7_dma_map_area and
>> arch/arm/mm/cache-v7.S:v7_dma_unmap_area.
>>
>> Unless I misread the assembly they should match.
>
> I think you have, beq doesn't set lr, so the called function will return
> to its "grandparent". i.e. the caller of v7_dma_map_area in this case
> (which will have used bl), so:
> ENTRY(v7_dma_map_area)
> add r1, r1, r0
> teq r2, #DMA_FROM_DEVICE
> beq v7_dma_inv_range
> b v7_dma_clean_range
> ENDPROC(v7_dma_map_area)
>
> Is actually
> if (dir == from device)
> inv
> else
> clean
>
> which makes much more sense I think.
This is how I read the assembler too.
David
next prev parent reply other threads:[~2014-10-03 16:36 UTC|newest]
Thread overview: 23+ messages / expand[flat|nested] mbox.gz Atom feed top
2014-10-03 14:52 [PATCH v2 0/2] introduce XENMEM_cache_flush Stefano Stabellini
2014-10-03 14:52 ` Stefano Stabellini
2014-10-03 14:52 ` Stefano Stabellini
2014-10-03 14:53 ` [PATCH v2 1/2] xen/arm: remove handling of XENFEAT_grant_map_identity Stefano Stabellini
2014-10-03 14:53 ` Stefano Stabellini
2014-10-03 14:53 ` Stefano Stabellini
2014-10-03 14:53 ` [PATCH v2 2/2] xen/arm: introduce GNTTABOP_cache_flush Stefano Stabellini
2014-10-03 14:53 ` Stefano Stabellini
2014-10-03 14:53 ` Stefano Stabellini
2014-10-03 15:05 ` David Vrabel
2014-10-03 15:05 ` David Vrabel
2014-10-03 15:05 ` David Vrabel
2014-10-03 16:20 ` Stefano Stabellini
2014-10-03 16:20 ` Stefano Stabellini
2014-10-03 16:20 ` Stefano Stabellini
2014-10-03 16:34 ` Ian Campbell
2014-10-03 16:34 ` Ian Campbell
2014-10-03 16:34 ` Ian Campbell
2014-10-03 16:36 ` David Vrabel [this message]
2014-10-03 16:36 ` David Vrabel
2014-10-03 16:36 ` David Vrabel
2014-10-03 16:57 ` Russell King - ARM Linux
2014-10-03 16:57 ` Russell King - ARM Linux
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=542ED08A.1090507@citrix.com \
--to=david.vrabel@citrix.com \
--cc=linux-arm-kernel@lists.infradead.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.