* [CFT:PATCH] Remove a seemingly unnecessary cache flush
@ 2009-12-07 22:12 Russell King - ARM Linux
2009-12-10 10:40 ` Russell King - ARM Linux
` (3 more replies)
0 siblings, 4 replies; 12+ messages in thread
From: Russell King - ARM Linux @ 2009-12-07 22:12 UTC (permalink / raw)
To: linux-arm-kernel
While looking into the possible msync() issue (which turned out to be a
non-issue - it's covered by clever code), I found that we're flushing
the same page multiple times.
Normally, update_mmu_cache() would not be doing anything on architectures
which don't have a software-loaded TLB. However, we use this hook for
two things:
1. for our delayed flush_dcache_page().
2. to fix up multiple shared write-able mappings of the same page.
However, as of 8 years ago, I committed this patch (sorry for the
bkbits URL):
http://linux.bkbits.net:8080/linux-2.6/?PAGE=patch&REV=3dd7c1f9DypjNd1HyQFAOsA3p7mYig
What the majority of that patch is doing is adding the user-mapping
flushing to __flush_dcache_page() - that's fine, and it's explained in
the changeset comments.
It also tries to merge the new __flush_dcache_page() code inside
make_coherent() - make_coherent() is sort-of already doing most of what
__flush_dcache_page() does, and for good measure it does a final
flush_cache_page().
The unused addition of 'dirty' seems to suggest that this extra
flush_cache_page should have been conditional on that.
Moreover, this final flush_cache_page() should not be needed - the whole
"delayed flush_dcache_page" thing only happens when there are no pre-
existing mappings of the page, and so there should be no user mappings
to worry about.
Hence, I believe it is safe to get rid of this - but this patch will
need some rigorous testing. My ARM VersatilePB926 still boots without
this additional patch - but that's not a very good test. I can't say
much more than that for the time being.
Note: for ARMv6 and later, flush_cache_page() is a no-op, so please
don't bother testing this patch there.
Signed-off-by: Russell King <rmk+kernel@arm.linux.org.uk>
---
arch/arm/mm/fault-armv.c | 2 --
1 files changed, 0 insertions(+), 2 deletions(-)
diff --git a/arch/arm/mm/fault-armv.c b/arch/arm/mm/fault-armv.c
index 7296022..666a871 100644
--- a/arch/arm/mm/fault-armv.c
+++ b/arch/arm/mm/fault-armv.c
@@ -127,8 +127,6 @@ make_coherent(struct address_space *mapping, struct vm_area_struct *vma, unsigne
flush_dcache_mmap_unlock(mapping);
if (aliases)
adjust_pte(vma, addr);
- else
- flush_cache_page(vma, addr, pfn);
}
/*
^ permalink raw reply related [flat|nested] 12+ messages in thread* [CFT:PATCH] Remove a seemingly unnecessary cache flush
2009-12-07 22:12 [CFT:PATCH] Remove a seemingly unnecessary cache flush Russell King - ARM Linux
@ 2009-12-10 10:40 ` Russell King - ARM Linux
2009-12-14 11:50 ` Mikael Pettersson
2009-12-10 16:04 ` Mike Rapoport
` (2 subsequent siblings)
3 siblings, 1 reply; 12+ messages in thread
From: Russell King - ARM Linux @ 2009-12-10 10:40 UTC (permalink / raw)
To: linux-arm-kernel
On Mon, Dec 07, 2009 at 10:12:10PM +0000, Russell King - ARM Linux wrote:
> While looking into the possible msync() issue (which turned out to be a
> non-issue - it's covered by clever code), I found that we're flushing
> the same page multiple times.
I'm going to queue this patch up today since I believe it is correct,
though it would be nice to have a few more people test it on pre-ARMv6
hardware.
^ permalink raw reply [flat|nested] 12+ messages in thread* [CFT:PATCH] Remove a seemingly unnecessary cache flush
2009-12-10 10:40 ` Russell King - ARM Linux
@ 2009-12-14 11:50 ` Mikael Pettersson
2009-12-14 11:53 ` Russell King - ARM Linux
0 siblings, 1 reply; 12+ messages in thread
From: Mikael Pettersson @ 2009-12-14 11:50 UTC (permalink / raw)
To: linux-arm-kernel
Russell King - ARM Linux writes:
> On Mon, Dec 07, 2009 at 10:12:10PM +0000, Russell King - ARM Linux wrote:
> > While looking into the possible msync() issue (which turned out to be a
> > non-issue - it's covered by clever code), I found that we're flushing
> > the same page multiple times.
>
> I'm going to queue this patch up today since I believe it is correct,
> though it would be nice to have a few more people test it on pre-ARMv6
> hardware.
Tested here with gcc and glibc builds and testsuite runs on IXP420,
IOP80219, and Kirkwood (Feroceon?), all ARMv5TE. No signs of problems.
/Mikael
^ permalink raw reply [flat|nested] 12+ messages in thread
* [CFT:PATCH] Remove a seemingly unnecessary cache flush
2009-12-14 11:50 ` Mikael Pettersson
@ 2009-12-14 11:53 ` Russell King - ARM Linux
2009-12-14 12:15 ` Mikael Pettersson
0 siblings, 1 reply; 12+ messages in thread
From: Russell King - ARM Linux @ 2009-12-14 11:53 UTC (permalink / raw)
To: linux-arm-kernel
On Mon, Dec 14, 2009 at 12:50:11PM +0100, Mikael Pettersson wrote:
> Russell King - ARM Linux writes:
> > On Mon, Dec 07, 2009 at 10:12:10PM +0000, Russell King - ARM Linux wrote:
> > > While looking into the possible msync() issue (which turned out to be a
> > > non-issue - it's covered by clever code), I found that we're flushing
> > > the same page multiple times.
> >
> > I'm going to queue this patch up today since I believe it is correct,
> > though it would be nice to have a few more people test it on pre-ARMv6
> > hardware.
>
> Tested here with gcc and glibc builds and testsuite runs on IXP420,
> IOP80219, and Kirkwood (Feroceon?), all ARMv5TE. No signs of problems.
Do the builds appear to be any faster?
^ permalink raw reply [flat|nested] 12+ messages in thread
* [CFT:PATCH] Remove a seemingly unnecessary cache flush
2009-12-14 11:53 ` Russell King - ARM Linux
@ 2009-12-14 12:15 ` Mikael Pettersson
2009-12-20 19:32 ` Mikael Pettersson
0 siblings, 1 reply; 12+ messages in thread
From: Mikael Pettersson @ 2009-12-14 12:15 UTC (permalink / raw)
To: linux-arm-kernel
Russell King - ARM Linux writes:
> On Mon, Dec 14, 2009 at 12:50:11PM +0100, Mikael Pettersson wrote:
> > Russell King - ARM Linux writes:
> > > On Mon, Dec 07, 2009 at 10:12:10PM +0000, Russell King - ARM Linux wrote:
> > > > While looking into the possible msync() issue (which turned out to be a
> > > > non-issue - it's covered by clever code), I found that we're flushing
> > > > the same page multiple times.
> > >
> > > I'm going to queue this patch up today since I believe it is correct,
> > > though it would be nice to have a few more people test it on pre-ARMv6
> > > hardware.
> >
> > Tested here with gcc and glibc builds and testsuite runs on IXP420,
> > IOP80219, and Kirkwood (Feroceon?), all ARMv5TE. No signs of problems.
>
> Do the builds appear to be any faster?
I didn't record how long it took because I was mainly interested in
the correctness aspect. I can do a pair of properly measured gcc
builds on the fastest machine next weekend.
^ permalink raw reply [flat|nested] 12+ messages in thread
* [CFT:PATCH] Remove a seemingly unnecessary cache flush
2009-12-14 12:15 ` Mikael Pettersson
@ 2009-12-20 19:32 ` Mikael Pettersson
0 siblings, 0 replies; 12+ messages in thread
From: Mikael Pettersson @ 2009-12-20 19:32 UTC (permalink / raw)
To: linux-arm-kernel
Mikael Pettersson writes:
> Russell King - ARM Linux writes:
> > On Mon, Dec 14, 2009 at 12:50:11PM +0100, Mikael Pettersson wrote:
> > > Russell King - ARM Linux writes:
> > > > On Mon, Dec 07, 2009 at 10:12:10PM +0000, Russell King - ARM Linux wrote:
> > > > > While looking into the possible msync() issue (which turned out to be a
> > > > > non-issue - it's covered by clever code), I found that we're flushing
> > > > > the same page multiple times.
> > > >
> > > > I'm going to queue this patch up today since I believe it is correct,
> > > > though it would be nice to have a few more people test it on pre-ARMv6
> > > > hardware.
> > >
> > > Tested here with gcc and glibc builds and testsuite runs on IXP420,
> > > IOP80219, and Kirkwood (Feroceon?), all ARMv5TE. No signs of problems.
> >
> > Do the builds appear to be any faster?
>
> I didn't record how long it took because I was mainly interested in
> the correctness aspect. I can do a pair of properly measured gcc
> builds on the fastest machine next weekend.
On my Kirkwood (1.2GHz QNAP TS-119) the time to do a gcc-4.4 bootstrap
(all default languages) and testsuite run is 87561 seconds with a
vanilla 2.6.32 kernel and 86719 seconds when this patch is applied.
That's a 1% improvement.
Tested-by: Mikael Pettersson <mikpe@it.uu.se>
^ permalink raw reply [flat|nested] 12+ messages in thread
* [CFT:PATCH] Remove a seemingly unnecessary cache flush
2009-12-07 22:12 [CFT:PATCH] Remove a seemingly unnecessary cache flush Russell King - ARM Linux
2009-12-10 10:40 ` Russell King - ARM Linux
@ 2009-12-10 16:04 ` Mike Rapoport
2009-12-18 16:09 ` Russell King - ARM Linux
2010-03-09 15:46 ` Mikael Pettersson
3 siblings, 0 replies; 12+ messages in thread
From: Mike Rapoport @ 2009-12-10 16:04 UTC (permalink / raw)
To: linux-arm-kernel
Hello Russell,
I've tested boot and hackbench on PXA270.
cm-debian:~# cat /proc/cpuinfo
Processor : XScale-PXA270 rev 7 (v5l)
BogoMIPS : 311.29
Features : swp half thumb fastmult edsp iwmmxt
CPU implementer : 0x69
CPU architecture: 5TE
CPU variant : 0x0
CPU part : 0x411
CPU revision : 7
Hardware : Compulab CM-X2XX
Revision : 0000
Serial : 0000000000000000
--
Sincerely yours,
Mike.
^ permalink raw reply [flat|nested] 12+ messages in thread
* [CFT:PATCH] Remove a seemingly unnecessary cache flush
2009-12-07 22:12 [CFT:PATCH] Remove a seemingly unnecessary cache flush Russell King - ARM Linux
2009-12-10 10:40 ` Russell King - ARM Linux
2009-12-10 16:04 ` Mike Rapoport
@ 2009-12-18 16:09 ` Russell King - ARM Linux
2010-03-09 15:46 ` Mikael Pettersson
3 siblings, 0 replies; 12+ messages in thread
From: Russell King - ARM Linux @ 2009-12-18 16:09 UTC (permalink / raw)
To: linux-arm-kernel
Could I have some Tested-by: lines for this patch please?
On Mon, Dec 07, 2009 at 10:12:10PM +0000, Russell King - ARM Linux wrote:
> While looking into the possible msync() issue (which turned out to be a
> non-issue - it's covered by clever code), I found that we're flushing
> the same page multiple times.
>
> Normally, update_mmu_cache() would not be doing anything on architectures
> which don't have a software-loaded TLB. However, we use this hook for
> two things:
>
> 1. for our delayed flush_dcache_page().
> 2. to fix up multiple shared write-able mappings of the same page.
>
> However, as of 8 years ago, I committed this patch (sorry for the
> bkbits URL):
>
> http://linux.bkbits.net:8080/linux-2.6/?PAGE=patch&REV=3dd7c1f9DypjNd1HyQFAOsA3p7mYig
>
> What the majority of that patch is doing is adding the user-mapping
> flushing to __flush_dcache_page() - that's fine, and it's explained in
> the changeset comments.
>
> It also tries to merge the new __flush_dcache_page() code inside
> make_coherent() - make_coherent() is sort-of already doing most of what
> __flush_dcache_page() does, and for good measure it does a final
> flush_cache_page().
>
> The unused addition of 'dirty' seems to suggest that this extra
> flush_cache_page should have been conditional on that.
>
> Moreover, this final flush_cache_page() should not be needed - the whole
> "delayed flush_dcache_page" thing only happens when there are no pre-
> existing mappings of the page, and so there should be no user mappings
> to worry about.
>
> Hence, I believe it is safe to get rid of this - but this patch will
> need some rigorous testing. My ARM VersatilePB926 still boots without
> this additional patch - but that's not a very good test. I can't say
> much more than that for the time being.
>
> Note: for ARMv6 and later, flush_cache_page() is a no-op, so please
> don't bother testing this patch there.
>
> Signed-off-by: Russell King <rmk+kernel@arm.linux.org.uk>
> ---
> arch/arm/mm/fault-armv.c | 2 --
> 1 files changed, 0 insertions(+), 2 deletions(-)
>
> diff --git a/arch/arm/mm/fault-armv.c b/arch/arm/mm/fault-armv.c
> index 7296022..666a871 100644
> --- a/arch/arm/mm/fault-armv.c
> +++ b/arch/arm/mm/fault-armv.c
> @@ -127,8 +127,6 @@ make_coherent(struct address_space *mapping, struct vm_area_struct *vma, unsigne
> flush_dcache_mmap_unlock(mapping);
> if (aliases)
> adjust_pte(vma, addr);
> - else
> - flush_cache_page(vma, addr, pfn);
> }
>
> /*
>
> _______________________________________________
> linux-arm-kernel mailing list
> linux-arm-kernel at lists.infradead.org
> http://lists.infradead.org/mailman/listinfo/linux-arm-kernel
^ permalink raw reply [flat|nested] 12+ messages in thread
* [CFT:PATCH] Remove a seemingly unnecessary cache flush
2009-12-07 22:12 [CFT:PATCH] Remove a seemingly unnecessary cache flush Russell King - ARM Linux
` (2 preceding siblings ...)
2009-12-18 16:09 ` Russell King - ARM Linux
@ 2010-03-09 15:46 ` Mikael Pettersson
2010-03-09 17:20 ` Russell King - ARM Linux
3 siblings, 1 reply; 12+ messages in thread
From: Mikael Pettersson @ 2010-03-09 15:46 UTC (permalink / raw)
To: linux-arm-kernel
On Mon, 7 Dec 2009, Russell King - ARM Linux wrote:
> While looking into the possible msync() issue (which turned out to be a
> non-issue - it's covered by clever code), I found that we're flushing
> the same page multiple times.
>
> Normally, update_mmu_cache() would not be doing anything on architectures
> which don't have a software-loaded TLB. However, we use this hook for
> two things:
>
> 1. for our delayed flush_dcache_page().
> 2. to fix up multiple shared write-able mappings of the same page.
>
> However, as of 8 years ago, I committed this patch (sorry for the
> bkbits URL):
>
> http://linux.bkbits.net:8080/linux-2.6/?PAGE=patch&REV=3dd7c1f9DypjNd1HyQFAOsA3p7mYig
>
> What the majority of that patch is doing is adding the user-mapping
> flushing to __flush_dcache_page() - that's fine, and it's explained in
> the changeset comments.
>
> It also tries to merge the new __flush_dcache_page() code inside
> make_coherent() - make_coherent() is sort-of already doing most of what
> __flush_dcache_page() does, and for good measure it does a final
> flush_cache_page().
>
> The unused addition of 'dirty' seems to suggest that this extra
> flush_cache_page should have been conditional on that.
>
> Moreover, this final flush_cache_page() should not be needed - the whole
> "delayed flush_dcache_page" thing only happens when there are no pre-
> existing mappings of the page, and so there should be no user mappings
> to worry about.
>
> Hence, I believe it is safe to get rid of this - but this patch will
> need some rigorous testing. My ARM VersatilePB926 still boots without
> this additional patch - but that's not a very good test. I can't say
> much more than that for the time being.
>
> Note: for ARMv6 and later, flush_cache_page() is a no-op, so please
> don't bother testing this patch there.
>
> Signed-off-by: Russell King <rmk+kernel@arm.linux.org.uk>
> ---
> arch/arm/mm/fault-armv.c | 2 --
> 1 files changed, 0 insertions(+), 2 deletions(-)
>
> diff --git a/arch/arm/mm/fault-armv.c b/arch/arm/mm/fault-armv.c
> index 7296022..666a871 100644
> --- a/arch/arm/mm/fault-armv.c
> +++ b/arch/arm/mm/fault-armv.c
> @@ -127,8 +127,6 @@ make_coherent(struct address_space *mapping, struct vm_area_struct *vma, unsigne
> flush_dcache_mmap_unlock(mapping);
> if (aliases)
> adjust_pte(vma, addr);
> - else
> - flush_cache_page(vma, addr, pfn);
> }
>
> /*
>
> _______________________________________________
> linux-arm-kernel mailing list
> linux-arm-kernel at lists.infradead.org
> http://lists.infradead.org/mailman/listinfo/linux-arm-kernel
What's the status of this patch? I see that it's still not included
in the 2.6.34-rc1 kernel. Did it turn out to be invalid, or was it
just forgotten?
As I reported earlier, it sped up an important use case for me by 1%,
so I've been using it in my ARM kernels for almost 3 months now.
/Mikael
^ permalink raw reply [flat|nested] 12+ messages in thread
[parent not found: <20091207203258.GH26821@n2100.arm.linux.org.uk>]
end of thread, other threads:[~2010-03-09 19:15 UTC | newest]
Thread overview: 12+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2009-12-07 22:12 [CFT:PATCH] Remove a seemingly unnecessary cache flush Russell King - ARM Linux
2009-12-10 10:40 ` Russell King - ARM Linux
2009-12-14 11:50 ` Mikael Pettersson
2009-12-14 11:53 ` Russell King - ARM Linux
2009-12-14 12:15 ` Mikael Pettersson
2009-12-20 19:32 ` Mikael Pettersson
2009-12-10 16:04 ` Mike Rapoport
2009-12-18 16:09 ` Russell King - ARM Linux
2010-03-09 15:46 ` Mikael Pettersson
2010-03-09 17:20 ` Russell King - ARM Linux
2010-03-09 19:15 ` Mikael Pettersson
[not found] <20091207203258.GH26821@n2100.arm.linux.org.uk>
2009-12-08 13:26 ` Uwe Kleine-König
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).