From: robherring2@gmail.com (Rob Herring)
To: linux-arm-kernel@lists.infradead.org
Subject: [PATCH 0/5] Kernel mode NEON for XOR and RAID6
Date: Thu, 06 Jun 2013 18:08:30 -0500 [thread overview]
Message-ID: <51B1166E.10303@gmail.com> (raw)
In-Reply-To: <alpine.LFD.2.03.1306061156330.18597@syhkavp.arg>
On 06/06/2013 11:17 AM, Nicolas Pitre wrote:
> On Thu, 6 Jun 2013, Will Deacon wrote:
>
>> On Thu, Jun 06, 2013 at 04:03:00PM +0100, Ard Biesheuvel wrote:
>>> Hi all,
>>
>> Hi Ard,
>>
>>> This is a partial repost of the patches I proposed a couple of weeks ago to add
>>> support for VFP/NEON in kernel mode.
>>>
>>> This time, I have included two use cases that I have been using, XOR and RAID-6
>>> checksumming. The former gets a 60% performance boost on the NEON, the latter
>>> over 400%.
>>
>> Whilst that sounds impressive, can you achieve similar results across all
>> NEON-capable CPUs? In particular, we need to make sure this doesn't cause
>> performance regressions on some cores.
>
> Note that the kernel performs runtime benchmarking of all the different
> implementations it has available at boot time and selects the best one.
> So if this would turn out to make things worse on some cores then the
> Neon code would simply not be used.
>
>> Furthermore, do you have any power figures to complement your
>> findings?
>
> This is going to be most useful in server type environments where a bit
> more power is not such an issue but throughput is ... unless you start
> using RAID6 arrays on your phone that is. :-) Otherwise this can be
> left configured out for mobile targets.
Agreed. Any power difference will be noise for a server.
Rob
>> The increased context-switch overhead
>> is also worth measuring if you can (i.e. run some userspace NEON-based
>> benchmarks in parallel with NEON and non-NEON implementations of the
>> checksumming).
>
> Do we know the context switch cost of normal task scheduling between
> tasks using FP operations? The in-kernel Neon usage should bring about
> the same cost. Measuring it would be interesting albeit probably
> difficult.
>
>> We support building the kernel with older toolchains, so I don't see the
>> benefit of using intrinsics here.
>
> These days the compiler tends to do a better job than humans at properly
> scheduling instructions for some code. We shouldn't deprive ourselves
> from it when a recent enough gcc is available.
>
>
> Nicolas
>
> _______________________________________________
> linux-arm-kernel mailing list
> linux-arm-kernel at lists.infradead.org
> http://lists.infradead.org/mailman/listinfo/linux-arm-kernel
>
next prev parent reply other threads:[~2013-06-06 23:08 UTC|newest]
Thread overview: 24+ messages / expand[flat|nested] mbox.gz Atom feed top
2013-06-06 15:03 [PATCH 0/5] Kernel mode NEON for XOR and RAID6 Ard Biesheuvel
2013-06-06 15:03 ` [PATCH 1/5] ARM: add support for kernel mode NEON Ard Biesheuvel
2013-06-06 15:03 ` [PATCH 2/5] ARM: move VFP init to an earlier boot stage Ard Biesheuvel
2013-06-06 15:03 ` [PATCH 3/5] ARM: be strict about FP exceptions in kernel mode Ard Biesheuvel
2013-06-06 15:03 ` [PATCH 4/5] ARM: crypto: add NEON accelerated XOR implementation Ard Biesheuvel
2013-06-06 15:45 ` Nicolas Pitre
2013-06-06 15:03 ` [PATCH 5/5] lib/raid6: add ARM-NEON accelerated syndrome calculation Ard Biesheuvel
2013-06-06 15:55 ` Nicolas Pitre
2013-06-06 15:17 ` [PATCH 0/5] Kernel mode NEON for XOR and RAID6 Will Deacon
2013-06-06 15:52 ` Ard Biesheuvel
2013-06-06 16:17 ` Nicolas Pitre
2013-06-06 23:08 ` Rob Herring [this message]
2013-06-07 17:50 ` Will Deacon
2013-06-07 19:49 ` Ard Biesheuvel
2013-06-08 3:09 ` Nicolas Pitre
2013-06-21 9:33 ` Will Deacon
2013-06-21 10:08 ` Ard Biesheuvel
2013-06-21 14:58 ` Christopher Covington
2013-06-24 8:08 ` Ard Biesheuvel
2013-06-24 8:54 ` Russell King - ARM Linux
2013-06-24 9:10 ` Ard Biesheuvel
2013-06-25 13:56 ` Dave Martin
2013-06-25 14:14 ` Ard Biesheuvel
2013-06-25 14:29 ` Dave Martin
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=51B1166E.10303@gmail.com \
--to=robherring2@gmail.com \
--cc=linux-arm-kernel@lists.infradead.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.