qemu-devel.nongnu.org archive mirror
 help / color / mirror / Atom feed
* neon acceleration via mmx/sse (was: Re: [Qemu-devel] CFP: 1st International QEMU Users Forum)
@ 2010-11-28 12:11 Peter Maydell
  2010-11-28 22:39 ` Frédéric Pétrot
  0 siblings, 1 reply; 2+ messages in thread
From: Peter Maydell @ 2010-11-28 12:11 UTC (permalink / raw)
  To: Frédéric Pétrot
  Cc: wolfgang mueller, Nathan Froyd, Arnd Bergmann,
	QEMU-devel Developers, Alexander Graf, Wolfgang Mueller

2010/11/28 Frédéric Pétrot <Frederic.Petrot@imag.fr>:
> PS: We have indeed ourselves worked on the acceleration of the neon support
>    (neon on mmx/sse instead of helpers)

Slight tangent, but: How well did you find that worked?
Were you trying to retain bit-for-bit accuracy in the results?

-- PMM

^ permalink raw reply	[flat|nested] 2+ messages in thread

* Re: neon acceleration via mmx/sse (was: Re: [Qemu-devel] CFP: 1st International QEMU Users Forum)
  2010-11-28 12:11 neon acceleration via mmx/sse (was: Re: [Qemu-devel] CFP: 1st International QEMU Users Forum) Peter Maydell
@ 2010-11-28 22:39 ` Frédéric Pétrot
  0 siblings, 0 replies; 2+ messages in thread
From: Frédéric Pétrot @ 2010-11-28 22:39 UTC (permalink / raw)
  To: Peter Maydell
  Cc: wolfgang mueller, Nathan Froyd, Arnd Bergmann,
	QEMU-devel Developers, Alexander Graf, Wolfgang Mueller

Peter Maydell a écrit :
> 2010/11/28 Frédéric Pétrot <Frederic.Petrot@imag.fr>:
>> PS: We have indeed ourselves worked on the acceleration of the neon support
>>    (neon on mmx/sse instead of helpers)
> 
> Slight tangent, but: How well did you find that worked?
> Were you trying to retain bit-for-bit accuracy in the results?
> 
> -- PMM
> 
Ok, we worked on the integer neon only, as in integrated devices, we
prefer fixed point for energy efficiency reasons (no plug, no fan,
specialized applications).
It works quite well, on synthetic benchmarks (loops with a growing
number of simd instructions) we have a good speedup (4 to 6x)
compared to the helper approach.
Amdahl's law may well reduce this to almost nothing for most applications,
but for video decoding and the like, it may have a value.
We (do our best to) have an exact translation, and we checked against
the arm of a beagleboard for the behavior.
Incidentally, we have an interactive paper at DATE this year on this very
topic.
Don't hesitate to drop me a mail if you want to have a look at it.
Frédéric Pétrot
--
+-----------------------------------------------------------------+
| Frédéric Pétrot, Pr. ENSIMAG-TIMA/SLS   Frederic.Petrot@imag.fr |
| Phone : +33 4 76 57 48 70               Fluctuat  nec  mergitur |
| Mobile: +33 6 74 57 99 65               Ad augusta  per angusta |
| Fax   : +33 4 76 57 49 81               Eppur si muove          |
+-----------------------------------------------------------------+

-- 
+-----------------------------------------------------------------+
| Frédéric Pétrot, Pr. ENSIMAG-TIMA/SLS   Frederic.Petrot@imag.fr |
| Phone : +33 4 76 57 48 70               Fluctuat  nec  mergitur |
| Mobile: +33 6 74 57 99 65               Ad augusta  per angusta |
| Fax   : +33 4 76 57 49 81               Eppur si muove          |
+-----------------------------------------------------------------+

^ permalink raw reply	[flat|nested] 2+ messages in thread

end of thread, other threads:[~2010-11-28 22:39 UTC | newest]

Thread overview: 2+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2010-11-28 12:11 neon acceleration via mmx/sse (was: Re: [Qemu-devel] CFP: 1st International QEMU Users Forum) Peter Maydell
2010-11-28 22:39 ` Frédéric Pétrot

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).