From: "Alexander van Heukelum" <heukelum@fastmail.fm>
To: "Joe Perches" <joe@perches.com>
Cc: "dean gaudet" <dean@arctic.org>,
"Harvey Harrison" <harvey.harrison@gmail.com>,
"Alexander van Heukelum" <heukelum@mailshack.com>,
"Ingo Molnar" <mingo@elte.hu>, "Andi Kleen" <andi@firstfloor.org>,
"LKML" <linux-kernel@vger.kernel.org>
Subject: Re: Alternative implementation of the generic __ffs
Date: Sat, 19 Apr 2008 22:26:34 +0200 [thread overview]
Message-ID: <1208636794.26656.1248793071@webmail.messagingengine.com> (raw)
In-Reply-To: <1208629021.12388.25.camel@localhost>
On Sat, 19 Apr 2008 11:17:01 -0700, "Joe Perches" <joe@perches.com>
said:
> On Sat, 2008-04-19 at 14:10 +0200, Alexander van Heukelum wrote:
> > I've added that to the benchmark, which you can now find here:
> > http://heukelum.fastmail.fm/ffs/.
Thanks! Added the version you sent to the program and added the results
of the ARM processor to the page.
More ideas welcome ;).
> retested on arm:
>
> $ gcc -Os -fomit-frame-pointer ffs.c
> $ ./a.out
> Original: 3170 tics, 8326 tics
> New: 4214 tics, 8793 tics
> Smallest: 4023 tics, 7733 tics
> Small const: 3442 tics, 6188 tics
> Empty loop: 1517 tics, 2243 tics
>
> $ gcc -O2 -fomit-frame-pointer ffs.c
> $ ./a.out
> Original: 3172 tics, 7832 tics
> New: 4805 tics, 8790 tics
> Smallest: 4405 tics, 7154 tics
> Small const: 3442 tics, 5612 tics
> Empty loop: 1516 tics, 2145 tics
>
> $ gcc -O3 -fomit-frame-pointer ffs.c
> $ ./a.out
> Original: 3080 tics, 7709 tics
> New: 4723 tics, 8656 tics
> Smallest: 4333 tics, 7121 tics
> Small const: 3379 tics, 5483 tics
> Empty loop: 1447 tics, 2016 tics
>
> > Testing the same with
> > "return x4 + x3 + x2 + x1 + x0;" as the last line would be
> > interesting too.
>
> Adding is slower:
>
> $ gcc -Os -fomit-frame-pointer ffs.c
> $ ./a.out
> Original: 3152 tics, 8310 tics
> New: 4214 tics, 8789 tics
> Smallest: 4024 tics, 7737 tics
> Small const: 3538 tics, 6295 tics
> Empty loop: 1517 tics, 2243 tics
>
> $ gcc -O2 -fomit-frame-pointer ffs.c
> $ ./a.out
> Original: 3184 tics, 7849 tics
> New: 4790 tics, 8814 tics
> Smallest: 4406 tics, 7161 tics
> Small const: 3538 tics, 5806 tics
> Empty loop: 1521 tics, 2153 tics
>
> $ gcc -O3 -fomit-frame-pointer ffs.c
> $ ./a.out
> Original: 3091 tics, 7694 tics
> New: 4718 tics, 8656 tics
> Smallest: 4333 tics, 7124 tics
> Small const: 3467 tics, 5687 tics
> Empty loop: 1445 tics, 2066 tics
>
>
--
Alexander van Heukelum
heukelum@fastmail.fm
--
http://www.fastmail.fm - And now for something completely different
next prev parent reply other threads:[~2008-04-19 20:26 UTC|newest]
Thread overview: 34+ messages / expand[flat|nested] mbox.gz Atom feed top
2008-03-31 17:15 [PATCH] x86: generic versions of find_first_(zero_)bit, convert i386 Alexander van Heukelum
2008-03-31 17:22 ` Stephen Hemminger
2008-03-31 19:38 ` Alexander van Heukelum
2008-03-31 21:58 ` Andi Kleen
2008-04-01 8:47 ` Ingo Molnar
2008-04-01 9:46 ` Alexander van Heukelum
2008-04-01 15:41 ` [PATCH] x86: switch x86_64 to generic find_first_bit Alexander van Heukelum
2008-04-01 15:42 ` [PATCH] x86: optimize find_first_bit for small bitmaps Alexander van Heukelum
2008-04-01 15:47 ` [PATCH] x86: remove x86-specific implementations of find_first_bit Alexander van Heukelum
2008-04-03 9:34 ` Alexander van Heukelum
2008-04-04 8:47 ` Ingo Molnar
2008-04-06 17:03 ` [PATCH] x86: generic versions of find_first_(zero_)bit, convert i386 dean gaudet
2008-04-06 18:51 ` Alexander van Heukelum
2008-04-06 20:22 ` dean gaudet
2008-04-07 8:43 ` Ingo Molnar
2008-04-07 10:25 ` Alexander van Heukelum
2008-04-18 20:18 ` Alternative implementation of the generic __ffs Alexander van Heukelum
2008-04-18 23:46 ` dean gaudet
2008-04-19 0:09 ` Harvey Harrison
2008-04-19 0:20 ` dean gaudet
2008-04-19 0:58 ` Joe Perches
2008-04-19 1:04 ` Harvey Harrison
2008-04-19 1:11 ` dean gaudet
2008-04-19 2:55 ` Joe Perches
2008-04-19 4:13 ` dean gaudet
2008-04-19 10:05 ` Mikael Pettersson
2008-04-19 12:10 ` Alexander van Heukelum
2008-04-19 18:17 ` Joe Perches
2008-04-19 20:26 ` Alexander van Heukelum [this message]
2008-04-19 22:29 ` Matti Aarnio
2008-04-20 3:06 ` Joe Perches
2008-04-20 8:42 ` Alexander van Heukelum
2008-04-20 12:31 ` Matti Aarnio
2008-04-21 11:43 ` Alexander van Heukelum
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=1208636794.26656.1248793071@webmail.messagingengine.com \
--to=heukelum@fastmail.fm \
--cc=andi@firstfloor.org \
--cc=dean@arctic.org \
--cc=harvey.harrison@gmail.com \
--cc=heukelum@mailshack.com \
--cc=joe@perches.com \
--cc=linux-kernel@vger.kernel.org \
--cc=mingo@elte.hu \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.