From: "Alexander van Heukelum" <heukelum@fastmail.fm>
To: "Joe Perches" <joe@perches.com>
Cc: "dean gaudet" <dean@arctic.org>,
"Harvey Harrison" <harvey.harrison@gmail.com>,
"Alexander van Heukelum" <heukelum@mailshack.com>,
"Ingo Molnar" <mingo@elte.hu>, "Andi Kleen" <andi@firstfloor.org>,
"LKML" <linux-kernel@vger.kernel.org>
Subject: Re: Alternative implementation of the generic __ffs
Date: Sat, 19 Apr 2008 22:26:34 +0200 [thread overview]
Message-ID: <1208636794.26656.1248793071@webmail.messagingengine.com> (raw)
In-Reply-To: <1208629021.12388.25.camel@localhost>
On Sat, 19 Apr 2008 11:17:01 -0700, "Joe Perches" <joe@perches.com>
said:
> On Sat, 2008-04-19 at 14:10 +0200, Alexander van Heukelum wrote:
> > I've added that to the benchmark, which you can now find here:
> > http://heukelum.fastmail.fm/ffs/.
Thanks! Added the version you sent to the program and added the results
of the ARM processor to the page.
More ideas welcome ;).
> retested on arm:
>
> $ gcc -Os -fomit-frame-pointer ffs.c
> $ ./a.out
> Original: 3170 tics, 8326 tics
> New: 4214 tics, 8793 tics
> Smallest: 4023 tics, 7733 tics
> Small const: 3442 tics, 6188 tics
> Empty loop: 1517 tics, 2243 tics
>
> $ gcc -O2 -fomit-frame-pointer ffs.c
> $ ./a.out
> Original: 3172 tics, 7832 tics
> New: 4805 tics, 8790 tics
> Smallest: 4405 tics, 7154 tics
> Small const: 3442 tics, 5612 tics
> Empty loop: 1516 tics, 2145 tics
>
> $ gcc -O3 -fomit-frame-pointer ffs.c
> $ ./a.out
> Original: 3080 tics, 7709 tics
> New: 4723 tics, 8656 tics
> Smallest: 4333 tics, 7121 tics
> Small const: 3379 tics, 5483 tics
> Empty loop: 1447 tics, 2016 tics
>
> > Testing the same with
> > "return x4 + x3 + x2 + x1 + x0;" as the last line would be
> > interesting too.
>
> Adding is slower:
>
> $ gcc -Os -fomit-frame-pointer ffs.c
> $ ./a.out
> Original: 3152 tics, 8310 tics
> New: 4214 tics, 8789 tics
> Smallest: 4024 tics, 7737 tics
> Small const: 3538 tics, 6295 tics
> Empty loop: 1517 tics, 2243 tics
>
> $ gcc -O2 -fomit-frame-pointer ffs.c
> $ ./a.out
> Original: 3184 tics, 7849 tics
> New: 4790 tics, 8814 tics
> Smallest: 4406 tics, 7161 tics
> Small const: 3538 tics, 5806 tics
> Empty loop: 1521 tics, 2153 tics
>
> $ gcc -O3 -fomit-frame-pointer ffs.c
> $ ./a.out
> Original: 3091 tics, 7694 tics
> New: 4718 tics, 8656 tics
> Smallest: 4333 tics, 7124 tics
> Small const: 3467 tics, 5687 tics
> Empty loop: 1445 tics, 2066 tics
>
>
--
Alexander van Heukelum
heukelum@fastmail.fm
--
http://www.fastmail.fm - And now for something completely different
next prev parent reply other threads:[~2008-04-19 20:26 UTC|newest]
Thread overview: 34+ messages / expand[flat|nested] mbox.gz Atom feed top
2008-03-31 17:15 [PATCH] x86: generic versions of find_first_(zero_)bit, convert i386 Alexander van Heukelum
2008-03-31 17:22 ` Stephen Hemminger
2008-03-31 19:38 ` Alexander van Heukelum
2008-03-31 21:58 ` Andi Kleen
2008-04-01 8:47 ` Ingo Molnar
2008-04-01 9:46 ` Alexander van Heukelum
2008-04-01 15:41 ` [PATCH] x86: switch x86_64 to generic find_first_bit Alexander van Heukelum
2008-04-01 15:42 ` [PATCH] x86: optimize find_first_bit for small bitmaps Alexander van Heukelum
2008-04-01 15:47 ` [PATCH] x86: remove x86-specific implementations of find_first_bit Alexander van Heukelum
2008-04-03 9:34 ` Alexander van Heukelum
2008-04-04 8:47 ` Ingo Molnar
2008-04-06 17:03 ` [PATCH] x86: generic versions of find_first_(zero_)bit, convert i386 dean gaudet
2008-04-06 18:51 ` Alexander van Heukelum
2008-04-06 20:22 ` dean gaudet
2008-04-07 8:43 ` Ingo Molnar
2008-04-07 10:25 ` Alexander van Heukelum
2008-04-18 20:18 ` Alternative implementation of the generic __ffs Alexander van Heukelum
2008-04-18 23:46 ` dean gaudet
2008-04-19 0:09 ` Harvey Harrison
2008-04-19 0:20 ` dean gaudet
2008-04-19 0:58 ` Joe Perches
2008-04-19 1:04 ` Harvey Harrison
2008-04-19 1:11 ` dean gaudet
2008-04-19 2:55 ` Joe Perches
2008-04-19 4:13 ` dean gaudet
2008-04-19 10:05 ` Mikael Pettersson
2008-04-19 12:10 ` Alexander van Heukelum
2008-04-19 18:17 ` Joe Perches
2008-04-19 20:26 ` Alexander van Heukelum [this message]
2008-04-19 22:29 ` Matti Aarnio
2008-04-20 3:06 ` Joe Perches
2008-04-20 8:42 ` Alexander van Heukelum
2008-04-20 12:31 ` Matti Aarnio
2008-04-21 11:43 ` Alexander van Heukelum
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=1208636794.26656.1248793071@webmail.messagingengine.com \
--to=heukelum@fastmail.fm \
--cc=andi@firstfloor.org \
--cc=dean@arctic.org \
--cc=harvey.harrison@gmail.com \
--cc=heukelum@mailshack.com \
--cc=joe@perches.com \
--cc=linux-kernel@vger.kernel.org \
--cc=mingo@elte.hu \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).