From: Joe Perches <joe@perches.com>
To: Alexander van Heukelum <heukelum@fastmail.fm>
Cc: dean gaudet <dean@arctic.org>,
Harvey Harrison <harvey.harrison@gmail.com>,
Alexander van Heukelum <heukelum@mailshack.com>,
Ingo Molnar <mingo@elte.hu>, Andi Kleen <andi@firstfloor.org>,
LKML <linux-kernel@vger.kernel.org>
Subject: Re: Alternative implementation of the generic __ffs
Date: Sat, 19 Apr 2008 11:17:01 -0700 [thread overview]
Message-ID: <1208629021.12388.25.camel@localhost> (raw)
In-Reply-To: <1208607019.13829.1248748343@webmail.messagingengine.com>
On Sat, 2008-04-19 at 14:10 +0200, Alexander van Heukelum wrote:
> I've added that to the benchmark, which you can now find here:
> http://heukelum.fastmail.fm/ffs/.
retested on arm:
$ gcc -Os -fomit-frame-pointer ffs.c
$ ./a.out
Original: 3170 tics, 8326 tics
New: 4214 tics, 8793 tics
Smallest: 4023 tics, 7733 tics
Small const: 3442 tics, 6188 tics
Empty loop: 1517 tics, 2243 tics
$ gcc -O2 -fomit-frame-pointer ffs.c
$ ./a.out
Original: 3172 tics, 7832 tics
New: 4805 tics, 8790 tics
Smallest: 4405 tics, 7154 tics
Small const: 3442 tics, 5612 tics
Empty loop: 1516 tics, 2145 tics
$ gcc -O3 -fomit-frame-pointer ffs.c
$ ./a.out
Original: 3080 tics, 7709 tics
New: 4723 tics, 8656 tics
Smallest: 4333 tics, 7121 tics
Small const: 3379 tics, 5483 tics
Empty loop: 1447 tics, 2016 tics
> Testing the same with
> "return x4 + x3 + x2 + x1 + x0;" as the last line would be
> interesting too.
Adding is slower:
$ gcc -Os -fomit-frame-pointer ffs.c
$ ./a.out
Original: 3152 tics, 8310 tics
New: 4214 tics, 8789 tics
Smallest: 4024 tics, 7737 tics
Small const: 3538 tics, 6295 tics
Empty loop: 1517 tics, 2243 tics
$ gcc -O2 -fomit-frame-pointer ffs.c
$ ./a.out
Original: 3184 tics, 7849 tics
New: 4790 tics, 8814 tics
Smallest: 4406 tics, 7161 tics
Small const: 3538 tics, 5806 tics
Empty loop: 1521 tics, 2153 tics
$ gcc -O3 -fomit-frame-pointer ffs.c
$ ./a.out
Original: 3091 tics, 7694 tics
New: 4718 tics, 8656 tics
Smallest: 4333 tics, 7124 tics
Small const: 3467 tics, 5687 tics
Empty loop: 1445 tics, 2066 tics
next prev parent reply other threads:[~2008-04-19 18:17 UTC|newest]
Thread overview: 34+ messages / expand[flat|nested] mbox.gz Atom feed top
2008-03-31 17:15 [PATCH] x86: generic versions of find_first_(zero_)bit, convert i386 Alexander van Heukelum
2008-03-31 17:22 ` Stephen Hemminger
2008-03-31 19:38 ` Alexander van Heukelum
2008-03-31 21:58 ` Andi Kleen
2008-04-01 8:47 ` Ingo Molnar
2008-04-01 9:46 ` Alexander van Heukelum
2008-04-01 15:41 ` [PATCH] x86: switch x86_64 to generic find_first_bit Alexander van Heukelum
2008-04-01 15:42 ` [PATCH] x86: optimize find_first_bit for small bitmaps Alexander van Heukelum
2008-04-01 15:47 ` [PATCH] x86: remove x86-specific implementations of find_first_bit Alexander van Heukelum
2008-04-03 9:34 ` Alexander van Heukelum
2008-04-04 8:47 ` Ingo Molnar
2008-04-06 17:03 ` [PATCH] x86: generic versions of find_first_(zero_)bit, convert i386 dean gaudet
2008-04-06 18:51 ` Alexander van Heukelum
2008-04-06 20:22 ` dean gaudet
2008-04-07 8:43 ` Ingo Molnar
2008-04-07 10:25 ` Alexander van Heukelum
2008-04-18 20:18 ` Alternative implementation of the generic __ffs Alexander van Heukelum
2008-04-18 23:46 ` dean gaudet
2008-04-19 0:09 ` Harvey Harrison
2008-04-19 0:20 ` dean gaudet
2008-04-19 0:58 ` Joe Perches
2008-04-19 1:04 ` Harvey Harrison
2008-04-19 1:11 ` dean gaudet
2008-04-19 2:55 ` Joe Perches
2008-04-19 4:13 ` dean gaudet
2008-04-19 10:05 ` Mikael Pettersson
2008-04-19 12:10 ` Alexander van Heukelum
2008-04-19 18:17 ` Joe Perches [this message]
2008-04-19 20:26 ` Alexander van Heukelum
2008-04-19 22:29 ` Matti Aarnio
2008-04-20 3:06 ` Joe Perches
2008-04-20 8:42 ` Alexander van Heukelum
2008-04-20 12:31 ` Matti Aarnio
2008-04-21 11:43 ` Alexander van Heukelum
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=1208629021.12388.25.camel@localhost \
--to=joe@perches.com \
--cc=andi@firstfloor.org \
--cc=dean@arctic.org \
--cc=harvey.harrison@gmail.com \
--cc=heukelum@fastmail.fm \
--cc=heukelum@mailshack.com \
--cc=linux-kernel@vger.kernel.org \
--cc=mingo@elte.hu \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.