Re: Alternative implementation of the generic __ffs

linux-kernel.vger.kernel.org archive mirror
 help / color / mirror / Atom feed

From: "Alexander van Heukelum" <heukelum@fastmail.fm>
To: "dean gaudet" <dean@arctic.org>, "Joe Perches" <joe@perches.com>
Cc: "Harvey Harrison" <harvey.harrison@gmail.com>,
	"Alexander van Heukelum" <heukelum@mailshack.com>,
	"Ingo Molnar" <mingo@elte.hu>, "Andi Kleen" <andi@firstfloor.org>,
	"LKML" <linux-kernel@vger.kernel.org>
Subject: Re: Alternative implementation of the generic __ffs
Date: Sat, 19 Apr 2008 14:10:19 +0200	[thread overview]
Message-ID: <1208607019.13829.1248748343@webmail.messagingengine.com> (raw)
In-Reply-To: <alpine.DEB.1.10.0804182112360.6108@twinlark.arctic.org>

On Fri, 18 Apr 2008 21:13:47 -0700 (PDT), "dean gaudet"
<dean@arctic.org> said:
> On Fri, 18 Apr 2008, Joe Perches wrote:
> > On Fri, 2008-04-18 at 18:11 -0700, dean gaudet wrote: 
> > > have you benchmarked it?
> > 
> > I modified Alexander's benchmark:
> > http://lkml.org/lkml/2008/4/18/267
> > to include 32 and 64 bit variants called smallest.
> > 
> > On an old ARM:
> 
> i'm guessing the 32-bit constants suck :(
> 
> the code could be modified to use 16-bit constants only -- it would add 
> some dependent operations though (to move the hot bit into the low 
> 16-bits).
> 
> -dean

That would look like this (although I chose to reduce to less than 128,
due to completely irrelevant x86 considerations ;) ).

static ATTR int __ffs32_smallconstant(unsigned int value)
{
        int x0, x1, x2, x3, x4;
        unsigned int t2, t4;

        value &= -value;
        t2 = value | (value >> 16);
        t4 = t2 | (t2 >> 8);
        x4 = (value << 16) ? 0 : 16;
        x3 = (t2 << 24) ? 0 : 8;
        x2 = (t4 & 0x0f) ? 0 : 4;
        x1 = (t4 & 0x33) ? 0 : 2;
        x0 = (t4 & 0x55) ? 0 : 1;

        return x4 | x3 | x2 | x1 | x0;
}

I've added that to the benchmark, which you can now find here:
http://heukelum.fastmail.fm/ffs/. Testing the same with 
"return x4 + x3 + x2 + x1 + x0;" as the last line would be
interesting too.

Greetings,
    Alexander

> > $ gcc --version gcc (GCC) 3.4.6
> > 
> > $ cat /proc/cpuinfo 
> > Processor	: Intel StrongARM-110 rev 4 (v4l)
> > BogoMIPS	: 262.14
> > Hardware	: Rebel-NetWinder
> > Revision	: 57ff
> > Serial		: 000000000000185c
> > 
> > $ gcc -Os -fomit-frame-pointer ffs.c
> > $ ./a.out
> > Original:       3180 tics,  8379 tics
> > New:            4280 tics,  8890 tics
> > Smallest:       4027 tics,  7835 tics
> > Empty loop:     1543 tics,  2260 tics
> > 
> > $ gcc -O2 -fomit-frame-pointer ffs.c
> > $ ./a.out
> > Original:       3161 tics,  7843 tics
> > New:            4778 tics,  8783 tics
> > Smallest:       4408 tics,  7149 tics
> > Empty loop:     1515 tics,  2140 tics
> > 
> > $ gcc -O3 -fomit-frame-pointer ffs.c
> > $ ./a.out
> > Original:       3078 tics,  7692 tics
> > New:            4714 tics,  8671 tics
> > Smallest:       4344 tics,  7117 tics
> > Empty loop:     1444 tics,  2024 tics

Thanks for testing, Harvey!
-- 
  Alexander van Heukelum
  heukelum@fastmail.fm

-- 
http://www.fastmail.fm - IMAP accessible web-mail

next prev parent reply	other threads:[~2008-04-19 12:10 UTC|newest]

Thread overview: 34+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2008-03-31 17:15 [PATCH] x86: generic versions of find_first_(zero_)bit, convert i386 Alexander van Heukelum
2008-03-31 17:22 ` Stephen Hemminger
2008-03-31 19:38   ` Alexander van Heukelum
2008-03-31 21:58     ` Andi Kleen
2008-04-01  8:47 ` Ingo Molnar
2008-04-01  9:46   ` Alexander van Heukelum
2008-04-01 15:41     ` [PATCH] x86: switch x86_64 to generic find_first_bit Alexander van Heukelum
2008-04-01 15:42       ` [PATCH] x86: optimize find_first_bit for small bitmaps Alexander van Heukelum
2008-04-01 15:47         ` [PATCH] x86: remove x86-specific implementations of find_first_bit Alexander van Heukelum
2008-04-03  9:34           ` Alexander van Heukelum
2008-04-04  8:47           ` Ingo Molnar
2008-04-06 17:03     ` [PATCH] x86: generic versions of find_first_(zero_)bit, convert i386 dean gaudet
2008-04-06 18:51       ` Alexander van Heukelum
2008-04-06 20:22         ` dean gaudet
2008-04-07  8:43           ` Ingo Molnar
2008-04-07 10:25           ` Alexander van Heukelum
2008-04-18 20:18             ` Alternative implementation of the generic __ffs Alexander van Heukelum
2008-04-18 23:46               ` dean gaudet
2008-04-19  0:09                 ` Harvey Harrison
2008-04-19  0:20                   ` dean gaudet
2008-04-19  0:58                     ` Joe Perches
2008-04-19  1:04                       ` Harvey Harrison
2008-04-19  1:11                         ` dean gaudet
2008-04-19  2:55                           ` Joe Perches
2008-04-19  4:13                             ` dean gaudet
2008-04-19 10:05                               ` Mikael Pettersson
2008-04-19 12:10                               ` Alexander van Heukelum [this message]
2008-04-19 18:17                                 ` Joe Perches
2008-04-19 20:26                                   ` Alexander van Heukelum
2008-04-19 22:29                             ` Matti Aarnio
2008-04-20  3:06                               ` Joe Perches
2008-04-20  8:42                                 ` Alexander van Heukelum
2008-04-20 12:31                                   ` Matti Aarnio
2008-04-21 11:43                                     ` Alexander van Heukelum

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=1208607019.13829.1248748343@webmail.messagingengine.com \
    --to=heukelum@fastmail.fm \
    --cc=andi@firstfloor.org \
    --cc=dean@arctic.org \
    --cc=harvey.harrison@gmail.com \
    --cc=heukelum@mailshack.com \
    --cc=joe@perches.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=mingo@elte.hu \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link

Be sure your reply has a Subject: header at the top and a blank line before the message body.

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).