From: Linus Torvalds <torvalds@osdl.org>
To: Junio C Hamano <junkio@cox.net>
Cc: git@vger.kernel.org, Davide Libenzi <davidel@xmailserver.org>,
Jim Meyering <jim@meyering.net>
Subject: Re: git-diff-tree inordinately (O(M*N)) slow on files with many changes
Date: Mon, 16 Oct 2006 16:24:05 -0700 (PDT) [thread overview]
Message-ID: <Pine.LNX.4.64.0610161604360.3962@g5.osdl.org> (raw)
In-Reply-To: <7vy7rfub36.fsf@assigned-by-dhcp.cox.net>
On Mon, 16 Oct 2006, Junio C Hamano wrote:
>
> I agree (although I am not sure about the "do it twice for
> small" bit), and I think Davide agrees with you in his reply:
Sure. Davide's all-macro version is fine. I don't like re-using the same
value twice even in a ALL-CAPS macro, so I'm used to inline functions, but
all the uses of XDL_HASHLONG() are fine with multiple uses of the
arguments.
Somebody should just double-check that all the parentheses ended up being
right ;)
It might be easier to read if you write it as
#define BITS_IN_LONG (CHAR_BIT * sizeof(unsigned long))
#define XDL_HIGHBITS(v,b) ((v) >> (BITS_IN_LONG - (b)))
#define XDL_MASKBITS(b) ((1UL << (b)) - 1)
#define XDL_HASHBITS(v,b) (((v) + XDL_HIGHBITS(v,b)) & XDL_MASKBITS(b))
#define XDL_HASHLONG(v,b) XDL_HASHBITS( (unsigned long)(v) , b )
just to avoid one huge #define.
That said, it unnecessarily calculates "BITS_IN_LONG - (b)" to shift with,
because it really shouldn't matter _which_ high bits you use for hashing,
so you might as well just use the "next" b bits, and have
#define XDL_ADDBITS(v,b) ((v) + ((v) >> (b)))
#define XDL_MASKBITS(b) ((1UL << (b)) - 1)
#define XDL_HASHLONG(v,b) (XDL_ADDBITS((unsigned long)(v), b) & XDL_MASKBITS(b))
which generates better code at least on x86 (and x86-64), because the
shift count stays the same for all shifts and can thus be kept in %ecx.
For example, on x86-64, you get
movq %rdi, %rax # copy 'val'
movl $1, %edx # const 1: start generating (1 << b) - 1
shrq %cl, %rax # val >> b
salq %cl, %rdx # 1 << b
leaq (%rdi,%rax), %rax # val + (val >> b)
subq $1, %rdx # (1 << b) -1
andq %rdx, %rax # final hash
which is short and sweet. And on ppc32 (or ppc64) you get
li 9,1 # const 1: start generating (1 << b) - 1
srw 0,3,4 # val >> b
slw 9,9,4 # 1 << b
add 0,0,3 # val + (val >> b)
addi 9,9,-1 # (1 << b) - 1
and 3,0,9 # final hash
in other words, apart from having two shifts (which you can't really
avoid, although a multiply can do one of them) it's just a very efficient
way to mix together (2*b) bits into a (b)-bit hash.
But taking the high bits from the "unsigned long" doesn't add _that_ much
cost. I just suspect that it's a good way to continue to get different
answers on 32-bit and 64-bit architectures.
Linus
next prev parent reply other threads:[~2006-10-16 23:24 UTC|newest]
Thread overview: 27+ messages / expand[flat|nested] mbox.gz Atom feed top
2006-10-16 14:12 git-diff-tree inordinately (O(M*N)) slow on files with many changes Jim Meyering
2006-10-16 15:47 ` Linus Torvalds
2006-10-16 16:12 ` Linus Torvalds
2006-10-16 16:33 ` Jim Meyering
2006-10-16 16:42 ` Davide Libenzi
2006-10-16 16:50 ` Jim Meyering
2006-10-16 16:54 ` Davide Libenzi
2006-10-16 16:57 ` Jim Meyering
2006-10-16 17:02 ` Davide Libenzi
2006-10-16 17:56 ` Linus Torvalds
2006-10-16 18:03 ` Linus Torvalds
2006-10-16 18:41 ` Davide Libenzi
2006-10-16 18:18 ` Davide Libenzi
2006-10-16 18:51 ` Linus Torvalds
2006-10-16 19:44 ` Davide Libenzi
2006-10-16 20:29 ` Jakub Narebski
2006-10-16 22:53 ` Junio C Hamano
2006-10-16 23:24 ` Linus Torvalds [this message]
2006-10-16 23:52 ` Davide Libenzi
2006-10-16 18:24 ` Jim Meyering
2006-10-16 18:30 ` Davide Libenzi
2006-10-16 18:43 ` Jim Meyering
2006-10-16 16:54 ` Linus Torvalds
2006-10-16 16:36 ` Davide Libenzi
2006-10-16 16:57 ` Linus Torvalds
2006-10-16 16:24 ` Davide Libenzi
2006-10-16 16:54 ` Jakub Narebski
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=Pine.LNX.4.64.0610161604360.3962@g5.osdl.org \
--to=torvalds@osdl.org \
--cc=davidel@xmailserver.org \
--cc=git@vger.kernel.org \
--cc=jim@meyering.net \
--cc=junkio@cox.net \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).