From: "René Scharfe" <l.s.r@web.de>
To: Jeff King <peff@peff.net>, Jonathan Nieder <jrnieder@gmail.com>,
Git Mailing List <git@vger.kernel.org>,
Junio C Hamano <gitster@pobox.com>
Subject: Re: [PATCH] fast-import: use hashcmp() for SHA1 hash comparison
Date: Sat, 19 Jul 2014 19:53:04 +0200 [thread overview]
Message-ID: <53CAB080.8060005@web.de> (raw)
In-Reply-To: <20140719164324.GA5616@vauxhall.crustytoothpaste.net>
Am 19.07.2014 18:43, schrieb brian m. carlson:
> On Sat, Jul 19, 2014 at 02:11:30PM +0200, René Scharfe wrote:
>> I'd say if a platform doesn't bother optimizing memcmp() then they
>> deserve the resulting performance. And it's probably not too bad a
>> penalty because such comparisons probably won't make up a significant
>> part of most applications.
>
> I tend to agree with this. On many modern versions of GCC, the compiler
> can generate an appropriately optimized inline version when it sees a
> memcmp call, so it's more of a compiler issue then, since no actual call
> to the function will be emitted.
I just found this open GCC bug entry about glibc memcmp being faster
than the inlined version of the compiler:
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=43052.
(Found through
http://randomascii.wordpress.com/2012/10/31/comparing-memory-is-still-tricky/,
which says that the compilers coming with Microsoft Visual Studio 2010
and 2012 are not optimizing memcmp() as much as they could as well.)
>> static inline int hashcmp(const unsigned char *sha1, const unsigned char *sha2)
>> {
>> + const uint32_t *p1 = (const uint32_t *)sha1;
>> + const uint32_t *p2 = (const uint32_t *)sha2;
>
> You can't make this cast. The guaranteed alignment for sha1 and sha2 is
> 1, and for p1 and p2, it's 4. If sha1 and sha2 are not suitably
> aligned, this will get a SIGBUS on sparc and possibly a wrong value on
> ARM[0].
>
> [0] http://www.aleph1.co.uk/chapter-10-arm-structured-alignment-faq
Yeah, it was just a test balloon that happens to work on amd64. We
could invent a hash type with correct alignment (a struct with a
uint32_t[5] member?) and replace all those unsigned char pointers if we
wanted to go with such a "vectorized" hashcmp, but that would be
maximally invasive.
René
prev parent reply other threads:[~2014-07-19 17:54 UTC|newest]
Thread overview: 7+ messages / expand[flat|nested] mbox.gz Atom feed top
2014-07-18 16:00 [PATCH] fast-import: use hashcmp() for SHA1 hash comparison René Scharfe
2014-07-18 18:42 ` Jonathan Nieder
2014-07-18 19:14 ` René Scharfe
2014-07-18 23:57 ` Jeff King
2014-07-19 12:11 ` René Scharfe
2014-07-19 16:43 ` brian m. carlson
2014-07-19 17:53 ` René Scharfe [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=53CAB080.8060005@web.de \
--to=l.s.r@web.de \
--cc=git@vger.kernel.org \
--cc=gitster@pobox.com \
--cc=jrnieder@gmail.com \
--cc=peff@peff.net \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.