From: "René Scharfe" <l.s.r@web.de>
To: Jeff King <peff@peff.net>, Jonathan Nieder <jrnieder@gmail.com>,
Git Mailing List <git@vger.kernel.org>,
Junio C Hamano <gitster@pobox.com>
Subject: Re: [PATCH] fast-import: use hashcmp() for SHA1 hash comparison
Date: Sat, 19 Jul 2014 19:53:04 +0200 [thread overview]
Message-ID: <53CAB080.8060005@web.de> (raw)
In-Reply-To: <20140719164324.GA5616@vauxhall.crustytoothpaste.net>
Am 19.07.2014 18:43, schrieb brian m. carlson:
> On Sat, Jul 19, 2014 at 02:11:30PM +0200, René Scharfe wrote:
>> I'd say if a platform doesn't bother optimizing memcmp() then they
>> deserve the resulting performance. And it's probably not too bad a
>> penalty because such comparisons probably won't make up a significant
>> part of most applications.
>
> I tend to agree with this. On many modern versions of GCC, the compiler
> can generate an appropriately optimized inline version when it sees a
> memcmp call, so it's more of a compiler issue then, since no actual call
> to the function will be emitted.
I just found this open GCC bug entry about glibc memcmp being faster
than the inlined version of the compiler:
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=43052.
(Found through
http://randomascii.wordpress.com/2012/10/31/comparing-memory-is-still-tricky/,
which says that the compilers coming with Microsoft Visual Studio 2010
and 2012 are not optimizing memcmp() as much as they could as well.)
>> static inline int hashcmp(const unsigned char *sha1, const unsigned char *sha2)
>> {
>> + const uint32_t *p1 = (const uint32_t *)sha1;
>> + const uint32_t *p2 = (const uint32_t *)sha2;
>
> You can't make this cast. The guaranteed alignment for sha1 and sha2 is
> 1, and for p1 and p2, it's 4. If sha1 and sha2 are not suitably
> aligned, this will get a SIGBUS on sparc and possibly a wrong value on
> ARM[0].
>
> [0] http://www.aleph1.co.uk/chapter-10-arm-structured-alignment-faq
Yeah, it was just a test balloon that happens to work on amd64. We
could invent a hash type with correct alignment (a struct with a
uint32_t[5] member?) and replace all those unsigned char pointers if we
wanted to go with such a "vectorized" hashcmp, but that would be
maximally invasive.
René
prev parent reply other threads:[~2014-07-19 17:54 UTC|newest]
Thread overview: 7+ messages / expand[flat|nested] mbox.gz Atom feed top
2014-07-18 16:00 [PATCH] fast-import: use hashcmp() for SHA1 hash comparison René Scharfe
2014-07-18 18:42 ` Jonathan Nieder
2014-07-18 19:14 ` René Scharfe
2014-07-18 23:57 ` Jeff King
2014-07-19 12:11 ` René Scharfe
2014-07-19 16:43 ` brian m. carlson
2014-07-19 17:53 ` René Scharfe [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=53CAB080.8060005@web.de \
--to=l.s.r@web.de \
--cc=git@vger.kernel.org \
--cc=gitster@pobox.com \
--cc=jrnieder@gmail.com \
--cc=peff@peff.net \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).