* [PATCH v5 0/2] cleanup duplicate name_compare() functions @ 2014-06-20 2:06 Jeremiah Mahler 2014-06-20 2:06 ` [PATCH v5 1/2] name-hash.c: replace cache_name_compare() with memcmp() Jeremiah Mahler 2014-06-20 2:06 ` [PATCH v5 2/2] cleanup duplicate name_compare() functions Jeremiah Mahler 0 siblings, 2 replies; 5+ messages in thread From: Jeremiah Mahler @ 2014-06-20 2:06 UTC (permalink / raw) To: Junio C Hamano; +Cc: Jonathan Nieder, git, Jeremiah Mahler Version 5 of the patch series to cleanup the duplicate name_compare() functions. - name-hash.c had a call to cache_name_compare() but it required that the lengths were equal. Since cache_name_compare() is equivalent to memcmp() when the lengths are equal, replace it with memcmp(). This avoids renaming cache_name_compare() to name_compare() in a later patch. - Cleanup of log message by Junio C Humano. Jeremiah Mahler (2): name-hash.c: replace cache_name_compare() with memcmp() cleanup duplicate name_compare() functions cache.h | 2 +- dir.c | 3 +-- name-hash.c | 2 +- read-cache.c | 23 +++++++++++++---------- tree-walk.c | 10 ---------- unpack-trees.c | 11 ----------- 6 files changed, 16 insertions(+), 35 deletions(-) -- 2.0.0.694.g5736dad ^ permalink raw reply [flat|nested] 5+ messages in thread
* [PATCH v5 1/2] name-hash.c: replace cache_name_compare() with memcmp() 2014-06-20 2:06 [PATCH v5 0/2] cleanup duplicate name_compare() functions Jeremiah Mahler @ 2014-06-20 2:06 ` Jeremiah Mahler 2014-06-20 17:10 ` Junio C Hamano 2014-06-20 2:06 ` [PATCH v5 2/2] cleanup duplicate name_compare() functions Jeremiah Mahler 1 sibling, 1 reply; 5+ messages in thread From: Jeremiah Mahler @ 2014-06-20 2:06 UTC (permalink / raw) To: Junio C Hamano; +Cc: Jonathan Nieder, git, Jeremiah Mahler When cache_name_compare() is used on counted strings of the same length, it is equivalent to a memcmp(). Since the one use of cache_name_compare() in name-hash.c requires that the lengths are equal, just replace it with memcmp(). Signed-off-by: Jeremiah Mahler <jmmahler@gmail.com> --- name-hash.c | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/name-hash.c b/name-hash.c index be7c4ae..63cc188 100644 --- a/name-hash.c +++ b/name-hash.c @@ -179,7 +179,7 @@ static int same_name(const struct cache_entry *ce, const char *name, int namelen * Always do exact compare, even if we want a case-ignoring comparison; * we do the quick exact one first, because it will be the common case. */ - if (len == namelen && !cache_name_compare(name, namelen, ce->name, len)) + if (len == namelen && !memcmp(name, ce->name, len)) return 1; if (!icase) -- 2.0.0.694.g5736dad ^ permalink raw reply related [flat|nested] 5+ messages in thread
* Re: [PATCH v5 1/2] name-hash.c: replace cache_name_compare() with memcmp() 2014-06-20 2:06 ` [PATCH v5 1/2] name-hash.c: replace cache_name_compare() with memcmp() Jeremiah Mahler @ 2014-06-20 17:10 ` Junio C Hamano 0 siblings, 0 replies; 5+ messages in thread From: Junio C Hamano @ 2014-06-20 17:10 UTC (permalink / raw) To: Jeremiah Mahler; +Cc: Jonathan Nieder, git Jeremiah Mahler <jmmahler@gmail.com> writes: > When cache_name_compare() is used on counted strings of the same > length, it is equivalent to a memcmp(). Since the one use of > cache_name_compare() in name-hash.c requires that the lengths are > equal, just replace it with memcmp(). I do not think it is not "requires that the lengths are equal"; it merely is a premature optimization, as it wants to catch only the case where the names are the same. Your patch is not wrong per-se, but with the above justification of yours, I would actually have expected to see it updated to use !cache_name_compare() and then later !name_compare(). That way, if it ever turns out that giving name_compare() semantics specific to "name" (as opposed to just byte-for-byte comparison given by memcmp(3)) is a good idea, we will use that comparison with semantics specific to "name"s here, without having to change it from memcmp(). Having said all that, I think we see a more correct justification for this change in the pre-context of the patch. We want the exact comparison, without any funky "name"-specific semantics in the quick-and-exact case. I've queued it like this (no need to reroll). Thanks. name-hash.c: replace cache_name_compare() with memcmp(3) The same_name() private function wants a quick-and-exact check to see if they two names are byte-for-byte identical first and then fall back to the slow path. Use memcmp(3) for the former to make it clear that we do not want any "name" specific comparison. Signed-off-by: Jeremiah Mahler <jmmahler@gmail.com> Signed-off-by: Junio C Hamano <gitster@pobox.com> diff --git a/name-hash.c b/name-hash.c index 97444d0..49fd508 100644 --- a/name-hash.c +++ b/name-hash.c @@ -179,7 +179,7 @@ static int same_name(const struct cache_entry *ce, const char *name, int namelen * Always do exact compare, even if we want a case-ignoring comparison; * we do the quick exact one first, because it will be the common case. */ - if (len == namelen && !cache_name_compare(name, namelen, ce->name, len)) + if (len == namelen && !memcmp(name, ce->name, len)) return 1; if (!icase) ^ permalink raw reply related [flat|nested] 5+ messages in thread
* [PATCH v5 2/2] cleanup duplicate name_compare() functions 2014-06-20 2:06 [PATCH v5 0/2] cleanup duplicate name_compare() functions Jeremiah Mahler 2014-06-20 2:06 ` [PATCH v5 1/2] name-hash.c: replace cache_name_compare() with memcmp() Jeremiah Mahler @ 2014-06-20 2:06 ` Jeremiah Mahler 2014-06-20 17:15 ` Junio C Hamano 1 sibling, 1 reply; 5+ messages in thread From: Jeremiah Mahler @ 2014-06-20 2:06 UTC (permalink / raw) To: Junio C Hamano; +Cc: Jonathan Nieder, git, Jeremiah Mahler We often represent our strings as a counted string, i.e. a pair of the pointer to the beginning of the string and its length, and the string may not be NUL terminated to that length. To compare a pair of such counted strings, unpack-trees.c and read-cache.c implement their own name_compare() functions identically. In addition, the cache_name_compare() function in read-cache.c is nearly identical. The only difference is when one string is the prefix of the other string, in which case the former returns -1/+1 to show which one is longer and the latter returns the difference of the lengths to show the same information. Unify these three functions by using the implementation from cache_name_compare(). This does not make any difference to the existing and future callers, as they must be paying attention only to the sign of the returned value (and not the magnitude) because the original implementations of these two functions return values returned by memcmp(3) when the one string is not a prefix of the other string, and the only thing memcmp(3) guarantees its callers is the sign of the returned value, not the magnitude. Signed-off-by: Jeremiah Mahler <jmmahler@gmail.com> --- cache.h | 2 +- dir.c | 3 +-- read-cache.c | 23 +++++++++++++---------- tree-walk.c | 10 ---------- unpack-trees.c | 11 ----------- 5 files changed, 15 insertions(+), 34 deletions(-) diff --git a/cache.h b/cache.h index c498a30..e3205fe 100644 --- a/cache.h +++ b/cache.h @@ -1027,7 +1027,7 @@ extern int validate_headref(const char *ref); extern int base_name_compare(const char *name1, int len1, int mode1, const char *name2, int len2, int mode2); extern int df_name_compare(const char *name1, int len1, int mode1, const char *name2, int len2, int mode2); -extern int cache_name_compare(const char *name1, int len1, const char *name2, int len2); +extern int name_compare(const char *name1, size_t len1, const char *name2, size_t len2); extern int cache_name_stage_compare(const char *name1, int len1, int stage1, const char *name2, int len2, int stage2); extern void *read_object_with_reference(const unsigned char *sha1, diff --git a/dir.c b/dir.c index 797805d..e65888d 100644 --- a/dir.c +++ b/dir.c @@ -1354,8 +1354,7 @@ static int cmp_name(const void *p1, const void *p2) const struct dir_entry *e1 = *(const struct dir_entry **)p1; const struct dir_entry *e2 = *(const struct dir_entry **)p2; - return cache_name_compare(e1->name, e1->len, - e2->name, e2->len); + return name_compare(e1->name, e1->len, e2->name, e2->len); } static struct path_simplify *create_simplify(const char **pathspec) diff --git a/read-cache.c b/read-cache.c index 9f56d76..158241d 100644 --- a/read-cache.c +++ b/read-cache.c @@ -434,18 +434,26 @@ int df_name_compare(const char *name1, int len1, int mode1, return c1 - c2; } -int cache_name_stage_compare(const char *name1, int len1, int stage1, const char *name2, int len2, int stage2) +int name_compare(const char *name1, size_t len1, const char *name2, size_t len2) { - int len = len1 < len2 ? len1 : len2; - int cmp; - - cmp = memcmp(name1, name2, len); + size_t min_len = (len1 < len2) ? len1 : len2; + int cmp = memcmp(name1, name2, min_len); if (cmp) return cmp; if (len1 < len2) return -1; if (len1 > len2) return 1; + return 0; +} + +int cache_name_stage_compare(const char *name1, int len1, int stage1, const char *name2, int len2, int stage2) +{ + int cmp; + + cmp = name_compare(name1, len1, name2, len2); + if (cmp) + return cmp; if (stage1 < stage2) return -1; @@ -454,11 +462,6 @@ int cache_name_stage_compare(const char *name1, int len1, int stage1, const char return 0; } -int cache_name_compare(const char *name1, int len1, const char *name2, int len2) -{ - return cache_name_stage_compare(name1, len1, 0, name2, len2, 0); -} - static int index_name_stage_pos(const struct index_state *istate, const char *name, int namelen, int stage) { int first, last; diff --git a/tree-walk.c b/tree-walk.c index 4dc86c7..5dd9a71 100644 --- a/tree-walk.c +++ b/tree-walk.c @@ -144,16 +144,6 @@ struct tree_desc_x { struct tree_desc_skip *skip; }; -static int name_compare(const char *a, int a_len, - const char *b, int b_len) -{ - int len = (a_len < b_len) ? a_len : b_len; - int cmp = memcmp(a, b, len); - if (cmp) - return cmp; - return (a_len - b_len); -} - static int check_entry_match(const char *a, int a_len, const char *b, int b_len) { /* diff --git a/unpack-trees.c b/unpack-trees.c index 4a9cdf2..c4a97ca 100644 --- a/unpack-trees.c +++ b/unpack-trees.c @@ -629,17 +629,6 @@ static int unpack_failed(struct unpack_trees_options *o, const char *message) return -1; } -/* NEEDSWORK: give this a better name and share with tree-walk.c */ -static int name_compare(const char *a, int a_len, - const char *b, int b_len) -{ - int len = (a_len < b_len) ? a_len : b_len; - int cmp = memcmp(a, b, len); - if (cmp) - return cmp; - return (a_len - b_len); -} - /* * The tree traversal is looking at name p. If we have a matching entry, * return it. If name p is a directory in the index, do not return -- 2.0.0.694.g5736dad ^ permalink raw reply related [flat|nested] 5+ messages in thread
* Re: [PATCH v5 2/2] cleanup duplicate name_compare() functions 2014-06-20 2:06 ` [PATCH v5 2/2] cleanup duplicate name_compare() functions Jeremiah Mahler @ 2014-06-20 17:15 ` Junio C Hamano 0 siblings, 0 replies; 5+ messages in thread From: Junio C Hamano @ 2014-06-20 17:15 UTC (permalink / raw) To: Jeremiah Mahler; +Cc: Jonathan Nieder, git Jeremiah Mahler <jmmahler@gmail.com> writes: > We often represent our strings as a counted string, i.e. a pair of the > pointer to the beginning of the string and its length, and the string > may not be NUL terminated to that length. > > To compare a pair of such counted strings, unpack-trees.c and > read-cache.c implement their own name_compare() functions identically. > In addition, the cache_name_compare() function in read-cache.c is nearly > identical. The only difference is when one string is the prefix of the > other string, in which case the former returns -1/+1 to show which one > is longer and the latter returns the difference of the lengths to show > the same information. I think I got the former/latter swapped by mistake when I wrote this (two name_compare() give us the difference, and cache_name_compare() gives -1/+1); I'll spell their names out when I queue this patch. Thanks. ^ permalink raw reply [flat|nested] 5+ messages in thread
end of thread, other threads:[~2014-06-25 21:30 UTC | newest] Thread overview: 5+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2014-06-20 2:06 [PATCH v5 0/2] cleanup duplicate name_compare() functions Jeremiah Mahler 2014-06-20 2:06 ` [PATCH v5 1/2] name-hash.c: replace cache_name_compare() with memcmp() Jeremiah Mahler 2014-06-20 17:10 ` Junio C Hamano 2014-06-20 2:06 ` [PATCH v5 2/2] cleanup duplicate name_compare() functions Jeremiah Mahler 2014-06-20 17:15 ` Junio C Hamano
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).