From: Junio C Hamano <gitster@pobox.com>
To: Ben Peart <peartben@gmail.com>
Cc: git@vger.kernel.org, pclouds@gmail.com,
Ben Peart <benpeart@microsoft.com>
Subject: Re: [PATCH v1] speed up refresh_index() by utilizing preload_index()
Date: Tue, 30 Oct 2018 11:28:28 +0900 [thread overview]
Message-ID: <xmqq7ei02mnn.fsf@gitster-ct.c.googlers.com> (raw)
In-Reply-To: <20181029204159.18208-1-peartben@gmail.com> (Ben Peart's message of "Mon, 29 Oct 2018 16:41:59 -0400")
Ben Peart <peartben@gmail.com> writes:
> From: Ben Peart <benpeart@microsoft.com>
>
> Speed up refresh_index() by utilizing preload_index() to do most of the work
> spread across multiple threads. This works because most cache entries will
> get marked CE_UPTODATE so that refresh_cache_ent() can bail out early when
> called from within refresh_index().
>
> On a Windows repo with ~200K files, this drops refresh times from 6.64
> seconds to 2.87 seconds for a savings of 57%.
>
> Signed-off-by: Ben Peart <benpeart@microsoft.com>
> ---
OK. We used to only expose the whole "read the index file into an
istate, and then do the lstat() part in parallel", but now we also
make the "do the lstat() part" available separately.
Which makes sense.
> diff --git a/cache.h b/cache.h
> index f7fabdde8f..883099db08 100644
> --- a/cache.h
> +++ b/cache.h
> @@ -659,6 +659,9 @@ extern int daemonize(void);
> /* Initialize and use the cache information */
> struct lock_file;
> extern int read_index(struct index_state *);
> +extern void preload_index(struct index_state *index,
> + const struct pathspec *pathspec,
> + unsigned int refresh_flags);
> extern int read_index_preload(struct index_state *,
> const struct pathspec *pathspec,
> unsigned int refresh_flags);
> diff --git a/preload-index.c b/preload-index.c
> index 9e7152ab14..222792ccbc 100644
> --- a/preload-index.c
> +++ b/preload-index.c
> @@ -9,7 +9,7 @@
> #include "progress.h"
>
> #ifdef NO_PTHREADS
> -static void preload_index(struct index_state *index,
> +void preload_index(struct index_state *index,
> const struct pathspec *pathspec,
> unsigned int refresh_flags)
> {
> @@ -100,9 +100,9 @@ static void *preload_thread(void *_data)
> return NULL;
> }
>
> -static void preload_index(struct index_state *index,
> - const struct pathspec *pathspec,
> - unsigned int refresh_flags)
> +void preload_index(struct index_state *index,
> + const struct pathspec *pathspec,
> + unsigned int refresh_flags)
> {
> int threads, i, work, offset;
> struct thread_data data[MAX_PARALLEL];
> diff --git a/read-cache.c b/read-cache.c
> index d57958233e..53733d651d 100644
> --- a/read-cache.c
> +++ b/read-cache.c
> @@ -1496,6 +1496,12 @@ int refresh_index(struct index_state *istate, unsigned int flags,
> typechange_fmt = (in_porcelain ? "T\t%s\n" : "%s needs update\n");
> added_fmt = (in_porcelain ? "A\t%s\n" : "%s needs update\n");
> unmerged_fmt = (in_porcelain ? "U\t%s\n" : "%s: needs merge\n");
> + /*
> + * Use the multi-threaded preload_index() to refresh most of the
> + * cache entries quickly then in the single threaded loop below,
> + * we only have to do the special cases that are left.
> + */
> + preload_index(istate, pathspec, 0);
> for (i = 0; i < istate->cache_nr; i++) {
> struct cache_entry *ce, *new_entry;
> int cache_errno = 0;
>
> base-commit: c670b1f876521c9f7cd40184bf7ed05aad843433
prev parent reply other threads:[~2018-10-30 2:28 UTC|newest]
Thread overview: 2+ messages / expand[flat|nested] mbox.gz Atom feed top
2018-10-29 20:41 [PATCH v1] speed up refresh_index() by utilizing preload_index() Ben Peart
2018-10-30 2:28 ` Junio C Hamano [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=xmqq7ei02mnn.fsf@gitster-ct.c.googlers.com \
--to=gitster@pobox.com \
--cc=benpeart@microsoft.com \
--cc=git@vger.kernel.org \
--cc=pclouds@gmail.com \
--cc=peartben@gmail.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).