From: Ben Peart <peartben@gmail.com>
To: Christian Couder <christian.couder@gmail.com>
Cc: git <git@vger.kernel.org>, "Junio C Hamano" <gitster@pobox.com>,
"Ben Peart" <benpeart@microsoft.com>,
"Nguyen Thai Ngoc Duy" <pclouds@gmail.com>,
"Johannes Schindelin" <johannes.schindelin@gmx.de>,
"David Turner" <David.Turner@twosigma.com>,
"Jeff King" <peff@peff.net>,
"Ævar Arnfjörð Bjarmason" <avarab@gmail.com>
Subject: Re: [PATCH v5 3/7] fsmonitor: teach git to optionally utilize a file system monitor to speed up detecting new or changed files.
Date: Mon, 3 Jul 2017 17:25:32 -0400 [thread overview]
Message-ID: <6b05b590-2fda-31e4-e07e-bda10f510cc4@gmail.com> (raw)
In-Reply-To: <CAP8UFD1kL9i93HnnigYcB9U8u2dUijHj5djey5k=33VrFaWUBw@mail.gmail.com>
On 6/27/2017 11:43 AM, Christian Couder wrote:
> On Sat, Jun 10, 2017 at 3:40 PM, Ben Peart <peartben@gmail.com> wrote:
>
>> +int read_fsmonitor_extension(struct index_state *istate, const void *data,
>> + unsigned long sz)
>> +{
>> + const char *index = data;
>> + uint32_t hdr_version;
>> + uint32_t ewah_size;
>> + int ret;
>> +
>> + if (sz < sizeof(uint32_t) + sizeof(uint64_t) + sizeof(uint32_t))
>> + return error("corrupt fsmonitor extension (too short)");
>> +
>> + hdr_version = get_be32(index);
>
> Here we use get_be32(), ...
>
>> + index += sizeof(uint32_t);
>> + if (hdr_version != INDEX_EXTENSION_VERSION)
>> + return error("bad fsmonitor version %d", hdr_version);
>> +
>> + istate->fsmonitor_last_update = get_be64(index);
>
> ...get_be64(), ...
>
>> + index += sizeof(uint64_t);
>> +
>> + ewah_size = get_be32(index);
>
> ... and get_be32 again, ...
>
>> + index += sizeof(uint32_t);
>> +
>> + istate->fsmonitor_dirty = ewah_new();
>> + ret = ewah_read_mmap(istate->fsmonitor_dirty, index, ewah_size);
>> + if (ret != ewah_size) {
>> + ewah_free(istate->fsmonitor_dirty);
>> + istate->fsmonitor_dirty = NULL;
>> + return error("failed to parse ewah bitmap reading fsmonitor index extension");
>> + }
>> +
>> + return 0;
>> +}
>> +
>> +void write_fsmonitor_extension(struct strbuf *sb, struct index_state *istate)
>> +{
>> + uint32_t hdr_version;
>> + uint64_t tm;
>> + struct ewah_bitmap *bitmap;
>> + int i;
>> + uint32_t ewah_start;
>> + uint32_t ewah_size = 0;
>> + int fixup = 0;
>> +
>> + hdr_version = htonl(INDEX_EXTENSION_VERSION);
>
> ... but here we use htonl() instead of put_be32(), ...
>
>> + strbuf_add(sb, &hdr_version, sizeof(uint32_t));
>> +
>> + tm = htonll((uint64_t)istate->fsmonitor_last_update);
>
> ... htonll(), ...
>
>> + strbuf_add(sb, &tm, sizeof(uint64_t));
>> + fixup = sb->len;
>> + strbuf_add(sb, &ewah_size, sizeof(uint32_t)); /* we'll fix this up later */
>> +
>> + ewah_start = sb->len;
>> + bitmap = ewah_new();
>> + for (i = 0; i < istate->cache_nr; i++)
>> + if (istate->cache[i]->ce_flags & CE_FSMONITOR_DIRTY)
>> + ewah_set(bitmap, i);
>> + ewah_serialize_strbuf(bitmap, sb);
>> + ewah_free(bitmap);
>> +
>> + /* fix up size field */
>> + ewah_size = htonl(sb->len - ewah_start);
>
> ... and htonl() again.
>
> It would be more consistent (and perhaps more correct) to use
> put_beXX() functions, instead of the htonl() and htonll() functions.
I agree that these are asymmetric. I followed the pattern used in the
untracked cache in which write_untracked_extension uses htonl and
read_untracked_extension uses get_be32. I can change this to be more
symmetric.
>
>> + memcpy(sb->buf + fixup, &ewah_size, sizeof(uint32_t));
>> +}
>
>> +/*
>> + * Call the query-fsmonitor hook passing the time of the last saved results.
>> + */
>> +static int query_fsmonitor(int version, uint64_t last_update, struct strbuf *query_result)
>> +{
>> + struct child_process cp = CHILD_PROCESS_INIT;
>> + char ver[64];
>> + char date[64];
>> + const char *argv[4];
>> +
>> + if (!(argv[0] = find_hook("query-fsmonitor")))
>> + return -1;
>> +
>> + snprintf(ver, sizeof(version), "%d", version);
>> + snprintf(date, sizeof(date), "%" PRIuMAX, (uintmax_t)last_update);
>> + argv[1] = ver;
>> + argv[2] = date;
>> + argv[3] = NULL;
>> + cp.argv = argv;
>
> Maybe it would be nicer using the argv_array_pushX() functions.
When the number of arguments is fixed and known, I prefer avoiding the
dynamic allocations that come along with the argv_array_pushX() functions.
>
>> + cp.out = -1;
>> +
>> + return capture_command(&cp, query_result, 1024);
>> +}
>> +
>> +static void mark_file_dirty(struct index_state *istate, const char *name)
>> +{
>> + struct untracked_cache_dir *dir;
>> + int pos;
>> +
>> + /* find it in the index and mark that entry as dirty */
>> + pos = index_name_pos(istate, name, strlen(name));
>> + if (pos >= 0) {
>> + if (!(istate->cache[pos]->ce_flags & CE_FSMONITOR_DIRTY)) {
>> + istate->cache[pos]->ce_flags |= CE_FSMONITOR_DIRTY;
>> + istate->cache_changed |= FSMONITOR_CHANGED;
>> + }
>> + }
>> +
>> + /*
>> + * Find the corresponding directory in the untracked cache
>> + * and mark it as invalid
>> + */
>> + if (!istate->untracked || !istate->untracked->root)
>> + return;
>> +
>> + dir = find_untracked_cache_dir(istate->untracked, istate->untracked->root, name);
>> + if (dir) {
>> + if (dir->valid) {
>> + dir->valid = 0;
>> + istate->cache_changed |= FSMONITOR_CHANGED;
>> + }
>> + }
>
> The code above is quite similar as what is in mark_fsmonitor_dirty(),
> so I wonder if a refactoring is possible.
I've felt the same way and looked at how to refactor it better a number
of times but never came up with a way that made it simpler, clearer and
easier to maintain. I'm happy to review a patch if you can figure out
something better. :)
>
>> +}
>> +
>> +void refresh_by_fsmonitor(struct index_state *istate)
>> +{
>> + static int has_run_once = 0;
>> + struct strbuf query_result = STRBUF_INIT;
>> + int query_success = 0;
>> + size_t bol = 0; /* beginning of line */
>> + uint64_t last_update;
>> + char *buf, *entry;
>> + int i;
>> +
>> + if (!core_fsmonitor || has_run_once)
>> + return;
>> + has_run_once = 1;
>> +
>> + /*
>> + * This could be racy so save the date/time now and the hook
>> + * should be inclusive to ensure we don't miss potential changes.
>> + */
>> + last_update = getnanotime();
>> +
>> + /*
>> + * If we have a last update time, call query-monitor for the set of
>> + * changes since that time.
>> + */
>> + if (istate->fsmonitor_last_update) {
>> + query_success = !query_fsmonitor(HOOK_INTERFACE_VERSION,
>> + istate->fsmonitor_last_update, &query_result);
>> + trace_performance_since(last_update, "query-fsmonitor");
>> + }
>> +
>> + if (query_success) {
>> + /* Mark all entries returned by the monitor as dirty */
>> + buf = entry = query_result.buf;
>> + for (i = 0; i < query_result.len; i++) {
>> + if (buf[i] != '\0')
>> + continue;
>> + mark_file_dirty(istate, buf + bol);
>
> It looks like bol is always equal to i here...
>
>> + bol = i + 1;
>> + }
>> + if (bol < query_result.len)
>> + mark_file_dirty(istate, buf + bol);
>
> ... and here too. As it is not used below, I wonder if you really need
> the bol variable.
bol saves the position of the beginning of the current line while i
iterates through the remainder of the string looking for the NULL.
However, the entry variable is no longer used so I will remove it.
>
>> + /* Mark all clean entries up-to-date */
>> + for (i = 0; i < istate->cache_nr; i++) {
>> + struct cache_entry *ce = istate->cache[i];
>> + if (ce_stage(ce) || (ce->ce_flags & CE_FSMONITOR_DIRTY))
>> + continue;
>> + ce_mark_uptodate(ce);
>> + }
next prev parent reply other threads:[~2017-07-03 21:25 UTC|newest]
Thread overview: 137+ messages / expand[flat|nested] mbox.gz Atom feed top
2017-06-10 13:40 [PATCH v5 0/7] Fast git status via a file system watcher Ben Peart
2017-06-10 13:40 ` [PATCH v5 1/7] bswap: add 64 bit endianness helper get_be64 Ben Peart
2017-06-10 13:40 ` [PATCH v5 2/7] dir: make lookup_untracked() available outside of dir.c Ben Peart
2017-06-10 13:40 ` [PATCH v5 3/7] fsmonitor: teach git to optionally utilize a file system monitor to speed up detecting new or changed files Ben Peart
2017-06-27 15:43 ` Christian Couder
2017-07-03 21:25 ` Ben Peart [this message]
2017-06-10 13:40 ` [PATCH v5 4/7] fsmonitor: add test cases for fsmonitor extension Ben Peart
2017-06-27 16:20 ` Christian Couder
2017-07-07 18:50 ` Ben Peart
2017-06-10 13:40 ` [PATCH v5 5/7] fsmonitor: add documentation for the " Ben Peart
2017-06-10 13:40 ` [PATCH v5 6/7] fsmonitor: add a sample query-fsmonitor hook script for Watchman Ben Peart
2017-06-10 13:40 ` [PATCH v5 7/7] fsmonitor: add a performance test Ben Peart
2017-06-10 14:04 ` Ben Peart
2017-06-12 22:04 ` Junio C Hamano
2017-06-14 14:12 ` Ben Peart
2017-06-14 18:36 ` Junio C Hamano
2017-07-07 18:14 ` Ben Peart
2017-07-07 18:35 ` Junio C Hamano
2017-07-07 19:07 ` Ben Peart
2017-07-07 19:33 ` David Turner
2017-07-08 7:19 ` Christian Couder
2017-06-28 5:11 ` [PATCH v5 0/7] Fast git status via a file system watcher Christian Couder
2017-07-10 13:36 ` Ben Peart
2017-07-10 14:40 ` Ben Peart
2017-09-15 19:20 ` [PATCH v6 00/12] " Ben Peart
2017-09-15 19:20 ` [PATCH v6 01/12] bswap: add 64 bit endianness helper get_be64 Ben Peart
2017-09-15 19:20 ` [PATCH v6 02/12] preload-index: add override to enable testing preload-index Ben Peart
2017-09-15 19:20 ` [PATCH v6 03/12] update-index: add a new --force-write-index option Ben Peart
2017-09-15 19:20 ` [PATCH v6 04/12] fsmonitor: teach git to optionally utilize a file system monitor to speed up detecting new or changed files Ben Peart
2017-09-15 21:35 ` David Turner
2017-09-18 13:07 ` Ben Peart
2017-09-18 13:32 ` David Turner
2017-09-18 13:49 ` Ben Peart
2017-09-15 19:20 ` [PATCH v6 05/12] fsmonitor: add documentation for the fsmonitor extension Ben Peart
2017-09-15 19:43 ` David Turner
2017-09-18 13:27 ` Ben Peart
2017-09-17 8:03 ` Junio C Hamano
2017-09-18 13:29 ` Ben Peart
2017-09-15 19:20 ` [PATCH v6 06/12] ls-files: Add support in ls-files to display the fsmonitor valid bit Ben Peart
2017-09-15 20:34 ` David Turner
2017-09-15 19:20 ` [PATCH v6 07/12] update-index: add fsmonitor support to update-index Ben Peart
2017-09-15 19:20 ` [PATCH v6 08/12] fsmonitor: add a test tool to dump the index extension Ben Peart
2017-09-17 8:02 ` Junio C Hamano
2017-09-18 13:38 ` Ben Peart
2017-09-18 15:43 ` Torsten Bögershausen
2017-09-18 16:28 ` Ben Peart
2017-09-19 14:16 ` Torsten Bögershausen
2017-09-19 15:36 ` Ben Peart
2017-09-15 19:20 ` [PATCH v6 09/12] split-index: disable the fsmonitor extension when running the split index test Ben Peart
2017-09-19 20:43 ` Jonathan Nieder
2017-09-20 17:11 ` Ben Peart
2017-09-20 17:46 ` Jonathan Nieder
2017-09-21 0:05 ` Ben Peart
2017-09-15 19:20 ` [PATCH v6 10/12] fsmonitor: add test cases for fsmonitor extension Ben Peart
2017-09-15 22:00 ` David Turner
2017-09-19 19:32 ` David Turner
2017-09-19 20:30 ` Ben Peart
2017-09-16 15:27 ` Torsten Bögershausen
2017-09-17 5:43 ` [PATCH v1 1/1] test-lint: echo -e (or -E) is not portable tboegi
2017-09-19 20:37 ` Jonathan Nieder
2017-09-20 13:49 ` Torsten Bögershausen
2017-09-22 1:04 ` Junio C Hamano
2017-09-18 14:06 ` [PATCH v6 10/12] fsmonitor: add test cases for fsmonitor extension Ben Peart
2017-09-17 4:47 ` Junio C Hamano
2017-09-18 15:25 ` Ben Peart
2017-09-19 20:34 ` Jonathan Nieder
2017-09-15 19:20 ` [PATCH v6 11/12] fsmonitor: add a sample integration script for Watchman Ben Peart
2017-09-15 19:20 ` [PATCH v6 12/12] fsmonitor: add a performance test Ben Peart
2017-09-15 21:56 ` David Turner
2017-09-18 14:24 ` Johannes Schindelin
2017-09-18 18:19 ` Ben Peart
2017-09-19 15:28 ` Johannes Schindelin
2017-09-19 19:27 ` [PATCH v7 00/12] Fast git status via a file system watcher Ben Peart
2017-09-19 19:27 ` [PATCH v7 01/12] bswap: add 64 bit endianness helper get_be64 Ben Peart
2017-09-19 19:27 ` [PATCH v7 02/12] preload-index: add override to enable testing preload-index Ben Peart
2017-09-20 22:06 ` Stefan Beller
2017-09-21 0:02 ` Ben Peart
2017-09-21 0:44 ` Stefan Beller
2017-09-19 19:27 ` [PATCH v7 03/12] update-index: add a new --force-write-index option Ben Peart
2017-09-20 5:47 ` Junio C Hamano
2017-09-20 14:58 ` Ben Peart
2017-09-21 1:46 ` Junio C Hamano
2017-09-21 2:06 ` Ben Peart
2017-09-21 2:18 ` Junio C Hamano
2017-09-21 2:32 ` Junio C Hamano
2017-09-19 19:27 ` [PATCH v7 04/12] fsmonitor: teach git to optionally utilize a file system monitor to speed up detecting new or changed files Ben Peart
2017-09-20 2:28 ` Junio C Hamano
2017-09-20 16:19 ` Ben Peart
2017-09-21 2:00 ` Junio C Hamano
2017-09-21 2:24 ` Ben Peart
2017-09-21 14:35 ` Ben Peart
2017-09-22 1:02 ` Junio C Hamano
2017-09-20 6:23 ` Junio C Hamano
2017-09-20 16:29 ` Ben Peart
2017-09-19 19:27 ` [PATCH v7 05/12] fsmonitor: add documentation for the fsmonitor extension Ben Peart
2017-09-20 10:00 ` Martin Ågren
2017-09-20 17:02 ` Ben Peart
2017-09-20 17:11 ` Martin Ågren
2017-09-19 19:27 ` [PATCH v7 06/12] ls-files: Add support in ls-files to display the fsmonitor valid bit Ben Peart
2017-09-19 19:46 ` David Turner
2017-09-19 20:44 ` Ben Peart
2017-09-19 21:27 ` David Turner
2017-09-19 22:44 ` Ben Peart
2017-09-19 19:27 ` [PATCH v7 07/12] update-index: add fsmonitor support to update-index Ben Peart
2017-09-19 19:27 ` [PATCH v7 08/12] fsmonitor: add a test tool to dump the index extension Ben Peart
2017-09-19 19:27 ` [PATCH v7 09/12] split-index: disable the fsmonitor extension when running the split index test Ben Peart
2017-09-19 19:27 ` [PATCH v7 10/12] fsmonitor: add test cases for fsmonitor extension Ben Peart
2017-09-19 19:27 ` [PATCH v7 11/12] fsmonitor: add a sample integration script for Watchman Ben Peart
2017-09-19 19:27 ` [PATCH v7 12/12] fsmonitor: add a performance test Ben Peart
2017-09-22 16:35 ` [PATCH v8 00/12] Fast git status via a file system watcher Ben Peart
2017-09-22 16:35 ` [PATCH v8 01/12] bswap: add 64 bit endianness helper get_be64 Ben Peart
2017-09-22 23:37 ` Martin Ågren
2017-09-23 23:31 ` Ben Peart
2017-09-24 3:51 ` Jeff King
2017-09-24 3:52 ` Junio C Hamano
2017-09-22 16:35 ` [PATCH v8 02/12] preload-index: add override to enable testing preload-index Ben Peart
2017-09-22 16:35 ` [PATCH v8 03/12] update-index: add a new --force-write-index option Ben Peart
2017-09-22 16:35 ` [PATCH v8 04/12] fsmonitor: teach git to optionally utilize a file system monitor to speed up detecting new or changed files Ben Peart
2017-09-22 16:35 ` [PATCH v8 05/12] fsmonitor: add documentation for the fsmonitor extension Ben Peart
2017-09-22 16:35 ` [PATCH v8 06/12] ls-files: Add support in ls-files to display the fsmonitor valid bit Ben Peart
2017-09-22 16:35 ` [PATCH v8 07/12] update-index: add fsmonitor support to update-index Ben Peart
2017-09-22 16:35 ` [PATCH v8 08/12] fsmonitor: add a test tool to dump the index extension Ben Peart
2017-09-22 23:37 ` Martin Ågren
2017-09-23 23:33 ` Ben Peart
2017-09-24 3:51 ` Junio C Hamano
2017-09-22 16:35 ` [PATCH v8 09/12] split-index: disable the fsmonitor extension when running the split index test Ben Peart
2017-09-22 16:35 ` [PATCH v8 10/12] fsmonitor: add test cases for fsmonitor extension Ben Peart
2017-09-22 16:35 ` [PATCH v8 11/12] fsmonitor: add a sample integration script for Watchman Ben Peart
2017-09-22 16:35 ` [PATCH v8 12/12] fsmonitor: add a performance test Ben Peart
2017-09-29 2:20 ` [PATCH v8 00/12] Fast git status via a file system watcher Junio C Hamano
2017-09-29 12:07 ` Ben Peart
2017-10-01 8:24 ` Junio C Hamano
2017-10-03 19:48 ` Ben Peart
2017-10-04 2:09 ` Junio C Hamano
2017-10-04 6:38 ` Alex Vandiver
2017-10-04 12:48 ` Ben Peart
2017-10-04 12:27 ` Ben Peart
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=6b05b590-2fda-31e4-e07e-bda10f510cc4@gmail.com \
--to=peartben@gmail.com \
--cc=David.Turner@twosigma.com \
--cc=avarab@gmail.com \
--cc=benpeart@microsoft.com \
--cc=christian.couder@gmail.com \
--cc=git@vger.kernel.org \
--cc=gitster@pobox.com \
--cc=johannes.schindelin@gmx.de \
--cc=pclouds@gmail.com \
--cc=peff@peff.net \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).