From: "Victoria Dye via GitGitGadget" <gitgitgadget@gmail.com>
To: git@vger.kernel.org
Cc: Eric Sunshine <sunshine@sunshineco.com>,
Patrick Steinhardt <ps@pks.im>, Victoria Dye <vdye@github.com>,
Victoria Dye <vdye@github.com>
Subject: [PATCH v2 15/17] mktree: optionally add to an existing tree
Date: Wed, 19 Jun 2024 21:58:03 +0000 [thread overview]
Message-ID: <4b88f84b933b1598d12e3620f0c9fb85c559e8fb.1718834285.git.gitgitgadget@gmail.com> (raw)
In-Reply-To: <pull.1746.v2.git.1718834285.gitgitgadget@gmail.com>
From: Victoria Dye <vdye@github.com>
Allow users to specify a single "tree-ish" value as a positional argument.
If provided, the contents of the given tree serve as the basis for the new
tree (or trees, in --batch mode) created by 'mktree', on top of which all of
the stdin-provided tree entries are applied.
At a high level, the entries are "applied" to a base tree by iterating
through the base tree using 'read_tree' in parallel with iterating through
the sorted & deduplicated stdin entries via their iterator. That is, for
each call to the 'build_index_from_tree callback of 'read_tree':
* If the iterator entry precedes the base tree entry, add it to the in-core
index, increment the iterator, and repeat.
* If the iterator entry has the same name as the base tree entry, add the
iterator entry to the index, increment the iterator, and return from the
callback to continue the 'read_tree' iteration.
* If the iterator entry follows the base tree entry, first check
'df_name_hash' to ensure we won't be adding an entry with the same name
later (with a different mode). If there's no directory/file conflict, add
the base tree entry to the index. In either case, return from the callback
to continue the 'read_tree' iteration.
Finally, once 'read_tree' is complete, add the remaining entries in the
iterator to the index and write out the index as a tree.
Signed-off-by: Victoria Dye <vdye@github.com>
---
Documentation/git-mktree.txt | 7 +-
builtin/mktree.c | 138 +++++++++++++++++++++++++++++------
t/t1010-mktree.sh | 36 +++++++++
3 files changed, 159 insertions(+), 22 deletions(-)
diff --git a/Documentation/git-mktree.txt b/Documentation/git-mktree.txt
index cf1fd82f754..260d0e0bd7b 100644
--- a/Documentation/git-mktree.txt
+++ b/Documentation/git-mktree.txt
@@ -9,7 +9,7 @@ git-mktree - Build a tree-object from formatted tree entries
SYNOPSIS
--------
[verse]
-'git mktree' [-z] [--missing] [--literally] [--batch]
+'git mktree' [-z] [--missing] [--literally] [--batch] [<tree-ish>]
DESCRIPTION
-----------
@@ -41,6 +41,11 @@ OPTIONS
optional. Note - if the `-z` option is used, lines are terminated
with NUL.
+<tree-ish>::
+ If provided, the tree entries provided in stdin are added to this
+ tree rather than a new empty one, replacing existing entries with
+ identical names. Not compatible with `--literally`.
+
INPUT FORMAT
------------
Tree entries may be specified in any of the formats compatible with the
diff --git a/builtin/mktree.c b/builtin/mktree.c
index b4d71dcdd02..96f06547a2a 100644
--- a/builtin/mktree.c
+++ b/builtin/mktree.c
@@ -12,7 +12,9 @@
#include "read-cache-ll.h"
#include "strbuf.h"
#include "tree.h"
+#include "object-name.h"
#include "parse-options.h"
+#include "pathspec.h"
#include "object-store-ll.h"
struct tree_entry {
@@ -204,47 +206,124 @@ static void tree_entry_iterator_advance(struct tree_entry_iterator *iter)
: NULL;
}
-static int add_tree_entry_to_index(struct index_state *istate,
+struct build_index_data {
+ struct tree_entry_iterator iter;
+ struct hashmap *df_name_hash;
+ struct index_state istate;
+};
+
+static int add_tree_entry_to_index(struct build_index_data *data,
struct tree_entry *ent)
{
struct cache_entry *ce;
- struct strbuf ce_name = STRBUF_INIT;
- strbuf_add(&ce_name, ent->name, ent->len);
-
- ce = make_cache_entry(istate, ent->mode, &ent->oid, ent->name, 0, 0);
+ ce = make_cache_entry(&data->istate, ent->mode, &ent->oid, ent->name, 0, 0);
if (!ce)
return error(_("make_cache_entry failed for path '%s'"), ent->name);
- add_index_entry(istate, ce, ADD_CACHE_JUST_APPEND);
- strbuf_release(&ce_name);
+ add_index_entry(&data->istate, ce, ADD_CACHE_JUST_APPEND);
return 0;
}
-static void write_tree(struct tree_entry_array *arr, struct object_id *oid)
+static int build_index_from_tree(const struct object_id *oid,
+ struct strbuf *base, const char *filename,
+ unsigned mode, void *context)
{
- struct tree_entry_iterator iter = { NULL };
- struct index_state istate = INDEX_STATE_INIT(the_repository);
- istate.sparse_index = 1;
+ int result;
+ struct tree_entry *base_tree_ent;
+ struct build_index_data *cbdata = context;
+ size_t filename_len = strlen(filename);
+ size_t path_len = S_ISDIR(mode) ? st_add3(filename_len, base->len, 1)
+ : st_add(filename_len, base->len);
+
+ /* Create a tree entry from the current entry in read_tree iteration */
+ base_tree_ent = xcalloc(1, st_add3(sizeof(struct tree_entry), path_len, 1));
+ base_tree_ent->len = path_len;
+ base_tree_ent->mode = mode;
+ oidcpy(&base_tree_ent->oid, oid);
+
+ memcpy(base_tree_ent->name, base->buf, base->len);
+ memcpy(base_tree_ent->name + base->len, filename, filename_len);
+ if (S_ISDIR(mode))
+ base_tree_ent->name[base_tree_ent->len - 1] = '/';
+
+ while (cbdata->iter.current) {
+ struct tree_entry *ent = cbdata->iter.current;
+
+ int cmp = name_compare(ent->name, ent->len,
+ base_tree_ent->name, base_tree_ent->len);
+ if (!cmp || cmp < 0) {
+ tree_entry_iterator_advance(&cbdata->iter);
+
+ if (add_tree_entry_to_index(cbdata, ent) < 0) {
+ result = error(_("failed to add tree entry '%s'"), ent->name);
+ goto cleanup_and_return;
+ }
+
+ if (!cmp) {
+ result = 0;
+ goto cleanup_and_return;
+ } else
+ continue;
+ }
+
+ break;
+ }
+
+ /*
+ * If the tree entry should be replaced with an entry with the same name
+ * (but different mode), skip it.
+ */
+ hashmap_entry_init(&base_tree_ent->ent,
+ memhash(base_tree_ent->name, df_path_len(base_tree_ent->len, base_tree_ent->mode)));
+ if (hashmap_get_entry(cbdata->df_name_hash, base_tree_ent, ent, NULL)) {
+ result = 0;
+ goto cleanup_and_return;
+ }
+
+ if (add_tree_entry_to_index(cbdata, base_tree_ent)) {
+ result = -1;
+ goto cleanup_and_return;
+ }
+
+ result = 0;
+
+cleanup_and_return:
+ FREE_AND_NULL(base_tree_ent);
+ return result;
+}
+
+static void write_tree(struct tree_entry_array *arr, struct tree *base_tree,
+ struct object_id *oid)
+{
+ struct build_index_data cbdata = { 0 };
+ struct pathspec ps = { 0 };
sort_and_dedup_tree_entry_array(arr);
- tree_entry_iterator_init(&iter, arr);
+ index_state_init(&cbdata.istate, the_repository);
+ cbdata.istate.sparse_index = 1;
+ tree_entry_iterator_init(&cbdata.iter, arr);
+ cbdata.df_name_hash = &arr->df_name_hash;
/* Construct an in-memory index from the provided entries & base tree */
- while (iter.current) {
- struct tree_entry *ent = iter.current;
- tree_entry_iterator_advance(&iter);
+ if (base_tree &&
+ read_tree(the_repository, base_tree, &ps, build_index_from_tree, &cbdata) < 0)
+ die(_("failed to create tree"));
+
+ while (cbdata.iter.current) {
+ struct tree_entry *ent = cbdata.iter.current;
+ tree_entry_iterator_advance(&cbdata.iter);
- if (add_tree_entry_to_index(&istate, ent))
+ if (add_tree_entry_to_index(&cbdata, ent))
die(_("failed to add tree entry '%s'"), ent->name);
}
/* Write out new tree */
- if (cache_tree_update(&istate, WRITE_TREE_SILENT | WRITE_TREE_MISSING_OK))
+ if (cache_tree_update(&cbdata.istate, WRITE_TREE_SILENT | WRITE_TREE_MISSING_OK))
die(_("failed to write tree"));
- oidcpy(oid, &istate.cache_tree->oid);
+ oidcpy(oid, &cbdata.istate.cache_tree->oid);
- release_index(&istate);
+ release_index(&cbdata.istate);
}
static void write_tree_literally(struct tree_entry_array *arr,
@@ -268,7 +347,7 @@ static void write_tree_literally(struct tree_entry_array *arr,
}
static const char *mktree_usage[] = {
- "git mktree [-z] [--missing] [--literally] [--batch]",
+ "git mktree [-z] [--missing] [--literally] [--batch] [<tree-ish>]",
NULL
};
@@ -334,6 +413,7 @@ int cmd_mktree(int ac, const char **av, const char *prefix)
struct tree_entry_array arr = { 0 };
struct mktree_line_data mktree_line_data = { .arr = &arr };
struct strbuf line = STRBUF_INIT;
+ struct tree *base_tree = NULL;
int ret;
const struct option option[] = {
@@ -346,6 +426,22 @@ int cmd_mktree(int ac, const char **av, const char *prefix)
};
ac = parse_options(ac, av, prefix, option, mktree_usage, 0);
+ if (ac > 1)
+ usage_with_options(mktree_usage, option);
+
+ if (ac) {
+ struct object_id base_tree_oid;
+
+ if (mktree_line_data.literally)
+ die(_("option '%s' and tree-ish cannot be used together"), "--literally");
+
+ if (repo_get_oid(the_repository, av[0], &base_tree_oid))
+ die(_("not a valid object name %s"), av[0]);
+
+ base_tree = parse_tree_indirect(&base_tree_oid);
+ if (!base_tree)
+ die(_("not a tree object: %s"), oid_to_hex(&base_tree_oid));
+ }
tree_entry_array_init(&arr);
@@ -373,7 +469,7 @@ int cmd_mktree(int ac, const char **av, const char *prefix)
if (mktree_line_data.literally)
write_tree_literally(&arr, &oid);
else
- write_tree(&arr, &oid);
+ write_tree(&arr, base_tree, &oid);
puts(oid_to_hex(&oid));
fflush(stdout);
}
diff --git a/t/t1010-mktree.sh b/t/t1010-mktree.sh
index 08760141d6f..435ac23bd50 100755
--- a/t/t1010-mktree.sh
+++ b/t/t1010-mktree.sh
@@ -234,4 +234,40 @@ test_expect_success 'mktree with duplicate entries' '
test_cmp expect actual
'
+test_expect_success 'mktree with base tree' '
+ tree_oid=$(cat tree) &&
+ folder_oid=$(git rev-parse ${tree_oid}:folder) &&
+ before_oid=$(git rev-parse ${tree_oid}:before) &&
+ head_oid=$(git rev-parse HEAD) &&
+
+ {
+ printf "040000 tree $folder_oid\ttest\n" &&
+ printf "100644 blob $before_oid\ttest.txt\n" &&
+ printf "040000 tree $folder_oid\ttest-\n" &&
+ printf "160000 commit $head_oid\ttest0\n"
+ } >top.base &&
+ git mktree <top.base >tree.base &&
+
+ {
+ printf "100755 blob $before_oid\tz\n" &&
+ printf "160000 commit $head_oid\ttest.xyz\n" &&
+ printf "040000 tree $folder_oid\ta\n" &&
+ printf "100644 blob $before_oid\ttest\n"
+ } >top.append &&
+ git mktree $(cat tree.base) <top.append >tree.actual &&
+
+ {
+ printf "040000 tree $folder_oid\ta\n" &&
+ printf "100644 blob $before_oid\ttest\n" &&
+ printf "040000 tree $folder_oid\ttest-\n" &&
+ printf "100644 blob $before_oid\ttest.txt\n" &&
+ printf "160000 commit $head_oid\ttest.xyz\n" &&
+ printf "160000 commit $head_oid\ttest0\n" &&
+ printf "100755 blob $before_oid\tz\n"
+ } >expect &&
+ git ls-tree $(cat tree.actual) >actual &&
+
+ test_cmp expect actual
+'
+
test_done
--
gitgitgadget
next prev parent reply other threads:[~2024-06-19 21:58 UTC|newest]
Thread overview: 65+ messages / expand[flat|nested] mbox.gz Atom feed top
2024-06-11 18:24 [PATCH 00/16] mktree: support more flexible usage Victoria Dye via GitGitGadget
2024-06-11 18:24 ` [PATCH 01/16] mktree: use OPT_BOOL Victoria Dye via GitGitGadget
2024-06-11 18:24 ` [PATCH 02/16] mktree: rename treeent to tree_entry Victoria Dye via GitGitGadget
2024-06-12 9:40 ` Patrick Steinhardt
2024-06-11 18:24 ` [PATCH 03/16] mktree: use non-static tree_entry array Victoria Dye via GitGitGadget
2024-06-11 18:45 ` Eric Sunshine
2024-06-12 9:40 ` Patrick Steinhardt
2024-06-11 18:24 ` [PATCH 04/16] update-index: generalize 'read_index_info' Victoria Dye via GitGitGadget
2024-06-11 22:45 ` Junio C Hamano
2024-06-11 18:24 ` [PATCH 05/16] index-info.c: identify empty input lines in read_index_info Victoria Dye via GitGitGadget
2024-06-11 22:52 ` Junio C Hamano
2024-06-18 17:33 ` Victoria Dye
2024-06-11 18:24 ` [PATCH 06/16] index-info.c: parse object type in provided " Victoria Dye via GitGitGadget
2024-06-12 1:54 ` Junio C Hamano
2024-06-11 18:24 ` [PATCH 07/16] mktree: use read_index_info to read stdin lines Victoria Dye via GitGitGadget
2024-06-12 2:11 ` Junio C Hamano
2024-06-12 9:40 ` Patrick Steinhardt
2024-06-12 18:35 ` Junio C Hamano
2024-06-11 18:24 ` [PATCH 08/16] mktree: add a --literally option Victoria Dye via GitGitGadget
2024-06-12 2:18 ` Junio C Hamano
2024-06-11 18:24 ` [PATCH 09/16] mktree: validate paths more carefully Victoria Dye via GitGitGadget
2024-06-12 2:26 ` Junio C Hamano
2024-06-12 19:01 ` Victoria Dye
2024-06-12 19:45 ` Junio C Hamano
2024-06-11 18:24 ` [PATCH 10/16] mktree: overwrite duplicate entries Victoria Dye via GitGitGadget
2024-06-12 9:40 ` Patrick Steinhardt
2024-06-12 18:48 ` Victoria Dye
2024-06-11 18:24 ` [PATCH 11/16] mktree: create tree using an in-core index Victoria Dye via GitGitGadget
2024-06-12 9:40 ` Patrick Steinhardt
2024-06-11 18:24 ` [PATCH 12/16] mktree: use iterator struct to add tree entries to index Victoria Dye via GitGitGadget
2024-06-12 9:40 ` Patrick Steinhardt
2024-06-13 18:38 ` Victoria Dye
2024-06-11 18:24 ` [PATCH 13/16] mktree: add directory-file conflict hashmap Victoria Dye via GitGitGadget
2024-06-11 18:24 ` [PATCH 14/16] mktree: optionally add to an existing tree Victoria Dye via GitGitGadget
2024-06-12 9:40 ` Patrick Steinhardt
2024-06-12 19:50 ` Junio C Hamano
2024-06-17 19:23 ` Victoria Dye
2024-06-11 18:24 ` [PATCH 15/16] mktree: allow deeper paths in input Victoria Dye via GitGitGadget
2024-06-11 18:24 ` [PATCH 16/16] mktree: remove entries when mode is 0 Victoria Dye via GitGitGadget
2024-06-19 21:57 ` [PATCH v2 00/17] mktree: support more flexible usage Victoria Dye via GitGitGadget
2024-06-19 21:57 ` [PATCH v2 01/17] mktree: use OPT_BOOL Victoria Dye via GitGitGadget
2024-06-19 21:57 ` [PATCH v2 02/17] mktree: rename treeent to tree_entry Victoria Dye via GitGitGadget
2024-06-19 21:57 ` [PATCH v2 03/17] mktree: use non-static tree_entry array Victoria Dye via GitGitGadget
2024-06-19 21:57 ` [PATCH v2 04/17] update-index: generalize 'read_index_info' Victoria Dye via GitGitGadget
2024-06-19 21:57 ` [PATCH v2 05/17] index-info.c: return unrecognized lines to caller Victoria Dye via GitGitGadget
2024-06-19 21:57 ` [PATCH v2 06/17] index-info.c: parse object type in provided in read_index_info Victoria Dye via GitGitGadget
2024-06-19 21:57 ` [PATCH v2 07/17] mktree: use read_index_info to read stdin lines Victoria Dye via GitGitGadget
2024-06-20 20:18 ` Junio C Hamano
2024-06-19 21:57 ` [PATCH v2 08/17] mktree.c: do not fail on mismatched submodule type Victoria Dye via GitGitGadget
2024-06-19 21:57 ` [PATCH v2 09/17] mktree: add a --literally option Victoria Dye via GitGitGadget
2024-06-19 21:57 ` [PATCH v2 10/17] mktree: validate paths more carefully Victoria Dye via GitGitGadget
2024-06-19 21:57 ` [PATCH v2 11/17] mktree: overwrite duplicate entries Victoria Dye via GitGitGadget
2024-06-20 22:05 ` Junio C Hamano
2024-06-19 21:58 ` [PATCH v2 12/17] mktree: create tree using an in-core index Victoria Dye via GitGitGadget
2024-06-20 22:26 ` Junio C Hamano
2024-06-19 21:58 ` [PATCH v2 13/17] mktree: use iterator struct to add tree entries to index Victoria Dye via GitGitGadget
2024-06-26 21:10 ` Junio C Hamano
2024-06-19 21:58 ` [PATCH v2 14/17] mktree: add directory-file conflict hashmap Victoria Dye via GitGitGadget
2024-06-19 21:58 ` Victoria Dye via GitGitGadget [this message]
2024-06-26 21:23 ` [PATCH v2 15/17] mktree: optionally add to an existing tree Junio C Hamano
2024-06-19 21:58 ` [PATCH v2 16/17] mktree: allow deeper paths in input Victoria Dye via GitGitGadget
2024-06-27 19:29 ` Junio C Hamano
2024-06-19 21:58 ` [PATCH v2 17/17] mktree: remove entries when mode is 0 Victoria Dye via GitGitGadget
2024-06-25 23:26 ` [PATCH v2 00/17] mktree: support more flexible usage Junio C Hamano
2024-07-10 21:40 ` Junio C Hamano
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=4b88f84b933b1598d12e3620f0c9fb85c559e8fb.1718834285.git.gitgitgadget@gmail.com \
--to=gitgitgadget@gmail.com \
--cc=git@vger.kernel.org \
--cc=ps@pks.im \
--cc=sunshine@sunshineco.com \
--cc=vdye@github.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).