From: Linus Torvalds <torvalds@linux-foundation.org>
To: Junio C Hamano <gitster@pobox.com>
Cc: git@vger.kernel.org
Subject: Re: [Junio C Hamano] Re: Consolidate SHA1 object file close
Date: Sun, 15 Jun 2008 11:08:10 -0700 (PDT) [thread overview]
Message-ID: <alpine.LFD.1.10.0806151057100.2949@woody.linux-foundation.org> (raw)
In-Reply-To: <7viqwbfxk6.fsf@gitster.siamese.dyndns.org>
On Sat, 14 Jun 2008, Junio C Hamano wrote:
>
> Other than that I had to stop for a few seconds to think at the magic
> "20 bytes longer", they look scary-but-correct ;-).
Actually, thinking about it, the +20 was not just fairly arbitrary and
would have been better off with a comment, it was also not a very good
number to begin with - it just was the old tempfile pattern rounded up to
the next "even" number.
The _correct_ number to use is +39, with a better tempfile pattern.
That needs a few comments, though. So I added those too.
This patch is totally unimportant, but it does mean that a really
traditional filesystem can now do the final rename in-place, because the
temporary file is not just in the same directory as the final one, but ti
also has the same length, so old-style filesystems can literally just edit
the name in place and mark the buffer dirty.
And more modern filesystems will generally end up doing more complex
things regardless of what we do, but at least this won't _hurt_ either.
There is one more funny (or sad) detail to this: the old temporary
filename was an interesting size: 14 characters. That happens to be the
traditional path component limit on really old UNIX filesystems, so if you
were to try to use such a filesystem, you'd always be able to create the
temporary file, but then you'd never actually be able to do the final link
or rename due to the final name being too long.
So take this or leave it as you want - it's really not very important, I
just wrote this because I was thinking about what really goes on at a very
low level when we do that final atomic rename to create the actual
directory entry. This makes it theoretically just a tiny bit more atomic
on old-fashioned filesystems.
Linus
---
sha1_file.c | 21 ++++++++++++++++++---
1 files changed, 18 insertions(+), 3 deletions(-)
diff --git a/sha1_file.c b/sha1_file.c
index 37bcc54..f9ec069 100644
--- a/sha1_file.c
+++ b/sha1_file.c
@@ -2110,17 +2110,31 @@ static inline int directory_size(const char *filename)
*
* We want to avoid cross-directory filename renames, because those
* can have problems on various filesystems (FAT, NFS, Coda).
+ *
+ * In order to make the final rename as simple as possible to do
+ * in-place for traditional filesystems, we make the temporary
+ * filename have the same size as the final one - 38 characters
+ * (19 bytes of SHA1 info - the first byte is used for the sub-
+ * directory fan-out).
+ *
+ * The temporary filename pattern is
+ * - 8 characters of "tmp_obj_"
+ * - 23 characters of the object filename
+ * - 7 characters of "_XXXXXX" for mkstemp().
*/
static int create_tmpfile(char *buffer, size_t bufsiz, const char *filename)
{
int fd, dirlen = directory_size(filename);
- if (dirlen + 20 > bufsiz) {
+ if (dirlen + 39 > bufsiz) {
errno = ENAMETOOLONG;
return -1;
}
memcpy(buffer, filename, dirlen);
- strcpy(buffer + dirlen, "tmp_obj_XXXXXX");
+ memcpy(buffer + dirlen, "tmp_obj_", 8);
+ memcpy(buffer + dirlen + 8, filename + dirlen, 23);
+ memcpy(buffer + dirlen + 31, "_XXXXXX", 8);
+
fd = mkstemp(buffer);
if (fd < 0 && dirlen) {
/* Make sure the directory exists */
@@ -2129,7 +2143,8 @@ static int create_tmpfile(char *buffer, size_t bufsiz, const char *filename)
return -1;
/* Try again */
- strcpy(buffer + dirlen - 1, "/tmp_obj_XXXXXX");
+ buffer[dirlen-1] = '/';
+ memcpy(buffer + dirlen + 32, "XXXXXX", 6);
fd = mkstemp(buffer);
}
return fd;
next prev parent reply other threads:[~2008-06-15 18:09 UTC|newest]
Thread overview: 6+ messages / expand[flat|nested] mbox.gz Atom feed top
2008-06-14 22:42 [Junio C Hamano] Re: Consolidate SHA1 object file close Junio C Hamano
2008-06-15 18:08 ` Linus Torvalds [this message]
2008-06-16 1:13 ` Nicolas Pitre
2008-06-17 5:01 ` Junio C Hamano
2008-06-17 16:30 ` Linus Torvalds
2008-07-04 17:44 ` Pierre Habouzit
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=alpine.LFD.1.10.0806151057100.2949@woody.linux-foundation.org \
--to=torvalds@linux-foundation.org \
--cc=git@vger.kernel.org \
--cc=gitster@pobox.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).