From: Jeff King <peff@github.com>
To: "René Scharfe" <rene.scharfe@lsrfire.ath.cx>
Cc: Junio C Hamano <gitster@pobox.com>,
"J.H." <warthog19@eaglescrag.net>,
git@vger.kernel.org, git-dev@github.com
Subject: Re: [RFC/PATCH 0/7] user-configurable git-archive output formats
Date: Tue, 21 Jun 2011 12:01:59 -0400 [thread overview]
Message-ID: <20110621160159.GA17334@sigill.intra.peff.net> (raw)
In-Reply-To: <4DFCBB92.5040308@lsrfire.ath.cx>
On Sat, Jun 18, 2011 at 04:52:02PM +0200, René Scharfe wrote:
> >> The gzip path is not configurable at all. Probably it should read the
> >> path and arguments from the config file. In fact, we could even allow
> >> arbitrary config like:
> >>
> >> [tarfilter "tgz"]
> >> command = gzip -c
> >> extension = tgz
> >> extension = tar.gz
>
> Configuration options whose values are appended instead of overwritten
> by duplicate definitions are a new concept for git, I think. Perhaps
> it's not a big thing, but I think it's better avoided.
>
> The only (stupid) practical shortcoming I can think if is this, though:
> You can't remove anything from the list of supported extensions in a
> user config if the system config already contains e.g. tgz and tar.gz.
Yeah, I have mixed feelings on that.
As Jakub pointed out, we already have them in several places. I don't
know that removal is that big a deal in this instance. If we did want to
support it, I think it would make more sense to have a generic solution
at the config level, like:
[some-section]
multivalue = foo
multivalue = bar
!multivalue
multivalue = baz
multivalue = whee
at which point the value is ("baz", "whee"). That matches what we do on
the command line, where:
git foo --multivalue=foo --multivalue=bar --no-multivalue \
--multivalue=baz --multivalue=whee
handles the same issue in a similar way.
The other option, of course, is having a single value with list
semantics. But then you have to invent separator syntax. In this
instance whitespace would probably be fine, but I'd rather that each new
multi-valued option did not invent its own syntax, and in the general
case you may need to handle quoting. Plus you may need some kind of
append syntax. For example, if we support "tgz" and "tar.gz" internally,
how do you say 'add "pax.gz"' to that list without reiterating the whole
list?
> The pax format is identical to the ustar format, which --format=tar
> produces. The other major format that comes to mind is cpio. The
> (never merged) predecessor of tar-tree actually used that format.
Thanks, cpio is probably the most likely example.
> Since then I have been waiting for users to request being able to export
> using cpio format (which is simpler and slightly smaller than tar), but
> that never happened. It seems the existence of the pax format really
> has pacified the tar vs. cpio war of old.
Fair enough. I haven't heard anybody clamoring for it either. I just
didn't want to paint us into a corner. Since it seems like the most
likely format and nobody really wants it, it's perhaps not worth
worrying about.
> I'm not sure "filter" is a good name, though. We have core.pager, which
> is technically a filter as well, but for a specific purpose.
Yeah, any name would have to be "archive filter" or similar. But I would
think being under the "tar" section would be enough to disambiguate it.
> And we have the tar.umask setting as a precedence for format specfic
> config options. So how about tar.<extension>.compressor?
>
> [tar "tgz"]
> compressor = gzip -cn
> [tar "tar.gz"]
> compressor = gzip -cn
> [tar "tar.bz2"]
> compressor = bzip2 -c
My two complaints are:
1. The user has to repeat themselves in describing the command for
multiple extensions. In practice, that's probably not a big deal,
though.
2. The namespace for user-defined extensions is the same as the
namespace for tar options. I guess we can disambiguate based on the
number of dots (so, e.g., I know that "tar.umask" is not the umask
extension, because it doesn't have a third component). It does
limit us a little bit for adding future options.
I don't know if it's worth caring about. We have the same problem
with the diff.* namespace (e.g., diff.color.* exists, but is not a
userdiff driver). In that case, besides the code being a little
careful to be tolerant of the clash, I don't think it has been a
problem.
> We don't need a compressionlevels option here because we can simply
> assume that the compressor commands do support them.
But we discussed elsewhere the concept of a tar-to-7z filter. I'm not
sure I'd call that a "compressor" as much as a filter. And it wouldn't
want the compression-level options (or maybe you would; I don't use it,
but skimming the manpage, it looks like you would want to convert -5
into "-mx=5"; so maybe you would want a wrapper script anyway).
> (Side note: this is not fully true for bzip2, as it doesn't support
> -0, but I don't think this is worth special consideration in our code,
> as long as errors of the filter are displayed properly.)
Yeah, I think that can be ignored. bzip can take care of complaining
itself.
> And we can also add a config option to restrict the formats creatable by
> upload-archive, to address concerns over DoS attacks with expensive
> compressors:
>
> [archive]
> remoteFormats = tar zip tgz tar.gz
Right. It does have the ad-hoc list syntax I complained about above,
though.
-Peff
next prev parent reply other threads:[~2011-06-21 16:02 UTC|newest]
Thread overview: 56+ messages / expand[flat|nested] mbox.gz Atom feed top
2011-06-14 18:17 [PATCH 1/2] archive: factor out write phase of tar format Jeff King
2011-06-14 18:18 ` [PATCH 2/2] archive: support gzipped tar files Jeff King
2011-06-14 19:25 ` J.H.
2011-06-14 19:30 ` Jeff King
2011-06-14 19:39 ` René Scharfe
2011-06-14 20:14 ` Jeff King
2011-06-14 20:45 ` Jeff King
2011-06-15 22:30 ` [RFC/PATCH 0/7] user-configurable git-archive output formats Jeff King
2011-06-15 22:31 ` [PATCH 1/7] archive: reorder option parsing and config reading Jeff King
2011-06-15 22:33 ` [PATCH 2/7] archive: add user-configurable tar-filter infrastructure Jeff King
2011-06-15 23:33 ` Junio C Hamano
2011-06-16 0:29 ` Jeff King
2011-06-15 22:33 ` [PATCH 3/7] archive: support user tar-filters via --format Jeff King
2011-06-15 22:33 ` [PATCH 4/7] archive: advertise user tar-filters in --list Jeff King
2011-06-15 22:34 ` [PATCH 5/7] archive: refactor format-guessing from filename Jeff King
2011-06-15 23:48 ` Junio C Hamano
2011-06-16 0:34 ` Jeff King
2011-06-15 22:34 ` [PATCH 6/7] archive: match extensions from user-configured formats Jeff King
2011-06-15 22:35 ` [PATCH 7/7] archive: provide builtin .tar.gz filter Jeff King
2011-06-15 23:55 ` Junio C Hamano
2011-06-15 23:57 ` Junio C Hamano
2011-06-16 0:38 ` Jeff King
2011-06-16 6:27 ` Junio C Hamano
2011-06-16 6:51 ` Jeff King
2011-06-16 7:56 ` Chris Webb
2011-06-16 17:46 ` Jeff King
2011-06-16 18:02 ` Junio C Hamano
2011-06-16 18:21 ` Jeff King
2011-06-16 18:27 ` John Szakmeister
2011-06-16 18:42 ` Junio C Hamano
2011-06-16 18:57 ` Jeff King
2011-06-18 14:52 ` [RFC/PATCH 0/7] user-configurable git-archive output formats René Scharfe
2011-06-18 15:28 ` Jakub Narebski
2011-06-20 15:58 ` Junio C Hamano
2011-06-22 1:19 ` [PATCHv2 0/9] configurable tar compressors Jeff King
2011-06-22 1:20 ` [PATCHv2 1/9] archive: reorder option parsing and config reading Jeff King
2011-06-22 1:22 ` [PATCHv2 2/9] archive-tar: don't reload default config options Jeff King
2011-06-22 1:23 ` [PATCHv2 3/9] archive: refactor list of archive formats Jeff King
2011-06-23 17:05 ` Thiago Farina
2011-06-23 17:30 ` Jeff King
2011-06-22 1:24 ` [PATCHv2 4/9] archive: pass archiver struct to write_archive callback Jeff King
2011-06-22 1:24 ` [PATCHv2 5/9] archive: move file extension format-guessing lower Jeff King
2011-06-22 1:25 ` [PATCHv2 6/9] archive: refactor file extension format-guessing Jeff King
2011-06-22 1:26 ` [PATCHv2 7/9] archive: implement configurable tar filters Jeff King
2011-06-22 1:45 ` Jeff King
2011-06-22 6:09 ` René Scharfe
2011-06-22 14:59 ` Jeff King
2011-06-22 1:27 ` [PATCHv2 8/9] archive: provide builtin .tar.gz filter Jeff King
2011-06-22 1:35 ` [PATCHv2 9/9] upload-archive: allow user to turn off filters Jeff King
2011-06-22 3:17 ` Jeff King
2011-06-21 16:01 ` Jeff King [this message]
2011-06-18 15:40 ` [RFC/PATCH 0/7] user-configurable git-archive output formats René Scharfe
2011-06-14 20:30 ` [PATCH 2/2] archive: support gzipped tar files Junio C Hamano
2011-06-14 20:49 ` Jeff King
2011-06-14 23:40 ` Miles Bader
2011-06-15 22:46 ` Jeff King
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20110621160159.GA17334@sigill.intra.peff.net \
--to=peff@github.com \
--cc=git-dev@github.com \
--cc=git@vger.kernel.org \
--cc=gitster@pobox.com \
--cc=rene.scharfe@lsrfire.ath.cx \
--cc=warthog19@eaglescrag.net \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).