git.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: "Ævar Arnfjörð Bjarmason" <avarab@gmail.com>
To: "brian m. carlson" <sandals@crustytoothpaste.net>
Cc: Konstantin Ryabitsev <konstantin@linuxfoundation.org>,
	Eli Schwartz <eschwartz93@gmail.com>,
	Git List <git@vger.kernel.org>
Subject: Re: Stability of git-archive, breaking (?) the Github universe, and a possible solution
Date: Thu, 02 Feb 2023 01:42:48 +0100	[thread overview]
Message-ID: <230202.86r0v8q3oz.gmgdl@evledraar.gmail.com> (raw)
In-Reply-To: <Y9ry5Wxck4s/X2B+@tapette.crustytoothpaste.net>


On Wed, Feb 01 2023, brian m. carlson wrote:

> [[PGP Signed Part:Undecided]]
> On 2023-02-01 at 09:40:57, Ævar Arnfjörð Bjarmason wrote:
>> "A spec" here seems like overkill to me, so far on that front we've been
>> shelling out to gzip(1), and the breakage/event that triggered this
>> thread is rectified by starting to do that again by default.
>
> Sure, that will fix the immediate problem.
>
>> But so what? We don't need to make promises for all potential git
>> implementations, just this one. So we could add a blurb like this to the
>> docs:
>> 
>> 	As people have come to rely on the exact "deflate"
>> 	implementation "git archive" promises to invoke the system's
>> 	"gzip" binary by default, under the assumption that its output
>> 	is stable. If that's no longer the case you'll need to complain
>> 	to whoever maintains your local "gzip".
>
> I don't think a blurb is necessary, but you're basically underscoring
> the problem, which is that nobody is willing to promise that compression
> is consistent, but yet people want to rely on that fact.  I'm willing to
> write and implement a consistent tar spec and to guarantee compatibility
> with that, but the tension here is that people also want gzip to never
> change its byte format ever, which frankly seems unrealistic without
> explicit guarantees.  Maybe the authors will agree to promise that, but
> it seems unlikely.

Maybe they won't, the point is that an upgrade of git wouldn't break
github in the way that's been observed, instead that potential breakage
would happen whenever the OS (or whatever's providing "gzip") is
upgraded.

So, if gzip promises to never change such sites can upgrade it without
issues, but if it does they'll presumably need to pin it forever.

And those sites that don't care about "git archive" stability can use
whatever their local "gzip" is, without caring that the output might
change.

>> If we wanted to be even more helpful we could bunde and ship an old
>> version of GNU gzip with our sources, and either default to that, or
>> offer it as a "--stable" implementation of deflate.
>
> That would probably break things, because gzip is GPLv3, and we'd need
> to ship a much older GPLv2 gzip, which would probably differ from the
> current behaviour, and might also have some security problems.

We're way off in the realm of the hypothetical, I don't think we need a
gzip fallback, we can make it the issue of the rare downstream user who
needs such stability.

But if we shipped a last-good gzip my understanding of software
licensing is that we could ship the GPLv3 version.

The issue with combining GPLv3 and GPLv2 works is if you do something
like upgrade our wildmatch.c to the GPLv3 version (ours is derived from
an older GPLv2 version). Then our combined work is derived from two
different licenses.

But if you're just invoking a different process those two sources can
use incompatible licenses. There's established precedence for that
throughout the industry, and it's the FSF's position on the matter.

So if we offered to build a gzip for you from GPLv3 sources shipped
in-tree that wouldn't infect the rest of git's GPLv2 code, any more than
Debian shipping both git and gzip is cross-contaminating the two.

It might cause us some hassle with distributors for whom any mention of
GPLv3 is anathema (e.g. Apple), but I understand that that's general
paranoia about its patent clauses impacting the distributor, not a
license incompatiblity.

  parent reply	other threads:[~2023-02-02  1:03 UTC|newest]

Thread overview: 57+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2023-01-31  0:06 Stability of git-archive, breaking (?) the Github universe, and a possible solution Eli Schwartz
2023-01-31  7:49 ` Ævar Arnfjörð Bjarmason
2023-01-31  9:11   ` Eli Schwartz
2023-02-02  9:32   ` [PATCH 0/9] git archive: use gzip again by default, document output stabilty Ævar Arnfjörð Bjarmason
2023-02-02  9:32     ` [PATCH 1/9] archive & tar config docs: de-duplicate configuration section Ævar Arnfjörð Bjarmason
2023-02-02  9:32     ` [PATCH 2/9] git config docs: document "tar.<format>.{command,remote}" Ævar Arnfjörð Bjarmason
2023-02-02  9:32     ` [PATCH 3/9] archiver API: make the "flags" in "struct archiver" an enum Ævar Arnfjörð Bjarmason
2023-02-02  9:32     ` [PATCH 4/9] archive: omit the shell for built-in "command" filters Ævar Arnfjörð Bjarmason
2023-02-02  9:32     ` [PATCH 5/9] archive-tar.c: move internal gzip implementation to a function Ævar Arnfjörð Bjarmason
2023-02-02  9:32     ` [PATCH 6/9] archive: use "gzip -cn" for stability, not "git archive gzip" Ævar Arnfjörð Bjarmason
2023-02-02  9:32     ` [PATCH 7/9] test-lib.sh: add a lazy GZIP prerequisite Ævar Arnfjörð Bjarmason
2023-02-02  9:32     ` [PATCH 8/9] archive tests: test for "gzip -cn" and "git archive gzip" stability Ævar Arnfjörð Bjarmason
2023-02-02  9:32     ` [PATCH 9/9] git archive docs: document output non-stability Ævar Arnfjörð Bjarmason
2023-02-02 10:25       ` brian m. carlson
2023-02-02 10:30         ` Ævar Arnfjörð Bjarmason
2023-02-02 16:34         ` Junio C Hamano
2023-02-04 17:46           ` brian m. carlson
2023-02-02 16:17     ` [PATCH 0/9] git archive: use gzip again by default, document output stabilty Phillip Wood
2023-02-02 16:40       ` Junio C Hamano
2023-02-03 13:49       ` Ævar Arnfjörð Bjarmason
2023-02-06 14:46         ` Phillip Wood
2023-02-03 15:47       ` Theodore Ts'o
2023-02-02 16:25     ` Junio C Hamano
2023-02-04 18:08       ` René Scharfe
2023-02-05 21:30         ` Ævar Arnfjörð Bjarmason
2023-02-12 17:41           ` René Scharfe
2023-02-02 19:23     ` Raymond E. Pasco
2023-02-03  8:06       ` [PATCH] archive: document output stability concerns Raymond E. Pasco
2023-01-31  9:54 ` Stability of git-archive, breaking (?) the Github universe, and a possible solution brian m. carlson
2023-01-31 11:31   ` Ævar Arnfjörð Bjarmason
2023-01-31 15:05   ` Konstantin Ryabitsev
2023-01-31 22:32     ` brian m. carlson
2023-02-01  9:40       ` Ævar Arnfjörð Bjarmason
2023-02-01 11:34         ` demerphq
2023-02-01 12:21           ` Michal Suchánek
2023-02-01 12:48             ` demerphq
2023-02-01 13:43               ` Ævar Arnfjörð Bjarmason
2023-02-01 15:21                 ` demerphq
2023-02-01 18:56                   ` Theodore Ts'o
2023-02-02 21:19                     ` Joey Hess
2023-02-03  4:02                       ` Theodore Ts'o
2023-02-03 13:32                         ` Ævar Arnfjörð Bjarmason
2023-02-01 23:16         ` brian m. carlson
2023-02-01 23:37           ` Junio C Hamano
2023-02-02 23:01             ` brian m. carlson
2023-02-02 23:47               ` rsbecker
2023-02-03 13:18                 ` Ævar Arnfjörð Bjarmason
2023-02-02  0:42           ` Ævar Arnfjörð Bjarmason [this message]
2023-02-01 12:17       ` Raymond E. Pasco
2023-01-31 15:56   ` Eli Schwartz
2023-01-31 16:20     ` Konstantin Ryabitsev
2023-01-31 16:34       ` Eli Schwartz
2023-01-31 20:34         ` Konstantin Ryabitsev
2023-01-31 20:45         ` Michal Suchánek
2023-02-01  1:33     ` brian m. carlson
2023-02-01 12:42   ` Ævar Arnfjörð Bjarmason
2023-02-01 23:18     ` brian m. carlson

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=230202.86r0v8q3oz.gmgdl@evledraar.gmail.com \
    --to=avarab@gmail.com \
    --cc=eschwartz93@gmail.com \
    --cc=git@vger.kernel.org \
    --cc=konstantin@linuxfoundation.org \
    --cc=sandals@crustytoothpaste.net \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).