git.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* pack.packSizeLimit, safety checks
@ 2010-02-01  9:20 Sergio
  2010-02-01 16:11 ` Nicolas Pitre
  0 siblings, 1 reply; 11+ messages in thread
From: Sergio @ 2010-02-01  9:20 UTC (permalink / raw)
  To: git

Hi,

documentation about pack.packSizeLimit

says:

The default maximum size of a pack. This setting only affects packing to a file,
i.e. the git:// protocol is unaffected. It can be overridden by the
--max-pack-size option of git-repack(1).

I would suggest clarifying it into

The default maximum size of a pack in bytes. This setting only affects packing
to a file, i.e. the git:// protocol is unaffected. It can be overridden by the
--max-pack-size option of git-repack(1).

Since --max-pack-size takes MB and one might be tempted to assume that the same
is valid for pack.packSizeLimit.

Also note that some safety check on pack.packSizeLimit could probably be
desirable to avoid an unreasonably small limit. For instance:

Assume that pack.packSizeLimit is set to 1 (believing it would be 1MB, but it is
in fact 1B). With this at the first git gc every object goes in its own pack.
You realize the mistake, you fix pack.packSizeLimit to 1000000, but at this
point you cannot go back since git gc cannot run anymore (too many open files).

Maybe, considering all the possible use cases of pack.packSizeLimit could help
finding a reasonable lower bound.

^ permalink raw reply	[flat|nested] 11+ messages in thread

* Re: pack.packSizeLimit, safety checks
  2010-02-01  9:20 pack.packSizeLimit, safety checks Sergio
@ 2010-02-01 16:11 ` Nicolas Pitre
  2010-02-01 16:26   ` Johannes Sixt
  2010-02-01 17:19   ` Junio C Hamano
  0 siblings, 2 replies; 11+ messages in thread
From: Nicolas Pitre @ 2010-02-01 16:11 UTC (permalink / raw)
  To: Sergio; +Cc: git

On Mon, 1 Feb 2010, Sergio wrote:

> Hi,
> 
> documentation about pack.packSizeLimit
> 
> says:
> 
> The default maximum size of a pack. This setting only affects packing to a file,
> i.e. the git:// protocol is unaffected. It can be overridden by the
> --max-pack-size option of git-repack(1).
> 
> I would suggest clarifying it into
> 
> The default maximum size of a pack in bytes. This setting only affects packing
> to a file, i.e. the git:// protocol is unaffected. It can be overridden by the
> --max-pack-size option of git-repack(1).
> 
> Since --max-pack-size takes MB and one might be tempted to assume that the same
> is valid for pack.packSizeLimit.

Grrrrr.  This is a terrible discrepency given that all the other 
arguments in Git are always byte based, with the optional k/m/g suffix, 
by using git_parse_ulong().  So IMHO I'd just change --max-pack-size to 
be in line with all the rest and have it accept bytes instead of MB.  
And of course I'd push such a change to be included in v1.7.0 along with 
the other incompatible fixes.

Your suggested precision above is still worth it of course.

> Also note that some safety check on pack.packSizeLimit could probably be
> desirable to avoid an unreasonably small limit. For instance:
> 
> Assume that pack.packSizeLimit is set to 1 (believing it would be 1MB, but it is
> in fact 1B). With this at the first git gc every object goes in its own pack.
> You realize the mistake, you fix pack.packSizeLimit to 1000000, but at this
> point you cannot go back since git gc cannot run anymore (too many open files).

That's a totally orthogonal issue.  There are other ways to get into 
trouble with too many open files and that deserves a fix of its own 
(such as limiting the number of simultaneous opened packs).


Nicolas

^ permalink raw reply	[flat|nested] 11+ messages in thread

* Re: pack.packSizeLimit, safety checks
  2010-02-01 16:11 ` Nicolas Pitre
@ 2010-02-01 16:26   ` Johannes Sixt
  2010-02-01 16:28     ` Shawn O. Pearce
  2010-02-01 17:19   ` Junio C Hamano
  1 sibling, 1 reply; 11+ messages in thread
From: Johannes Sixt @ 2010-02-01 16:26 UTC (permalink / raw)
  To: Nicolas Pitre; +Cc: Sergio, git

Nicolas Pitre schrieb:
> Grrrrr.  This is a terrible discrepency given that all the other 
> arguments in Git are always byte based, with the optional k/m/g suffix, 
> by using git_parse_ulong().  So IMHO I'd just change --max-pack-size to 
> be in line with all the rest and have it accept bytes instead of MB.  
> And of course I'd push such a change to be included in v1.7.0 along with 
> the other incompatible fixes.

While at it, also change --big-file-threshold that fast-import learnt the
other day...

-- Hannes

^ permalink raw reply	[flat|nested] 11+ messages in thread

* Re: pack.packSizeLimit, safety checks
  2010-02-01 16:26   ` Johannes Sixt
@ 2010-02-01 16:28     ` Shawn O. Pearce
  2010-02-01 17:20       ` Junio C Hamano
  0 siblings, 1 reply; 11+ messages in thread
From: Shawn O. Pearce @ 2010-02-01 16:28 UTC (permalink / raw)
  To: Johannes Sixt; +Cc: Nicolas Pitre, Sergio, git

Johannes Sixt <j.sixt@viscovery.net> wrote:
> Nicolas Pitre schrieb:
> > Grrrrr.  This is a terrible discrepency given that all the other 
> > arguments in Git are always byte based, with the optional k/m/g suffix, 
> > by using git_parse_ulong().  So IMHO I'd just change --max-pack-size to 
> > be in line with all the rest and have it accept bytes instead of MB.  
> > And of course I'd push such a change to be included in v1.7.0 along with 
> > the other incompatible fixes.
> 
> While at it, also change --big-file-threshold that fast-import learnt the
> other day...

Yup.  WTF was I thinking when I did megabytes as the default unit
on the command line...

-- 
Shawn.

^ permalink raw reply	[flat|nested] 11+ messages in thread

* Re: pack.packSizeLimit, safety checks
  2010-02-01 16:11 ` Nicolas Pitre
  2010-02-01 16:26   ` Johannes Sixt
@ 2010-02-01 17:19   ` Junio C Hamano
  2010-02-01 18:04     ` Nicolas Pitre
  1 sibling, 1 reply; 11+ messages in thread
From: Junio C Hamano @ 2010-02-01 17:19 UTC (permalink / raw)
  To: Nicolas Pitre; +Cc: Sergio, git

Nicolas Pitre <nico@fluxnic.net> writes:

> Grrrrr.  This is a terrible discrepency given that all the other 
> arguments in Git are always byte based, with the optional k/m/g suffix, 
> by using git_parse_ulong().  So IMHO I'd just change --max-pack-size to 
> be in line with all the rest and have it accept bytes instead of MB.  
> And of course I'd push such a change to be included in v1.7.0 along with 
> the other incompatible fixes.

All of the "other incompatible" changes had ample leading time for
transition with warnings and all.

I am afraid that doing this "unit change" is way too late for 1.7.0, and
it makes me somewhat unhappy to hear such a suggestion.  It belittles all
the careful planning that has been done for these other changes to help
protect the users from transition pain.

Introduce --max-pack-megabytes that is a synonym for --max-pack-size for
now, and warn when --max-pack-size is used; warn that --max-pack-size will
count in bytes in 1.8.0. Ship 1.7.0 with that change.  --max-pack-bytes
can also be added if you feel like, while at it.

But changing the unit --max-pack-size counts in to bytes in 1.7.0 feels
a bit too irresponsible for the existing users.

^ permalink raw reply	[flat|nested] 11+ messages in thread

* Re: pack.packSizeLimit, safety checks
  2010-02-01 16:28     ` Shawn O. Pearce
@ 2010-02-01 17:20       ` Junio C Hamano
  0 siblings, 0 replies; 11+ messages in thread
From: Junio C Hamano @ 2010-02-01 17:20 UTC (permalink / raw)
  To: Shawn O. Pearce; +Cc: Johannes Sixt, Nicolas Pitre, Sergio, git

"Shawn O. Pearce" <spearce@spearce.org> writes:

> Yup.  WTF was I thinking when I did megabytes as the default unit
> on the command line...

That thing is new, so it is worth fixing before 1.7.0 final.
Please make it so.

Thanks.

^ permalink raw reply	[flat|nested] 11+ messages in thread

* Re: pack.packSizeLimit, safety checks
  2010-02-01 17:19   ` Junio C Hamano
@ 2010-02-01 18:04     ` Nicolas Pitre
  2010-02-01 18:45       ` Junio C Hamano
  2010-02-04  2:14       ` Junio C Hamano
  0 siblings, 2 replies; 11+ messages in thread
From: Nicolas Pitre @ 2010-02-01 18:04 UTC (permalink / raw)
  To: Junio C Hamano; +Cc: Sergio, git

On Mon, 1 Feb 2010, Junio C Hamano wrote:

> Nicolas Pitre <nico@fluxnic.net> writes:
> 
> > Grrrrr.  This is a terrible discrepency given that all the other 
> > arguments in Git are always byte based, with the optional k/m/g suffix, 
> > by using git_parse_ulong().  So IMHO I'd just change --max-pack-size to 
> > be in line with all the rest and have it accept bytes instead of MB.  
> > And of course I'd push such a change to be included in v1.7.0 along with 
> > the other incompatible fixes.
> 
> All of the "other incompatible" changes had ample leading time for
> transition with warnings and all.
> 
> I am afraid that doing this "unit change" is way too late for 1.7.0, and
> it makes me somewhat unhappy to hear such a suggestion.  It belittles all
> the careful planning that has been done for these other changes to help
> protect the users from transition pain.
> 
> Introduce --max-pack-megabytes that is a synonym for --max-pack-size for
> now, and warn when --max-pack-size is used; warn that --max-pack-size will
> count in bytes in 1.8.0. Ship 1.7.0 with that change.  --max-pack-bytes
> can also be added if you feel like, while at it.
> 
> But changing the unit --max-pack-size counts in to bytes in 1.7.0 feels
> a bit too irresponsible for the existing users.

Thing is... I don't know if the --max-pack-size argument is really that 
used.  I'd expect people relying on that feature to use the config 
variable instead, given that 'git gc' has no idea about --max-pack-size 
anyway.  People using the --max-pack-size argument directly are probably 
doing so only to experiment with it, and then setting the config 
variable, probably using the wrong unit.  The fact that such a 
discrepancy just came to our attention after all the time this feature 
has existed is certainly a good indicator of its popularity.

I understand the really unfortunate timing for such a change.  OTOH 
there is a big advantage to bundle as much incompatibilities together at 
the same time, as people will be prepared for such things already.

While I share your concern for advance warning and such, I think those 
concerns are worth an effort proportional to the depth of user exposure.  
Like for the THREADED_DELTA_SEARCH case, I'm wondering how much pain if 
at all might be saved with a transition plan vs the cost of maintaining 
that plan and carrying the discrepancy further.


Nicolas

^ permalink raw reply	[flat|nested] 11+ messages in thread

* Re: pack.packSizeLimit, safety checks
  2010-02-01 18:04     ` Nicolas Pitre
@ 2010-02-01 18:45       ` Junio C Hamano
  2010-02-04  2:14       ` Junio C Hamano
  1 sibling, 0 replies; 11+ messages in thread
From: Junio C Hamano @ 2010-02-01 18:45 UTC (permalink / raw)
  To: Nicolas Pitre; +Cc: Sergio, git

Nicolas Pitre <nico@fluxnic.net> writes:

> While I share your concern for advance warning and such, I think those 
> concerns are worth an effort proportional to the depth of user exposure.  
> Like for the THREADED_DELTA_SEARCH case, I'm wondering how much pain if 
> at all might be saved with a transition plan vs the cost of maintaining 
> that plan and carrying the discrepancy further.

Absolutely.  I just wanted to hear that --max-pack-size command line
option is not widely used and it is Ok to change it.

^ permalink raw reply	[flat|nested] 11+ messages in thread

* Re: pack.packSizeLimit, safety checks
  2010-02-01 18:04     ` Nicolas Pitre
  2010-02-01 18:45       ` Junio C Hamano
@ 2010-02-04  2:14       ` Junio C Hamano
  2010-02-04  2:31         ` Nicolas Pitre
  1 sibling, 1 reply; 11+ messages in thread
From: Junio C Hamano @ 2010-02-04  2:14 UTC (permalink / raw)
  To: Nicolas Pitre; +Cc: Junio C Hamano, Sergio, git

Nicolas Pitre <nico@fluxnic.net> writes:

> Thing is... I don't know if the --max-pack-size argument is really that 
> used.  I'd expect people relying on that feature to use the config 
> variable instead,...

I suspect one of us need to be careful not to forget this thing...

-- >8 --
Subject: pack-objects --max-pack-size=<n> counts in bytes

The --window-memory argument and pack.packsizelimit configuration used by
the same program counted in bytes and honored the standard k/m/g suffixes.
Make this option do the same for consistency.

Signed-off-by: Junio C Hamano <gitster@pobox.com>
---
 Documentation/RelNotes-1.7.0.txt   |    6 ++++++
 Documentation/git-pack-objects.txt |    3 ++-
 builtin-pack-objects.c             |    7 +++----
 3 files changed, 11 insertions(+), 5 deletions(-)

diff --git a/Documentation/RelNotes-1.7.0.txt b/Documentation/RelNotes-1.7.0.txt
index 323ae54..adf8824 100644
--- a/Documentation/RelNotes-1.7.0.txt
+++ b/Documentation/RelNotes-1.7.0.txt
@@ -46,6 +46,12 @@ Notes on behaviour change
    environment, and diff.*.command and diff.*.textconv in the config
    file.
 
+ * "git pack-objects --max-pack-size=<n>" used to count in megabytes,
+   which was inconsistent with its corresponding configuration
+   variable and other options the command takes.  Now it counts in bytes
+   and allows standard k/m/g suffixes to be given.
+
+
 Updates since v1.6.6
 --------------------
 
diff --git a/Documentation/git-pack-objects.txt b/Documentation/git-pack-objects.txt
index 097a147..fdaf775 100644
--- a/Documentation/git-pack-objects.txt
+++ b/Documentation/git-pack-objects.txt
@@ -106,7 +106,8 @@ base-name::
 	default.
 
 --max-pack-size=<n>::
-	Maximum size of each output packfile, expressed in MiB.
+	Maximum size of each output packfile, expressed in bytes.  The
+	size can be suffixed with "k", "m", or "g".
 	If specified,  multiple packfiles may be created.
 	The default is unlimited, unless the config variable
 	`pack.packSizeLimit` is set.
diff --git a/builtin-pack-objects.c b/builtin-pack-objects.c
index 4a41547..33e11d7 100644
--- a/builtin-pack-objects.c
+++ b/builtin-pack-objects.c
@@ -2203,11 +2203,10 @@ int cmd_pack_objects(int argc, const char **argv, const char *prefix)
 			continue;
 		}
 		if (!prefixcmp(arg, "--max-pack-size=")) {
-			char *end;
-			pack_size_limit_cfg = 0;
-			pack_size_limit = strtoul(arg+16, &end, 0) * 1024 * 1024;
-			if (!arg[16] || *end)
+			unsigned long ul = 0;
+			if (!git_parse_ulong(arg + 16, &ul))
 				usage(pack_usage);
+			pack_size_limit_cfg = ul;
 			continue;
 		}
 		if (!prefixcmp(arg, "--window=")) {

^ permalink raw reply related	[flat|nested] 11+ messages in thread

* Re: pack.packSizeLimit, safety checks
  2010-02-04  2:14       ` Junio C Hamano
@ 2010-02-04  2:31         ` Nicolas Pitre
  2010-02-04  2:34           ` Junio C Hamano
  0 siblings, 1 reply; 11+ messages in thread
From: Nicolas Pitre @ 2010-02-04  2:31 UTC (permalink / raw)
  To: Junio C Hamano; +Cc: Sergio, git

On Wed, 3 Feb 2010, Junio C Hamano wrote:

> Nicolas Pitre <nico@fluxnic.net> writes:
> 
> > Thing is... I don't know if the --max-pack-size argument is really that 
> > used.  I'd expect people relying on that feature to use the config 
> > variable instead,...
> 
> I suspect one of us need to be careful not to forget this thing...

Please hold on.  I think I have a better patch.


Nicolas

^ permalink raw reply	[flat|nested] 11+ messages in thread

* Re: pack.packSizeLimit, safety checks
  2010-02-04  2:31         ` Nicolas Pitre
@ 2010-02-04  2:34           ` Junio C Hamano
  0 siblings, 0 replies; 11+ messages in thread
From: Junio C Hamano @ 2010-02-04  2:34 UTC (permalink / raw)
  To: Nicolas Pitre; +Cc: Junio C Hamano, Sergio, git

Nicolas Pitre <nico@fluxnic.net> writes:

>> I suspect one of us need to be careful not to forget this thing...
>
> Please hold on.  I think I have a better patch.

Ok, I dropped both fast-import.c mismerge fix and the one you are
responding to.

^ permalink raw reply	[flat|nested] 11+ messages in thread

end of thread, other threads:[~2010-02-04  2:35 UTC | newest]

Thread overview: 11+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2010-02-01  9:20 pack.packSizeLimit, safety checks Sergio
2010-02-01 16:11 ` Nicolas Pitre
2010-02-01 16:26   ` Johannes Sixt
2010-02-01 16:28     ` Shawn O. Pearce
2010-02-01 17:20       ` Junio C Hamano
2010-02-01 17:19   ` Junio C Hamano
2010-02-01 18:04     ` Nicolas Pitre
2010-02-01 18:45       ` Junio C Hamano
2010-02-04  2:14       ` Junio C Hamano
2010-02-04  2:31         ` Nicolas Pitre
2010-02-04  2:34           ` Junio C Hamano

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).