From: Jean Delvare <khali@linux-fr.org>
To: "J.H." <warthog9@kernel.org>
Cc: linux-kernel <linux-kernel@vger.kernel.org>,
mirrors@kernel.org, users@kernel.org,
"FTPAdmin Kernel.org" <ftpadmin@kernel.org>,
lasse.collin@tukaani.org
Subject: Re: [kernel.org users] XZ Migration discussion
Date: Fri, 12 Feb 2010 15:01:37 +0100 [thread overview]
Message-ID: <20100212150137.648dca7c@hyperion.delvare> (raw)
In-Reply-To: <4B744E13.8040004@kernel.org>
On Thu, 11 Feb 2010 10:36:03 -0800, J.H. wrote:
> -----BEGIN PGP SIGNED MESSAGE-----
> Hash: SHA1
>
> Hey Everyone,
>
> So as the subject states this is more a centralized discussion on
> migration plans to using and providing xz for content on kernel.org.
> Currently we provide gz and bz2, with gz acting as the original content
> and kernel.org itself generating the resulting bz2 files. There are a
> couple of possible proposals and wanted to toss them out there, and get
> feedback from everyone: the kernel community, the mirrors of kernel.org
> and the direct users of kernel.org.
Don't you have download statistics available? If we knew which
compression format is preferred, an by which margin, it would help make
an educated decision.
> ========================================================================
>
> Option 1)
>
> Leave gz as the master, and migrate bz2 to xz. This will happen in
> stages obviously. with bz2 ultimately being phased out.
>
> Migration option 1)
>
> All new content would be provided in .bz2 and .xz with
> an ultimate date set that the .bz2 files would stop
> being generated with new content. This would leave all
> existing content alone and it would not be a migration
> of the current .bz2 files to xz
>
> Migration option 2)
>
> At some point there would be a mass conversion of all
> existing content to include .bz2 and .xz. These would
> be run in parallel for a time period until it was
> determined that .bz2 was no longer needed and it would
> be removed from the servers leaving .gz and .xz
>
> Option 2)
>
> Convert the master data from gz to bz2 and use xz as the new file
> format. This has the downside of causing more tool churn as it means
> the kernel developers will have to eventually convert from gz to bz2,
> which means for a time there will be nag e-mails if you upload gz
> instead of bz2 and such. It would also mean that we (kernel.org) would
> need to be able to support .gz and .bz2 as master data for a time.
>
> Migration options are identical to Option 1 more or less, with either
> just new content getting converted, or all content getting converted.
>
> ========================================================================
>
> I'm personally leaning towards option 1, though personally don't really
> have a preference on the migration options, as both obviously offer
> different advantages, and again this e-mail is more to spur on the
> discussion and come to some general consensus across all of the groups
> concerned before moving forward with a more specific plan.
>
> So I'm inviting discussion, questions and comments on this so we know
> which way to ultimately go.
Maybe that's just me, but my main concern is neither download times nor
decompression times. My main concern is the access time to directory
indexes when browsing the kernel archive, because there are 5 entries
for every patch or tarball: .bz2, .bz2.sign, .gz, .gz.sign and .sign.
This is horribly slow. The main directory for 2.6 kernels has an index
weighting over 300 kB raw, turning into a ~600 kB document when
HTML-ized. Just fetching it takes 3 seconds and then my browser takes a
long time to format it. There are 3881 entries in that directory today,
and it keeps growing!
So, once we have settled for a compression strategy, I think it would
be the right time to discuss the directory structure. With the advent
of the stable branches and the new development model - which pretty
much implies that we'll live with main version 2.6 forever - the file
count is much higher than it used to me.
I can think of several ways to improve the situation here, some of
which could be combined.
1* Keep a single compression format. This saves almost 40% of the
files.
2* Move one of the compression formats somewhere else, so that it
doesn't get in the way but is still available if needed.
3* Create a new subdirectory for every 2.6.x kernel, and move all the
related files there. This would shrink the main index drastically, and
each subdirectory would have a reasonable size (except maybe 2.6.16 and
2.6.27.) Oddly enough this has been done for the files under testing/
already, so I am curious why we don't do it for the release files (and
the testing/incr/ files, while we're at it.)
4* Get rid of the LATEST-IS-* files. This is a small count, won't save
much, but these files seem totally useless to me these days. Depending
on what you want exactly, there are many versions which can be
considered the latest, and there are better ways to know which they are
(for example http://www.eu.kernel.org/kdist/finger_banner ). And these
files tend to get stuck so you can't rely on them anyway.
I wouldn't worry too much about breaking the current locations. Just
give some time for software authors (ketchup comes to mind) to update
their code and it shouldn't be a big problem.
Thanks,
--
Jean Delvare
next prev parent reply other threads:[~2010-02-12 14:01 UTC|newest]
Thread overview: 106+ messages / expand[flat|nested] mbox.gz Atom feed top
2010-02-11 18:36 XZ Migration discussion J.H.
2010-02-11 19:44 ` david
2010-02-11 19:48 ` H. Peter Anvin
2010-02-12 0:14 ` [kernel.org mirrors] " Carlos Carvalho
2010-02-11 20:22 ` [kernel.org users] " Willy Tarreau
2010-02-11 20:51 ` Pavel Machek
2010-02-11 22:22 ` H. Peter Anvin
2010-02-12 14:35 ` Jean Delvare
2010-02-13 17:10 ` Jean Delvare
2010-02-13 18:49 ` Geert Uytterhoeven
2010-02-13 19:30 ` Randy Dunlap
2010-02-16 14:55 ` Steven Rostedt
2010-02-13 23:28 ` Stefan Richter
2010-02-14 9:07 ` Jean Delvare
2010-02-13 23:52 ` Phillip Lougher
2010-02-14 9:23 ` Jean Delvare
2010-02-14 9:33 ` Justin P. Mattock
2010-02-14 9:49 ` Willy Tarreau
2010-02-14 12:43 ` Jean Delvare
2010-02-15 21:31 ` James Cloos
2010-02-17 5:40 ` Willy Tarreau
2010-02-17 5:54 ` H. Peter Anvin
2010-02-17 10:22 ` Petri Kaukasoina
2010-02-17 10:25 ` Petri Kaukasoina
2010-02-14 9:56 ` Andi Kleen
2010-02-18 23:59 ` Jan Engelhardt
2010-02-14 10:16 ` Lasse Collin
2010-02-12 14:01 ` Jean Delvare [this message]
2010-02-12 15:21 ` Linus Torvalds
2010-02-12 19:02 ` J.H.
2010-02-12 19:23 ` Jean Delvare
2010-02-12 23:07 ` Willy Tarreau
2010-02-13 6:20 ` Pavel Machek
2010-02-13 10:06 ` Stefan Richter
2010-02-13 10:21 ` Stefan Richter
2010-02-14 9:56 ` Lasse Collin
2010-02-13 8:17 ` Jean Delvare
2010-02-13 9:59 ` Stefan Richter
2010-02-13 10:18 ` Avi Kivity
2010-02-13 21:37 ` [kernel] " Mr. James W. Laferriere
2010-02-13 22:39 ` Bernd Petrovitsch
2010-02-15 19:33 ` [kernel.org users] [kernel] " Steve French
2010-02-16 9:16 ` Bernd Petrovitsch
2010-02-16 15:13 ` J.H.
2010-02-14 17:13 ` [kernel.org users] " Pavel Machek
2010-02-14 17:33 ` Stefan Richter
2010-02-14 20:51 ` Pavel Machek
2010-02-15 8:48 ` Stefan Richter
2010-02-15 23:32 ` Tom Rini
2010-02-16 8:21 ` Stefan Richter
2010-02-14 17:07 ` Pavel Machek
2010-02-14 18:39 ` Stefan Richter
2010-02-14 21:04 ` david
2010-02-14 21:32 ` Jean Delvare
2010-02-14 18:59 ` H. Peter Anvin
2010-02-14 19:08 ` Jean Delvare
2010-02-15 15:15 ` tytso
2010-02-16 15:29 ` J.H.
2010-02-16 16:03 ` Jean Delvare
2010-02-17 1:36 ` David Rees
2010-02-16 14:00 ` Pavel Machek
2010-02-19 0:08 ` Jan Engelhardt
2010-02-19 4:38 ` J.H.
2010-02-12 15:25 ` Steven Rostedt
2010-02-12 16:11 ` Jean Delvare
2010-02-12 16:30 ` Steven Rostedt
2010-02-12 16:44 ` Jean Delvare
2010-02-12 20:34 ` Junio C Hamano
2010-02-12 21:16 ` Steven Rostedt
2010-02-16 15:39 ` ketchup was " Pavel Machek
2010-02-16 16:17 ` Steven Rostedt
2010-02-16 16:27 ` Pavel Machek
2010-02-21 13:53 ` Pavel Machek
2010-02-21 14:26 ` Jean Delvare
2010-02-21 19:28 ` Pavel Machek
2010-02-22 18:59 ` Pavel Machek
2010-02-23 13:14 ` Steven Rostedt
2010-02-23 20:32 ` Pavel Machek
2010-02-12 19:02 ` Phillip Lougher
2010-02-12 19:32 ` Jean Delvare
2010-02-12 19:57 ` Phillip Lougher
2010-02-12 21:59 ` Jean Delvare
2010-02-12 23:30 ` Phillip Lougher
2010-02-12 23:39 ` Phillip Lougher
2010-02-13 7:31 ` Jean Delvare
2010-02-13 22:37 ` Phillip Lougher
2010-02-14 9:32 ` Jean Delvare
2010-02-14 9:53 ` Justin P. Mattock
[not found] ` <20100212223547.GN5186@tux>
2010-02-13 7:20 ` Jean Delvare
2010-02-12 19:03 ` H. Peter Anvin
2010-02-12 20:25 ` Matthew Wilcox
2010-02-12 21:54 ` Greg KH
2010-02-12 21:58 ` Steven Rostedt
2010-02-14 14:49 ` Harald Arnesen
2010-02-14 18:34 ` H. Peter Anvin
2010-02-12 20:31 ` [kernel.org mirrors] " Sleddens, J.P.G.
2010-02-12 22:11 ` H. Peter Anvin
2010-02-12 23:14 ` [kernel.org users] [kernel.org mirrors] " Willy Tarreau
2010-02-13 7:42 ` [kernel.org users] " Tony Luck
2010-02-13 8:14 ` H. Peter Anvin
2010-02-13 8:53 ` Jean Delvare
2010-02-14 5:43 ` H. Peter Anvin
[not found] ` <987664A83D2D224EAE907B061CE93D53123E0ED5@orsmsx505.amr.corp.intel.com>
2010-02-17 0:11 ` H. Peter Anvin
2010-02-14 18:03 ` Eric W. Biederman
2010-02-14 22:27 ` Stephen Hemminger
2010-02-15 16:15 ` Jean Delvare
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20100212150137.648dca7c@hyperion.delvare \
--to=khali@linux-fr.org \
--cc=ftpadmin@kernel.org \
--cc=lasse.collin@tukaani.org \
--cc=linux-kernel@vger.kernel.org \
--cc=mirrors@kernel.org \
--cc=users@kernel.org \
--cc=warthog9@kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox