From: Andrea Arcangeli <andrea@suse.de>
To: linux-kernel@vger.kernel.org, darcs-users@darcs.net
Subject: Re: [darcs-users] Re: [BK] upgrade will be needed
Date: Sat, 19 Feb 2005 18:53:48 +0100 [thread overview]
Message-ID: <20050219175348.GE7247@opteron.random> (raw)
In-Reply-To: <20050219171457.GA20285@abridgegame.org>
On Sat, Feb 19, 2005 at 12:15:02PM -0500, David Roundy wrote:
> The linux-2.5 tree right now (I'm re-doing the conversion, and am up to
> October of last year, so far) is at 141M, if you don't count the pristine
> cache or working directory. That's already compressed, so you don't get
> any extra bonuses. Darcs stores with each changeset both the old and new
> versions of each hunk, which gives you some redundancy, and probably
> accounts for the factor of two greater size than CVS. This gives a bit of
> redundancy, which can be helpful in cases of repository corruption.
Double size of the compressed backup is about the same as SVM with fsfs
(not tested on l-k tree but in something much smaller). Why not to
simply checksum instead of having data redundancy? Knowing when
something is corrupted is a great feature, but doing raid1 without the
user asking for it, is a worthless overhead.
The same is true for arch of course, last time I checked they were using
the default -U 3 format instead of -U 0.
> I presume you're referring to a local checkout? That is already done using
> hard links by darcs--only of course the working directory has to actually
> be copied over, since there are editors out there that aren't friendly to
> hard-linked files.
arch allows to hardlink the copy too (optionally) and it's up to you to
use the right switch in the editor (Davide had a LD_PRELOAD to do a
copy-on-write since the kernel doesn't provide the feature).
> And here's where darcs really falls down. To track the history of a single
> file it has to read the contents of every changeset since the creation of
> that file, which will take forever (well, not quite... but close).
Indeed, and as I mentioned this is the *major* feature as far as I'm
concerned (and frankly the only one I really care about and that helps a
lot to track changes in the tree and understand why the code evolved).
Note that cvsps works great for this, it's very efficient as well (not
remotely comparable to arch at least, even if arch provided a tool
equivalent to cvsps), the only problem is that CVS is out of sync...
> I hope to someday (when more pressing issues are dealt with) add a per-file
> cache indicating which patches affect which files, which should largely
> address this problem, although it won't help at all with files that are
> touched by most of the changesets, and won't be optimimal in any case. :(
Wouldn't using the CVS format help an order of magnitude here? With
CVS/SCCS format you can extract all the patchsets that affected a single
file in a extremely efficient manner, memory utilization will be
extremely low too (like in cvsps indeed). You'll only have to lookup the
"global changeset file", and then open the few ,v files that are
affected and extract their patchsets. cvsps does this optimally
already. The only difference is that what cvsps is a "readonly" cache,
while with a real SCM it would be a global file that control all the
changesets in an atomic way.
Infact *that* global file could be a bsddb too, I don't care about how
the changset file is being encoded, all I care is that the data is a ,v
file or SCCS file so cvsps won't have to read >20000 files every time I
ask that question, which is currently unavoidable with both darcs and
arch.
next prev parent reply other threads:[~2005-02-19 17:53 UTC|newest]
Thread overview: 156+ messages / expand[flat|nested] mbox.gz Atom feed top
2005-02-14 2:08 [BK] upgrade will be needed Larry McVoy
2005-02-14 6:02 ` Matthew Jacob
2005-02-14 6:17 ` Adam Sulmicki
2005-02-14 9:49 ` Geert Uytterhoeven
2005-02-14 12:08 ` Bartlomiej Zolnierkiewicz
2005-02-14 15:08 ` Jeff Sipek
2005-02-14 15:40 ` Larry McVoy
2005-02-14 16:21 ` linux-os
2005-02-14 17:12 ` Larry McVoy
2005-02-16 23:55 ` Pavel Machek
2005-02-17 20:33 ` David Weinehall
2005-02-17 21:31 ` Chris Wright
2005-02-14 17:14 ` Marcin Dalecki
2005-02-14 17:23 ` Russell Miller
2005-02-14 17:49 ` Larry McVoy
2005-02-14 18:33 ` Marcin Dalecki
2005-02-14 18:47 ` Steven Rostedt
[not found] ` <20050214190137.GB16029@bitmover.com>
[not found] ` <1108415541.8413.48.camel@localhost.localdomain>
[not found] ` <20050214231148.GP13174@bitmover.com>
[not found] ` <1108425420.8413.78.camel@localhost.localdomain>
[not found] ` <20050215000028.GS13174@bitmover.com>
[not found] ` <1108426451.8413.84.camel@localhost.localdomain>
[not found] ` <20050215003535.GB32158@bitmover.com>
2005-02-15 1:00 ` Steven Rostedt
2005-02-15 3:01 ` Larry McVoy
2005-02-15 3:39 ` Steven Rostedt
2005-02-15 14:05 ` Alexandre Oliva
2005-02-14 19:29 ` Larry McVoy
2005-02-14 21:13 ` Erik Andersen
2005-02-14 18:17 ` Matthew Jacob
2005-02-14 18:36 ` Marcin Dalecki
2005-02-14 18:45 ` Matthew Jacob
2005-02-14 20:02 ` Matthias Andree
2005-02-15 5:13 ` Scott Lockwood
2005-02-14 18:56 ` Larry McVoy
2005-02-14 20:36 ` Adrian Bunk
2005-02-14 21:25 ` Paolo Ciarrocchi
2005-02-14 23:02 ` Henrik Persson
2005-02-14 21:30 ` Tom Felker
2005-02-15 9:04 ` James Bruce
2005-02-16 17:31 ` Jeff Sipek
2005-02-14 23:24 ` Marcin Dalecki
2005-02-15 3:35 ` Horst von Brand
2005-02-15 5:23 ` Hua Zhong
2005-02-15 12:19 ` kernel
2005-02-15 12:45 ` linux-os
2005-02-15 13:39 ` Helge Hafting
2005-02-16 9:45 ` Clemens Schwaighofer
2005-02-16 10:21 ` Catalin Marinas
2005-02-16 13:40 ` Schwaighofer Clemens
2005-02-16 15:39 ` d.c
2005-02-17 0:14 ` Clemens Schwaighofer
2005-02-16 15:43 ` Olivier Galibert
2005-02-16 17:01 ` Sean
2005-02-17 0:11 ` Clemens Schwaighofer
2005-02-17 4:57 ` Theodore Ts'o
2005-02-17 5:57 ` Sean
2005-02-17 6:22 ` d.c
2005-02-17 6:49 ` Sean
2005-02-17 16:34 ` Lee Revell
2005-02-17 16:40 ` Sean
2005-02-17 16:55 ` Chris Friesen
2005-02-17 16:58 ` Sean
2005-02-17 20:52 ` Horst von Brand
2005-02-17 21:24 ` Sean
2005-02-18 1:42 ` Horst von Brand
2005-02-18 7:52 ` Sean
2005-02-18 16:27 ` Theodore Ts'o
2005-02-18 18:34 ` Sean
2005-02-18 19:26 ` Dmitry Torokhov
2005-02-18 19:31 ` Sean
2005-02-18 19:46 ` John Stoffel
2005-02-18 19:49 ` Dmitry Torokhov
2005-02-18 19:57 ` Sean
2005-02-19 0:59 ` Horst von Brand
2005-02-18 20:45 ` d.c
2005-02-18 21:13 ` David S. Miller
2005-02-18 21:34 ` Anton Altaparmakov
2005-02-18 22:18 ` Vojtech Pavlik
2005-02-18 22:35 ` Dmitry Torokhov
2005-02-19 7:53 ` Anton Altaparmakov
2005-02-23 19:15 ` Bill Davidsen
2005-02-21 5:43 ` Miles Bader
2005-02-21 6:56 ` Dmitry Torokhov
2005-02-21 7:00 ` N/A Christof Dorner
2005-02-21 15:40 ` [BK] upgrade will be needed Kevin P. Fleming
2005-02-17 23:25 ` Ed Tomlinson
2005-02-17 23:32 ` Sean
2005-02-17 23:54 ` Lee Revell
2005-02-17 23:56 ` Sean
2005-02-18 4:00 ` Theodore Ts'o
2005-02-18 4:03 ` Clemens Schwaighofer
2005-02-18 7:26 ` Sean
2005-02-18 12:31 ` Ed Tomlinson
2005-02-18 0:29 ` Clemens Schwaighofer
2005-02-18 2:31 ` Horst von Brand
2005-02-17 7:55 ` Roland Kuhn
2005-02-17 8:09 ` Clemens Schwaighofer
2005-02-17 9:27 ` Roland Kuhn
2005-02-17 10:27 ` Sean
2005-02-18 0:32 ` Clemens Schwaighofer
2005-02-14 18:54 ` Juergen Stuber
2005-02-14 20:13 ` Matthew Dharm
2005-02-14 20:17 ` Matthias Andree
2005-02-15 17:26 ` Juergen Stuber
2005-02-15 2:46 ` Tristan Wibberley
2005-02-15 13:24 ` Tristan Wibberley
2005-02-14 19:44 ` Matthias Andree
2005-02-14 20:05 ` Larry McVoy
2005-02-14 22:24 ` Gerold Jury
2005-02-14 22:57 ` Larry McVoy
2005-02-14 23:23 ` David Lang
2005-02-15 0:03 ` Larry McVoy
2005-02-15 1:23 ` David Lang
2005-02-15 16:34 ` Olivier Galibert
2005-02-14 23:29 ` Gerold Jury
2005-02-15 13:55 ` Alexandre Oliva
2005-02-15 16:57 ` Alan Cox
2005-02-17 0:00 ` Pavel Machek
2005-02-17 1:41 ` Alexandre Oliva
2005-02-17 9:46 ` Geert Uytterhoeven
2005-02-17 12:36 ` Pavel Machek
2005-02-17 14:07 ` Citizen Number 24655
2005-02-17 15:05 ` Theodore Ts'o
2005-02-17 16:27 ` Florian Weimer
2005-02-18 8:32 ` Andrea Arcangeli
2005-02-15 2:13 ` Ed Tomlinson
2005-02-15 2:40 ` Larry McVoy
2005-02-15 3:04 ` Paul Jackson
2005-02-15 12:41 ` Ed Tomlinson
2005-02-15 13:56 ` Alexandre Oliva
2005-02-15 14:55 ` Anton Ertl
2005-02-14 13:13 ` Mws
2005-02-14 15:03 ` Steven Rostedt
2005-02-14 16:00 ` Larry McVoy
2005-02-14 17:21 ` Marcin Dalecki
2005-02-15 12:13 ` kernel
2005-02-16 23:46 ` Pavel Machek
2005-02-18 8:56 ` Andrea Arcangeli
2005-02-18 2:05 ` Patrick McFarland
2005-02-18 2:24 ` [darcs-users] " Tupshin Harper
2005-02-18 2:33 ` Sean
2005-02-18 9:09 ` Andrea Arcangeli
2005-02-18 10:58 ` Tomasz Zielonka
2005-02-18 11:53 ` Erik Bågfors
2005-02-18 12:50 ` Andrea Arcangeli
2005-02-19 9:10 ` Patrick McFarland
2005-02-19 10:17 ` Sean
2005-02-19 16:42 ` Andrea Arcangeli
2005-02-19 17:15 ` David Roundy
2005-02-19 17:53 ` Andrea Arcangeli [this message]
2005-02-21 12:41 ` David Roundy
2005-02-21 18:33 ` David Brown
2005-02-21 13:48 ` Patrick McFarland
2005-02-20 10:36 ` Ralph Corderoy
2005-02-18 17:50 ` Dustin Sallings
2005-02-21 5:39 ` Miles Bader
2005-02-21 15:53 ` Andrea Arcangeli
2005-02-21 19:45 ` zander
2005-02-21 20:27 ` Horst von Brand
2005-02-23 17:14 ` Andreas Gruenbacher
[not found] <fa.g6qsmpt.131ka8d@ifi.uio.no>
[not found] ` <fa.i74fd0g.1j44204@ifi.uio.no>
2005-02-22 1:04 ` walt
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20050219175348.GE7247@opteron.random \
--to=andrea@suse.de \
--cc=darcs-users@darcs.net \
--cc=linux-kernel@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox