All of lore.kernel.org
 help / color / mirror / Atom feed
From: Padraig Brady <padraig@antefacto.com>
To: William Stearns <wstearns@pobox.com>
Cc: Larry McVoy <lm@bitmover.com>,
	ML-linux-kernel <linux-kernel@vger.kernel.org>
Subject: Re: [bk patch] Make cardbus compile in -pre4
Date: Mon, 11 Feb 2002 17:30:45 +0000	[thread overview]
Message-ID: <3C67FFC5.7020701@antefacto.com> (raw)
In-Reply-To: <Pine.LNX.4.33.0202092358500.1868-100000@sparrow.websense.net>

William Stearns wrote:
> Good day, Larry,
> 
> On Sat, 9 Feb 2002, Larry McVoy wrote:
> 
> 
>>On Sat, Feb 09, 2002 at 01:01:34PM -0800, David Lang wrote:
>>
>>>do you have a script that can go back after the fact and see what can be
>>>hardlinked?
>>>
>>>I'm thinking specififcly of the type of thing that will be happening to
>>>your server where you have a bunch of people putting in a clone of one
>>>tree who will probably not be doing a clone -l to set it up, but who could
>>>have and you want to clean up after the fact (and perhapse again on a
>>>periodic basis, becouse after all of these trees apply a changeset from
>>>linus they will all have changed (breaking the origional hardlinks) but
>>>will still be duplicates of each other.
>>>
>>We don't, but we can, and we should.  "bk relink tree1 tree2" seems like 
>>the right interface.
>>
>>Right now we aren't too worried about the disk space, the data is sitting 
>>on a pair of 40GB drives and we're running the trees in gzip mode, so they
>>are 75MB each.  But yes, it's a good idea, we should do it, and probably
>>should figure out some way to make it automatic.  I'll add it to the
>>(ever growing) list, thanks.
>>
> 
> 	Larry, I'll save you the time.
> 	"freedups -a -d /some/dir [/other/dirs]" will look for identical
> files (the -d requires dates to be equal as well as the content) and
> hardlink them.  It's not terribly efficient, but works marvelously well.  
> Run it from cron once a week or so, perhaps?
> 	http://www.stearns.org/freedups/
> 	Cheers,
> 	- Bill


Not terribly efficient? That's a bit of an understatement :-)
The findup component of fslint is MUCH quicker, and it's
also written in bash. A quick test against two 2.4.17 trees gives:

1m36s for  ./findup /usr/src/linux[12] | ./fstool/mergeDup
18m17s for ./freedups -a /usr/src/linux[12]

Note mergeDup was a quick hack and took 1m30s of findup's time!
I'm going to rewrite it in python ASAP to help with this.

You can download the current version of fslint from
http://developers.antefacto.net/~padraig/fslint.tar.gz

Padraig.


  reply	other threads:[~2002-02-11 17:37 UTC|newest]

Thread overview: 64+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2002-02-09  2:25 [bk patch] Make cardbus compile in -pre4 Patrick Mochel
2002-02-09  3:39 ` Andreas Dilger
2002-02-09  4:02   ` Jeff Garzik
2002-02-09  7:29     ` Andreas Dilger
2002-02-09  7:41       ` Larry McVoy
2002-02-10  2:39       ` Jeff Garzik
2002-02-10  3:52       ` Linus Torvalds
2002-02-10  7:47       ` Andreas Dilger
2002-02-10 20:57         ` Linus Torvalds
2002-02-11 18:38           ` Andreas Dilger
2002-02-09  5:12   ` Larry McVoy
2002-02-09  5:32     ` Andrew Morton
2002-02-09  9:36       ` Rob Landley
2002-02-09  9:57         ` Momchil Velikov
2002-02-09 10:01           ` Alexander Viro
2002-02-09 18:09             ` Rob Landley
2002-02-09 15:08           ` Daniel Phillips
2002-02-10  4:07             ` Linus Torvalds
2002-02-09 10:14     ` David Lang
2002-02-09 15:54       ` Larry McVoy
2002-02-09 16:50         ` Tom Rini
2002-02-09 17:05           ` Larry McVoy
2002-02-09 21:01             ` David Lang
2002-02-09 21:41               ` Larry McVoy
2002-02-09 23:36                 ` Andreas Dilger
2002-02-09 23:45                   ` Tom Rini
2002-02-10  0:42                     ` Andreas Dilger
2002-02-09 23:52                   ` Larry McVoy
2002-02-10  4:13                     ` Linus Torvalds
2002-02-10 18:02                     ` Tom Rini
2002-02-10  5:25                 ` William Stearns
2002-02-11 17:30                   ` Padraig Brady [this message]
2002-02-13 11:59                     ` Padraig Brady
2002-02-09  9:27   ` pull vs push (was Re: [bk patch] Make cardbus compile in -pre4) Rob Landley
2002-02-09 10:08     ` Andreas Dilger
2002-02-09 18:12       ` Stelian Pop
2002-02-09 20:59         ` Linus Torvalds
2002-02-09 20:12           ` Stelian Pop
2002-02-09 20:26             ` Larry McVoy
2002-02-09 20:51               ` Stelian Pop
2002-02-09 23:45                 ` Jeff Garzik
2002-02-09 23:49                 ` Larry McVoy
2002-02-09 20:57               ` Pau Aliagas
2002-02-09 21:07                 ` David Lang
2002-02-09 21:13                   ` Pau Aliagas
2002-02-09 21:45               ` Rob Landley
2002-02-10  0:19               ` Andreas Dilger
2002-02-10  0:36               ` Herbert Xu
2002-02-10  0:54                 ` ssh primer (was Re: pull vs push (was Re: [bk patch] Make cardbus compile in -pre4)) Jeff Garzik
2002-02-10  0:59                   ` Herbert Xu
2002-02-10  1:24                     ` Jeff Garzik
2002-02-10  8:13                       ` Herbert Xu
2002-02-13 17:13                       ` Aaron Lehmann
2002-02-14  0:22                         ` Rob Landley
2002-02-14  6:57                           ` Aaron Lehmann
2002-02-14 11:00                           ` Harald Arnesen
2002-02-10  0:59                   ` Ben Pfaff
2002-02-10  1:14                   ` David Lang
2002-02-10  1:22                     ` ssh primer (was Re: pull vs push (was Re: [bk patch] Make cardbuscompile " Jeff Garzik
2002-02-10  2:46               ` pull vs push (was Re: [bk patch] Make cardbus compile in -pre4) Alan Cox
2002-02-11 11:51     ` Pavel Machek
2002-02-11 18:42       ` John Alvord
2002-02-09 11:44 ` [bk patch] Make cardbus compile in -pre4 Peter Osterlund
  -- strict thread matches above, loose matches on Subject: below --
2002-02-10 20:12 Chris Adams

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=3C67FFC5.7020701@antefacto.com \
    --to=padraig@antefacto.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=lm@bitmover.com \
    --cc=wstearns@pobox.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.