public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
From: Padraig Brady <padraig@antefacto.com>
To: William Stearns <wstearns@pobox.com>
Cc: Larry McVoy <lm@bitmover.com>,
	ML-linux-kernel <linux-kernel@vger.kernel.org>
Subject: Re: [bk patch] Make cardbus compile in -pre4
Date: Mon, 11 Feb 2002 17:30:45 +0000	[thread overview]
Message-ID: <3C67FFC5.7020701@antefacto.com> (raw)
In-Reply-To: <Pine.LNX.4.33.0202092358500.1868-100000@sparrow.websense.net>

William Stearns wrote:
> Good day, Larry,
> 
> On Sat, 9 Feb 2002, Larry McVoy wrote:
> 
> 
>>On Sat, Feb 09, 2002 at 01:01:34PM -0800, David Lang wrote:
>>
>>>do you have a script that can go back after the fact and see what can be
>>>hardlinked?
>>>
>>>I'm thinking specififcly of the type of thing that will be happening to
>>>your server where you have a bunch of people putting in a clone of one
>>>tree who will probably not be doing a clone -l to set it up, but who could
>>>have and you want to clean up after the fact (and perhapse again on a
>>>periodic basis, becouse after all of these trees apply a changeset from
>>>linus they will all have changed (breaking the origional hardlinks) but
>>>will still be duplicates of each other.
>>>
>>We don't, but we can, and we should.  "bk relink tree1 tree2" seems like 
>>the right interface.
>>
>>Right now we aren't too worried about the disk space, the data is sitting 
>>on a pair of 40GB drives and we're running the trees in gzip mode, so they
>>are 75MB each.  But yes, it's a good idea, we should do it, and probably
>>should figure out some way to make it automatic.  I'll add it to the
>>(ever growing) list, thanks.
>>
> 
> 	Larry, I'll save you the time.
> 	"freedups -a -d /some/dir [/other/dirs]" will look for identical
> files (the -d requires dates to be equal as well as the content) and
> hardlink them.  It's not terribly efficient, but works marvelously well.  
> Run it from cron once a week or so, perhaps?
> 	http://www.stearns.org/freedups/
> 	Cheers,
> 	- Bill


Not terribly efficient? That's a bit of an understatement :-)
The findup component of fslint is MUCH quicker, and it's
also written in bash. A quick test against two 2.4.17 trees gives:

1m36s for  ./findup /usr/src/linux[12] | ./fstool/mergeDup
18m17s for ./freedups -a /usr/src/linux[12]

Note mergeDup was a quick hack and took 1m30s of findup's time!
I'm going to rewrite it in python ASAP to help with this.

You can download the current version of fslint from
http://developers.antefacto.net/~padraig/fslint.tar.gz

Padraig.


  reply	other threads:[~2002-02-11 17:37 UTC|newest]

Thread overview: 64+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2002-02-09  2:25 [bk patch] Make cardbus compile in -pre4 Patrick Mochel
2002-02-09  3:39 ` Andreas Dilger
2002-02-09  4:02   ` Jeff Garzik
2002-02-09  7:29     ` Andreas Dilger
2002-02-09  7:41       ` Larry McVoy
2002-02-10  2:39       ` Jeff Garzik
2002-02-10  3:52       ` Linus Torvalds
2002-02-10  7:47       ` Andreas Dilger
2002-02-10 20:57         ` Linus Torvalds
2002-02-11 18:38           ` Andreas Dilger
2002-02-09  5:12   ` Larry McVoy
2002-02-09  5:32     ` Andrew Morton
2002-02-09  9:36       ` Rob Landley
2002-02-09  9:57         ` Momchil Velikov
2002-02-09 10:01           ` Alexander Viro
2002-02-09 18:09             ` Rob Landley
2002-02-09 15:08           ` Daniel Phillips
2002-02-10  4:07             ` Linus Torvalds
2002-02-09 10:14     ` David Lang
2002-02-09 15:54       ` Larry McVoy
2002-02-09 16:50         ` Tom Rini
2002-02-09 17:05           ` Larry McVoy
2002-02-09 21:01             ` David Lang
2002-02-09 21:41               ` Larry McVoy
2002-02-09 23:36                 ` Andreas Dilger
2002-02-09 23:45                   ` Tom Rini
2002-02-10  0:42                     ` Andreas Dilger
2002-02-09 23:52                   ` Larry McVoy
2002-02-10  4:13                     ` Linus Torvalds
2002-02-10 18:02                     ` Tom Rini
2002-02-10  5:25                 ` William Stearns
2002-02-11 17:30                   ` Padraig Brady [this message]
2002-02-13 11:59                     ` Padraig Brady
2002-02-09  9:27   ` pull vs push (was Re: [bk patch] Make cardbus compile in -pre4) Rob Landley
2002-02-09 10:08     ` Andreas Dilger
2002-02-09 18:12       ` Stelian Pop
2002-02-09 20:59         ` Linus Torvalds
2002-02-09 20:12           ` Stelian Pop
2002-02-09 20:26             ` Larry McVoy
2002-02-09 20:51               ` Stelian Pop
2002-02-09 23:45                 ` Jeff Garzik
2002-02-09 23:49                 ` Larry McVoy
2002-02-09 20:57               ` Pau Aliagas
2002-02-09 21:07                 ` David Lang
2002-02-09 21:13                   ` Pau Aliagas
2002-02-09 21:45               ` Rob Landley
2002-02-10  0:19               ` Andreas Dilger
2002-02-10  0:36               ` Herbert Xu
2002-02-10  0:54                 ` ssh primer (was Re: pull vs push (was Re: [bk patch] Make cardbus compile in -pre4)) Jeff Garzik
2002-02-10  0:59                   ` Herbert Xu
2002-02-10  1:24                     ` Jeff Garzik
2002-02-10  8:13                       ` Herbert Xu
2002-02-13 17:13                       ` Aaron Lehmann
2002-02-14  0:22                         ` Rob Landley
2002-02-14  6:57                           ` Aaron Lehmann
2002-02-14 11:00                           ` Harald Arnesen
2002-02-10  0:59                   ` Ben Pfaff
2002-02-10  1:14                   ` David Lang
2002-02-10  1:22                     ` ssh primer (was Re: pull vs push (was Re: [bk patch] Make cardbuscompile " Jeff Garzik
2002-02-10  2:46               ` pull vs push (was Re: [bk patch] Make cardbus compile in -pre4) Alan Cox
2002-02-11 11:51     ` Pavel Machek
2002-02-11 18:42       ` John Alvord
2002-02-09 11:44 ` [bk patch] Make cardbus compile in -pre4 Peter Osterlund
  -- strict thread matches above, loose matches on Subject: below --
2002-02-10 20:12 Chris Adams

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=3C67FFC5.7020701@antefacto.com \
    --to=padraig@antefacto.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=lm@bitmover.com \
    --cc=wstearns@pobox.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox