All of lore.kernel.org
 help / color / mirror / Atom feed
* Congratulations! we have got hash function screwed up
@ 2004-12-28 22:12 Lehmann 
  2004-12-29 18:55 ` Stefan Traby
  2005-01-06 12:45 ` Alex Zarochentsev
  0 siblings, 2 replies; 72+ messages in thread
From: Lehmann  @ 2004-12-28 22:12 UTC (permalink / raw)
  To: reiserfs-list; +Cc: stefan

Hi!

When trying to upgrade or reinstall the xfonts-75dpi,
xfonts-75dpi-transcoded or 100dpi & transcoded debian packages on my
2.6.10-rc3 amd64 reiserfsv3 host, I get the following errors:

   dpkg: error processing
   /var/cache/apt/archives/xfonts-75dpi_4.3.0.dfsg.1-10_all.deb (--unpack):
    unable to make backup link of
    `./usr/X11R6/lib/X11/fonts/75dpi/lutBS19-ISO8859-1.pcf.gz' before
    installing new version: Device or resource busy
    dpkg-deb: subprocess paste killed by signal (Broken pipe)
   Preparing to replace xfonts-75dpi-transcoded 4.3.0.dfsg.1-10 (using
         .../xfonts-75dpi-transcoded_4.3.0.dfsg.1-10_all.deb) ...
   dpkg: error processing /var/cache/apt/archives/xfonts-75dpi-transcoded_4.3.0.dfsg.1-10_all.deb (--unpack):
    unable to make backup link of `./usr/X11R6/lib/X11/fonts/75dpi/lutBS19-ISO8859-10.pcf.gz' before installing new version: Device or resource busy
   dpkg-deb: subprocess paste killed by signal (Broken pipe)
   Errors were encountered while processing:
    /var/cache/apt/archives/xfonts-75dpi_4.3.0.dfsg.1-10_all.deb
    /var/cache/apt/archives/xfonts-75dpi-transcoded_4.3.0.dfsg.1-10_all.deb

And at the same time, I get this in my kernel log:

   ReiserFS: hdg2: warning: reiserfs_add_entry: Congratulations! we have got hash function screwed up

Sure sounds like a filesystem bug to me. Is this 2.6.10-rc3-specific or a
generic bug in handling hash collisions?

Deleteing the fonts and installing the package works, but the next upgrade
makes the error appear again.

-- 
                The choice of a
      -----==-     _GNU_
      ----==-- _       generation     Marc Lehmann
      ---==---(_)__  __ ____  __      pcg@goof.com
      --==---/ / _ \/ // /\ \/ /      http://schmorp.de/
      -=====/_/_//_/\_,_/ /_/\_\      XX11-RIPE

^ permalink raw reply	[flat|nested] 72+ messages in thread

* Re: Congratulations! we have got hash function screwed up
  2004-12-28 22:12 Congratulations! we have got hash function screwed up Lehmann 
@ 2004-12-29 18:55 ` Stefan Traby
  2004-12-29 21:04   ` Lehmann 
  2004-12-29 21:05   ` Hans Reiser
  2005-01-06 12:45 ` Alex Zarochentsev
  1 sibling, 2 replies; 72+ messages in thread
From: Stefan Traby @ 2004-12-29 18:55 UTC (permalink / raw)
  To:  Marc A. Lehmann ; +Cc: reiserfs-list, stefan

On Tue, Dec 28, 2004 at 11:12:18PM +0100,  Marc A. Lehmann  wrote:
 
>    ReiserFS: hdg2: warning: reiserfs_add_entry: Congratulations! we have got hash function screwed up
> 
> Sure sounds like a filesystem bug to me. Is this 2.6.10-rc3-specific or a
> generic bug in handling hash collisions?

I can confirm that with 2.6.10.
It is independent of hash function (r5, rupasov, tea) used.

Here a script that works independent of hash (feel free to forward it to
bugtraq - it's a showstopper bug):

#! /bin/sh
# reiserfs v3 denial of "00000000" creation attack (hash collisions)
# insider: EHASHCOLLISION EAGAIN!
#
ATTACK=00000000
R5="
000003435823 000022067556 000040799289 000047672563 000079051844 000097783577 
000119162858 000125037032 000137894590 000156516313 000169273871 000175148046 
000193879879 000209384885 000228006608 000246738340 000252611615 000259495899 
000278117621 000296849354 000305480087 000311354361 000318228635 000342833642 
000361465375 000374212923 000392944656 000401576389 000414323937 000439929944 
000445803118 000464434950 000470309125 000483066683 000504545964 000523177697 
000560530152 000567404427 000573288700 000598893708 000600641166 000607515440 
000626147173 000632020447 000657626454 000676258187 000689005735 000729116749 
000747848481 000753721756 000779227762 000797959495 000806590218 000819338776 
000843943783 000862575506 000888080512 000921308252 000934065800 000946913259 
000952797533 000971419266 000984176814 001008625581 001027257304 001033130588 
001051862310 001058736595 001070494043 001077368318 001117479331 001136100064 
001148958612 001154831897 001160706070 001186211078 001213574633 001226322091 
001257801372 001263685647 001289190653 001295064928 001303796660 001316544109 
001322418393 001335175941 001366655122 001385286955 001406766136 001425397969 
001431271143 001462750424 001481382157 001502861438 001509735712 001521493170 
001528367445 001534240719 001552972451 001559846726 001578478459 001611705198 
001618589472 001661816201 001687321209 001714684774 001727432222 001758911503 
001764795788 001777543236 001783417510 001823528524 001849033530 001867765263 
001873639538 001892270270 001907876277 001932381284 001945129832 001951003007 
001963860565 001976609013 001982492298 002012815329 002019699603 002031447061 
002038320336 002050078894 002062926342 002081558075"

RUPASOV="
000000000000 000016777216 000033554432 000050331648 000067108864 000083886080 
000100663296 000117440512 000134217728 000150994944 000167772160 000184549376 
000201326592 000218103808 000234881024 000251658240 000268435456 000285212672 
000301989888 000318767104 000335544320 000352321536 000369098752 000385875968 
000402653184 000419430400 000436207616 000452984832 000469762048 000486539264 
000503316480 000520093696 000536870912 000553648128 000570425344 000587202560 
000603979776 000620756992 000637534208 000654311424 000671088640 000687865856 
000704643072 000721420288 000738197504 000754974720 000771751936 000788529152 
000805306368 000822083584 000838860800 000855638016 000872415232 000889192448 
000905969664 000922746880 000939524096 000956301312 000973078528 000989855744 
001006632960 001023410176 001040187392 001056964608 001073741824 001090519040 
001107296256 001124073472 001140850688 001157627904 001174405120 001191182336 
001207959552 001224736768 001241513984 001258291200 001275068416 001291845632 
001308622848 001325400064 001342177280 001358954496 001375731712 001392508928 
001409286144 001426063360 001442840576 001459617792 001476395008 001493172224 
001509949440 001526726656 001543503872 001560281088 001577058304 001593835520 
001610612736 001627389952 001644167168 001660944384 001677721600 001694498816 
001711276032 001728053248 001744830464 001761607680 001778384896 001795162112 
001811939328 001828716544 001845493760 001862270976 001879048192 001895825408 
001912602624 001929379840 001946157056 001962934272 001979711488 001996488704 
002013265920 002030043136 002046820352 002063597568 002080374784 002097152000 
002113929216 002130706432 002147483648 002164260864"

TEA="
000004464160 000041804440 000080240100 000091329029 000104181015 000113725885 
000126527488 000140392446 000158910938 000228997445 000230956744 000265118409 
000278488948 000294393023 000295253722 000300066283 000302103786 000330187358 
000345002932 000351581026 000363320013 000366148241 000398298703 000411084407 
000430270876 000450889104 000457353842 000459620112 000464658163 000465039241 
000472966466 000479773493 000485638992 000490029225 000519300138 000523222490 
000543871739 000550161091 000614863063 000628859470 000658101403 000705881242 
000707428465 000709541412 000710835913 000712765852 000747815906 000751391777 
000758206682 000759473821 000761493018 000807141251 000819925766 000822342439 
000844968698 000846939644 000856679997 000862332598 000897273990 000903164600 
000959685453 000966591643 000975714799 001026819859 001030872126 001052008464 
001101513177 001111878931 001114914486 001126417564 001134342742 001145636506 
001156983946 001160376993 001181076193 001187045552 001229906330 001237307675 
001240018281 001248584794 001258422117 001295880517 001329389368 001334233160 
001350505390 001377143821 001393244657 001501373481 001530260023 001544062865 
001610909439 001642156853 001659354644 001663482837 001665935205 001709618183 
001759947280 001787829412 001793154973 001822800512 001896514533 001927905568 
001943169212 001959135151 001971173446 001990765252 001994429464 002029003899 
002067143236 002083825957 002092102503 002097735725 002117257216 002119310814 
002135908515 002158320625 002167193043 002172339658 002184965785 002188948362 
002206033194 002206592488 002207146649 002209003597 002219996827 002235862165 
002237358329 002267747802 002273109830 002284855435"

for i in $RUPASOV $R5 $TEA ; do
  touch $i
done

touch $ATTACK
if [ \! -f $ATTACK ] ; then
  echo "FATAL: can't create $ATTACK"
fi

rm -f $RUPASOV $R5 $TEA 


Dec 28 23:38:24 kotzmaster kernel: ReiserFS: loop0: Using rupasov hash to sort names
Dec 29 00:38:59 kotzmaster kernel: ReiserFS: loop0: warning: reiserfs_add_entry: Congratulations! we have got hash function screwed up

Dec 29 00:40:17 kotzmaster kernel: ReiserFS: loop0: Using tea hash to sort names
Dec 29 00:40:22 kotzmaster kernel: ReiserFS: loop0: warning: reiserfs_add_entry: Congratulations! we have got hash function screwed up

Dec 29 00:41:16 kotzmaster kernel: ReiserFS: loop0: Using r5 hash to sort names
Dec 29 00:41:23 kotzmaster kernel: ReiserFS: loop0: warning: reiserfs_add_entry: Congratulations! we have got hash function screwed up


--

  ciao - 
    Stefan

"          GNU's Not Unix          --            IIS Isn't Secure          "

^ permalink raw reply	[flat|nested] 72+ messages in thread

* Re: Congratulations! we have got hash function screwed up
  2004-12-29 18:55 ` Stefan Traby
@ 2004-12-29 21:04   ` Lehmann 
  2004-12-29 21:05   ` Hans Reiser
  1 sibling, 0 replies; 72+ messages in thread
From: Lehmann  @ 2004-12-29 21:04 UTC (permalink / raw)
  To: Stefan Traby; +Cc: reiserfs-list

On Wed, Dec 29, 2004 at 07:55:29PM +0100, Stefan Traby <stefan@hello-penguin.com> wrote:
> On Tue, Dec 28, 2004 at 11:12:18PM +0100,  Marc A. Lehmann  wrote:
>  
> >    ReiserFS: hdg2: warning: reiserfs_add_entry: Congratulations! we have got hash function screwed up
> > 
> > Sure sounds like a filesystem bug to me. Is this 2.6.10-rc3-specific or a
> > generic bug in handling hash collisions?
> 
> I can confirm that with 2.6.10.
> It is independent of hash function (r5, rupasov, tea) used.

Interesting, I would have hoped it's not so easy to generate
collisions. Now that a debian package creates collisions, this issue has
become very real.

Another note: It seems that the error returned is wrong. I would expect
ENOSPC if reiserfs runs out of (key-)space, not EBUSY or whatever it
returns.

-- 
                The choice of a
      -----==-     _GNU_
      ----==-- _       generation     Marc Lehmann
      ---==---(_)__  __ ____  __      pcg@goof.com
      --==---/ / _ \/ // /\ \/ /      http://schmorp.de/
      -=====/_/_//_/\_,_/ /_/\_\      XX11-RIPE

^ permalink raw reply	[flat|nested] 72+ messages in thread

* Re: Congratulations! we have got hash function screwed up
  2004-12-29 18:55 ` Stefan Traby
  2004-12-29 21:04   ` Lehmann 
@ 2004-12-29 21:05   ` Hans Reiser
  2004-12-29 21:43     ` Lehmann 
  1 sibling, 1 reply; 72+ messages in thread
From: Hans Reiser @ 2004-12-29 21:05 UTC (permalink / raw)
  To: Stefan Traby; +Cc: Marc A. Lehmann, reiserfs-list

Stefan Traby wrote:

>
>
>Here a script that works independent of hash (feel free to forward it to
>bugtraq - it's a showstopper bug):
>  
>
It is not independent of hash, it is hardcoded to be hash specific.  It 
is not a showstopper bug --- almost nobody cared about it for the last 5 
years.  If you don't accept that quality of service condition on 
filename creation, use reiser4 or ext3. 

^ permalink raw reply	[flat|nested] 72+ messages in thread

* Re: Congratulations! we have got hash function screwed up
  2004-12-29 21:05   ` Hans Reiser
@ 2004-12-29 21:43     ` Lehmann 
  2004-12-29 21:46       ` Christian Iversen
  2004-12-30  2:05       ` Hans Reiser
  0 siblings, 2 replies; 72+ messages in thread
From: Lehmann  @ 2004-12-29 21:43 UTC (permalink / raw)
  To: Hans Reiser; +Cc: Stefan Traby, reiserfs-list

On Wed, Dec 29, 2004 at 01:05:38PM -0800, Hans Reiser <reiser@namesys.com> wrote:
> Stefan Traby wrote:
> 
> >
> >
> >Here a script that works independent of hash (feel free to forward it to
> >bugtraq - it's a showstopper bug):
> > 
> >
> is not a showstopper bug

If it keeps debian from being usable on reiserfs (mind you, xfonts-75 and
xfonts-100 are not unimportant packages), I'd call this a showstopper
indeed *g*.

> --- almost nobody cared about it for the last 5 
> years.

This is a lame excuse for a bug - after all, you promoted reiserfs of being
capable of storing many files in one directory instead of having to rely on
directory hierarchies for e.g. squid and other apps. But exactly that is not
possible with reiserfs, as too many files in one directory == collisions.

Also, it's a lie that nobody cared about this, after all, there ahd been
earlier reports.

And last not least, most apps do not create many files in one directory by
default, for compatibility with other filesystems, where this is too slow.

> If you don't accept that quality of service condition on filename
> creation,

Again, this is a lame excuse for a bug. First you declare some features on
your filesystem, later, when it turns out that it isn't being delivered,
you act as if this were a known condition.

(Even if it were ok to fail file creation, the error generated is still
wrong. It is a bug, no matter how you try to twist it).

> use reiser4 or ext3. 

reiser4, is, of course, far from being stable enough for such uses still.

-- 
                The choice of a
      -----==-     _GNU_
      ----==-- _       generation     Marc Lehmann
      ---==---(_)__  __ ____  __      pcg@goof.com
      --==---/ / _ \/ // /\ \/ /      http://schmorp.de/
      -=====/_/_//_/\_,_/ /_/\_\      XX11-RIPE

^ permalink raw reply	[flat|nested] 72+ messages in thread

* Re: Congratulations! we have got hash function screwed up
  2004-12-29 21:43     ` Lehmann 
@ 2004-12-29 21:46       ` Christian Iversen
  2004-12-29 22:27         ` Lehmann 
  2004-12-30  2:05       ` Hans Reiser
  1 sibling, 1 reply; 72+ messages in thread
From: Christian Iversen @ 2004-12-29 21:46 UTC (permalink / raw)
  To: reiserfs-list

On Wednesday 29 December 2004 22:43, pcg@goof.com ( Marc) (A.) (Lehmann ) 
wrote:
> On Wed, Dec 29, 2004 at 01:05:38PM -0800, Hans Reiser <reiser@namesys.com> 
wrote:
> > Stefan Traby wrote:
> > >Here a script that works independent of hash (feel free to forward it to
> > >bugtraq - it's a showstopper bug):
> >
> > is not a showstopper bug
>
> If it keeps debian from being usable on reiserfs (mind you, xfonts-75 and
> xfonts-100 are not unimportant packages), I'd call this a showstopper
> indeed *g*.

Under what conditions does this occur? I have 5 installs of debian linux on 
reiserfs here at home, and I have never had such problems. Is it only in 
directories with thousands of other files?

-- 
Regards,
Christian Iversen

^ permalink raw reply	[flat|nested] 72+ messages in thread

* Re: Congratulations! we have got hash function screwed up
  2004-12-29 21:46       ` Christian Iversen
@ 2004-12-29 22:27         ` Lehmann 
  0 siblings, 0 replies; 72+ messages in thread
From: Lehmann  @ 2004-12-29 22:27 UTC (permalink / raw)
  To: Christian Iversen; +Cc: reiserfs-list

On Wed, Dec 29, 2004 at 10:46:46PM +0100, Christian Iversen <chrivers@iversen-net.dk> wrote:
> > On Wed, Dec 29, 2004 at 01:05:38PM -0800, Hans Reiser <reiser@namesys.com> 
> wrote:
> > > Stefan Traby wrote:
> > > >Here a script that works independent of hash (feel free to forward it to
> > > >bugtraq - it's a showstopper bug):
> > >
> > > is not a showstopper bug
> >
> > If it keeps debian from being usable on reiserfs (mind you, xfonts-75 and
> > xfonts-100 are not unimportant packages), I'd call this a showstopper
> > indeed *g*.
> 
> Under what conditions does this occur?

Under the conditions that I already wrote about: when upgrading an
existing xfonts-75dpi package. Installing works fine, upgrading does not,
presumably because dpkg creates a backup copy of every file upgraded
first, which then exceeds some internal reiserfs limit.

> I have 5 installs of debian linux on 
> reiserfs here at home, and I have never had such problems. Is it only in 
> directories with thousands of other files?

As I understands the bug, it happens when too many filenames in the same
directory happen to hash to the same file - reiserfs requires the hash of
filenames to be "unqiue enough", otherwise it will not be able to create
more files with the same hashed name.

As the examples show, getting collisions is pretty straightforward and
easy, even with the tea hash (I can faintly remember hans reiser claiming
that this bug has been solved some years ago, and indeed this is the first
time it really bites me, albeit with a very real example).

As such, reiserfs v3 is not suitable for server operations, where such
irregular behaviour simply must not occur - consider this happening when
installing a kernel package, leaving your system in a non-bootable state
or so.

My specific case might depend on other packages - as I maintain some
i18n'ed software I have lots of extra font packages installed, although
I doubt many of them will end up in the 75dpi directory, but it might
be that my 75dpi and 100dpi dirs might be somewhat crowded - they both
contain 1888 files with similar names (and the standard hash, r5, is very
susceptible to similar names leading to similar hashes).

-- 
                The choice of a
      -----==-     _GNU_
      ----==-- _       generation     Marc Lehmann
      ---==---(_)__  __ ____  __      pcg@goof.com
      --==---/ / _ \/ // /\ \/ /      http://schmorp.de/
      -=====/_/_//_/\_,_/ /_/\_\      XX11-RIPE

^ permalink raw reply	[flat|nested] 72+ messages in thread

* Re: Congratulations! we have got hash function screwed up
  2004-12-29 21:43     ` Lehmann 
  2004-12-29 21:46       ` Christian Iversen
@ 2004-12-30  2:05       ` Hans Reiser
  2004-12-30 10:22         ` Matthias Andree
  2004-12-30 17:02         ` Lehmann 
  1 sibling, 2 replies; 72+ messages in thread
From: Hans Reiser @ 2004-12-30  2:05 UTC (permalink / raw)
  To: reiserfs-list; +Cc: Stefan Traby

pcg( Marc)@goof(A.).(Lehmann )com wrote:

>
>
>Again, this is a lame excuse for a bug. First you declare some features on
>your filesystem, later, when it turns out that it isn't being delivered,
>you act as if this were a known condition.
>  
>
Well this is true, you are right.  Reiser4 is the fix though.

>(Even if it were ok to fail file creation, the error generated is still
>wrong. It is a bug, no matter how you try to twist it).
>  
>
Blame Alan Cox for that, he changed it from -EHASHCOLLISION (or some 
such error I invented, I forget) over my objections.


^ permalink raw reply	[flat|nested] 72+ messages in thread

* Re: Congratulations! we have got hash function screwed up
  2004-12-30  2:05       ` Hans Reiser
@ 2004-12-30 10:22         ` Matthias Andree
  2004-12-30 17:02         ` Lehmann 
  1 sibling, 0 replies; 72+ messages in thread
From: Matthias Andree @ 2004-12-30 10:22 UTC (permalink / raw)
  To: Hans Reiser; +Cc: reiserfs-list, Stefan Traby

Hans Reiser <reiser@namesys.com> writes:

>>Again, this is a lame excuse for a bug. First you declare some features on
>>your filesystem, later, when it turns out that it isn't being delivered,
>>you act as if this were a known condition.
>>
> Well this is true, you are right.  Reiser4 is the fix though.

No, it isn't. Reiser4 is an alternative beast. Or will it transparently
"fix" the collision problem in a 3.5 or 3.6 file system, in a way that
is backwards compatible with 3.6 drivers? If not, please fix reiser3.6.

Given that Reiser4 isn't "proven" yet in the field (for that, it would
have to be used as the default file system by at least one major
distributor for at least a year), it is certainly not an option for
servers _yet_.

A file system that intransparently (i. e. not inode count or block
count) refuses to create a new file doesn't belong on _my_ production
machines, which shall migrate away from reiserfs on the next suitable
occasion (such as upgrades). There's ext3fs, jfs, xfs, and in 2006 or
2007, we'll talk about reiser4 again. Yes, I am conservative WRT file
systems and storage.

-- 
Matthias Andree

^ permalink raw reply	[flat|nested] 72+ messages in thread

* RE: Congratulations! we have got hash function screwed up
@ 2004-12-30 11:52 Yiannis Mavroukakis
  2004-12-30 12:40 ` Matthias Andree
  2004-12-30 17:09 ` Lehmann 
  0 siblings, 2 replies; 72+ messages in thread
From: Yiannis Mavroukakis @ 2004-12-30 11:52 UTC (permalink / raw)
  To: reiserfs-list

 
Hello Matthias,

Your "proven" reasoning sounds a bit strange to me..Microsoft (aka major
distributor at least in my books) had her filesystems "in the field" for
ages, does this prove any of them good (or bad for that matter)?
I don't think I'd wait for a distributor to shove reiser4 down my
throat, just because the distributor seems to trust it, so the logical
course would be for me to try it out. I'll grant you that I am not using
it on the mission critical server, because our hosting provider will not
support it (ext3 addicts..oh well) but I do have it on my development
server, that does house critical code and receives all kinds of
hammering from yours truly; And I use it at home.
I suppose my point is, filesystem testing and adoption belongs to the
masses be they your average Joe Linux user or a sysadmin who feels
confident enough in the filesystem's abilities to take the plunge. I run
reiser4, I'm happy with it, it is stable enough to carry out my *own*
activities. 

Happy holidays,

Yiannis.

-----Original Message-----
From: Matthias Andree [mailto:ma@dt.e-technik.uni-dortmund.de] 
Sent: 30 December 2004 10:23
To: Hans Reiser
Cc: reiserfs-list@namesys.com; Stefan Traby
Subject: Re: Congratulations! we have got hash function screwed up

Hans Reiser <reiser@namesys.com> writes:

>>Again, this is a lame excuse for a bug. First you declare some 
>>features on your filesystem, later, when it turns out that it isn't 
>>being delivered, you act as if this were a known condition.
>>
> Well this is true, you are right.  Reiser4 is the fix though.

No, it isn't. Reiser4 is an alternative beast. Or will it transparently
"fix" the collision problem in a 3.5 or 3.6 file system, in a way that
is backwards compatible with 3.6 drivers? If not, please fix reiser3.6.

Given that Reiser4 isn't "proven" yet in the field (for that, it would
have to be used as the default file system by at least one major
distributor for at least a year), it is certainly not an option for
servers _yet_.

A file system that intransparently (i. e. not inode count or block
count) refuses to create a new file doesn't belong on _my_ production
machines, which shall migrate away from reiserfs on the next suitable
occasion (such as upgrades). There's ext3fs, jfs, xfs, and in 2006 or
2007, we'll talk about reiser4 again. Yes, I am conservative WRT file
systems and storage.

--
Matthias Andree

________________________________________________________________________
This e-mail has been scanned for all viruses by Star Internet. The
service is powered by MessageLabs.

Note:__________________________________________________________________
This message is for the named person's use only. It may contain
confidential, proprietary or legally privileged information. No
confidentiality or privilege is waived or lost by any mistransmission.
If you receive this message in error, please immediately delete it and
all copies of it from your system, destroy any hard copies of it and
notify the sender. You must not, directly or indirectly, use, disclose,
distribute, print, or copy any part of this message if you are not the
intended recipient. Jaguar Freight Services and any of its subsidiaries
each reserve the right to monitor all e-mail communications through its
networks.
Any views expressed in this message are those of the individual sender,
except where the message states otherwise and the sender is authorized
to state them to be the views of any such entity.
________________________________________________________________________
This e-mail has been scanned for all viruses by Star Internet. The
service is powered by MessageLabs.

^ permalink raw reply	[flat|nested] 72+ messages in thread

* Re: Congratulations! we have got hash function screwed up
  2004-12-30 11:52 Yiannis Mavroukakis
@ 2004-12-30 12:40 ` Matthias Andree
  2004-12-30 12:59   ` Cal
  2004-12-30 17:09 ` Lehmann 
  1 sibling, 1 reply; 72+ messages in thread
From: Matthias Andree @ 2004-12-30 12:40 UTC (permalink / raw)
  To: Yiannis Mavroukakis; +Cc: reiserfs-list

"Yiannis Mavroukakis" <yiannis@jaguarfreight.com> writes:

> Your "proven" reasoning sounds a bit strange to me..Microsoft (aka
> major distributor at least in my books) had her filesystems "in the
> field" for ages, does this prove any of them good (or bad for that
> matter)?

My reasoning mentioned a /required/, but not a /sufficient/ criterion.

In other words: not before it is proven in the field will I consider it
for production use.

Remember the Linux 2.2 reiserfs 3.5 NFS woes?
Remember the early XFS-NFS woes?

These are all reasons to avoid a shiny new file system for serious work.

> I don't think I'd wait for a distributor to shove reiser4 down my
> throat, just because the distributor seems to trust it, so the logical
> course would be for me to try it out. I'll grant you that I am not
> using it on the mission critical server, because our hosting provider
> will not support it (ext3 addicts..oh well)

For practical recovery reasons (error on root FS after a crash), ext3fs
is easier to handle. You can fsck the (R/O) root partition just fine
(e2fsck then asks you to reboot right away); for reiserfs, you'll have
to boot into some emergency or rescue system...

> but I do have it on my development server, that does house critical
> code and receives all kinds of hammering from yours truly; And I use
> it at home.

I reformatted my last "at home" reiserfs an hour ago and unloaded the
reiserfs kernel module, as the way how Hans has responded to the error
report is inacceptable.

Anyone is free to choose the file system, and as the simple
demonstration code posted earlier shows a serious flaw in reiserfs,
Hans's response was boldfaced, I ditched reiserfs3. End of story.

-- 
Matthias Andree

^ permalink raw reply	[flat|nested] 72+ messages in thread

* Re: Congratulations! we have got hash function screwed up
  2004-12-30 12:40 ` Matthias Andree
@ 2004-12-30 12:59   ` Cal
  2004-12-30 14:18     ` Matthias Andree
  0 siblings, 1 reply; 72+ messages in thread
From: Cal @ 2004-12-30 12:59 UTC (permalink / raw)
  To: Matthias Andree; +Cc: reiserfs-list

----------
and then at Thu, 30 Dec 2004 13:40:53 +0100, it was written ...
 ... 
 >Anyone is free to choose the file system, and as the simple
 >demonstration code posted earlier shows a serious flaw in reiserfs,
 >Hans's response was boldfaced, I ditched reiserfs3. End of story.
 >

Your policy and philosophy on file system selection are yours to enjoy as
you see fit, but the anger and angst ... ?   Phew!! 

cheers, Cal


^ permalink raw reply	[flat|nested] 72+ messages in thread

* RE: Congratulations! we have got hash function screwed up
@ 2004-12-30 13:24 Yiannis Mavroukakis
  2004-12-30 14:11 ` Matthias Andree
  0 siblings, 1 reply; 72+ messages in thread
From: Yiannis Mavroukakis @ 2004-12-30 13:24 UTC (permalink / raw)
  To: reiserfs-list

>My reasoning mentioned a /required/, but not a /sufficient/ criterion.
>In other words: not before it is proven in the field will I consider it
for production use.
>Remember the Linux 2.2 reiserfs 3.5 NFS woes?
>Remember the early XFS-NFS woes?
>These are all reasons to avoid a shiny new file system for serious
work.

I agree, but you're generalising, this is not xfs and reiser4 is not 3.5
;)
If you don't try out the shiny new filesystem yourself, how can you
possibly dismiss it based on the past failures
of other filesystems? 


>For practical recovery reasons (error on root FS after a crash), ext3fs
is easier to handle. You can fsck the (R/O) root partition just fine
(e2fsck then asks you to reboot right away); for reiserfs, you'll have
to boot into some emergency or rescue system...

No biggie for me, just have a removable media of some sort with your
running kernel and some basic tools.


Y.

Note:__________________________________________________________________
This message is for the named person's use only. It may contain
confidential, proprietary or legally privileged information. No
confidentiality or privilege is waived or lost by any mistransmission.
If you receive this message in error, please immediately delete it and
all copies of it from your system, destroy any hard copies of it and
notify the sender. You must not, directly or indirectly, use, disclose,
distribute, print, or copy any part of this message if you are not the
intended recipient. Jaguar Freight Services and any of its subsidiaries
each reserve the right to monitor all e-mail communications through its
networks.
Any views expressed in this message are those of the individual sender,
except where the message states otherwise and the sender is authorized
to state them to be the views of any such entity.
________________________________________________________________________
This e-mail has been scanned for all viruses by Star Internet. The
service is powered by MessageLabs.

^ permalink raw reply	[flat|nested] 72+ messages in thread

* Re: Congratulations! we have got hash function screwed up
  2004-12-30 13:24 Yiannis Mavroukakis
@ 2004-12-30 14:11 ` Matthias Andree
  0 siblings, 0 replies; 72+ messages in thread
From: Matthias Andree @ 2004-12-30 14:11 UTC (permalink / raw)
  To: Yiannis Mavroukakis; +Cc: reiserfs-list

"Yiannis Mavroukakis" <yiannis@jaguarfreight.com> writes:

> I agree, but you're generalising, this is not xfs and reiser4 is not 3.5
> ;)
> If you don't try out the shiny new filesystem yourself, how can you
> possibly dismiss it based on the past failures
> of other filesystems? 

I doubt new software is bug-free. I don't expect NFS problems with
reiser4 though, these should be in the regression tests. :-)

-- 
Matthias Andree

^ permalink raw reply	[flat|nested] 72+ messages in thread

* Re: Congratulations! we have got hash function screwed up
  2004-12-30 12:59   ` Cal
@ 2004-12-30 14:18     ` Matthias Andree
  2004-12-30 16:40       ` Hans Reiser
  2004-12-30 17:07       ` Esben Stien
  0 siblings, 2 replies; 72+ messages in thread
From: Matthias Andree @ 2004-12-30 14:18 UTC (permalink / raw)
  To: hihone; +Cc: Matthias Andree, reiserfs-list

Cal <hihone@bigpond.net.au> writes:

> ----------
> and then at Thu, 30 Dec 2004 13:40:53 +0100, it was written ...
>  ... 
>  >Anyone is free to choose the file system, and as the simple
>  >demonstration code posted earlier shows a serious flaw in reiserfs,
>  >Hans's response was boldfaced, I ditched reiserfs3. End of story.
>  >
>
> Your policy and philosophy on file system selection are yours to enjoy as
> you see fit, but the anger and angst ... ?   Phew!! 

I have no interest to deal with systems that have known and reproducible
cases of failure that are nondeterministic in practical use.

And Marc's documentation showed this is a real-world problem, not an
ivory tower problem.

The reiserfs story is over for me. All private machines I deal with are
reiserfs-free as of a few hours ago.

It was just one bug too many, and it was handled unprofessionally,
unlike many bugs before which had been dealt with on short notice
usually, or at least accepted for looking into.

I'll phase reiser3 out on my work machines as I see fit.

I have seen too many bugs in reiserfs3.

I do believe reiserfs4 fixes some design flaws of reiser3, and when the
implementation issues are all shaken out in one or two years' time, it
may be a good file system and I will look at it - I trust the reiserfs
team can learn from their mistakes.

I hope they learn that THIS handling of the error was wrong.

"Who cares, not us for the past five years" is not a proper response to
a real-world problem.

-- 
Matthias Andree

^ permalink raw reply	[flat|nested] 72+ messages in thread

* Re: Congratulations! we have got hash function screwed up
  2004-12-30 14:18     ` Matthias Andree
@ 2004-12-30 16:40       ` Hans Reiser
  2004-12-30 16:51         ` Matthias Andree
  2005-01-18 21:17         ` Grzegorz Jaśkiewicz
  2004-12-30 17:07       ` Esben Stien
  1 sibling, 2 replies; 72+ messages in thread
From: Hans Reiser @ 2004-12-30 16:40 UTC (permalink / raw)
  To: Matthias Andree; +Cc: hihone, reiserfs-list

Fixing hash collisions in V3 to do them the way V4 does them would 
create more bugs and user disruption than the current bug we have all 
lived with for 5 years until now.  If someone thinks it is a small 
change to fix it, send me a patch.  Better by far to fix bugs in V4, 
which is pretty stable these days.

Hans

^ permalink raw reply	[flat|nested] 72+ messages in thread

* Re: Congratulations! we have got hash function screwed up
  2004-12-30 16:40       ` Hans Reiser
@ 2004-12-30 16:51         ` Matthias Andree
  2005-01-18 21:17         ` Grzegorz Jaśkiewicz
  1 sibling, 0 replies; 72+ messages in thread
From: Matthias Andree @ 2004-12-30 16:51 UTC (permalink / raw)
  To: reiserfs-list

On Thu, 30 Dec 2004, Hans Reiser wrote:

> Fixing hash collisions in V3 to do them the way V4 does them would 
> create more bugs and user disruption than the current bug we have all 
> lived with for 5 years until now.  If someone thinks it is a small 
> change to fix it, send me a patch.  Better by far to fix bugs in V4, 
> which is pretty stable these days.

Better to fix a known bug than create a file system vacuum before V4 is
really stable.

Anyways, I don't care any more, I'm phasing out ReiserFS v3 and have no
plans to try V4 before 2006.

-- 
Matthias Andree

^ permalink raw reply	[flat|nested] 72+ messages in thread

* Re: Congratulations! we have got hash function screwed up
  2004-12-30  2:05       ` Hans Reiser
  2004-12-30 10:22         ` Matthias Andree
@ 2004-12-30 17:02         ` Lehmann 
  1 sibling, 0 replies; 72+ messages in thread
From: Lehmann  @ 2004-12-30 17:02 UTC (permalink / raw)
  To: Hans Reiser; +Cc: reiserfs-list, Stefan Traby

On Wed, Dec 29, 2004 at 06:05:59PM -0800, Hans Reiser <reiser@namesys.com> wrote:
> >
> >Again, this is a lame excuse for a bug. First you declare some features on
> >your filesystem, later, when it turns out that it isn't being delivered,
> >you act as if this were a known condition.
> > 
> >
> Well this is true, you are right.  Reiser4 is the fix though.

So that what happens to the filesystems develop once you have a new toy. Good
to know when planning my next server :)

> >(Even if it were ok to fail file creation, the error generated is still
> >wrong. It is a bug, no matter how you try to twist it).
> > 
> Blame Alan Cox for that, he changed it from -EHASHCOLLISION (or some 
> such error I invented, I forget) over my objections.

Blaming Cox for trying to fix your code and not getting it completely
right is not nice. After all, Cox also found that the error code is
inadequate. The point is that EBUSY is still bad, for open. ENOSPC is a
much better code, as it is a documented error code for open, whereas EBUSY
is not.

-- 
                The choice of a
      -----==-     _GNU_
      ----==-- _       generation     Marc Lehmann
      ---==---(_)__  __ ____  __      pcg@goof.com
      --==---/ / _ \/ // /\ \/ /      http://schmorp.de/
      -=====/_/_//_/\_,_/ /_/\_\      XX11-RIPE

^ permalink raw reply	[flat|nested] 72+ messages in thread

* Re: Congratulations! we have got hash function screwed up
  2004-12-30 14:18     ` Matthias Andree
  2004-12-30 16:40       ` Hans Reiser
@ 2004-12-30 17:07       ` Esben Stien
  2004-12-30 17:15         ` Christian Iversen
                           ` (3 more replies)
  1 sibling, 4 replies; 72+ messages in thread
From: Esben Stien @ 2004-12-30 17:07 UTC (permalink / raw)
  To: Matthias Andree; +Cc: hihone, reiserfs-list

Matthias Andree <ma@dt.e-technik.uni-dortmund.de> writes:

> All private machines I deal with are reiserfs-free as of a few hours
> ago.

What do you use instead?

I really don't like that there is no undelete feature in reiserfs -
it's not planned for reiserfs-4 either. I see desperate users all the
time trying to get back what they mistakenly removed. It shouldn't be
hard with reiserfs either. There should be a selection of what files
to restore, so we can avoid the file merging problem with rebuilding
the tree from leaf nodes.

> I have seen too many bugs in reiserfs3.

Is there a list of these current issues?

> "Who cares, not us for the past five years" is not a proper response
> to a real-world problem.

I also reacted to this response. 

A flaw in the filesystem, in my opinion, is equivalent to the space
ship crashing and all crew members die.

-- 
Esben Stien is b0ef@esben-stien.name
http://www.esben-stien.name
irc://irc.esben-stien.name/%23contact
[sip|iax]:b0ef@esben-stien.name

^ permalink raw reply	[flat|nested] 72+ messages in thread

* Re: Congratulations! we have got hash function screwed up
  2004-12-30 11:52 Yiannis Mavroukakis
  2004-12-30 12:40 ` Matthias Andree
@ 2004-12-30 17:09 ` Lehmann 
  2004-12-30 20:11   ` Hans Reiser
  1 sibling, 1 reply; 72+ messages in thread
From: Lehmann  @ 2004-12-30 17:09 UTC (permalink / raw)
  To: Yiannis Mavroukakis; +Cc: reiserfs-list

On Thu, Dec 30, 2004 at 11:52:58AM -0000, Yiannis Mavroukakis <yiannis@jaguarfreight.com> wrote:
> Your "proven" reasoning sounds a bit strange to me..Microsoft (aka major
> distributor at least in my books) had her filesystems "in the field" for
> ages, does this prove any of them good (or bad for that matter)?

You state that "proven" is the same as "good", but why you do so escapes
me. In general, you can easily prove that black == white (etc.) by such
illogical reasoning.

The reasoning in fact is that file systems which are out in the field for
years have _known_ properties, are proven to work with some amount of
software, possibly including software that specicially has workarounds for
filesystem shortcomins (i.e. squid with it's multi-dir-hierarchy, which is
just a workaround for basically all non-reiserfs filesystems, and now also
for reiserfs3 filesystems, which also doesn't cope with millions of files
in a dir, as opposed to the original claims by it's developers).

This is cerftainly true for microsofts filesystems: you know what to expect
from vfat or ntfs, for example.

For new filesystems, the issue is completely different. Look at the "file
is a directory" approach that formerly was the default in reiserfs. This
breaks a number of programs, despite the expectancy by the reiserfs
authors that this is not the case (in my experience, stat "path/." to
quickly check wether sth. is a file or a directory is pretty common). Unless
the filesystem is in the field for some time these bugs will not be found.

-- 
                The choice of a
      -----==-     _GNU_
      ----==-- _       generation     Marc Lehmann
      ---==---(_)__  __ ____  __      pcg@goof.com
      --==---/ / _ \/ // /\ \/ /      http://schmorp.de/
      -=====/_/_//_/\_,_/ /_/\_\      XX11-RIPE

^ permalink raw reply	[flat|nested] 72+ messages in thread

* Re: Congratulations! we have got hash function screwed up
  2004-12-30 17:07       ` Esben Stien
@ 2004-12-30 17:15         ` Christian Iversen
  2004-12-30 17:47         ` Sander
                           ` (2 subsequent siblings)
  3 siblings, 0 replies; 72+ messages in thread
From: Christian Iversen @ 2004-12-30 17:15 UTC (permalink / raw)
  To: reiserfs-list

On Thursday 30 December 2004 18:07, Esben Stien wrote:
> Matthias Andree <ma@dt.e-technik.uni-dortmund.de> writes:
> > All private machines I deal with are reiserfs-free as of a few hours
> > ago.
>
> What do you use instead?
>
> I really don't like that there is no undelete feature in reiserfs -
> it's not planned for reiserfs-4 either. I see desperate users all the
> time trying to get back what they mistakenly removed. It shouldn't be
> hard with reiserfs either. There should be a selection of what files
> to restore, so we can avoid the file merging problem with rebuilding
> the tree from leaf nodes.
>
> > I have seen too many bugs in reiserfs3.
>
> Is there a list of these current issues?
>
> > "Who cares, not us for the past five years" is not a proper response
> > to a real-world problem.
>
> I also reacted to this response.
>
> A flaw in the filesystem, in my opinion, is equivalent to the space
> ship crashing and all crew members die.

To be more exact, it's equal to a mars probe stranding and making many 
scientists very nervous :-)

-- 
Regards,
Christian Iversen

^ permalink raw reply	[flat|nested] 72+ messages in thread

* RE: Congratulations! we have got hash function screwed up
@ 2004-12-30 17:22 Yiannis Mavroukakis
  0 siblings, 0 replies; 72+ messages in thread
From: Yiannis Mavroukakis @ 2004-12-30 17:22 UTC (permalink / raw)
  To: reiserfs-list

>You state that "proven" is the same as "good", but why you do so
escapes me. In general, you can easily prove that black == white >(etc.)
by such illogical reasoning.

No I don't :) I merely say that "proven" does not equal good OR bad if a
distributor chooses to bundle the filesystem with a distribution. It can
be proven good or proven bad :) Clear?:) 



>[..]Unless the filesystem is in the field for some time these bugs will
not be found.

I agree but only if the filesystem is used in the real world, and it's
adoption is not driven primarily by distributors.

BTW red|purple|brown might as well be black if you are colour blind ;)))
It's all a mater of perception. 


Note:__________________________________________________________________
This message is for the named person's use only. It may contain
confidential, proprietary or legally privileged information. No
confidentiality or privilege is waived or lost by any mistransmission.
If you receive this message in error, please immediately delete it and
all copies of it from your system, destroy any hard copies of it and
notify the sender. You must not, directly or indirectly, use, disclose,
distribute, print, or copy any part of this message if you are not the
intended recipient. Jaguar Freight Services and any of its subsidiaries
each reserve the right to monitor all e-mail communications through its
networks.
Any views expressed in this message are those of the individual sender,
except where the message states otherwise and the sender is authorized
to state them to be the views of any such entity.
________________________________________________________________________
This e-mail has been scanned for all viruses by Star Internet. The
service is powered by MessageLabs.

^ permalink raw reply	[flat|nested] 72+ messages in thread

* Re: Congratulations! we have got hash function screwed up
  2004-12-30 17:07       ` Esben Stien
  2004-12-30 17:15         ` Christian Iversen
@ 2004-12-30 17:47         ` Sander
  2004-12-30 17:59           ` Esben Stien
                             ` (2 more replies)
  2004-12-30 20:08         ` Hans Reiser
  2004-12-30 20:57         ` Adrian Ulrich
  3 siblings, 3 replies; 72+ messages in thread
From: Sander @ 2004-12-30 17:47 UTC (permalink / raw)
  To: Esben Stien; +Cc: Matthias Andree, hihone, reiserfs-list

Esben Stien wrote (ao):
> I really don't like that there is no undelete feature in reiserfs -
> it's not planned for reiserfs-4 either. I see desperate users all the
> time trying to get back what they mistakenly removed.

If you 'see desperate users all the time' you might be amoung the wrong
people ;-)

Maybe you can clue them in the wonderful world of making backups?

Next time it is not their mistake, but instead a broken harddisk.
Undelete wont save them then.

Or they just edited it instead of rm.

^ permalink raw reply	[flat|nested] 72+ messages in thread

* Re: Congratulations! we have got hash function screwed up
  2004-12-30 17:47         ` Sander
@ 2004-12-30 17:59           ` Esben Stien
  2004-12-30 18:30             ` Sander
  2004-12-30 18:16           ` Esben Stien
  2004-12-30 18:26           ` Spam
  2 siblings, 1 reply; 72+ messages in thread
From: Esben Stien @ 2004-12-30 17:59 UTC (permalink / raw)
  To: sander; +Cc: Matthias Andree, hihone, reiserfs-list

Sander <sander@humilis.net> writes:

> If you 'see desperate users all the time' you might be amoung the
> wrong people ;-)

;), on the lists and on user groups.

> Maybe you can clue them in the wonderful world of making backups?

I always promote bacula, but you will always have those who don't do
the dance.

> Next time it is not their mistake, but instead a broken harddisk.
> Undelete wont save them then.

We got pretty good tools to restore from a hd with bad blocks. 

dd it, loop it, fsck it. 

> Or they just edited it instead of rm.

This is handled in user space.

-- 
Esben Stien is b0ef@esben-stien.name
http://www.esben-stien.name
irc://irc.esben-stien.name/%23contact
[sip|iax]:b0ef@esben-stien.name

^ permalink raw reply	[flat|nested] 72+ messages in thread

* Re: Congratulations! we have got hash function screwed up
  2004-12-30 17:47         ` Sander
  2004-12-30 17:59           ` Esben Stien
@ 2004-12-30 18:16           ` Esben Stien
  2004-12-30 18:26           ` Spam
  2 siblings, 0 replies; 72+ messages in thread
From: Esben Stien @ 2004-12-30 18:16 UTC (permalink / raw)
  To: sander; +Cc: Matthias Andree, hihone, reiserfs-list

Sander <sander@humilis.net> writes:

> Maybe you can clue them in the wonderful world of making backups?

This is not always the case, btw

This might be caused by a faulty application as well if the system is
not secure enough.

Tools to scan the partition and give a list of possible restores
should not be hard to implement. 

-- 
Esben Stien is b0ef@esben-stien.name
http://www.esben-stien.name
irc://irc.esben-stien.name/%23contact
[sip|iax]:b0ef@esben-stien.name

^ permalink raw reply	[flat|nested] 72+ messages in thread

* RE: Congratulations! we have got hash function screwed up
@ 2004-12-30 18:16 Burnes, James
  2004-12-30 18:36 ` Esben Stien
                   ` (2 more replies)
  0 siblings, 3 replies; 72+ messages in thread
From: Burnes, James @ 2004-12-30 18:16 UTC (permalink / raw)
  To: Esben Stien, sander; +Cc: Matthias Andree, hihone, reiserfs-list

> We got pretty good tools to restore from a hd with bad blocks.
> 
> dd it, loop it, fsck it.

Heh, heh.  That won't help you if a circuit board, spindle, read head or
other mechanism fails.  Then you better hope the data wasn't *that*
valuable or you know good platter recovery shop.

Backups are cheap.  Recovery is very expensive.  Ask the CIA ;-)

It's Hans and friend's responsibility to patch whatever looks like a
serious problem.  It's the user's responsibility to protect valuable
data.
File system reliability after that is really a question of operational
downtime expenses etc.

If people don't trouble themselves to perform backups:

1. They have accepted that risk or
2. They are too junior to have ever lost a couple months worth of work
because they were too lazy or inexperienced to perform backups.
3. They don't have any work that can't be rebuilt in a matter of hours
(see #1).

(BTW: If Hans is a little tired of working on Reiser3 it's probably
because he is currently stressed out making last minute tweaks on
Reiser4 and managing his team.

Cut him some slack.  Email conversations don't show a number of things
we take for granted, like that fact that the person we're talking to
looks really tired etc.  Unlike ext3, XFS and JFS, Reiser isn't funded
by someone with huge pockets.)

Jim Burnes




^ permalink raw reply	[flat|nested] 72+ messages in thread

* Re: Congratulations! we have got hash function screwed up
  2004-12-30 17:47         ` Sander
  2004-12-30 17:59           ` Esben Stien
  2004-12-30 18:16           ` Esben Stien
@ 2004-12-30 18:26           ` Spam
  2004-12-30 20:41             ` Tom Vier
  2004-12-30 23:14             ` Matthias Andree
  2 siblings, 2 replies; 72+ messages in thread
From: Spam @ 2004-12-30 18:26 UTC (permalink / raw)
  To: reiserfs-list


  

> Esben Stien wrote (ao):
>> I really don't like that there is no undelete feature in reiserfs -
>> it's not planned for reiserfs-4 either. I see desperate users all the
>> time trying to get back what they mistakenly removed.

> If you 'see desperate users all the time' you might be amoung the wrong
> people ;-)

> Maybe you can clue them in the wonderful world of making backups?

  This is the standard Linux comment... But in reality, it just a way
  to say you do not know, or that linux lacks an (in this users
  opinion) important feature.

  In any case. Undelete has been since ages on many platforms. It IS a
  useful feature. Accidents CAN happen for many reasons and in some
  cases you may need to recover data.

  Besides, a deletion does not fully remove the data, but just unlinks
  it. In Reiser where there is tailing etc for small files this can be
  a problem. Either the little file might not be able to be recovered
  (shouldn't the data still exist, even if it is tailed), or the user
  need to use a non-tailing policy?

> Next time it is not their mistake, but instead a broken harddisk.
> Undelete wont save them then.

  Indeed. Backups are important. But the backup does not recover the
  last minute changes. They may be a day or a week old at best - even
  at larger sites!

> Or they just edited it instead of rm.
´
  well, overwritten data is not so easy to get back. But from what I
  understand in Linux, is that many applications actually write
  another file and then unlinks the old file? If that is the case then
  it may even be possible to get back some overwritten files!
  
  ~S

-- 


^ permalink raw reply	[flat|nested] 72+ messages in thread

* Re: Congratulations! we have got hash function screwed up
  2004-12-30 17:59           ` Esben Stien
@ 2004-12-30 18:30             ` Sander
  2004-12-30 18:46               ` Esben Stien
  0 siblings, 1 reply; 72+ messages in thread
From: Sander @ 2004-12-30 18:30 UTC (permalink / raw)
  To: Esben Stien; +Cc: sander, Matthias Andree, hihone, reiserfs-list

Esben Stien wrote (ao):
> Sander <sander@humilis.net> writes:
> > Next time it is not their mistake, but instead a broken harddisk.
> > Undelete wont save them then.
> 
> We got pretty good tools to restore from a hd with bad blocks. 
> 
> dd it, loop it, fsck it. 
 
I'm sure a friend of mine disagrees with you after paying big bucks to a
Norway based disk recovery company after a disk crash and zero backups.
A Dutch recovery company couldn't recover the disk.

And restore from backup is always quicker and less stressful on the
nerves.

> > Or they just edited it instead of rm.
> 
> This is handled in user space.

Maybe in your situation. But in general I advice everybody to make
backups. Might also be the reason nobody wrote a reiser4 undelete plugin
yet.

^ permalink raw reply	[flat|nested] 72+ messages in thread

* Re: Congratulations! we have got hash function screwed up
  2004-12-30 18:16 Burnes, James
@ 2004-12-30 18:36 ` Esben Stien
  2004-12-30 19:26   ` Matthias Andree
  2004-12-30 19:24 ` Matthias Andree
  2004-12-30 20:25 ` Hans Reiser
  2 siblings, 1 reply; 72+ messages in thread
From: Esben Stien @ 2004-12-30 18:36 UTC (permalink / raw)
  To: Burnes, James; +Cc: sander, Matthias Andree, hihone, reiserfs-list

"Burnes, James" <james.burnes@gwl.com> writes:

>> dd it, loop it, fsck it.
> That won't help you if a circuit board, spindle, read head or other
> mechanism fails.

Sure, but that is not a simple case of bad blocks;)

> Then you better hope the data wasn't *that* valuable or you know
> good platter recovery shop.

We got IBAS in Norway; the price for an evaluation is close to a
handful of gold.

> Backups are cheap.  Recovery is very expensive.

Sure, but your not factoring in murphys law here. A tool to undelete
would come many people in handy who even got proper backup solutions.

Besides, a recovery is not the same as a feature to undelete. Such a
feature would maybe save a days work, which is a lot in some circles.

-- 
Esben Stien is b0ef@esben-stien.name
http://www.esben-stien.name
irc://irc.esben-stien.name/%23contact
[sip|iax]:b0ef@esben-stien.name

^ permalink raw reply	[flat|nested] 72+ messages in thread

* Re: Congratulations! we have got hash function screwed up
  2004-12-30 18:30             ` Sander
@ 2004-12-30 18:46               ` Esben Stien
  2004-12-30 18:49                 ` Chris Dukes
  2004-12-30 19:21                 ` Sander
  0 siblings, 2 replies; 72+ messages in thread
From: Esben Stien @ 2004-12-30 18:46 UTC (permalink / raw)
  To: sander; +Cc: Matthias Andree, hihone, reiserfs-list

Sander <sander@humilis.net> writes:

>> dd it, loop it, fsck it. 
>  
> I'm sure a friend of mine disagrees with you after paying big bucks
> to a Norway based disk recovery company after a disk crash and zero
> backups.  A Dutch recovery company couldn't recover the disk.

Probably IBAS;). What was the problem of the hd then?. Why couldn't
the dutchies do the job?

> And restore from backup is always quicker and less stressful on the
> nerves.

Like I said to James, this might not only be the fault of the user. 

> Maybe in your situation. But in general I advice everybody to make
> backups. Might also be the reason nobody wrote a reiser4 undelete
> plugin yet.

It's no use to talk about backups when we talk about undelete
features; backups are for recovery after nuclear explosions, faulty
hardware which makes it inaccessible to even retrieve data out of it
or something similar.

It would be too expensive to do all this backup'ing in userspace. When
a file gets deleted, a proper procedure to retrieve it would be to
umount the filesystem, scan it for the data which was removed and then
put it back in the tree again. It's not like reiserfs overwrites this
data, it's still there, so why should there be an artificial barrier
to getting this data back?.

-- 
Esben Stien is b0ef@esben-stien.name
http://www.esben-stien.name
irc://irc.esben-stien.name/%23contact
[sip|iax]:b0ef@esben-stien.name

^ permalink raw reply	[flat|nested] 72+ messages in thread

* Re: Congratulations! we have got hash function screwed up
  2004-12-30 18:46               ` Esben Stien
@ 2004-12-30 18:49                 ` Chris Dukes
  2004-12-30 19:21                 ` Sander
  1 sibling, 0 replies; 72+ messages in thread
From: Chris Dukes @ 2004-12-30 18:49 UTC (permalink / raw)
  To: Esben Stien; +Cc: sander, Matthias Andree, hihone, reiserfs-list

On Thu, Dec 30, 2004 at 07:46:09PM +0100, Esben Stien wrote:
> 
> It would be too expensive to do all this backup'ing in userspace. When
> a file gets deleted, a proper procedure to retrieve it would be to
> umount the filesystem, scan it for the data which was removed and then
> put it back in the tree again. It's not like reiserfs overwrites this
> data, it's still there, so why should there be an artificial barrier
> to getting this data back?.

You're using the wrong tool for the job.
For what you're asking for you want Plan 9 and its
wormfs.

Thanks for playing.  

-- 
Chris Dukes
Warning: Do not use the reflow toaster oven to prepare foods after
it has been used for solder paste reflow. 
http://www.stencilsunlimited.com/stencil_article_page5.htm

^ permalink raw reply	[flat|nested] 72+ messages in thread

* Re: Congratulations! we have got hash function screwed up
  2004-12-30 18:46               ` Esben Stien
  2004-12-30 18:49                 ` Chris Dukes
@ 2004-12-30 19:21                 ` Sander
  2004-12-30 19:29                   ` Esben Stien
  1 sibling, 1 reply; 72+ messages in thread
From: Sander @ 2004-12-30 19:21 UTC (permalink / raw)
  To: Esben Stien; +Cc: sander, Matthias Andree, hihone, reiserfs-list

Esben Stien wrote (ao):
> Sander <sander@humilis.net> writes:
> > I'm sure a friend of mine disagrees with you after paying big bucks
> > to a Norway based disk recovery company after a disk crash and zero
> > backups. A Dutch recovery company couldn't recover the disk.
> 
> Probably IBAS;). What was the problem of the hd then?. Why couldn't
> the dutchies do the job?

Maybe the Dutch company is not that good :-)
It could be IBAS, but they didn't mention a company name. It has to be a
well know one. And your mention of gold in your other mail is very true
of course :-)

> When a file gets deleted, a proper procedure to retrieve it would be
> to umount the filesystem, scan it for the data which was removed and
> then put it back in the tree again. It's not like reiserfs overwrites
> this data, it's still there, so why should there be an artificial
> barrier to getting this data back?.

I did recover data that way several times back when I didn't do backups.
The problem is though that one file does not occupy one spot on the
harddisk. It might be spread all over the place. While the data might
still be there (on an idle disk), you miss the much needed pointers to
the data.

^ permalink raw reply	[flat|nested] 72+ messages in thread

* Re: Congratulations! we have got hash function screwed up
  2004-12-30 18:16 Burnes, James
  2004-12-30 18:36 ` Esben Stien
@ 2004-12-30 19:24 ` Matthias Andree
  2004-12-30 20:25 ` Hans Reiser
  2 siblings, 0 replies; 72+ messages in thread
From: Matthias Andree @ 2004-12-30 19:24 UTC (permalink / raw)
  To: reiserfs-list

On Thu, 30 Dec 2004, Burnes, James wrote:

> (BTW: If Hans is a little tired of working on Reiser3 it's probably
> because he is currently stressed out making last minute tweaks on
> Reiser4 and managing his team.
> 
> Cut him some slack.  Email conversations don't show a number of things
> we take for granted, like that fact that the person we're talking to
> looks really tired etc.  Unlike ext3, XFS and JFS, Reiser isn't funded
> by someone with huge pockets.)

I'm willing to grant ANY time-out, if Hans wrote "I have a pile of
deadline reiser4 contract work before I can deal with that", fine, he
didn't but said "use reiser4 instead". And that's inadequate.

And I say this without any emotions, red head, swelling veins and such.

-- 
Matthias Andree

^ permalink raw reply	[flat|nested] 72+ messages in thread

* Re: Congratulations! we have got hash function screwed up
  2004-12-30 18:36 ` Esben Stien
@ 2004-12-30 19:26   ` Matthias Andree
  0 siblings, 0 replies; 72+ messages in thread
From: Matthias Andree @ 2004-12-30 19:26 UTC (permalink / raw)
  To: reiserfs-list

On Thu, 30 Dec 2004, Esben Stien wrote:

> Sure, but your not factoring in murphys law here. A tool to undelete
> would come many people in handy who even got proper backup solutions.

You're asking for a versioned file system. If reiserfs v4 doesn't offer
such properties, find something else that does.

> Besides, a recovery is not the same as a feature to undelete. Such a
> feature would maybe save a days work, which is a lot in some circles.

Backup more often. Staged backup schemes (hourly, daily, weekly,
monthly) with varying levels of differential or complete backups, plus
off-site archives, are probably a good idea for these circles then.

-- 
Matthias Andree

^ permalink raw reply	[flat|nested] 72+ messages in thread

* Re: Congratulations! we have got hash function screwed up
  2004-12-30 19:21                 ` Sander
@ 2004-12-30 19:29                   ` Esben Stien
  0 siblings, 0 replies; 72+ messages in thread
From: Esben Stien @ 2004-12-30 19:29 UTC (permalink / raw)
  To: sander; +Cc: Matthias Andree, hihone, reiserfs-list

Sander <sander@humilis.net> writes:

> I did recover data that way several times back when I didn't do backups.
> The problem is though that one file does not occupy one spot on the
> harddisk. It might be spread all over the place. While the data might
> still be there (on an idle disk), you miss the much needed pointers to
> the data.

I'm not talking about grep here. I'm talking about a reiser tool to
scan the partition for metadata and use this to recover the file by
putting it back in the tree.

Methods like grep is an absolute last resort, if, like in this case,
there is no tool that does the job.

-- 
Esben Stien is b0ef@esben-stien.name
http://www.esben-stien.name
irc://irc.esben-stien.name/%23contact
[sip|iax]:b0ef@esben-stien.name

^ permalink raw reply	[flat|nested] 72+ messages in thread

* Re: Congratulations! we have got hash function screwed up
  2004-12-30 17:07       ` Esben Stien
  2004-12-30 17:15         ` Christian Iversen
  2004-12-30 17:47         ` Sander
@ 2004-12-30 20:08         ` Hans Reiser
  2004-12-30 21:55           ` Esben Stien
  2004-12-31  4:05           ` David Masover
  2004-12-30 20:57         ` Adrian Ulrich
  3 siblings, 2 replies; 72+ messages in thread
From: Hans Reiser @ 2004-12-30 20:08 UTC (permalink / raw)
  To: Esben Stien; +Cc: Matthias Andree, hihone, reiserfs-list

Esben Stien wrote:

>
>I really don't like that there is no undelete feature in reiserfs -
>it's not planned for reiserfs-4 either.
>
Blame Linus for that.  I would put it in, but he thinks it belongs in 
userspace.  I may still put it in someday and just not tell him.;-)  
I'll just let the users know about it and not him.;-)

> I see desperate users all the
>time trying to get back what they mistakenly removed. It shouldn't be
>hard with reiserfs either. There should be a selection of what files
>to restore, so we can avoid the file merging problem with rebuilding
>the tree from leaf nodes.
>  
>


^ permalink raw reply	[flat|nested] 72+ messages in thread

* Re: Congratulations! we have got hash function screwed up
  2004-12-30 17:09 ` Lehmann 
@ 2004-12-30 20:11   ` Hans Reiser
  0 siblings, 0 replies; 72+ messages in thread
From: Hans Reiser @ 2004-12-30 20:11 UTC (permalink / raw)
  Cc: Yiannis Mavroukakis, reiserfs-list

pcg( Marc)@goof(A.).(Lehmann )com wrote:

>
> which also doesn't cope with millions of files
>in a dir, as opposed to the original claims by it's developers).
>  
>
The generation number thing just isn't as good as teaching the tree to 
cope with duplicate keys.  Kudos to Nikita on that one.

>  
>


^ permalink raw reply	[flat|nested] 72+ messages in thread

* Re: Congratulations! we have got hash function screwed up
  2004-12-30 18:16 Burnes, James
  2004-12-30 18:36 ` Esben Stien
  2004-12-30 19:24 ` Matthias Andree
@ 2004-12-30 20:25 ` Hans Reiser
  2 siblings, 0 replies; 72+ messages in thread
From: Hans Reiser @ 2004-12-30 20:25 UTC (permalink / raw)
  To: Burnes, James; +Cc: Esben Stien, sander, Matthias Andree, hihone, reiserfs-list

Burnes, James wrote:

>>We got pretty good tools to restore from a hd with bad blocks.
>>
>>dd it, loop it, fsck it.
>>    
>>
>
>Heh, heh.  That won't help you if a circuit board, spindle, read head or
>other mechanism fails.  Then you better hope the data wasn't *that*
>valuable or you know good platter recovery shop.
>
>Backups are cheap.  Recovery is very expensive.  Ask the CIA ;-)
>
>It's Hans and friend's responsibility to patch whatever looks like a
>serious problem.  It's the user's responsibility to protect valuable
>data.
>File system reliability after that is really a question of operational
>downtime expenses etc.
>
>If people don't trouble themselves to perform backups:
>
>1. They have accepted that risk or
>2. They are too junior to have ever lost a couple months worth of work
>because they were too lazy or inexperienced to perform backups.
>3. They don't have any work that can't be rebuilt in a matter of hours
>(see #1).
>
>(BTW: If Hans is a little tired of working on Reiser3 it's probably
>because he is currently stressed out making last minute tweaks on
>Reiser4 and managing his team.
>
>Cut him some slack.  Email conversations don't show a number of things
>we take for granted, like that fact that the person we're talking to
>looks really tired etc.  Unlike ext3, XFS and JFS, Reiser isn't funded
>by someone with huge pockets.)
>
>Jim Burnes
>
>
>
>
>
>  
>
It seems a bit more shows than I would like.

My wife just got some 75 year old female judge filling in during the 
christmas holidays for the regular judge to take my kids away from me, 
without me getting a chance for a response to the allegations, until a 
hearing in 25 days from now, because I show my son movies about world 
war II, and my exposing him to violent computer games is alleged to 
induce traumatic stress disorder which somehow has never been reproduced 
in front of me.  Only in California.....

I shall be alleging that little boys take to violent computer games like 
monkeys take to trees, and challenging anyone to reproduce in front of 
me evidence to the contrary.

^ permalink raw reply	[flat|nested] 72+ messages in thread

* Re: Congratulations! we have got hash function screwed up
  2004-12-30 18:26           ` Spam
@ 2004-12-30 20:41             ` Tom Vier
  2004-12-30 23:14             ` Matthias Andree
  1 sibling, 0 replies; 72+ messages in thread
From: Tom Vier @ 2004-12-30 20:41 UTC (permalink / raw)
  To: reiserfs-list

Alias rm to 'mv $@ trash/' (not sure about the $@ - my bash is rusty) and
have a cronjob that reaps files older than x. Or, try pdumpfs (i've been
using it for a long time).

-- 
Tom Vier <tmv@comcast.net>
DSA Key ID 0x15741ECE

^ permalink raw reply	[flat|nested] 72+ messages in thread

* Re: Congratulations! we have got hash function screwed up
  2004-12-30 17:07       ` Esben Stien
                           ` (2 preceding siblings ...)
  2004-12-30 20:08         ` Hans Reiser
@ 2004-12-30 20:57         ` Adrian Ulrich
  2004-12-30 21:01           ` Stefan Traby
  3 siblings, 1 reply; 72+ messages in thread
From: Adrian Ulrich @ 2004-12-30 20:57 UTC (permalink / raw)
  To: reiserfs-list


> A flaw in the filesystem, in my opinion, is equivalent to the space
> ship crashing and all crew members die.

No, it isn't..

A dying filesystem is a bad thing.. But it's just a filesystem..



^ permalink raw reply	[flat|nested] 72+ messages in thread

* Re: Congratulations! we have got hash function screwed up
  2004-12-30 20:57         ` Adrian Ulrich
@ 2004-12-30 21:01           ` Stefan Traby
  2004-12-30 21:20             ` brianmas
  0 siblings, 1 reply; 72+ messages in thread
From: Stefan Traby @ 2004-12-30 21:01 UTC (permalink / raw)
  To: Adrian Ulrich; +Cc: reiserfs-list

On Thu, Dec 30, 2004 at 09:57:29PM +0100, Adrian Ulrich wrote:
> 
> > A flaw in the filesystem, in my opinion, is equivalent to the space
> > ship crashing and all crew members die.
> 
> No, it isn't..
> 
> A dying filesystem is a bad thing.. But it's just a filesystem..

... that causes the space ship to crash.

Please think before you post.

-- 

  ciao - 
    Stefan

"          GNU's Not Unix          --            IIS Isn't Secure          "

^ permalink raw reply	[flat|nested] 72+ messages in thread

* Re: Congratulations! we have got hash function screwed up
  2004-12-30 21:01           ` Stefan Traby
@ 2004-12-30 21:20             ` brianmas
  0 siblings, 0 replies; 72+ messages in thread
From: brianmas @ 2004-12-30 21:20 UTC (permalink / raw)
  To: Stefan Traby; +Cc: reiserfs-list

Quoting Stefan Traby <stefan@hello-penguin.com>:

> On Thu, Dec 30, 2004 at 09:57:29PM +0100, Adrian Ulrich wrote:
> >
> > > A flaw in the filesystem, in my opinion, is equivalent to the space
> > > ship crashing and all crew members die.
> >
> > No, it isn't..
> >
> > A dying filesystem is a bad thing.. But it's just a filesystem..
>
> ... that causes the space ship to crash.
>
> Please think before you post.

he's talking about the value of human life vs. some data.

--



^ permalink raw reply	[flat|nested] 72+ messages in thread

* Re: Congratulations! we have got hash function screwed up
  2004-12-30 20:08         ` Hans Reiser
@ 2004-12-30 21:55           ` Esben Stien
  2004-12-31  4:05           ` David Masover
  1 sibling, 0 replies; 72+ messages in thread
From: Esben Stien @ 2004-12-30 21:55 UTC (permalink / raw)
  To: Hans Reiser; +Cc: Matthias Andree, hihone, reiserfs-list

Hans Reiser <reiser@namesys.com> writes:

> I may still put it in someday and just not tell him.;-)
> I'll just let the users know about it and not him.;-)

Hehe, nice;)

-- 
Esben Stien is b0ef@esben-stien.name
http://www.esben-stien.name
irc://irc.esben-stien.name/%23contact
[sip|iax]:b0ef@esben-stien.name

^ permalink raw reply	[flat|nested] 72+ messages in thread

* Re: Congratulations! we have got hash function screwed up
  2004-12-30 18:26           ` Spam
  2004-12-30 20:41             ` Tom Vier
@ 2004-12-30 23:14             ` Matthias Andree
  2004-12-30 23:25               ` Spam
  2004-12-31  4:11               ` Hans Reiser
  1 sibling, 2 replies; 72+ messages in thread
From: Matthias Andree @ 2004-12-30 23:14 UTC (permalink / raw)
  To: Spam; +Cc: reiserfs-list

Spam <spam@tnonline.net> writes:

>   In any case. Undelete has been since ages on many platforms. It IS a
>   useful feature. Accidents CAN happen for many reasons and in some
>   cases you may need to recover data.
>
>   Besides, a deletion does not fully remove the data, but just unlinks
>   it. In Reiser where there is tailing etc for small files this can be
>   a problem. Either the little file might not be able to be recovered
>   (shouldn't the data still exist, even if it is tailed), or the user
>   need to use a non-tailing policy?

A working undelete can either hog disk space or die the moment some
large write comes in. And if you're at that point, make it a versioning
file system - but then don't complain about space efficiency.

>   well, overwritten data is not so easy to get back. But from what I
>   understand in Linux, is that many applications actually write
>   another file and then unlinks the old file? If that is the case then
>   it may even be possible to get back some overwritten files!

I see enough applications to just overwrite an output file. 

This whole discussion doesn't belong here until someone talks about
implementing a whole versioning system for reiser4.

-- 
Matthias Andree

^ permalink raw reply	[flat|nested] 72+ messages in thread

* Re: Congratulations! we have got hash function screwed up
  2004-12-30 23:14             ` Matthias Andree
@ 2004-12-30 23:25               ` Spam
  2004-12-31  4:11               ` Hans Reiser
  1 sibling, 0 replies; 72+ messages in thread
From: Spam @ 2004-12-30 23:25 UTC (permalink / raw)
  To: reiserfs-list



> Spam <spam@tnonline.net> writes:

>>   In any case. Undelete has been since ages on many platforms. It IS a
>>   useful feature. Accidents CAN happen for many reasons and in some
>>   cases you may need to recover data.
>>
>>   Besides, a deletion does not fully remove the data, but just unlinks
>>   it. In Reiser where there is tailing etc for small files this can be
>>   a problem. Either the little file might not be able to be recovered
>>   (shouldn't the data still exist, even if it is tailed), or the user
>>   need to use a non-tailing policy?

> A working undelete can either hog disk space or die the moment some
> large write comes in. And if you're at that point, make it a versioning
> file system - but then don't complain about space efficiency.

  Yes, When data is overwritten then it is overwritten. The longer the
  user waits to try to recover data, the more risk of this happening.
  Undelete has existed a long time and people know it is not a
  foolproof thing. I do not think anyone asked for automatic backup
  features, but just tools that can try to recover accidental deletions.

>>   well, overwritten data is not so easy to get back. But from what I
>>   understand in Linux, is that many applications actually write
>>   another file and then unlinks the old file? If that is the case then
>>   it may even be possible to get back some overwritten files!

> I see enough applications to just overwrite an output file. 

  Yes, This was an example only.

  In any case. If there were tools that could scan and recover, even
  partly, deleted files then I would welcome them. I am sure lots of
  other people do too.

  It is very easy to say you need backups of your data, that you need
  versioning filesystems etc. But not all of this is possible for
  everyone. Just take a laptop as example. Making backups is not so
  easy to do frequently - especially not when traveling.

  Sure, if you run in a corporate environment you can do shadow
  copying or use other versioning systems and mount that over the
  network. But for the normal "home" or "small business" users this is
  not really what you can expect...

> This whole discussion doesn't belong here until someone talks about
> implementing a whole versioning system for reiser4.

  I think someone said they wanted undelete recovery features in
  reiser4 - which was what started this discussion?

´

-- 


^ permalink raw reply	[flat|nested] 72+ messages in thread

* Re: Congratulations! we have got hash function screwed up
  2004-12-30 20:08         ` Hans Reiser
  2004-12-30 21:55           ` Esben Stien
@ 2004-12-31  4:05           ` David Masover
  2004-12-31  4:26             ` Hans Reiser
  1 sibling, 1 reply; 72+ messages in thread
From: David Masover @ 2004-12-31  4:05 UTC (permalink / raw)
  To: Hans Reiser; +Cc: Esben Stien, Matthias Andree, hihone, reiserfs-list

Hans Reiser wrote:
> Esben Stien wrote:
> 
>>
>> I really don't like that there is no undelete feature in reiserfs -
>> it's not planned for reiserfs-4 either.
>>
> Blame Linus for that.  I would put it in, but he thinks it belongs in 
> userspace.  I may still put it in someday and just not tell him.;-)  
> I'll just let the users know about it and not him.;-)

He has some good points, though.  If you're going to have a kernel, you 
want to keep it small, put stuff in only if it needs to be there.  And 
how much speed do we lose by putting stuff in userspace, if we do it 
right?  C'mon, if Doom 3 can run in userspace, surely some sort of trash 
can / recycle bin can, right?

Oh wait... Gnome/KDE already do that.


^ permalink raw reply	[flat|nested] 72+ messages in thread

* Re: Congratulations! we have got hash function screwed up
  2004-12-30 23:14             ` Matthias Andree
  2004-12-30 23:25               ` Spam
@ 2004-12-31  4:11               ` Hans Reiser
  2004-12-31  8:36                 ` Matthias Andree
  1 sibling, 1 reply; 72+ messages in thread
From: Hans Reiser @ 2004-12-31  4:11 UTC (permalink / raw)
  To: Matthias Andree; +Cc: Spam, reiserfs-list

Matthias Andree wrote:

>Spam <spam@tnonline.net> writes:
>
>  
>
>>  In any case. Undelete has been since ages on many platforms. It IS a
>>  useful feature. Accidents CAN happen for many reasons and in some
>>  cases you may need to recover data.
>>
>>  Besides, a deletion does not fully remove the data, but just unlinks
>>  it. In Reiser where there is tailing etc for small files this can be
>>  a problem. Either the little file might not be able to be recovered
>>  (shouldn't the data still exist, even if it is tailed), or the user
>>  need to use a non-tailing policy?
>>    
>>
>
>A working undelete can either hog disk space or die the moment some
>large write comes in. And if you're at that point, make it a versioning
>file system
>
Well, yes, it should be one.....

darpa is paying for views, add in a little versioning and.....

> - but then don't complain about space efficiency.
>  
>
This is an area where apple was smarter than Unix.  Having a trash can 
is what real users need, more than they need performance..

Yes, there is a high performance cost, but so long as it can be turned 
off/ avoided, the cost is acceptable.

I would however auto-empty the trash can when space got low....

>  
>
>>  well, overwritten data is not so easy to get back. But from what I
>>  understand in Linux, is that many applications actually write
>>  another file and then unlinks the old file? If that is the case then
>>  it may even be possible to get back some overwritten files!
>>    
>>
>
>I see enough applications to just overwrite an output file. 
>
>This whole discussion doesn't belong here until someone talks about
>implementing a whole versioning system for reiser4.
>
>  
>
Well, it hasn't been coded solely because we haven't gotten around to it 
what with all else that needs doing and still needs doing.  Remind me 
about this in a year.:)

Hans

^ permalink raw reply	[flat|nested] 72+ messages in thread

* Re: Congratulations! we have got hash function screwed up
  2004-12-31  4:05           ` David Masover
@ 2004-12-31  4:26             ` Hans Reiser
  2004-12-31  5:59               ` David Masover
  0 siblings, 1 reply; 72+ messages in thread
From: Hans Reiser @ 2004-12-31  4:26 UTC (permalink / raw)
  To: David Masover; +Cc: Esben Stien, Matthias Andree, hihone, reiserfs-list

David Masover wrote:

> Hans Reiser wrote:
>
>> Esben Stien wrote:
>>
>>>
>>> I really don't like that there is no undelete feature in reiserfs -
>>> it's not planned for reiserfs-4 either.
>>>
>> Blame Linus for that.  I would put it in, but he thinks it belongs in 
>> userspace.  I may still put it in someday and just not tell him.;-)  
>> I'll just let the users know about it and not him.;-)
>
>
> He has some good points, though.  If you're going to have a kernel, 
> you want to keep it small, put stuff in only if it needs to be there.  
> And how much speed do we lose by putting stuff in userspace, if we do 
> it right?  C'mon, if Doom 3 can run in userspace, surely some sort of 
> trash can / recycle bin can, right?
>
> Oh wait... Gnome/KDE already do that.
>
>
>
the problem being that everyone uses rm not the gnome/kde trashcan....  
literal copying of Apple doesn't work for unix because we use shells 
most of the time.

^ permalink raw reply	[flat|nested] 72+ messages in thread

* Re: Congratulations! we have got hash function screwed up
  2004-12-31  4:26             ` Hans Reiser
@ 2004-12-31  5:59               ` David Masover
  0 siblings, 0 replies; 72+ messages in thread
From: David Masover @ 2004-12-31  5:59 UTC (permalink / raw)
  To: Hans Reiser; +Cc: Esben Stien, Matthias Andree, hihone, reiserfs-list

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Hans Reiser wrote:
| David Masover wrote:
|
|> Hans Reiser wrote:
|>
|>> Esben Stien wrote:
|>>
|>>>
|>>> I really don't like that there is no undelete feature in reiserfs -
|>>> it's not planned for reiserfs-4 either.
|>>>
|>> Blame Linus for that.  I would put it in, but he thinks it belongs in
|>> userspace.  I may still put it in someday and just not tell him.;-)
|>> I'll just let the users know about it and not him.;-)
|>
|>
|>
|> He has some good points, though.  If you're going to have a kernel,
|> you want to keep it small, put stuff in only if it needs to be there.
|> And how much speed do we lose by putting stuff in userspace, if we do
|> it right?  C'mon, if Doom 3 can run in userspace, surely some sort of
|> trash can / recycle bin can, right?
|>
|> Oh wait... Gnome/KDE already do that.
|>
|>
|>
| the problem being that everyone uses rm not the gnome/kde trashcan....
| literal copying of Apple doesn't work for unix because we use shells
| most of the time.

"we" being people who should be able to back things up and avoid really
stupid mistakes, and people who care about performance.

But you are right, this has to be lower down -- maybe glibc, maybe vfs,
maybe fs views -- because there's no standardization once you get to
things like gnome.

My point about Doom 3 is that I like the idea of a microkernel, if it
could be done fast enough.  Games talk to kernel space and hardware very
quickly, but it seems kernel-userspace communication isn't fast enough
for things like filesystems.

Maybe someone can engineer a way around that?
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.2.6 (GNU/Linux)
Comment: Using GnuPG with Thunderbird - http://enigmail.mozdev.org

iQIVAwUBQdTqswisZLIF6uqOAQKczQ/8DeKVlFL5cS+KXnBaq9+w+rBkpak/LRld
1gyIyp4s0mVdpRNTGa5W2YPB0McOZj5TDUVZivAKScbd1srsV1QxlhUxzY6QMUwo
7Y9k+yJbm0J89Tnrap2DJmE/teN40Rw4thY5VyQ5cfugNFZtdrwgRsKqvsxV4cOQ
lIOci7SJuLUdgdtfLWORyEdYSe180P9XJMaGTxoVYSnwnBMbXX5C33OOVmJjEvU7
Pevro6GcfOvALj+ciI3AtvhdCbyasrevgfc1K/iZk3MwQFZKXRUtW5LcT1Lwax08
acn4n6f5ZhEQJWarv93EOn0T6KfXuD0uY9GbNRJS/ldPUg4tCaw3RfzVgdvpUVlw
uI+hPXsVpEGDyBm1aK3TQ30brUzBx/ILgblQvCe4QuHH5NWb63I0xPKbmqqmCdCR
cx06b3cbBCgO6QK3gqURHdyeQjlpJL92EXARmYLFjAnAicnKG95N4QiZsHIIuGvn
9iQcAS0bfU+elPbnfeiJnRTfEU6i1eLwSmsYo0i1m80U58TW604Dcfv2RcQrtluw
96eO9VB++8QWpl2Tv4G7u9yFrmOcx4Vo1tVkbFoPZvsP5sMqY6XhKowX5uWLjkwQ
hiAWb57HKxUTaPcW6CyGh+zr8g+Z8Po7TRA1GcdNMZZBfgNcAlZvlSDmOw7BHlMj
ixpwaj2lWYU=
=Z6t0
-----END PGP SIGNATURE-----

^ permalink raw reply	[flat|nested] 72+ messages in thread

* Re: Congratulations! we have got hash function screwed up
  2004-12-31  4:11               ` Hans Reiser
@ 2004-12-31  8:36                 ` Matthias Andree
  0 siblings, 0 replies; 72+ messages in thread
From: Matthias Andree @ 2004-12-31  8:36 UTC (permalink / raw)
  To: Hans Reiser; +Cc: reiserfs-list

On Thu, 30 Dec 2004, Hans Reiser wrote:

> >A working undelete can either hog disk space or die the moment some
> >large write comes in. And if you're at that point, make it a versioning
> >file system
> >
> Well, yes, it should be one.....
> 
> darpa is paying for views, add in a little versioning and.....

If the "view" is something between a transactional view in a SQL
database and a device-mapper snapshot, then yes, it might be close
enough. There's however always the problem of capacity conflicts, and
there may need to be a switch that prefers "keep older versions" over
"discard older versions" so that the admin with - by your leave - idiot
users has a chance to save his users' a**e*.

> >- but then don't complain about space efficiency.
>
> This is an area where apple was smarter than Unix.  Having a trash can 
> is what real users need, more than they need performance..

Does Apple's trash can help against files that get overwritten in situ?
If not, it's insufficient to fix another common failure. My Mom is
"prone" (is that word applicable to human behavior?) to open a file
(say, a half year schedule of the local community), edit it, without
saving it under a new name - by the time she's completed her edit, she
has forgotten to rename it and boom, old file dead. Next week she wants
it back...

> I would however auto-empty the trash can when space got low....

That isn't desired. See above.

> Well, it hasn't been coded solely because we haven't gotten around to it 
> what with all else that needs doing and still needs doing.  Remind me 
> about this in a year.:)

Save this mail to a file and have atd mail it to you. Or use a calendar :)

-- 
Matthias Andree

^ permalink raw reply	[flat|nested] 72+ messages in thread

* Re: Congratulations! we have got hash function screwed up
  2004-12-28 22:12 Congratulations! we have got hash function screwed up Lehmann 
  2004-12-29 18:55 ` Stefan Traby
@ 2005-01-06 12:45 ` Alex Zarochentsev
  2005-01-06 14:27   ` Lehmann 
  1 sibling, 1 reply; 72+ messages in thread
From: Alex Zarochentsev @ 2005-01-06 12:45 UTC (permalink / raw)
  To:  Marc A. Lehmann ; +Cc: reiserfs-list, stefan

Hello,

On Tue, Dec 28, 2004 at 11:12:18PM +0100,  Marc A. Lehmann  wrote:
> Hi!
> 
> When trying to upgrade or reinstall the xfonts-75dpi,
> xfonts-75dpi-transcoded or 100dpi & transcoded debian packages on my
> 2.6.10-rc3 amd64 reiserfsv3 host, I get the following errors:
> 
>    dpkg: error processing
>    /var/cache/apt/archives/xfonts-75dpi_4.3.0.dfsg.1-10_all.deb (--unpack):
>     unable to make backup link of
>     `./usr/X11R6/lib/X11/fonts/75dpi/lutBS19-ISO8859-1.pcf.gz' before
>     installing new version: Device or resource busy
>     dpkg-deb: subprocess paste killed by signal (Broken pipe)
>    Preparing to replace xfonts-75dpi-transcoded 4.3.0.dfsg.1-10 (using
>          .../xfonts-75dpi-transcoded_4.3.0.dfsg.1-10_all.deb) ...
>    dpkg: error processing /var/cache/apt/archives/xfonts-75dpi-transcoded_4.3.0.dfsg.1-10_all.deb (--unpack):
>     unable to make backup link of `./usr/X11R6/lib/X11/fonts/75dpi/lutBS19-ISO8859-10.pcf.gz' before installing new version: Device or resource busy
>    dpkg-deb: subprocess paste killed by signal (Broken pipe)
>    Errors were encountered while processing:
>     /var/cache/apt/archives/xfonts-75dpi_4.3.0.dfsg.1-10_all.deb
>     /var/cache/apt/archives/xfonts-75dpi-transcoded_4.3.0.dfsg.1-10_all.deb
> 
> And at the same time, I get this in my kernel log:
> 
>    ReiserFS: hdg2: warning: reiserfs_add_entry: Congratulations! we have got hash function screwed up
> 
> Sure sounds like a filesystem bug to me. Is this 2.6.10-rc3-specific or a
> generic bug in handling hash collisions?

Tea hash is designed to be more resistant.  

there is a generic problem with overloading of the generation counter, but
tea hash should mix file names better and have less chances to 'screw the hash
function up'.  

Does the debian install all X font files into one dir? May be you have your own
font files installed in the same dir? I suggest to split the dir into several
ones.

> Deleteing the fonts and installing the package works, but the next upgrade
> makes the error appear again.
> 
> -- 
>                 The choice of a
>       -----==-     _GNU_
>       ----==-- _       generation     Marc Lehmann
>       ---==---(_)__  __ ____  __      pcg@goof.com
>       --==---/ / _ \/ // /\ \/ /      http://schmorp.de/
>       -=====/_/_//_/\_,_/ /_/\_\      XX11-RIPE

-- 
Alex.

^ permalink raw reply	[flat|nested] 72+ messages in thread

* Re: Congratulations! we have got hash function screwed up
  2005-01-06 12:45 ` Alex Zarochentsev
@ 2005-01-06 14:27   ` Lehmann 
  2005-01-06 15:56     ` Hans Reiser
  2005-01-06 18:55     ` Congratulations! we have got hash function screwed up Edward Shishkin
  0 siblings, 2 replies; 72+ messages in thread
From: Lehmann  @ 2005-01-06 14:27 UTC (permalink / raw)
  To: Alex Zarochentsev; +Cc: reiserfs-list, stefan

On Thu, Jan 06, 2005 at 03:45:06PM +0300, Alex Zarochentsev <zam@namesys.com> wrote:
> > generic bug in handling hash collisions?
> 
> Tea hash is designed to be more resistant.  

As the example posted shows, tea doesn't look better, it generates
nicely-looking collisions, too.

> Does the debian install all X font files into one dir?

No, but xfree nowadays comes with a lot of fonts because it stupidly makes
a copy of about each and every font in each and every encoding, leading to
many font files in the bitmapped category (75dpi and 100dpi).

> May be you have your own font files installed in the same dir?

I also have some other debian packages that install their fonts there, but
it should be less than 10 extra files.

> I suggest to split the dir into several ones.

I'd suggest getting rid of reiserfs on anything important. I can't have it
when my filesystem randomly returns errors when it should be working.

I wonder wether this hasn't any security relevance, as it allows attackers
easily to create filename holes in the filesystem that even root cannot
override.

Thanks for the suggestion, though! However, the workaround I currently use
(delete the dir, reinstall) works better, as it doesn't destroy debian's
idea of the filesystem layout.

-- 
                The choice of a
      -----==-     _GNU_
      ----==-- _       generation     Marc Lehmann
      ---==---(_)__  __ ____  __      pcg@goof.com
      --==---/ / _ \/ // /\ \/ /      http://schmorp.de/
      -=====/_/_//_/\_,_/ /_/\_\      XX11-RIPE

^ permalink raw reply	[flat|nested] 72+ messages in thread

* Re: Congratulations! we have got hash function screwed up
  2005-01-06 14:27   ` Lehmann 
@ 2005-01-06 15:56     ` Hans Reiser
  2005-01-06 16:13       ` Spam
  2005-01-06 18:55     ` Congratulations! we have got hash function screwed up Edward Shishkin
  1 sibling, 1 reply; 72+ messages in thread
From: Hans Reiser @ 2005-01-06 15:56 UTC (permalink / raw)
  To: reiserfs-list; +Cc: Alex Zarochentsev, stefan

pcg( Marc)@goof(A.).(Lehmann )com wrote:

>On Thu, Jan 06, 2005 at 03:45:06PM +0300, Alex Zarochentsev <zam@namesys.com> wrote:
>  
>
>>>generic bug in handling hash collisions?
>>>      
>>>
>>Tea hash is designed to be more resistant.  
>>    
>>
>
>As the example posted shows, tea doesn't look better, it generates
>nicely-looking collisions, too.
>  
>
You mean, in practice you hit them, or with an artificially generated 
set of filenames intended to cause collisions you get those collisions?


^ permalink raw reply	[flat|nested] 72+ messages in thread

* Re: Congratulations! we have got hash function screwed up
  2005-01-06 15:56     ` Hans Reiser
@ 2005-01-06 16:13       ` Spam
  2005-01-06 16:26         ` Chris Dukes
  0 siblings, 1 reply; 72+ messages in thread
From: Spam @ 2005-01-06 16:13 UTC (permalink / raw)
  To: reiserfs-list

>>>>generic bug in handling hash collisions?
>>>>      
>>>>
>>>Tea hash is designed to be more resistant.  
>>>    
>>>
>>
>>As the example posted shows, tea doesn't look better, it generates
>>nicely-looking collisions, too.
>>  
>>
> You mean, in practice you hit them, or with an artificially generated
> set of filenames intended to cause collisions you get those collisions?

 Excuse me, but do you mean that there are undocumented limits on what
 files can be named to, and how many files with similar or random
 names in a ReiferFS volume?

 This sounds bad...


^ permalink raw reply	[flat|nested] 72+ messages in thread

* Re: Congratulations! we have got hash function screwed up
  2005-01-06 16:13       ` Spam
@ 2005-01-06 16:26         ` Chris Dukes
  2005-01-06 16:29           ` Spam
  2005-01-07 17:22           ` Hans Reiser
  0 siblings, 2 replies; 72+ messages in thread
From: Chris Dukes @ 2005-01-06 16:26 UTC (permalink / raw)
  To: Spam; +Cc: reiserfs-list

On Thu, Jan 06, 2005 at 05:13:23PM +0100, Spam wrote:
> >>>>generic bug in handling hash collisions?
> >>>>      
> >>>>
> >>>Tea hash is designed to be more resistant.  
> >>>    
> >>>
> >>
> >>As the example posted shows, tea doesn't look better, it generates
> >>nicely-looking collisions, too.
> >>  
> >>
> > You mean, in practice you hit them, or with an artificially generated
> > set of filenames intended to cause collisions you get those collisions?
> 
>  Excuse me, but do you mean that there are undocumented limits on what
>  files can be named to, and how many files with similar or random
>  names in a ReiferFS volume?

No, I'd say it's pretty well documented that reiserfs fails under
certain hash collision conditions instead of continueing to work
(albeit more slowly).

The nature of the hash collisions must be pretty obvious if a shell
script can be written to demonstrate the problem.
> 
>  This sounds bad...

It's a risk assessment.  What are the odds of your normal data sets
hitting the bug or of someone with malicious intent introducing
a demonstration program vs the performance hit of a filesystem
without the problem.

All filesystems will fail or suffer degraded performance under
certain conditions, you need to determine what conditions are acceptable
for your data.

-- 
Chris Dukes
Warning: Do not use the reflow toaster oven to prepare foods after
it has been used for solder paste reflow. 
http://www.stencilsunlimited.com/stencil_article_page5.htm

^ permalink raw reply	[flat|nested] 72+ messages in thread

* Re: Congratulations! we have got hash function screwed up
  2005-01-06 16:26         ` Chris Dukes
@ 2005-01-06 16:29           ` Spam
  2005-01-06 16:56             ` Chris Dukes
  2005-01-07 17:22           ` Hans Reiser
  1 sibling, 1 reply; 72+ messages in thread
From: Spam @ 2005-01-06 16:29 UTC (permalink / raw)
  To: reiserfs-list


  

> On Thu, Jan 06, 2005 at 05:13:23PM +0100, Spam wrote:
>> >>>>generic bug in handling hash collisions?
>> >>>>      
>> >>>>
>> >>>Tea hash is designed to be more resistant.  
>> >>>    
>> >>>
>> >>
>> >>As the example posted shows, tea doesn't look better, it generates
>> >>nicely-looking collisions, too.
>> >>  
>> >>
>> > You mean, in practice you hit them, or with an artificially generated
>> > set of filenames intended to cause collisions you get those collisions?
>> 
>>  Excuse me, but do you mean that there are undocumented limits on what
>>  files can be named to, and how many files with similar or random
>>  names in a ReiferFS volume?

> No, I'd say it's pretty well documented that reiserfs fails under
> certain hash collision conditions instead of continueing to work
> (albeit more slowly).

> The nature of the hash collisions must be pretty obvious if a shell
> script can be written to demonstrate the problem.
>> 
>>  This sounds bad...

> It's a risk assessment.  What are the odds of your normal data sets
> hitting the bug or of someone with malicious intent introducing
> a demonstration program vs the performance hit of a filesystem
> without the problem.

  How can I assess the risk, if I do not know how to produce the bugs?
  You say certain conditions. But from what I read earlier in the
  thread, a directory with a fonts in them.....?

> All filesystems will fail or suffer degraded performance under
> certain conditions, you need to determine what conditions are acceptable
> for your data.

  Slow can be acceptable. But failing? No, a filesystem should not
  fail.

-- 


^ permalink raw reply	[flat|nested] 72+ messages in thread

* Re: Congratulations! we have got hash function screwed up
  2005-01-06 16:29           ` Spam
@ 2005-01-06 16:56             ` Chris Dukes
  0 siblings, 0 replies; 72+ messages in thread
From: Chris Dukes @ 2005-01-06 16:56 UTC (permalink / raw)
  To: Spam; +Cc: reiserfs-list

On Thu, Jan 06, 2005 at 05:29:39PM +0100, Spam wrote:
> 
> > It's a risk assessment.  What are the odds of your normal data sets
> > hitting the bug or of someone with malicious intent introducing
> > a demonstration program vs the performance hit of a filesystem
> > without the problem.
> 
>   How can I assess the risk, if I do not know how to produce the bugs?
>   You say certain conditions. But from what I read earlier in the
>   thread, a directory with a fonts in them.....?

Since the concepts of simulators, hash function analysis, and dataset
modelling seem to escape you, perhaps you need to go for the black
and white "Is any risk acceptable," given anecdotal data of one
unexpected failing condition and one script that can regularly create
the failing condition.
> 
> > All filesystems will fail or suffer degraded performance under
> > certain conditions, you need to determine what conditions are acceptable
> > for your data.
> 
>   Slow can be acceptable. But failing? No, a filesystem should not
>   fail.

It should not fail 
1) When media fails
2) When transport hardware is not compliant with specs (permanently on
write caching anyone?)
3) Media has a limited lifetime
...

One thing I don't think I ever saw in this thread was
1) How old was the drive that saw the problem.
2) What was the drive lifetime used to calculate it's MTBF.
-- 
Chris Dukes
Warning: Do not use the reflow toaster oven to prepare foods after
it has been used for solder paste reflow. 
http://www.stencilsunlimited.com/stencil_article_page5.htm

^ permalink raw reply	[flat|nested] 72+ messages in thread

* Re: Congratulations! we have got hash function screwed up
  2005-01-06 14:27   ` Lehmann 
  2005-01-06 15:56     ` Hans Reiser
@ 2005-01-06 18:55     ` Edward Shishkin
  2005-01-07 17:26       ` Lehmann 
  1 sibling, 1 reply; 72+ messages in thread
From: Edward Shishkin @ 2005-01-06 18:55 UTC (permalink / raw)
  To: pcg; +Cc: Alex Zarochentsev, reiserfs-list, stefan

pcg( Marc)@goof(A.).(Lehmann )com wrote:

>On Thu, Jan 06, 2005 at 03:45:06PM +0300, Alex Zarochentsev <zam@namesys.com> wrote:
>  
>
>>>generic bug in handling hash collisions?
>>>      
>>>
>>Tea hash is designed to be more resistant.  
>>    
>>

Actually this can not be more resistant as it use the same 32-bit output 
size. So to find
a collision you just need to find hashes of  2^16 = 65536 random documents.

>
>As the example posted shows, tea doesn't look better, it generates
>nicely-looking collisions, too.
>
>  
>

>I'd suggest getting rid of reiserfs on anything important. I can't have it
>when my filesystem randomly returns errors when it should be working.
>
>I wonder wether this hasn't any security relevance, as it allows attackers
>easily to create filename holes in the filesystem that even root cannot
>override.
>  
>

It should be a weighty reason to use strong hash function for creating 
entries because
stable hash means bad performance and more occupied place in stat-data: 
I am not
sure that even 160 bit will guarantee absence of collision for a long time..

Edward.

>Thanks for the suggestion, though! However, the workaround I currently use
>(delete the dir, reinstall) works better, as it doesn't destroy debian's
>idea of the filesystem layout.
>
>  
>


^ permalink raw reply	[flat|nested] 72+ messages in thread

* Re: Congratulations! we have got hash function screwed up
  2005-01-06 16:26         ` Chris Dukes
  2005-01-06 16:29           ` Spam
@ 2005-01-07 17:22           ` Hans Reiser
  2005-01-07 17:28             ` Chris Dukes
  2005-01-07 23:27             ` flush earlier? (was Re: Congratulations! we have got hash function screwed up) David Masover
  1 sibling, 2 replies; 72+ messages in thread
From: Hans Reiser @ 2005-01-07 17:22 UTC (permalink / raw)
  To: Chris Dukes; +Cc: Spam, reiserfs-list

Chris Dukes wrote:

>
>
>All filesystems will fail or suffer degraded performance under
>certain conditions, you need to determine what conditions are acceptable
>for your data.
>
>  
>
and each generation of software reduces the extent of such conditions.  
Reiser4 fixes this problem cleanly.

^ permalink raw reply	[flat|nested] 72+ messages in thread

* Re: Congratulations! we have got hash function screwed up
  2005-01-06 18:55     ` Congratulations! we have got hash function screwed up Edward Shishkin
@ 2005-01-07 17:26       ` Lehmann 
  0 siblings, 0 replies; 72+ messages in thread
From: Lehmann  @ 2005-01-07 17:26 UTC (permalink / raw)
  To: Edward Shishkin; +Cc: Alex Zarochentsev, reiserfs-list, stefan

On Thu, Jan 06, 2005 at 09:55:20PM +0300, Edward Shishkin <edward@namesys.com> wrote:
> >On Thu, Jan 06, 2005 at 03:45:06PM +0300, Alex Zarochentsev 
> ><zam@namesys.com> wrote:
> >>>
> >>Tea hash is designed to be more resistant.  
> >>   
> >>
> 
> Actually this can not be more resistant as it use the same 32-bit output 
> size.

Sure it can, filenames are not randomly distributed, so your argument doesn't
suffice to show that tea cannot be more resistent, as it could be more
resistent for other reasons.

That's why I originally wrote "nicely-looking", which (if it wasn't clear)
was meant to say that filenames with somewhat similar names do collide
even with tea, which suppossedly was chosen to avoid this case.

> So to find a collision you just need to find hashes of 2^16 = 65536
> random documents.

True. It's even worse if these collisions happen to filenames occuring in
practise.

(I also agree to the rest of your mail)

-- 
                The choice of a
      -----==-     _GNU_
      ----==-- _       generation     Marc Lehmann
      ---==---(_)__  __ ____  __      pcg@goof.com
      --==---/ / _ \/ // /\ \/ /      http://schmorp.de/
      -=====/_/_//_/\_,_/ /_/\_\      XX11-RIPE

^ permalink raw reply	[flat|nested] 72+ messages in thread

* Re: Congratulations! we have got hash function screwed up
  2005-01-07 17:22           ` Hans Reiser
@ 2005-01-07 17:28             ` Chris Dukes
  2005-01-07 23:27             ` flush earlier? (was Re: Congratulations! we have got hash function screwed up) David Masover
  1 sibling, 0 replies; 72+ messages in thread
From: Chris Dukes @ 2005-01-07 17:28 UTC (permalink / raw)
  To: Hans Reiser; +Cc: Spam, reiserfs-list

On Fri, Jan 07, 2005 at 09:22:02AM -0800, Hans Reiser wrote:
> Chris Dukes wrote:
> 
> >
> >
> >All filesystems will fail or suffer degraded performance under
> >certain conditions, you need to determine what conditions are acceptable
> >for your data.
> >
> > 
> >
> and each generation of software reduces the extent of such conditions.  
> Reiser4 fixes this problem cleanly.

Should reduce.  I currently have the misfortune of supporting
some software that seems to be of the "For every bug fixed atleast
one bug with an equal level of impact is introduced" variety.

-- 
Chris Dukes
hello stacklimit my old friend, I've come to visit you again

^ permalink raw reply	[flat|nested] 72+ messages in thread

* flush earlier?  (was Re: Congratulations! we have got hash function screwed up)
  2005-01-07 17:22           ` Hans Reiser
  2005-01-07 17:28             ` Chris Dukes
@ 2005-01-07 23:27             ` David Masover
  2005-01-07 23:52               ` Hans Reiser
  1 sibling, 1 reply; 72+ messages in thread
From: David Masover @ 2005-01-07 23:27 UTC (permalink / raw)
  To: Hans Reiser; +Cc: Chris Dukes, Spam, reiserfs-list

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Hans Reiser wrote:
| Chris Dukes wrote:
|
|>
|>
|> All filesystems will fail or suffer degraded performance under
|> certain conditions, you need to determine what conditions are acceptable
|> for your data.
|>
|>
|>
| and each generation of software reduces the extent of such conditions.
| Reiser4 fixes this problem cleanly.

I think Reiser4's degraded performance condition is when it gets lots of
RAM.  First, a disclaimer -- I don't have the latest reiser4 patch.  But
in all versions of the FS, I've found that if I'm ever trying to do
anything when reiser finally decides to flush to disk, basically my
whole system is locked up.  I haven't tested, but I think this would
actually be worse with more RAM, because it would be longer until the
flush was forced, so each flush would take longer.

What is needed is some sort of estimator or estimate.  An estimator
would be something that would flush when, based on recent fs load, it
was reasonable to expect that RAM would fill up just as the flush was
completing.  An estimate would be to flush if a certain percentage of
RAM was full, and to go to synchronous mode if memory usage didn't go
back below that percentage.
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.2.6 (GNU/Linux)
Comment: Using GnuPG with Thunderbird - http://enigmail.mozdev.org

iQIVAwUBQd8a4HgHNmZLgCUhAQKb+A//fps5uIUtnfrHGe0y3itGbggWGkDqV80O
MPlLDqmlyMRWAGij5f5F345OU7GBi7VVCkgXMJbemO7TJ82Yr8OQXPO1Ywrt1rL5
bdPej6rZD7RW7+2DRlT78XdvP2ZbqKQNjvIlmLvQIziGk2BfGxwzxt1P5vztIvfE
HtqiP9ImSbxFkjRjZmdeQC8koyEK0vqYeasRma2pC1ZsABXxRHdRPnm/OT1gZc5D
YAYqyHCcs+RFr6+KYs9TSh4H5pNcHq1kYUEcoNFI0ubMSnNK+DvlrhBPhi5fkxPT
8D86nvphYFn7gKob1A9DZzKQz0qFcbhY35KW8g4hwHUrAyQWQtOU5sh+eIF0hV+j
LRhI6Ao6lgSAyq/GuxVwZ64emiM70JvoRNDLt4YVs4VDWnTcarhUNCikEUjvSCqj
Ux2RWI+6YILFso1D0Zl+4J0hkSxZPurzvRGL9TWT3Gwkdlyfk77jRbgfvZ5oy2aC
Q4gv5h/j+7AngfzlYY8n20zlocGoW8GEpaSgWa4VimTwAuHYEoDMu0TdlHLZzMB6
u2CBoYy7brCaxUpaV7np5E/rfEPcccINVgea0/dVhvkVGjyMQHb8msyX1r4IXRRC
jVxP2veoyxsyTTmcOCAViF6Epe4fVIhTnafH7hVmcWEk0Vt+aHSbAJtYK9oFTz7E
RX0WHi9z4fs=
=UeYu
-----END PGP SIGNATURE-----

^ permalink raw reply	[flat|nested] 72+ messages in thread

* Re: flush earlier?  (was Re: Congratulations! we have got hash function screwed up)
  2005-01-07 23:27             ` flush earlier? (was Re: Congratulations! we have got hash function screwed up) David Masover
@ 2005-01-07 23:52               ` Hans Reiser
  2005-01-08  5:03                 ` David Masover
  0 siblings, 1 reply; 72+ messages in thread
From: Hans Reiser @ 2005-01-07 23:52 UTC (permalink / raw)
  To: David Masover; +Cc: Chris Dukes, Spam, reiserfs-list

David Masover wrote:

> Hans Reiser wrote:
> | Chris Dukes wrote:
> |
> |>
> |>
> |> All filesystems will fail or suffer degraded performance under
> |> certain conditions, you need to determine what conditions are 
> acceptable
> |> for your data.
> |>
> |>
> |>
> | and each generation of software reduces the extent of such conditions.
> | Reiser4 fixes this problem cleanly.
>
> I think Reiser4's degraded performance condition is when it gets lots of
> RAM.  First, a disclaimer -- I don't have the latest reiser4 patch.  But
> in all versions of the FS, I've found that if I'm ever trying to do
> anything when reiser finally decides to flush to disk, basically my
> whole system is locked up.  I haven't tested, but I think this would
> actually be worse with more RAM, because it would be longer until the
> flush was forced, so each flush would take longer.
>
> What is needed is some sort of estimator or estimate.  An estimator
> would be something that would flush when, based on recent fs load, it
> was reasonable to expect that RAM would fill up just as the flush was
> completing.  An estimate would be to flush if a certain percentage of
> RAM was full, and to go to synchronous mode if memory usage didn't go
> back below that percentage.

We need to throttle rather than flush, so as to ensure that for every 
page added to an atom, at least X pages must reach disk, until close to 
the end of the atom when we just flush it out.

Another missing and needed feature....

Hans

^ permalink raw reply	[flat|nested] 72+ messages in thread

* Re: flush earlier?  (was Re: Congratulations! we have got hash function screwed up)
  2005-01-07 23:52               ` Hans Reiser
@ 2005-01-08  5:03                 ` David Masover
  2005-01-08 20:48                   ` Hans Reiser
  0 siblings, 1 reply; 72+ messages in thread
From: David Masover @ 2005-01-08  5:03 UTC (permalink / raw)
  To: Hans Reiser; +Cc: Chris Dukes, Spam, reiserfs-list

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1



Hans Reiser wrote:
| David Masover wrote:
|
|> Hans Reiser wrote:
|> | Chris Dukes wrote:
|> |
|> |>
|> |>
|> |> All filesystems will fail or suffer degraded performance under
|> |> certain conditions, you need to determine what conditions are
|> acceptable
|> |> for your data.
|> |>
|> |>
|> |>
|> | and each generation of software reduces the extent of such conditions.
|> | Reiser4 fixes this problem cleanly.
|>
|> I think Reiser4's degraded performance condition is when it gets lots of
|> RAM.  First, a disclaimer -- I don't have the latest reiser4 patch.  But
|> in all versions of the FS, I've found that if I'm ever trying to do
|> anything when reiser finally decides to flush to disk, basically my
|> whole system is locked up.  I haven't tested, but I think this would
|> actually be worse with more RAM, because it would be longer until the
|> flush was forced, so each flush would take longer.
|>
|> What is needed is some sort of estimator or estimate.  An estimator
|> would be something that would flush when, based on recent fs load, it
|> was reasonable to expect that RAM would fill up just as the flush was
|> completing.  An estimate would be to flush if a certain percentage of
|> RAM was full, and to go to synchronous mode if memory usage didn't go
|> back below that percentage.
|
|
| We need to throttle rather than flush, so as to ensure that for every
| page added to an atom, at least X pages must reach disk, until close to
| the end of the atom when we just flush it out.

I'm not sure I understand that.  Is the idea of that to build up a write
buffer which insists on flushing bytes off the front as they are added
onto the back, without flushing huge chunks at once?

Would that be as efficient at packing (no fragmentation)?

But the main problem was it would basically lock the fs entirely -- no
access at all.  This especially hurts with read access.  When fs is
under heavy load, I can wait several minutes to start a browser,
especially when a lot of writing is happening.

For now, how about a quick fix.  Can we force a flush when we get to
80%?  Is there a synchronous mode, and can we use that after 80%, or do
we have to fake it by trying to flush every few seconds?

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.2.6 (GNU/Linux)
Comment: Using GnuPG with Thunderbird - http://enigmail.mozdev.org

iQIVAwUBQd9pm3gHNmZLgCUhAQLj6BAAnzjp7PNnNr952pOg1LLckELJbG0Cq/Kj
smMR5VknCSeM0ho30gnjWOzI9FE/O1nAk6sMEdEcX6MGwCB8KUeQLo2OebIQCMi2
PdzB+nh6VImqeOZkgccBMQL6OK7o8QHYhQ/QtqBQ+3UIXTuJdBG4YZjNAkYDXF3s
ObJcLDKwaN3yZCzrLmcpK2+BQSckBu797Z3mwnbj1jpOiki76BqCrMCI7+081V/j
A4aOBeSQwKXfsld4vdIbWZzKF2lpUGx1jzfynYMVfQ9sj/aqtpKMbwDWiUOBdSv+
OX01ARcU1O90ZdL+zjT6qhJXgXOjbZu5Tpp9slJaYuvSbv516DJn2l1A+ADlbEkk
fxEHx2RRT0rqtepiBKu+QsGrO3ixl4lGZtyx4d8cj/yRjbHtNNI0YYg5w1kJ+udE
H93Y3JpdJi6iBdKvblJpRW1bA/HR9g+p/PsQy6jdwcKthzjTvFwXjSfGCy0u+nF7
mKry8Axd3QiANs4x8Pc6CDdbFEiPjo4FVpfBASb5qq7TbQVSSpvzEoWPn5AE5xE9
ZBuHireWoGG5XzwxkhfS14Y6letoKx1tbkZHSULwlTdmNffxIXDf0cc//G4H8waU
T3T8FOHQZtK+lnAMD7ImeNInJlPKde90JGYUg1YGeVcJqgfvroc+Ke0F4YTlhKi6
zaj+QkeKf7Q=
=ev3/
-----END PGP SIGNATURE-----

^ permalink raw reply	[flat|nested] 72+ messages in thread

* Re: flush earlier?  (was Re: Congratulations! we have got hash function screwed up)
  2005-01-08  5:03                 ` David Masover
@ 2005-01-08 20:48                   ` Hans Reiser
  2005-01-09 23:26                     ` David Masover
  0 siblings, 1 reply; 72+ messages in thread
From: Hans Reiser @ 2005-01-08 20:48 UTC (permalink / raw)
  To: David Masover; +Cc: Chris Dukes, Spam, reiserfs-list

David Masover wrote:

>
>
> Hans Reiser wrote:
> | David Masover wrote:
> |
> |> Hans Reiser wrote:
> |> | Chris Dukes wrote:
> |> |
> |> |>
> |> |>
> |> |> All filesystems will fail or suffer degraded performance under
> |> |> certain conditions, you need to determine what conditions are
> |> acceptable
> |> |> for your data.
> |> |>
> |> |>
> |> |>
> |> | and each generation of software reduces the extent of such 
> conditions.
> |> | Reiser4 fixes this problem cleanly.
> |>
> |> I think Reiser4's degraded performance condition is when it gets 
> lots of
> |> RAM.  First, a disclaimer -- I don't have the latest reiser4 
> patch.  But
> |> in all versions of the FS, I've found that if I'm ever trying to do
> |> anything when reiser finally decides to flush to disk, basically my
> |> whole system is locked up.  I haven't tested, but I think this would
> |> actually be worse with more RAM, because it would be longer until the
> |> flush was forced, so each flush would take longer.
> |>
> |> What is needed is some sort of estimator or estimate.  An estimator
> |> would be something that would flush when, based on recent fs load, it
> |> was reasonable to expect that RAM would fill up just as the flush was
> |> completing.  An estimate would be to flush if a certain percentage of
> |> RAM was full, and to go to synchronous mode if memory usage didn't go
> |> back below that percentage.
> |
> |
> | We need to throttle rather than flush, so as to ensure that for every
> | page added to an atom, at least X pages must reach disk, until close to
> | the end of the atom when we just flush it out.
>
> I'm not sure I understand that.  Is the idea of that to build up a write
> buffer which insists on flushing bytes off the front as they are added
> onto the back, without flushing huge chunks at once?

Yes.

>
> Would that be as efficient at packing (no fragmentation)?

No.

>
> But the main problem was it would basically lock the fs entirely -- no
> access at all.

Which is why we need to allow fusing a little bit.

> This especially hurts with read access.

Read access?  Do you have one CPU?  Maybe the problem isn't what I 
thought it was.

> When fs is
> under heavy load, I can wait several minutes to start a browser,
> especially when a lot of writing is happening.
>
> For now, how about a quick fix.  Can we force a flush when we get to
> 80%?

VM already has an asynchronous flush mechanism.  We just need to clean 
up our interaction with it a bit to smooth things out.

> Is there a synchronous mode, and can we use that after 80%, or do
> we have to fake it by trying to flush every few seconds?
>

^ permalink raw reply	[flat|nested] 72+ messages in thread

* Re: flush earlier?  (was Re: Congratulations! we have got hash function screwed up)
  2005-01-08 20:48                   ` Hans Reiser
@ 2005-01-09 23:26                     ` David Masover
  0 siblings, 0 replies; 72+ messages in thread
From: David Masover @ 2005-01-09 23:26 UTC (permalink / raw)
  To: Hans Reiser; +Cc: Chris Dukes, Spam, reiserfs-list

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Hans Reiser wrote:
[...]
|> I'm not sure I understand that.  Is the idea of that to build up a write
|> buffer which insists on flushing bytes off the front as they are added
|> onto the back, without flushing huge chunks at once?
|
|
| Yes.
|
|>
|> Would that be as efficient at packing (no fragmentation)?
|
|
| No.

Really?  I'd think that it would waste more CPU, but you could actually
end up packing better, if you allocate for the end of the buffer as it's
flushed, because you can look at the whole buffer and what's already on
disk.  What would be a good idea with one flush could be inefficient by
the next, and this system can react faster to that change.

But, there's still laptops, and there's CPU usage.

Will it be hard to implement both approaches?  One example would be
someone trying to create a clean fs "image", or someone who's installing
software, so they don't care how long it takes, but they do want it to
be tightly packed.  Someone who can't afford the repacker :(

|> But the main problem was it would basically lock the fs entirely -- no
|> access at all.
|
|
| Which is why we need to allow fusing a little bit.

What do you mean by "fusing" here?

|> This especially hurts with read access.
|
|
| Read access?  Do you have one CPU?  Maybe the problem isn't what I
| thought it was.

Yes, I have one CPU.  But I only have one reiser4 drive per machine.  I
think that reiser4 locks all fs access until the flush is done, or at
least it used to.  But even if it didn't, reads take _forever_ when the
disk is in use, and reads are usually more urgent than writes (hence
asynchronous writes).  And the disk will be in use trying to write a
huge chunk of data for some time, and will take longer the more RAM you
have.

|> For now, how about a quick fix.  Can we force a flush when we get to
|> 80%?
|
|
| VM already has an asynchronous flush mechanism.  We just need to clean
| up our interaction with it a bit to smooth things out.

What I want to avoid is doing away with lazy writes entirely (laptops)
and doing something like ext3's "flush every five seconds" concept.  But
I also want to avoid truly _massive_ spikes in disk activity.

That's what the 80% is about.  You still get lazy writes, but they can
be at a lower priority than reads, hopefully minimizing seeks, too --
thrashing between reading and writing.  But when the lazy write happens,
you don't slow down the rest of the system, because it still has some
room to write.

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.2.6 (GNU/Linux)
Comment: Using GnuPG with Thunderbird - http://enigmail.mozdev.org

iQIVAwUBQeG9l3gHNmZLgCUhAQJFqQ/9FH9OVdTEM3DLCfRef0Xc6N+ZA4Wv4kCp
T1W4OHbxq3AL31TcLv5hRDA8xhv3ivp/ODQfcGNyM2mg2UcV61xwHWxvUwQmBTPG
0Nl2o4uM51LuoqW4V1OoVPD4sSr/Ak08ahyeG1vZY5T5kpLk+V6Ms6jklRrQqTDZ
r6PgBvDyw7C9JVuX911MMIzOJpDjx2vJavJZu0C4M/X2k9Tl2A99O2gpLB3Vuvcn
kCKEctXb6K+bt+kiFrnsMl4tqBhrc6fYMSQq+ZP4uU1Ttzxz1T59lT+gtMf2yqZL
hUZgMZ/DNpEdyzU7QOTMFOHFr5l73D4YainEJ7XP9zHzMf4tkHEwUiE3/H7/0ktD
FIqAnJ7khiDwC9r/yqAliecqWg+fspvyLy85aVsmG/5RWbEMYHhyxM4e+vEM5EMs
5dJ1XpSuoVrujiKFx2h/5drQ4KFui9bplbfoSahEsFpdcJ+MtNYHvNOqJ5l2ZLfe
s75j4DtsbWTXK/tGFRmIcL8gPzlUszfXrRVtQw+bhIxAtxmoABuqbCMYDd9LyclO
Kyzy0Ha0AYQy5pIFwm5cTCkC7GBThLBzPk7863J4jxHJ23izg5B/IyEEhOk9NOx2
jC3zf6ZwD1nEIWbZ/yH4s2J8kXw/wno92GLwxI5hpPFPqZxxGMG7xeND1vhcHBvF
1fozX/nMpN4=
=puoc
-----END PGP SIGNATURE-----

^ permalink raw reply	[flat|nested] 72+ messages in thread

* Re: Congratulations! we have got hash function screwed up
  2004-12-30 16:40       ` Hans Reiser
  2004-12-30 16:51         ` Matthias Andree
@ 2005-01-18 21:17         ` Grzegorz Jaśkiewicz
  2005-01-19 16:06           ` Hans Reiser
  1 sibling, 1 reply; 72+ messages in thread
From: Grzegorz Jaśkiewicz @ 2005-01-18 21:17 UTC (permalink / raw)
  To: Hans Reiser; +Cc: Matthias Andree, hihone, reiserfs-list

On Thu, 30 Dec 2004 08:40:51 -0800, Hans Reiser <reiser@namesys.com> wrote:
> Fixing hash collisions in V3 to do them the way V4 does them would
> create more bugs and user disruption than the current bug we have all
> lived with for 5 years until now.  If someone thinks it is a small
> change to fix it, send me a patch.  Better by far to fix bugs in V4,
> which is pretty stable these days.

As I understeand, tea hash is based on tea (tiny encryption aglo),
which was the cause of xbox-linux sucess, and few others.
Pleas consider updating it to use xxtea algo. I know, it won't be
backward compatbile, but well.
Where is about all the others, I don't use them, and for me tea is the
only resonable hash to use on systems where I have very much great
number of files per directory (to name it, Maildirs).
Never had such problem myself, every hash function has a weaknes.
Nothing new. But providing another, much stronger hash, or correct tea
hash to use xxtea, would be something good indeed.

-- 
GJ

^ permalink raw reply	[flat|nested] 72+ messages in thread

* Re: Congratulations! we have got hash function screwed up
  2005-01-18 21:17         ` Grzegorz Jaśkiewicz
@ 2005-01-19 16:06           ` Hans Reiser
  2005-01-19 22:41             ` David Masover
  2005-01-20 13:18             ` Edward Shishkin
  0 siblings, 2 replies; 72+ messages in thread
From: Hans Reiser @ 2005-01-19 16:06 UTC (permalink / raw)
  To: Grzegorz Jaśkiewicz, Edward Shishkin
  Cc: Matthias Andree, hihone, reiserfs-list

Grzegorz Jaśkiewicz wrote:

>On Thu, 30 Dec 2004 08:40:51 -0800, Hans Reiser <reiser@namesys.com> wrote:
>  
>
>>Fixing hash collisions in V3 to do them the way V4 does them would
>>create more bugs and user disruption than the current bug we have all
>>lived with for 5 years until now.  If someone thinks it is a small
>>change to fix it, send me a patch.  Better by far to fix bugs in V4,
>>which is pretty stable these days.
>>    
>>
>
>As I understeand, tea hash is based on tea (tiny encryption aglo),
>which was the cause of xbox-linux sucess, and few others.
>Pleas consider updating it to use xxtea algo. I know, it won't be
>backward compatbile, but well.
>Where is about all the others, I don't use them, and for me tea is the
>only resonable hash to use on systems where I have very much great
>number of files per directory (to name it, Maildirs).
>Never had such problem myself, every hash function has a weaknes.
>Nothing new. But providing another, much stronger hash, or correct tea
>hash to use xxtea, would be something good indeed.
>
>  
>
Edward, please look into whether we should use xxtea in Reiser4, and 
make a recommendation to me.  We aren't changing V3, it is stable and I 
want to leave it that way.

Hans

^ permalink raw reply	[flat|nested] 72+ messages in thread

* Re: Congratulations! we have got hash function screwed up
  2005-01-19 16:06           ` Hans Reiser
@ 2005-01-19 22:41             ` David Masover
  2005-01-20 13:18             ` Edward Shishkin
  1 sibling, 0 replies; 72+ messages in thread
From: David Masover @ 2005-01-19 22:41 UTC (permalink / raw)
  To: Hans Reiser
  Cc: Grzegorz Jaśkiewicz, Edward Shishkin, Matthias Andree,
	hihone, reiserfs-list

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Hans Reiser wrote:
| Grzegorz Jaśkiewicz wrote:
|
|> On Thu, 30 Dec 2004 08:40:51 -0800, Hans Reiser <reiser@namesys.com>
|> wrote:
|>
|>
|>> Fixing hash collisions in V3 to do them the way V4 does them would
|>> create more bugs and user disruption than the current bug we have all
|>> lived with for 5 years until now.  If someone thinks it is a small
|>> change to fix it, send me a patch.  Better by far to fix bugs in V4,
|>> which is pretty stable these days.
|>>
|>
|>
|> As I understeand, tea hash is based on tea (tiny encryption aglo),
|> which was the cause of xbox-linux sucess, and few others.
|> Pleas consider updating it to use xxtea algo. I know, it won't be
|> backward compatbile, but well.
|> Where is about all the others, I don't use them, and for me tea is the
|> only resonable hash to use on systems where I have very much great
|> number of files per directory (to name it, Maildirs).
|> Never had such problem myself, every hash function has a weaknes.
|> Nothing new. But providing another, much stronger hash, or correct tea
|> hash to use xxtea, would be something good indeed.
|>
|>
|>
| Edward, please look into whether we should use xxtea in Reiser4, and
| make a recommendation to me.  We aren't changing V3, it is stable and I
| want to leave it that way.

You don't change, it gets forked.  You have a strange definition of
"stable".
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.2.6 (GNU/Linux)
Comment: Using GnuPG with Thunderbird - http://enigmail.mozdev.org

iQIVAwUBQe7h/HgHNmZLgCUhAQL5OA//TencDKPIk4QR8u3GIJC0gUp2vF+eS3H7
IQTZTufZqXTHhEP3n6ySNied7jeGi7BHL2Oxk2wY+nXWCY5mLGtvBHyOvWaP8oBF
sL7lNqhifljwJkA3yvqSFl7d5BUF3bTZTQ83BYnAPYdZccL4WRuFtxS7jH4xdK0g
lJV7w0x2nNkboy/GWRlqXE1GauBzoLBVJ9twsplxX5RSbHuPECGKZBs/fx2yNN5Y
zkqWxRpRWKIUgkiH/ziE/amAe1BHbV3ZkPx2VE54lbrz1TQox2qEHzOFW9ABQTUh
Nu+W3omw/1OGmZ2sF9X3kBIUhEz3/VK/wk1fG1dKAuhPMfVmjVTVCAyYiRPqZHsC
ZWwKrHxqMbTBvOYHHN8KAyChSLU1Qi2Lm5BnncfdUZ2hM5nQqOBoajX1ymSNiq5f
ykww8P02tJobX9u0UncMfi7wGsA/jW3k7GgWPjHM4A+wxACSiVIWRhR4gcqMFFsc
9nobQSiIiENpeAP6wmJrupPRMPcLUQUGZe+UBjyCHTo9ygAWngBqByTR+74tRrz+
OBcL1WforIOHS6FEHLGyEJVmsjz/0ibOuM6PAUHuNcrDlfa2dMc09Y1YPLXjvCjl
TPRh0OKel3vqq+tFFuh8Xvl/rD4BBM+eUxDjgpqH2NT9DbisYbDpkx+5WwSQMUqk
Z9kOPizEF7o=
=MxVy
-----END PGP SIGNATURE-----

^ permalink raw reply	[flat|nested] 72+ messages in thread

* Re: Congratulations! we have got hash function screwed up
  2005-01-19 16:06           ` Hans Reiser
  2005-01-19 22:41             ` David Masover
@ 2005-01-20 13:18             ` Edward Shishkin
  2005-01-20 23:43               ` Grzegorz Jaśkiewicz
  1 sibling, 1 reply; 72+ messages in thread
From: Edward Shishkin @ 2005-01-20 13:18 UTC (permalink / raw)
  To: Hans Reiser
  Cc: Grzegorz Jaśkiewicz, Matthias Andree, hihone, reiserfs-list

Hans Reiser wrote:

> Grzegorz Jaśkiewicz wrote:
>
>> On Thu, 30 Dec 2004 08:40:51 -0800, Hans Reiser <reiser@namesys.com> 
>> wrote:
>>  
>>
>>> Fixing hash collisions in V3 to do them the way V4 does them would
>>> create more bugs and user disruption than the current bug we have all
>>> lived with for 5 years until now.  If someone thinks it is a small
>>> change to fix it, send me a patch.  Better by far to fix bugs in V4,
>>> which is pretty stable these days.
>>>   
>>
>>
>> As I understeand, tea hash is based on tea (tiny encryption aglo),
>> which was the cause of xbox-linux sucess, and few others.
>> Pleas consider updating it to use xxtea algo. I know, it won't be
>> backward compatbile, but well.
>> Where is about all the others, I don't use them, and for me tea is the
>> only resonable hash to use on systems where I have very much great
>> number of files per directory (to name it, Maildirs).
>> Never had such problem myself, every hash function has a weaknes.
>> Nothing new. But providing another, much stronger hash, or correct tea
>> hash to use xxtea, would be something good indeed.
>>
>>  
>>
> Edward, please look into whether we should use xxtea in Reiser4, and 
> make a recommendation to me.  We aren't changing V3, it is stable and 
> I want to leave it that way.
>
> Hans
>
>
I found that:
1. xxtea is a correction to the Blocktea algorithm against the attack 
not related to the original tea or xtea.
2. xtea is an upgrade of tea algo which eliminates two minor weakness of 
the last one related to key attacks,
and not related to the collisions of tea hash (for each name tea hash 
uses ciphering by the key constructed
by  this name).
So imho it doesn't make sense to upgrade the core rounds used in tea 
hash. Any objections?

Edward.

^ permalink raw reply	[flat|nested] 72+ messages in thread

* Re: Congratulations! we have got hash function screwed up
  2005-01-20 13:18             ` Edward Shishkin
@ 2005-01-20 23:43               ` Grzegorz Jaśkiewicz
  2005-01-21  9:31                 ` Edward Shishkin
  0 siblings, 1 reply; 72+ messages in thread
From: Grzegorz Jaśkiewicz @ 2005-01-20 23:43 UTC (permalink / raw)
  To: Edward Shishkin; +Cc: Hans Reiser, Matthias Andree, hihone, reiserfs-list

All I know is that xxtea is fixed tea algo. If that fixes weakness in
crypto algo, than so it should make hashing better.
No doubt there is no ideal hash algo, but if base algo has weaknes,
using fixed one only can be better, Right ?

-- 
GJ

^ permalink raw reply	[flat|nested] 72+ messages in thread

* Re: Congratulations! we have got hash function screwed up
  2005-01-20 23:43               ` Grzegorz Jaśkiewicz
@ 2005-01-21  9:31                 ` Edward Shishkin
  0 siblings, 0 replies; 72+ messages in thread
From: Edward Shishkin @ 2005-01-21  9:31 UTC (permalink / raw)
  To: Grzegorz Jaśkiewicz
  Cc: Hans Reiser, Matthias Andree, hihone, reiserfs-list

Grzegorz Jaśkiewicz wrote:

>All I know is that xxtea is fixed tea algo. If that fixes weakness in
>crypto algo, than so it should make hashing better.
>  
>

Not necessary. The xtea, xxtea don't fix mixing portion of tea in 
accordance with the
following papers: http://algolist.manual.ru/defence/well_known/tea.zip

>No doubt there is no ideal hash algo, but if base algo has weaknes,
>using fixed one only can be better, Right ?
>
>  
>
Nop ;)
We don't need to keep a track of teacore upgrades against the attacks 
that allow to reveal secret key.
Tea hash uses tea algorithm only for mixing, not for private ciphering, 
so plain text is known
(this is hardcoded vector  u32 k[] = { 0x9464a485, 0x542e1a94}), keys 
(constructed by names)
are known, output values are known.
Nothing to reveal - nothing to upgrade..

Edward.


^ permalink raw reply	[flat|nested] 72+ messages in thread

end of thread, other threads:[~2005-01-21  9:31 UTC | newest]

Thread overview: 72+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2004-12-28 22:12 Congratulations! we have got hash function screwed up Lehmann 
2004-12-29 18:55 ` Stefan Traby
2004-12-29 21:04   ` Lehmann 
2004-12-29 21:05   ` Hans Reiser
2004-12-29 21:43     ` Lehmann 
2004-12-29 21:46       ` Christian Iversen
2004-12-29 22:27         ` Lehmann 
2004-12-30  2:05       ` Hans Reiser
2004-12-30 10:22         ` Matthias Andree
2004-12-30 17:02         ` Lehmann 
2005-01-06 12:45 ` Alex Zarochentsev
2005-01-06 14:27   ` Lehmann 
2005-01-06 15:56     ` Hans Reiser
2005-01-06 16:13       ` Spam
2005-01-06 16:26         ` Chris Dukes
2005-01-06 16:29           ` Spam
2005-01-06 16:56             ` Chris Dukes
2005-01-07 17:22           ` Hans Reiser
2005-01-07 17:28             ` Chris Dukes
2005-01-07 23:27             ` flush earlier? (was Re: Congratulations! we have got hash function screwed up) David Masover
2005-01-07 23:52               ` Hans Reiser
2005-01-08  5:03                 ` David Masover
2005-01-08 20:48                   ` Hans Reiser
2005-01-09 23:26                     ` David Masover
2005-01-06 18:55     ` Congratulations! we have got hash function screwed up Edward Shishkin
2005-01-07 17:26       ` Lehmann 
  -- strict thread matches above, loose matches on Subject: below --
2004-12-30 11:52 Yiannis Mavroukakis
2004-12-30 12:40 ` Matthias Andree
2004-12-30 12:59   ` Cal
2004-12-30 14:18     ` Matthias Andree
2004-12-30 16:40       ` Hans Reiser
2004-12-30 16:51         ` Matthias Andree
2005-01-18 21:17         ` Grzegorz Jaśkiewicz
2005-01-19 16:06           ` Hans Reiser
2005-01-19 22:41             ` David Masover
2005-01-20 13:18             ` Edward Shishkin
2005-01-20 23:43               ` Grzegorz Jaśkiewicz
2005-01-21  9:31                 ` Edward Shishkin
2004-12-30 17:07       ` Esben Stien
2004-12-30 17:15         ` Christian Iversen
2004-12-30 17:47         ` Sander
2004-12-30 17:59           ` Esben Stien
2004-12-30 18:30             ` Sander
2004-12-30 18:46               ` Esben Stien
2004-12-30 18:49                 ` Chris Dukes
2004-12-30 19:21                 ` Sander
2004-12-30 19:29                   ` Esben Stien
2004-12-30 18:16           ` Esben Stien
2004-12-30 18:26           ` Spam
2004-12-30 20:41             ` Tom Vier
2004-12-30 23:14             ` Matthias Andree
2004-12-30 23:25               ` Spam
2004-12-31  4:11               ` Hans Reiser
2004-12-31  8:36                 ` Matthias Andree
2004-12-30 20:08         ` Hans Reiser
2004-12-30 21:55           ` Esben Stien
2004-12-31  4:05           ` David Masover
2004-12-31  4:26             ` Hans Reiser
2004-12-31  5:59               ` David Masover
2004-12-30 20:57         ` Adrian Ulrich
2004-12-30 21:01           ` Stefan Traby
2004-12-30 21:20             ` brianmas
2004-12-30 17:09 ` Lehmann 
2004-12-30 20:11   ` Hans Reiser
2004-12-30 13:24 Yiannis Mavroukakis
2004-12-30 14:11 ` Matthias Andree
2004-12-30 17:22 Yiannis Mavroukakis
2004-12-30 18:16 Burnes, James
2004-12-30 18:36 ` Esben Stien
2004-12-30 19:26   ` Matthias Andree
2004-12-30 19:24 ` Matthias Andree
2004-12-30 20:25 ` Hans Reiser

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.