From mboxrd@z Thu Jan 1 00:00:00 1970 From: studdugie Subject: Re: 15M files Date: Fri, 19 Aug 2005 18:39:44 -0400 Message-ID: <5a59ce53050819153961a71cde@mail.gmail.com> References: <5a59ce53050819144970d07ab0@mail.gmail.com> <43065D7D.2000802@namesys.com> Mime-Version: 1.0 Content-Transfer-Encoding: quoted-printable Return-path: list-help: list-unsubscribe: list-post: Errors-To: flx@namesys.com In-Reply-To: <43065D7D.2000802@namesys.com> Content-Disposition: inline List-Id: Content-Type: text/plain; charset="us-ascii" To: Hans Reiser Cc: reiserfs-mailing-list I can't use V4 because I can't introduce an unstable kernel on the box where the app is running. On 8/19/05, Hans Reiser wrote: > studdugie wrote: >=20 > >Hello. I'm looking to replace a couple Berkeley DB data stores w/ > >regular file system directories backed by reiserfs (3.6). The reason > >is Berkeley DB is slow especially for data w/ little or no locality of > >reference. I'm posting to this list because I would like to get some > >opinions on if reiserfs is suitable for the job. Currently there are > >15,079,597 records in 1 of the database. If I moved to a directory > >based db it would result in 15,079,597 discreet files ranging in sizes > >from 1 byte to 1kb. I was reading the FAQ on the namesys site and it > >mentioned that the r5 hash supports 1,200,000 files w/o collision. > >Since 15M is 12.5x greater I'm expecting massive amounts of > >collisions. So the question becomes how bad should I expect it to be? > >Should I assume the file system can handle it or slow to a crawl? I > >would really appreciate some feedback from the experts before I go > >ripping out the Berkeley DB code. > > > >Thanx. > > > > > > > > > Use V4, it has much better hashiing. >