All of lore.kernel.org
 help / color / mirror / Atom feed
* NFSv3, 2.6, ext3 and dir_index
@ 2005-01-20 16:38 Tobias Diedrich
  2005-01-20 19:26 ` Michael Haverkamp
  0 siblings, 1 reply; 6+ messages in thread
From: Tobias Diedrich @ 2005-01-20 16:38 UTC (permalink / raw)
  To: nfs

[-- Attachment #1: Type: text/plain, Size: 2246 bytes --]

Hi,

I recently switched from a local homedir to a NFS mounted homedir
and noticed some strange things:

I have a Cronjob that goes over my mp3 directory and after the
switch find would print "foo/bar/blubb: No such file or directory"
for a few (approx. 2-3 out of 1928) directories, but not always the same
ones.  Also, my mp3 player program would sometimes stop short in the
middle of a song and skip to the next one, because it got ENOENT on
a read of an open file.

Now first I thought it might just be a problem with my Kernel
version or one of the patches I'm using and upgraded to a more
recent one (currently 2.6.10-ac8-nfsacl on the client and
2.6.10-ac8-imq-nfsacl on the server, I also had the same problem
without the nfsacl patches).

Then I read an older thread about trouble with ext3/dir_index and
NFS in older 2.6 versions and tried disabling dir_index (Which was
enabled on all my ext3 filesystems).  With that the problem
vanished.

Are there any known problems with dir_index and NFS, or is this
maybe a new bug?

Getting a tcpdump of the client<->server traffic proved difficult,
because it is a quite sporadic bug and reproducing it would involve
quite a lot of NFS traffic.  I could reliably trigger this with my
CD/DVD burning script, which generates md5sums for each file and
puts them both into the file MD5SUMS in the current directory, as
well as into another file in my Homedir (But the files being
md5summed also live on another NFS export, so there is a _lot_ of
traffic).

I _do_ have a traffic capture of ls returning "No such file or
directory" on the current directory, then cding up a level, down into the
directory again and then a working ls.

Client fstab entry:
nukunuku:/mnt/space1/ranma /home/ranma nfs hard,intr,bg,udp,rsize=4096,wsize=4096 0 0

/proc/mounts entry:
nukunuku:/mnt/space1/ranma /home/ranma nfs rw,v3,rsize=4096,wsize=4096,hard,intr,udp,lock,addr=nukunuku 0 0

Server export file:
/ melchior.yamamaya.is-a-geek.org(sync,rw,no_root_squash)
/mnt/space1 melchior.yamamaya.is-a-geek.org(sync,rw,no_root_squash)
/mnt/space2 melchior.yamamaya.is-a-geek.org(sync,rw,no_root_squash)
/mnt/space3 melchior.yamamaya.is-a-geek.org(sync,rw,no_root_squash)

-- 
Tobias						PGP: http://9ac7e0bc.uguu.de

[-- Attachment #2: nfs.tcpdump.gz --]
[-- Type: application/octet-stream, Size: 5397 bytes --]

^ permalink raw reply	[flat|nested] 6+ messages in thread

* NFSv3, 2.6, ext3 and dir_index
@ 2005-01-20 16:38   ` Tobias Diedrich
  2005-03-14 17:22     ` Chip Salzenberg
  0 siblings, 1 reply; 6+ messages in thread
From: Tobias Diedrich @ 2005-01-20 16:38 UTC (permalink / raw)
  To: nfs

[-- Attachment #1: Type: text/plain, Size: 2246 bytes --]

Hi,

I recently switched from a local homedir to a NFS mounted homedir
and noticed some strange things:

I have a Cronjob that goes over my mp3 directory and after the
switch find would print "foo/bar/blubb: No such file or directory"
for a few (approx. 2-3 out of 1928) directories, but not always the same
ones.  Also, my mp3 player program would sometimes stop short in the
middle of a song and skip to the next one, because it got ENOENT on
a read of an open file.

Now first I thought it might just be a problem with my Kernel
version or one of the patches I'm using and upgraded to a more
recent one (currently 2.6.10-ac8-nfsacl on the client and
2.6.10-ac8-imq-nfsacl on the server, I also had the same problem
without the nfsacl patches).

Then I read an older thread about trouble with ext3/dir_index and
NFS in older 2.6 versions and tried disabling dir_index (Which was
enabled on all my ext3 filesystems).  With that the problem
vanished.

Are there any known problems with dir_index and NFS, or is this
maybe a new bug?

Getting a tcpdump of the client<->server traffic proved difficult,
because it is a quite sporadic bug and reproducing it would involve
quite a lot of NFS traffic.  I could reliably trigger this with my
CD/DVD burning script, which generates md5sums for each file and
puts them both into the file MD5SUMS in the current directory, as
well as into another file in my Homedir (But the files being
md5summed also live on another NFS export, so there is a _lot_ of
traffic).

I _do_ have a traffic capture of ls returning "No such file or
directory" on the current directory, then cding up a level, down into the
directory again and then a working ls.

Client fstab entry:
nukunuku:/mnt/space1/ranma /home/ranma nfs hard,intr,bg,udp,rsize=4096,wsize=4096 0 0

/proc/mounts entry:
nukunuku:/mnt/space1/ranma /home/ranma nfs rw,v3,rsize=4096,wsize=4096,hard,intr,udp,lock,addr=nukunuku 0 0

Server export file:
/ melchior.yamamaya.is-a-geek.org(sync,rw,no_root_squash)
/mnt/space1 melchior.yamamaya.is-a-geek.org(sync,rw,no_root_squash)
/mnt/space2 melchior.yamamaya.is-a-geek.org(sync,rw,no_root_squash)
/mnt/space3 melchior.yamamaya.is-a-geek.org(sync,rw,no_root_squash)

-- 
Tobias						PGP: http://9ac7e0bc.uguu.de

[-- Attachment #2: nfs.tcpdump.gz --]
[-- Type: application/octet-stream, Size: 5397 bytes --]

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: NFSv3, 2.6, ext3 and dir_index
  2005-01-20 16:38 NFSv3, 2.6, ext3 and dir_index Tobias Diedrich
@ 2005-01-20 19:26 ` Michael Haverkamp
  0 siblings, 0 replies; 6+ messages in thread
From: Michael Haverkamp @ 2005-01-20 19:26 UTC (permalink / raw)
  To: nfs

I have had the same problem with LVM2 and reiserfs using kernel 2.6.9. 
Whenever I have a large IO load on the server, these error crop up.  I 
copy everything on /space (/dev/hda5) to /space_bak (/dev/hdc5) using cp 
-a and that always makes the problem happen for me.  I have not tried 
removing LVM2 yet, so I don't know if that is a contributor.

I don't think that this problem is limited to ext3 and dir_index.

Tobias Diedrich wrote:
> Hi,
> 
> I recently switched from a local homedir to a NFS mounted homedir
> and noticed some strange things:
> 
> I have a Cronjob that goes over my mp3 directory and after the
> switch find would print "foo/bar/blubb: No such file or directory"
> for a few (approx. 2-3 out of 1928) directories, but not always the same
> ones.  Also, my mp3 player program would sometimes stop short in the
> middle of a song and skip to the next one, because it got ENOENT on
> a read of an open file.
> 
> Now first I thought it might just be a problem with my Kernel
> version or one of the patches I'm using and upgraded to a more
> recent one (currently 2.6.10-ac8-nfsacl on the client and
> 2.6.10-ac8-imq-nfsacl on the server, I also had the same problem
> without the nfsacl patches).
> 
> Then I read an older thread about trouble with ext3/dir_index and
> NFS in older 2.6 versions and tried disabling dir_index (Which was
> enabled on all my ext3 filesystems).  With that the problem
> vanished.
> 
> Are there any known problems with dir_index and NFS, or is this
> maybe a new bug?
> 
> Getting a tcpdump of the client<->server traffic proved difficult,
> because it is a quite sporadic bug and reproducing it would involve
> quite a lot of NFS traffic.  I could reliably trigger this with my
> CD/DVD burning script, which generates md5sums for each file and
> puts them both into the file MD5SUMS in the current directory, as
> well as into another file in my Homedir (But the files being
> md5summed also live on another NFS export, so there is a _lot_ of
> traffic).
> 
> I _do_ have a traffic capture of ls returning "No such file or
> directory" on the current directory, then cding up a level, down into the
> directory again and then a working ls.
> 
> Client fstab entry:
> nukunuku:/mnt/space1/ranma /home/ranma nfs hard,intr,bg,udp,rsize=4096,wsize=4096 0 0
> 
> /proc/mounts entry:
> nukunuku:/mnt/space1/ranma /home/ranma nfs rw,v3,rsize=4096,wsize=4096,hard,intr,udp,lock,addr=nukunuku 0 0
> 
> Server export file:
> / melchior.yamamaya.is-a-geek.org(sync,rw,no_root_squash)
> /mnt/space1 melchior.yamamaya.is-a-geek.org(sync,rw,no_root_squash)
> /mnt/space2 melchior.yamamaya.is-a-geek.org(sync,rw,no_root_squash)
> /mnt/space3 melchior.yamamaya.is-a-geek.org(sync,rw,no_root_squash)
> 

-- 
Michael Haverkamp


-------------------------------------------------------
This SF.Net email is sponsored by: IntelliVIEW -- Interactive Reporting
Tool for open source databases. Create drag-&-drop reports. Save time
by over 75%! Publish reports on the web. Export to DOC, XLS, RTF, etc.
Download a FREE copy at http://www.intelliview.com/go/osdn_nl
_______________________________________________
NFS maillist  -  NFS@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nfs

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: NFSv3, 2.6, ext3 and dir_index
  2005-01-20 16:38   ` NFSv3, 2.6, ext3 and dir_index Tobias Diedrich
@ 2005-03-14 17:22     ` Chip Salzenberg
  2005-03-20 21:21       ` Tobias Diedrich
  2005-06-29 20:50       ` More Stale NFS handles Bill Rugolsky Jr.
  0 siblings, 2 replies; 6+ messages in thread
From: Chip Salzenberg @ 2005-03-14 17:22 UTC (permalink / raw)
  To: Tobias Diedrich; +Cc: nfs

According to Tobias Diedrich:
> Are there any known problems with dir_index and NFS, or is this
> maybe a new bug?

Did you ever get any replies on dir_index vs. nfs?

I'm just setting up a new server and I'm wondering if there's
something still out there that might make me sorry to use dir_index.
-- 
Chip Salzenberg            - a.k.a. -            <chip@pobox.com>
         Open Source is not an excuse to write fun code
            then leave the actual work to others.


-------------------------------------------------------
SF email is sponsored by - The IT Product Guide
Read honest & candid reviews on hundreds of IT Products from real users.
Discover which products truly live up to the hype. Start reading now.
http://ads.osdn.com/?ad_id=6595&alloc_id=14396&op=click
_______________________________________________
NFS maillist  -  NFS@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nfs

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: NFSv3, 2.6, ext3 and dir_index
  2005-03-14 17:22     ` Chip Salzenberg
@ 2005-03-20 21:21       ` Tobias Diedrich
  2005-08-13  9:39         ` Tobias Diedrich
  2005-06-29 20:50       ` More Stale NFS handles Bill Rugolsky Jr.
  1 sibling, 1 reply; 6+ messages in thread
From: Tobias Diedrich @ 2005-03-20 21:21 UTC (permalink / raw)
  To: Chip Salzenberg; +Cc: nfs

Sorry for the late reply.

Chip Salzenberg wrote:

> Did you ever get any replies on dir_index vs. nfs?

Not really.

> I'm just setting up a new server and I'm wondering if there's
> something still out there that might make me sorry to use dir_index.

For now I've just disabled dir_index and I haven't had any problems
since, but I'm not completely sure if that really was the cause of
the problem I was seeing.  At least disabling dir_index is rather
easy (remove dir_index featureflag with tune2fs and run e2fsck to
get rid of the remaining hashtrees).
Maybe I'll try again to get a proper traffic trace, but I guess I'd
have to set up a test client first to separate the unrelated nfs traffic.

-- 
Tobias						PGP: http://9ac7e0bc.uguu.de


-------------------------------------------------------
This SF.net email is sponsored by Microsoft Mobile & Embedded DevCon 2005
Attend MEDC 2005 May 9-12 in Vegas. Learn more about the latest Windows
Embedded(r) & Windows Mobile(tm) platforms, applications & content.  Register
by 3/29 & save $300 http://ads.osdn.com/?ad_id=6883&alloc_id=15149&op=click
_______________________________________________
NFS maillist  -  NFS@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nfs

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: NFSv3, 2.6, ext3 and dir_index
  2005-03-20 21:21       ` Tobias Diedrich
@ 2005-08-13  9:39         ` Tobias Diedrich
  0 siblings, 0 replies; 6+ messages in thread
From: Tobias Diedrich @ 2005-08-13  9:39 UTC (permalink / raw)
  To: nfs

Followup for people looking at this thread in the archive:

The patch mentioned in Bills Message seems to be in 2.6.13 release candidates,
however my server is still running 2.6.11.12 (without the patch),
but I'm about to upgrade to 2.6.12.4 (with the patch applied).
Hopefully this will fix both errors I was seeing. :-)

Tobias Diedrich wrote on 20 Mar 2005 in
<20050320212106.GA5580@yamamaya.is-a-geek.org>:
> Sorry for the late reply.
> 
> Chip Salzenberg wrote:
> 
> > Did you ever get any replies on dir_index vs. nfs?
> 
> Not really.
> 
> > I'm just setting up a new server and I'm wondering if there's
> > something still out there that might make me sorry to use dir_index.
> 
> For now I've just disabled dir_index and I haven't had any problems
> since, but I'm not completely sure if that really was the cause of
> the problem I was seeing.  At least disabling dir_index is rather
> easy (remove dir_index featureflag with tune2fs and run e2fsck to
> get rid of the remaining hashtrees).
> Maybe I'll try again to get a proper traffic trace, but I guess I'd
> have to set up a test client first to separate the unrelated nfs traffic.
> 
> -- 
> Tobias						PGP: http://9ac7e0bc.uguu.de

Bill Rugolsky Jr. wrote on 29 Jun 2005 in
<20050629205056.GB29952@ti64.telemetry-investments.com>:
> On Wed, Jun 29, 2005 at 01:28:07PM -0700, Kris Vassallo wrote:
> > I was using ext3, upgraded to core 3 from core 1 and had the exact same
> > problems. What did it for me, until someone fixes this problem, was to
> > turn off journaling (so basically its back to ext2). Since reverting
> > back to ext 2 the problem has gone away. Now this is going to be a huge
> > problem if the machine crashes because fscking a 1.5 TB disk array is
> > going to suck! I experimented with the way the journaling gets done
> > (data goes to disk first or journal first) and I wasn't able to fix the
> > problem. 
> > 
> > Aaaaaaargh! 
> > I just thought I would share my ongoing battle story, hopefully someone
> > will figure out what's causing this and will come up with a fix.
> 
> We're you using ext3 directory indexing? [Look for feature "dir_index" on
> the "Filesystem features:" line in "/sbin/tune2fs -l <device>" output.]
> 
> If so, was the following patch in the kernel that you were using?
> 
>    http://marc.theaimsgroup.com/?l=ext2-devel&m=111568753527230&w=2
> 
> Regards,
> 
> 	Bill Rugolsky

-- 
Tobias						PGP: http://9ac7e0bc.uguu.de


-------------------------------------------------------
SF.Net email is Sponsored by the Better Software Conference & EXPO
September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices
Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA
Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf
_______________________________________________
NFS maillist  -  NFS@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nfs

^ permalink raw reply	[flat|nested] 6+ messages in thread

end of thread, other threads:[~2005-08-13 11:38 UTC | newest]

Thread overview: 6+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2005-01-20 16:38 NFSv3, 2.6, ext3 and dir_index Tobias Diedrich
2005-01-20 19:26 ` Michael Haverkamp
  -- strict thread matches above, loose matches on Subject: below --
2005-06-23 20:55 More Stale NFS handles Eiwe Lingefors
2005-06-29 20:28 ` Kris Vassallo
2005-01-20 16:38   ` NFSv3, 2.6, ext3 and dir_index Tobias Diedrich
2005-03-14 17:22     ` Chip Salzenberg
2005-03-20 21:21       ` Tobias Diedrich
2005-08-13  9:39         ` Tobias Diedrich
2005-06-29 20:50       ` More Stale NFS handles Bill Rugolsky Jr.

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.