public inbox for linux-nfs@vger.kernel.org
 help / color / mirror / Atom feed
* nfs client performance while server is down
@ 2010-01-23 15:45 Whoop Whouzer
       [not found] ` <d7f0b3a81001230745h18dbb14fi42f28adff0c45294-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
  0 siblings, 1 reply; 24+ messages in thread
From: Whoop Whouzer @ 2010-01-23 15:45 UTC (permalink / raw)
  To: linux-nfs

Howdy,

I was wondering why nfs is designed in such a way that the performance
of an nfs client machine gets very bad when the nfs server is offline?
This is even the case with a soft mount (either via mount or fstab).
Just about every application that requires disk access (not talking
about nfs share acces) gets really slow to unresponsive. For instance
nautilus becomes unresponsive when displaying the contents of any
folder on the local disk,
playing movie files (stored on local disk) let totem or vlc get stuck
on set intervals, even the terminal becomes unresponsive at times.

I could understand that these problems would occur while accessing the
nfs share directory while the server is offline, but why for totally
unrelated directories?

I have experienced this behaviour on various distro's, and also found
various bug reports on this issue, they don't seem to get solved as
this is viewed as nfs design.
I see this as a flaw because clients are totally dependent on the
server. This would be less of a deal if the entire home directory
would be stored on nfs (although I even think some sort of
synchronisation technology could and should be implemented in this
case). It is a bit odd that (technically) one machine serving some
"useless" files to a non-trivial directory on client machines can take
down these client machines.

For me the preferred functionality would be:
*If an nfs server gets offline the client's nfs share becomes
unaccessible, but local directories and applications (that only
require local disk access) stay responsive.
*If an nfs server gets online (after being offline while the client
has not been restarted) the nfs share becomes reconnected.

regards,
Whoop

^ permalink raw reply	[flat|nested] 24+ messages in thread

* Re: nfs client performance while server is down
       [not found] ` <d7f0b3a81001230745h18dbb14fi42f28adff0c45294-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
@ 2010-01-23 15:57   ` Peter Chacko
  2010-01-23 16:27     ` Whoop Whouzer
  0 siblings, 1 reply; 24+ messages in thread
From: Peter Chacko @ 2010-01-23 15:57 UTC (permalink / raw)
  To: Whoop Whouzer; +Cc: linux-nfs

Which client OS you observed this behavior ?  This has nothing to do
NFS design, and its purely stateless...Its upto the client OS
implementation about aspects like how to deal with local IO, when NFS
share gets  disconnected..

May be a VFS bug on the local OS you found this problem ..

thanks

On Sat, Jan 23, 2010 at 9:15 PM, Whoop Whouzer <tiredandnumb@gmail.com>=
 wrote:
> Howdy,
>
> I was wondering why nfs is designed in such a way that the performanc=
e
> of an nfs client machine gets very bad when the nfs server is offline=
?
> This is even the case with a soft mount (either via mount or fstab).
> Just about every application that requires disk access (not talking
> about nfs share acces) gets really slow to unresponsive. For instance
> nautilus becomes unresponsive when displaying the contents of any
> folder on the local disk,
> playing movie files (stored on local disk) let totem or vlc get stuck
> on set intervals, even the terminal becomes unresponsive at times.
>
> I could understand that these problems would occur while accessing th=
e
> nfs share directoiourry while the server is offline, but why for tota=
lly
> unrelated directories?
>
> I have experienced this behaviour on various distro's, and also found
> various bug reports on this issue, they don't seem to get solved as
> this is viewed as nfs design.
> I see this as a flaw because clients are totally dependent on the
> server. This would be less of a deal if the entire home directory
> would be stored on nfs (although I even think some sort of
> synchronisation technology could and should be implemented in this
> case). It is a bit odd that (technically) one machine serving some
> "useless" files to a non-trivial directory on client machines can tak=
e
> down these client machines.
>
> For me the preferred functionality would be:
> *If an nfs server gets offline the client's nfs share becomes
> unaccessible, but local directories and applications (that only
> require local disk access) stay responsive.
> *If an nfs server gets online (after being offline while the client
> has not been restarted) the nfs share becomes reconnected.
>
> regards,
> Whoop
> --
> To unsubscribe from this list: send the line "unsubscribe linux-nfs" =
in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at =A0http://vger.kernel.org/majordomo-info.html
>

^ permalink raw reply	[flat|nested] 24+ messages in thread

* Re: nfs client performance while server is down
  2010-01-23 15:57   ` Peter Chacko
@ 2010-01-23 16:27     ` Whoop Whouzer
       [not found]       ` <d7f0b3a81001230827y52727993nf60210ae610643b7-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
  0 siblings, 1 reply; 24+ messages in thread
From: Whoop Whouzer @ 2010-01-23 16:27 UTC (permalink / raw)
  To: Peter Chacko; +Cc: linux-nfs

I don't remember all the different set-ups I tried it on, but I just
confirmed this with the following combinations:

ubuntu server 10.04 (alpha 2) --> ubuntu desktop 9.10, ubuntu desktop
10.04 (alpha 2), fedora 12
ubuntu server 9.10 --> ubuntu desktop 9.10, ubuntu desktop 10.04
(alpha 2), fedora 12

I'll be happy to test it on another client machine (distro) even
another server (although it would require a little more time)

Here are some examples on the bugreports I noticed and how they do not
seem to get solved:
https://bugzilla.redhat.com/show_bug.cgi?id=175283
https://bugs.launchpad.net/ubuntu/+source/nfs-utils/+bug/164120

regards,
Whoop

On Sat, Jan 23, 2010 at 4:57 PM, Peter Chacko <peterchacko35@gmail.com> wrote:
> Which client OS you observed this behavior ?  This has nothing to do
> NFS design, and its purely stateless...Its upto the client OS
> implementation about aspects like how to deal with local IO, when NFS
> share gets  disconnected..
>
> May be a VFS bug on the local OS you found this problem ..
>
> thanks
>
> On Sat, Jan 23, 2010 at 9:15 PM, Whoop Whouzer <tiredandnumb@gmail.com> wrote:
>> Howdy,
>>
>> I was wondering why nfs is designed in such a way that the performance
>> of an nfs client machine gets very bad when the nfs server is offline?
>> This is even the case with a soft mount (either via mount or fstab).
>> Just about every application that requires disk access (not talking
>> about nfs share acces) gets really slow to unresponsive. For instance
>> nautilus becomes unresponsive when displaying the contents of any
>> folder on the local disk,
>> playing movie files (stored on local disk) let totem or vlc get stuck
>> on set intervals, even the terminal becomes unresponsive at times.
>>
>> I could understand that these problems would occur while accessing the
>> nfs share directoiourry while the server is offline, but why for totally
>> unrelated directories?
>>
>> I have experienced this behaviour on various distro's, and also found
>> various bug reports on this issue, they don't seem to get solved as
>> this is viewed as nfs design.
>> I see this as a flaw because clients are totally dependent on the
>> server. This would be less of a deal if the entire home directory
>> would be stored on nfs (although I even think some sort of
>> synchronisation technology could and should be implemented in this
>> case). It is a bit odd that (technically) one machine serving some
>> "useless" files to a non-trivial directory on client machines can take
>> down these client machines.
>>
>> For me the preferred functionality would be:
>> *If an nfs server gets offline the client's nfs share becomes
>> unaccessible, but local directories and applications (that only
>> require local disk access) stay responsive.
>> *If an nfs server gets online (after being offline while the client
>> has not been restarted) the nfs share becomes reconnected.
>>
>> regards,
>> Whoop
>> --
>> To unsubscribe from this list: send the line "unsubscribe linux-nfs" in
>> the body of a message to majordomo@vger.kernel.org
>> More majordomo info at  http://vger.kernel.org/majordomo-info.html
>>
>

^ permalink raw reply	[flat|nested] 24+ messages in thread

* RE: nfs client performance while server is down
       [not found]       ` <d7f0b3a81001230827y52727993nf60210ae610643b7-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
@ 2010-01-24 21:34         ` Muntz, Daniel
       [not found]           ` <7A24DF798E223B4C9864E8F92E8C93EC0527810C-hX7t0kiaRRpT+ZUat5FNkAK/GNPrWCqfQQ4Iyu8u01E@public.gmane.org>
  0 siblings, 1 reply; 24+ messages in thread
From: Muntz, Daniel @ 2010-01-24 21:34 UTC (permalink / raw)
  To: Whoop Whouzer, Peter Chacko; +Cc: linux-nfs

Perhaps something in your $PATH is in the NFS mount?  Do a network trac=
e and maybe you can see if, in fact, there are actually NFS operations =
being attempted that you weren't expecting.  Then try to figure out why=
=2E

  -Dan=20

> -----Original Message-----
> From: Whoop Whouzer [mailto:tiredandnumb@gmail.com]=20
> Sent: Saturday, January 23, 2010 8:28 AM
> To: Peter Chacko
> Cc: linux-nfs@vger.kernel.org
> Subject: Re: nfs client performance while server is down
>=20
> I don't remember all the different set-ups I tried it on, but I just
> confirmed this with the following combinations:
>=20
> ubuntu server 10.04 (alpha 2) --> ubuntu desktop 9.10, ubuntu desktop
> 10.04 (alpha 2), fedora 12
> ubuntu server 9.10 --> ubuntu desktop 9.10, ubuntu desktop 10.04
> (alpha 2), fedora 12
>=20
> I'll be happy to test it on another client machine (distro) even
> another server (although it would require a little more time)
>=20
> Here are some examples on the bugreports I noticed and how they do no=
t
> seem to get solved:
> https://bugzilla.redhat.com/show_bug.cgi?id=3D175283
> https://bugs.launchpad.net/ubuntu/+source/nfs-utils/+bug/164120
>=20
> regards,
> Whoop
>=20
> On Sat, Jan 23, 2010 at 4:57 PM, Peter Chacko=20
> <peterchacko35@gmail.com> wrote:
> > Which client OS you observed this behavior ? =A0This has nothing to=
 do
> > NFS design, and its purely stateless...Its upto the client OS
> > implementation about aspects like how to deal with local=20
> IO, when NFS
> > share gets =A0disconnected..
> >
> > May be a VFS bug on the local OS you found this problem ..
> >
> > thanks
> >
> > On Sat, Jan 23, 2010 at 9:15 PM, Whoop Whouzer=20
> <tiredandnumb@gmail.com> wrote:
> >> Howdy,
> >>
> >> I was wondering why nfs is designed in such a way that the=20
> performance
> >> of an nfs client machine gets very bad when the nfs server=20
> is offline?
> >> This is even the case with a soft mount (either via mount=20
> or fstab).
> >> Just about every application that requires disk access (not talkin=
g
> >> about nfs share acces) gets really slow to unresponsive.=20
> For instance
> >> nautilus becomes unresponsive when displaying the contents of any
> >> folder on the local disk,
> >> playing movie files (stored on local disk) let totem or=20
> vlc get stuck
> >> on set intervals, even the terminal becomes unresponsive at times.
> >>
> >> I could understand that these problems would occur while=20
> accessing the
> >> nfs share directoiourry while the server is offline, but=20
> why for totally
> >> unrelated directories?
> >>
> >> I have experienced this behaviour on various distro's, and=20
> also found
> >> various bug reports on this issue, they don't seem to get solved a=
s
> >> this is viewed as nfs design.
> >> I see this as a flaw because clients are totally dependent on the
> >> server. This would be less of a deal if the entire home directory
> >> would be stored on nfs (although I even think some sort of
> >> synchronisation technology could and should be implemented in this
> >> case). It is a bit odd that (technically) one machine serving some
> >> "useless" files to a non-trivial directory on client=20
> machines can take
> >> down these client machines.
> >>
> >> For me the preferred functionality would be:
> >> *If an nfs server gets offline the client's nfs share becomes
> >> unaccessible, but local directories and applications (that only
> >> require local disk access) stay responsive.
> >> *If an nfs server gets online (after being offline while the clien=
t
> >> has not been restarted) the nfs share becomes reconnected.
> >>
> >> regards,
> >> Whoop
> >> --
> >> To unsubscribe from this list: send the line "unsubscribe=20
> linux-nfs" in
> >> the body of a message to majordomo@vger.kernel.org
> >> More majordomo info at =A0http://vger.kernel.org/majordomo-info.ht=
ml
> >>
> >
> --
> To unsubscribe from this list: send the line "unsubscribe=20
> linux-nfs" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html
>=20

^ permalink raw reply	[flat|nested] 24+ messages in thread

* Re: nfs client performance while server is down
       [not found]           ` <7A24DF798E223B4C9864E8F92E8C93EC0527810C-hX7t0kiaRRpT+ZUat5FNkAK/GNPrWCqfQQ4Iyu8u01E@public.gmane.org>
@ 2010-01-24 22:03             ` Whoop Whouzer
  2010-01-25  0:09             ` Whoop Whouzer
  1 sibling, 0 replies; 24+ messages in thread
From: Whoop Whouzer @ 2010-01-24 22:03 UTC (permalink / raw)
  To: Muntz, Daniel; +Cc: Peter Chacko, linux-nfs

I don't think so, I added this to fstab:
hpserver:/home/whoop/share /nfsshare nfs
rsize=3D8192,wsize=3D8192,timeo=3D14,soft    0       0

and created the /nfsshare directory just for that.
echo $PATH gives me this:
/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin:/usr/games


On Sun, Jan 24, 2010 at 10:34 PM, Muntz, Daniel <Dan.Muntz@netapp.com> =
wrote:
> Perhaps something in your $PATH is in the NFS mount? =A0Do a network =
trace and maybe you can see if, in fact, there are actually NFS operati=
ons being attempted that you weren't expecting. =A0Then try to figure o=
ut why.
>
> =A0-Dan
>
>> -----Original Message-----
>> From: Whoop Whouzer [mailto:tiredandnumb@gmail.com]
>> Sent: Saturday, January 23, 2010 8:28 AM
>> To: Peter Chacko
>> Cc: linux-nfs@vger.kernel.org
>> Subject: Re: nfs client performance while server is down
>>
>> I don't remember all the different set-ups I tried it on, but I just
>> confirmed this with the following combinations:
>>
>> ubuntu server 10.04 (alpha 2) --> ubuntu desktop 9.10, ubuntu deskto=
p
>> 10.04 (alpha 2), fedora 12
>> ubuntu server 9.10 --> ubuntu desktop 9.10, ubuntu desktop 10.04
>> (alpha 2), fedora 12
>>
>> I'll be happy to test it on another client machine (distro) even
>> another server (although it would require a little more time)
>>
>> Here are some examples on the bugreports I noticed and how they do n=
ot
>> seem to get solved:
>> https://bugzilla.redhat.com/show_bug.cgi?id=3D175283
>> https://bugs.launchpad.net/ubuntu/+source/nfs-utils/+bug/164120
>>
>> regards,
>> Whoop
>>
>> On Sat, Jan 23, 2010 at 4:57 PM, Peter Chacko
>> <peterchacko35@gmail.com> wrote:
>> > Which client OS you observed this behavior ? =A0This has nothing t=
o do
>> > NFS design, and its purely stateless...Its upto the client OS
>> > implementation about aspects like how to deal with local
>> IO, when NFS
>> > share gets =A0disconnected..
>> >
>> > May be a VFS bug on the local OS you found this problem ..
>> >
>> > thanks
>> >
>> > On Sat, Jan 23, 2010 at 9:15 PM, Whoop Whouzer
>> <tiredandnumb@gmail.com> wrote:
>> >> Howdy,
>> >>
>> >> I was wondering why nfs is designed in such a way that the
>> performance
>> >> of an nfs client machine gets very bad when the nfs server
>> is offline?
>> >> This is even the case with a soft mount (either via mount
>> or fstab).
>> >> Just about every application that requires disk access (not talki=
ng
>> >> about nfs share acces) gets really slow to unresponsive.
>> For instance
>> >> nautilus becomes unresponsive when displaying the contents of any
>> >> folder on the local disk,
>> >> playing movie files (stored on local disk) let totem or
>> vlc get stuck
>> >> on set intervals, even the terminal becomes unresponsive at times=
=2E
>> >>
>> >> I could understand that these problems would occur while
>> accessing the
>> >> nfs share directoiourry while the server is offline, but
>> why for totally
>> >> unrelated directories?
>> >>
>> >> I have experienced this behaviour on various distro's, and
>> also found
>> >> various bug reports on this issue, they don't seem to get solved =
as
>> >> this is viewed as nfs design.
>> >> I see this as a flaw because clients are totally dependent on the
>> >> server. This would be less of a deal if the entire home directory
>> >> would be stored on nfs (although I even think some sort of
>> >> synchronisation technology could and should be implemented in thi=
s
>> >> case). It is a bit odd that (technically) one machine serving som=
e
>> >> "useless" files to a non-trivial directory on client
>> machines can take
>> >> down these client machines.
>> >>
>> >> For me the preferred functionality would be:
>> >> *If an nfs server gets offline the client's nfs share becomes
>> >> unaccessible, but local directories and applications (that only
>> >> require local disk access) stay responsive.
>> >> *If an nfs server gets online (after being offline while the clie=
nt
>> >> has not been restarted) the nfs share becomes reconnected.
>> >>
>> >> regards,
>> >> Whoop
>> >> --
>> >> To unsubscribe from this list: send the line "unsubscribe
>> linux-nfs" in
>> >> the body of a message to majordomo@vger.kernel.org
>> >> More majordomo info at =A0http://vger.kernel.org/majordomo-info.h=
tml
>> >>
>> >
>> --
>> To unsubscribe from this list: send the line "unsubscribe
>> linux-nfs" in
>> the body of a message to majordomo@vger.kernel.org
>> More majordomo info at =A0http://vger.kernel.org/majordomo-info.html
>>
>

^ permalink raw reply	[flat|nested] 24+ messages in thread

* Re: nfs client performance while server is down
       [not found]           ` <7A24DF798E223B4C9864E8F92E8C93EC0527810C-hX7t0kiaRRpT+ZUat5FNkAK/GNPrWCqfQQ4Iyu8u01E@public.gmane.org>
  2010-01-24 22:03             ` Whoop Whouzer
@ 2010-01-25  0:09             ` Whoop Whouzer
  2010-01-25 16:48               ` Chuck Lever
  1 sibling, 1 reply; 24+ messages in thread
From: Whoop Whouzer @ 2010-01-25  0:09 UTC (permalink / raw)
  To: Muntz, Daniel; +Cc: Peter Chacko, linux-nfs

I did some network traces and there is nothing strange happening as
far as I can tell. I shut down the server (some network traffic
occurred as is to be expected). It got quiet again, I launched
nautilus, it got stuck without displaying anything and there was no
real network activity except 3 broadcasts using the ARP protocol
asking where the server was (could be just coincidence). Closing
nautilus and launching it again will let it hang again but I see no
additional network traffic. After a while nautilus will display the
contents of the folder without any network traffic.

On Sun, Jan 24, 2010 at 10:34 PM, Muntz, Daniel <Dan.Muntz@netapp.com> =
wrote:
> Perhaps something in your $PATH is in the NFS mount? =A0Do a network =
trace and maybe you can see if, in fact, there are actually NFS operati=
ons being attempted that you weren't expecting. =A0Then try to figure o=
ut why.
>
> =A0-Dan
>
>> -----Original Message-----
>> From: Whoop Whouzer [mailto:tiredandnumb@gmail.com]
>> Sent: Saturday, January 23, 2010 8:28 AM
>> To: Peter Chacko
>> Cc: linux-nfs@vger.kernel.org
>> Subject: Re: nfs client performance while server is down
>>
>> I don't remember all the different set-ups I tried it on, but I just
>> confirmed this with the following combinations:
>>
>> ubuntu server 10.04 (alpha 2) --> ubuntu desktop 9.10, ubuntu deskto=
p
>> 10.04 (alpha 2), fedora 12
>> ubuntu server 9.10 --> ubuntu desktop 9.10, ubuntu desktop 10.04
>> (alpha 2), fedora 12
>>
>> I'll be happy to test it on another client machine (distro) even
>> another server (although it would require a little more time)
>>
>> Here are some examples on the bugreports I noticed and how they do n=
ot
>> seem to get solved:
>> https://bugzilla.redhat.com/show_bug.cgi?id=3D175283
>> https://bugs.launchpad.net/ubuntu/+source/nfs-utils/+bug/164120
>>
>> regards,
>> Whoop
>>
>> On Sat, Jan 23, 2010 at 4:57 PM, Peter Chacko
>> <peterchacko35@gmail.com> wrote:
>> > Which client OS you observed this behavior ? =A0This has nothing t=
o do
>> > NFS design, and its purely stateless...Its upto the client OS
>> > implementation about aspects like how to deal with local
>> IO, when NFS
>> > share gets =A0disconnected..
>> >
>> > May be a VFS bug on the local OS you found this problem ..
>> >
>> > thanks
>> >
>> > On Sat, Jan 23, 2010 at 9:15 PM, Whoop Whouzer
>> <tiredandnumb@gmail.com> wrote:
>> >> Howdy,
>> >>
>> >> I was wondering why nfs is designed in such a way that the
>> performance
>> >> of an nfs client machine gets very bad when the nfs server
>> is offline?
>> >> This is even the case with a soft mount (either via mount
>> or fstab).
>> >> Just about every application that requires disk access (not talki=
ng
>> >> about nfs share acces) gets really slow to unresponsive.
>> For instance
>> >> nautilus becomes unresponsive when displaying the contents of any
>> >> folder on the local disk,
>> >> playing movie files (stored on local disk) let totem or
>> vlc get stuck
>> >> on set intervals, even the terminal becomes unresponsive at times=
=2E
>> >>
>> >> I could understand that these problems would occur while
>> accessing the
>> >> nfs share directoiourry while the server is offline, but
>> why for totally
>> >> unrelated directories?
>> >>
>> >> I have experienced this behaviour on various distro's, and
>> also found
>> >> various bug reports on this issue, they don't seem to get solved =
as
>> >> this is viewed as nfs design.
>> >> I see this as a flaw because clients are totally dependent on the
>> >> server. This would be less of a deal if the entire home directory
>> >> would be stored on nfs (although I even think some sort of
>> >> synchronisation technology could and should be implemented in thi=
s
>> >> case). It is a bit odd that (technically) one machine serving som=
e
>> >> "useless" files to a non-trivial directory on client
>> machines can take
>> >> down these client machines.
>> >>
>> >> For me the preferred functionality would be:
>> >> *If an nfs server gets offline the client's nfs share becomes
>> >> unaccessible, but local directories and applications (that only
>> >> require local disk access) stay responsive.
>> >> *If an nfs server gets online (after being offline while the clie=
nt
>> >> has not been restarted) the nfs share becomes reconnected.
>> >>
>> >> regards,
>> >> Whoop
>> >> --
>> >> To unsubscribe from this list: send the line "unsubscribe
>> linux-nfs" in
>> >> the body of a message to majordomo@vger.kernel.org
>> >> More majordomo info at =A0http://vger.kernel.org/majordomo-info.h=
tml
>> >>
>> >
>> --
>> To unsubscribe from this list: send the line "unsubscribe
>> linux-nfs" in
>> the body of a message to majordomo@vger.kernel.org
>> More majordomo info at =A0http://vger.kernel.org/majordomo-info.html
>>
>

^ permalink raw reply	[flat|nested] 24+ messages in thread

* Re: nfs client performance while server is down
  2010-01-25  0:09             ` Whoop Whouzer
@ 2010-01-25 16:48               ` Chuck Lever
  2010-01-25 19:02                 ` Whoop Whouzer
  0 siblings, 1 reply; 24+ messages in thread
From: Chuck Lever @ 2010-01-25 16:48 UTC (permalink / raw)
  To: Whoop Whouzer; +Cc: Muntz, Daniel, Peter Chacko, linux-nfs

On Jan 24, 2010, at 7:09 PM, Whoop Whouzer wrote:
> I did some network traces and there is nothing strange happening as
> far as I can tell. I shut down the server (some network traffic
> occurred as is to be expected). It got quiet again, I launched
> nautilus, it got stuck without displaying anything and there was no
> real network activity except 3 broadcasts using the ARP protocol
> asking where the server was (could be just coincidence).

That sounds like the client does want to reconnect with the server.

You could try enabling debug tracing on your client (sudo rpcdebug -m  
nfs -s all) after shutting down your server, then try to start  
nautilus.  The kernel log would then contain NFS-related messages that  
might indicate where to look next.

> Closing
> nautilus and launching it again will let it hang again but I see no
> additional network traffic. After a while nautilus will display the
> contents of the folder without any network traffic.
>
> On Sun, Jan 24, 2010 at 10:34 PM, Muntz, Daniel  
> <Dan.Muntz@netapp.com> wrote:
>> Perhaps something in your $PATH is in the NFS mount?  Do a network  
>> trace and maybe you can see if, in fact, there are actually NFS  
>> operations being attempted that you weren't expecting.  Then try to  
>> figure out why.
>>
>>  -Dan
>>
>>> -----Original Message-----
>>> From: Whoop Whouzer [mailto:tiredandnumb@gmail.com]
>>> Sent: Saturday, January 23, 2010 8:28 AM
>>> To: Peter Chacko
>>> Cc: linux-nfs@vger.kernel.org
>>> Subject: Re: nfs client performance while server is down
>>>
>>> I don't remember all the different set-ups I tried it on, but I just
>>> confirmed this with the following combinations:
>>>
>>> ubuntu server 10.04 (alpha 2) --> ubuntu desktop 9.10, ubuntu  
>>> desktop
>>> 10.04 (alpha 2), fedora 12
>>> ubuntu server 9.10 --> ubuntu desktop 9.10, ubuntu desktop 10.04
>>> (alpha 2), fedora 12
>>>
>>> I'll be happy to test it on another client machine (distro) even
>>> another server (although it would require a little more time)
>>>
>>> Here are some examples on the bugreports I noticed and how they do  
>>> not
>>> seem to get solved:
>>> https://bugzilla.redhat.com/show_bug.cgi?id=175283
>>> https://bugs.launchpad.net/ubuntu/+source/nfs-utils/+bug/164120
>>>
>>> regards,
>>> Whoop
>>>
>>> On Sat, Jan 23, 2010 at 4:57 PM, Peter Chacko
>>> <peterchacko35@gmail.com> wrote:
>>>> Which client OS you observed this behavior ?  This has nothing to  
>>>> do
>>>> NFS design, and its purely stateless...Its upto the client OS
>>>> implementation about aspects like how to deal with local
>>> IO, when NFS
>>>> share gets  disconnected..
>>>>
>>>> May be a VFS bug on the local OS you found this problem ..
>>>>
>>>> thanks
>>>>
>>>> On Sat, Jan 23, 2010 at 9:15 PM, Whoop Whouzer
>>> <tiredandnumb@gmail.com> wrote:
>>>>> Howdy,
>>>>>
>>>>> I was wondering why nfs is designed in such a way that the
>>> performance
>>>>> of an nfs client machine gets very bad when the nfs server
>>> is offline?
>>>>> This is even the case with a soft mount (either via mount
>>> or fstab).
>>>>> Just about every application that requires disk access (not  
>>>>> talking
>>>>> about nfs share acces) gets really slow to unresponsive.
>>> For instance
>>>>> nautilus becomes unresponsive when displaying the contents of any
>>>>> folder on the local disk,
>>>>> playing movie files (stored on local disk) let totem or
>>> vlc get stuck
>>>>> on set intervals, even the terminal becomes unresponsive at times.
>>>>>
>>>>> I could understand that these problems would occur while
>>> accessing the
>>>>> nfs share directoiourry while the server is offline, but
>>> why for totally
>>>>> unrelated directories?
>>>>>
>>>>> I have experienced this behaviour on various distro's, and
>>> also found
>>>>> various bug reports on this issue, they don't seem to get solved  
>>>>> as
>>>>> this is viewed as nfs design.
>>>>> I see this as a flaw because clients are totally dependent on the
>>>>> server. This would be less of a deal if the entire home directory
>>>>> would be stored on nfs (although I even think some sort of
>>>>> synchronisation technology could and should be implemented in this
>>>>> case). It is a bit odd that (technically) one machine serving some
>>>>> "useless" files to a non-trivial directory on client
>>> machines can take
>>>>> down these client machines.
>>>>>
>>>>> For me the preferred functionality would be:
>>>>> *If an nfs server gets offline the client's nfs share becomes
>>>>> unaccessible, but local directories and applications (that only
>>>>> require local disk access) stay responsive.
>>>>> *If an nfs server gets online (after being offline while the  
>>>>> client
>>>>> has not been restarted) the nfs share becomes reconnected.
>>>>>
>>>>> regards,
>>>>> Whoop
>>>>> --
>>>>> To unsubscribe from this list: send the line "unsubscribe
>>> linux-nfs" in
>>>>> the body of a message to majordomo@vger.kernel.org
>>>>> More majordomo info at  http://vger.kernel.org/majordomo-info.html
>>>>>
>>>>
>>> --
>>> To unsubscribe from this list: send the line "unsubscribe
>>> linux-nfs" in
>>> the body of a message to majordomo@vger.kernel.org
>>> More majordomo info at  http://vger.kernel.org/majordomo-info.html
>>>
>>
> --
> To unsubscribe from this list: send the line "unsubscribe linux-nfs"  
> in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html

-- 
Chuck Lever
chuck[dot]lever[at]oracle[dot]com





^ permalink raw reply	[flat|nested] 24+ messages in thread

* Re: nfs client performance while server is down
  2010-01-25 16:48               ` Chuck Lever
@ 2010-01-25 19:02                 ` Whoop Whouzer
       [not found]                   ` <d7f0b3a81001251102p5e631706jfd9f147a00487061-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
  0 siblings, 1 reply; 24+ messages in thread
From: Whoop Whouzer @ 2010-01-25 19:02 UTC (permalink / raw)
  To: Chuck Lever; +Cc: Muntz, Daniel, Peter Chacko, linux-nfs

Ok, I did that, after shutting down the server and enabling debug
trace I tried to open the home folder of the current account (totally
unrelated to the nfsshare), it wouldn't open at all, I got no nautilus
at all. During the time my cursor was in busy mode I got the following
messages in kern.log (for ubuntu 10.04 client):
Jan 25 19:30:13 whoop-desktop kernel: [  160.719262] NFS call  fsstat
Jan 25 19:30:37 whoop-desktop kernel: [  184.458611] NFS:
permission(0:16/74386), mask=0x10, res=0
Jan 25 19:30:37 whoop-desktop kernel: [  184.458647] NFS call  access
Jan 25 19:30:43 whoop-desktop kernel: [  190.721086] nfs: server
192.168.1.130 not responding, timed out
Jan 25 19:30:43 whoop-desktop kernel: [  190.721113] NFS reply statfs: -5
Jan 25 19:30:43 whoop-desktop kernel: [  190.721116] nfs_statfs:
statfs error = 5
These series of traces are repeating over and over again at a set
interval (there is no flooding of the logs), even if I do nothing.
It's even worse than I thought because when I tried to shutdown, the
machine wouldn't shutdown because it claimed
the "File manager" was still running (although it was not visible on
screen); so I had to kill that before I could shutdown (properly).

In Fedora 12 I had a similar user experience (nautilus did show up
without showing any contents and it was hanging). I had enabled
tracing and it seems to be logged to /var/log/messages. I got this
output in fedora:
Jan 25 20:48:38 localhost kernel: NFS reply statfs: -5
Jan 25 20:48:38 localhost kernel: nfs_statfs: statfs error = 5
Jan 25 20:48:38 localhost kernel: NFS call  fsstat
Jan 25 20:49:14 localhost kernel: nfs: server 192.168.1.130 not
responding, timed out
Jan 25 20:49:14 localhost kernel: NFS reply getattr: -5
Jan 25 20:49:14 localhost kernel: nfs_revalidate_inode: (0:14/74386)
getattr failed, error=-5
Jan 25 20:49:25 localhost kernel: NFS: revalidating (0:14/74386)
Jan 25 20:49:25 localhost kernel: NFS call  getattr
Jan 25 20:50:14 localhost kernel: nfs: server 192.168.1.130 not
responding, timed out
Jan 25 20:50:14 localhost kernel: NFS reply access: -5
Jan 25 20:50:14 localhost kernel: NFS: permission(0:14/74386), mask=0x1, res=-5
Jan 25 20:50:14 localhost kernel: NFS call  access
Jan 25 20:51:14 localhost kernel: nfs: server 192.168.1.130 not
responding, timed out
Jan 25 20:51:14 localhost kernel: NFS reply statfs: -5
Jan 25 20:51:14 localhost kernel: nfs_statfs: statfs error = 5
Jan 25 20:51:14 localhost kernel: NFS call  fsstat
Most of the trace is repeating in set intervals as well, there is no
flooding of the logs...
Fedora would not shutdown normally either


On Mon, Jan 25, 2010 at 5:48 PM, Chuck Lever <chuck.lever@oracle.com> wrote:
> On Jan 24, 2010, at 7:09 PM, Whoop Whouzer wrote:
>>
>> I did some network traces and there is nothing strange happening as
>> far as I can tell. I shut down the server (some network traffic
>> occurred as is to be expected). It got quiet again, I launched
>> nautilus, it got stuck without displaying anything and there was no
>> real network activity except 3 broadcasts using the ARP protocol
>> asking where the server was (could be just coincidence).
>
> That sounds like the client does want to reconnect with the server.
>
> You could try enabling debug tracing on your client (sudo rpcdebug -m nfs -s
> all) after shutting down your server, then try to start nautilus.  The
> kernel log would then contain NFS-related messages that might indicate where
> to look next.
>
>> Closing
>> nautilus and launching it again will let it hang again but I see no
>> additional network traffic. After a while nautilus will display the
>> contents of the folder without any network traffic.
>>
>> On Sun, Jan 24, 2010 at 10:34 PM, Muntz, Daniel <Dan.Muntz@netapp.com>
>> wrote:
>>>
>>> Perhaps something in your $PATH is in the NFS mount?  Do a network trace
>>> and maybe you can see if, in fact, there are actually NFS operations being
>>> attempted that you weren't expecting.  Then try to figure out why.
>>>
>>>  -Dan
>>>
>>>> -----Original Message-----
>>>> From: Whoop Whouzer [mailto:tiredandnumb@gmail.com]
>>>> Sent: Saturday, January 23, 2010 8:28 AM
>>>> To: Peter Chacko
>>>> Cc: linux-nfs@vger.kernel.org
>>>> Subject: Re: nfs client performance while server is down
>>>>
>>>> I don't remember all the different set-ups I tried it on, but I just
>>>> confirmed this with the following combinations:
>>>>
>>>> ubuntu server 10.04 (alpha 2) --> ubuntu desktop 9.10, ubuntu desktop
>>>> 10.04 (alpha 2), fedora 12
>>>> ubuntu server 9.10 --> ubuntu desktop 9.10, ubuntu desktop 10.04
>>>> (alpha 2), fedora 12
>>>>
>>>> I'll be happy to test it on another client machine (distro) even
>>>> another server (although it would require a little more time)
>>>>
>>>> Here are some examples on the bugreports I noticed and how they do not
>>>> seem to get solved:
>>>> https://bugzilla.redhat.com/show_bug.cgi?id=175283
>>>> https://bugs.launchpad.net/ubuntu/+source/nfs-utils/+bug/164120
>>>>
>>>> regards,
>>>> Whoop
>>>>
>>>> On Sat, Jan 23, 2010 at 4:57 PM, Peter Chacko
>>>> <peterchacko35@gmail.com> wrote:
>>>>>
>>>>> Which client OS you observed this behavior ?  This has nothing to do
>>>>> NFS design, and its purely stateless...Its upto the client OS
>>>>> implementation about aspects like how to deal with local
>>>>
>>>> IO, when NFS
>>>>>
>>>>> share gets  disconnected..
>>>>>
>>>>> May be a VFS bug on the local OS you found this problem ..
>>>>>
>>>>> thanks
>>>>>
>>>>> On Sat, Jan 23, 2010 at 9:15 PM, Whoop Whouzer
>>>>
>>>> <tiredandnumb@gmail.com> wrote:
>>>>>>
>>>>>> Howdy,
>>>>>>
>>>>>> I was wondering why nfs is designed in such a way that the
>>>>
>>>> performance
>>>>>>
>>>>>> of an nfs client machine gets very bad when the nfs server
>>>>
>>>> is offline?
>>>>>>
>>>>>> This is even the case with a soft mount (either via mount
>>>>
>>>> or fstab).
>>>>>>
>>>>>> Just about every application that requires disk access (not talking
>>>>>> about nfs share acces) gets really slow to unresponsive.
>>>>
>>>> For instance
>>>>>>
>>>>>> nautilus becomes unresponsive when displaying the contents of any
>>>>>> folder on the local disk,
>>>>>> playing movie files (stored on local disk) let totem or
>>>>
>>>> vlc get stuck
>>>>>>
>>>>>> on set intervals, even the terminal becomes unresponsive at times.
>>>>>>
>>>>>> I could understand that these problems would occur while
>>>>
>>>> accessing the
>>>>>>
>>>>>> nfs share directoiourry while the server is offline, but
>>>>
>>>> why for totally
>>>>>>
>>>>>> unrelated directories?
>>>>>>
>>>>>> I have experienced this behaviour on various distro's, and
>>>>
>>>> also found
>>>>>>
>>>>>> various bug reports on this issue, they don't seem to get solved as
>>>>>> this is viewed as nfs design.
>>>>>> I see this as a flaw because clients are totally dependent on the
>>>>>> server. This would be less of a deal if the entire home directory
>>>>>> would be stored on nfs (although I even think some sort of
>>>>>> synchronisation technology could and should be implemented in this
>>>>>> case). It is a bit odd that (technically) one machine serving some
>>>>>> "useless" files to a non-trivial directory on client
>>>>
>>>> machines can take
>>>>>>
>>>>>> down these client machines.
>>>>>>
>>>>>> For me the preferred functionality would be:
>>>>>> *If an nfs server gets offline the client's nfs share becomes
>>>>>> unaccessible, but local directories and applications (that only
>>>>>> require local disk access) stay responsive.
>>>>>> *If an nfs server gets online (after being offline while the client
>>>>>> has not been restarted) the nfs share becomes reconnected.
>>>>>>
>>>>>> regards,
>>>>>> Whoop
>>>>>> --
>>>>>> To unsubscribe from this list: send the line "unsubscribe
>>>>
>>>> linux-nfs" in
>>>>>>
>>>>>> the body of a message to majordomo@vger.kernel.org
>>>>>> More majordomo info at  http://vger.kernel.org/majordomo-info.html
>>>>>>
>>>>>
>>>> --
>>>> To unsubscribe from this list: send the line "unsubscribe
>>>> linux-nfs" in
>>>> the body of a message to majordomo@vger.kernel.org
>>>> More majordomo info at  http://vger.kernel.org/majordomo-info.html
>>>>
>>>
>> --
>> To unsubscribe from this list: send the line "unsubscribe linux-nfs" in
>> the body of a message to majordomo@vger.kernel.org
>> More majordomo info at  http://vger.kernel.org/majordomo-info.html
>
> --
> Chuck Lever
> chuck[dot]lever[at]oracle[dot]com
>
>
>
>
>

^ permalink raw reply	[flat|nested] 24+ messages in thread

* Re: nfs client performance while server is down
       [not found]                   ` <d7f0b3a81001251102p5e631706jfd9f147a00487061-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
@ 2010-01-25 19:08                     ` Chuck Lever
       [not found]                       ` <d7f0b3a81001251138h30e25428o25db9bc8c0884636@mail.gmail.com>
  2010-01-26 23:21                       ` J. Bruce Fields
  0 siblings, 2 replies; 24+ messages in thread
From: Chuck Lever @ 2010-01-25 19:08 UTC (permalink / raw)
  To: Whoop Whouzer; +Cc: Muntz, Daniel, Peter Chacko, linux-nfs

On Jan 25, 2010, at 2:02 PM, Whoop Whouzer wrote:
> Ok, I did that, after shutting down the server and enabling debug
> trace I tried to open the home folder of the current account (totally
> unrelated to the nfsshare), it wouldn't open at all, I got no nautilus
> at all. During the time my cursor was in busy mode I got the following
> messages in kern.log (for ubuntu 10.04 client):
> Jan 25 19:30:13 whoop-desktop kernel: [  160.719262] NFS call  fsstat
> Jan 25 19:30:37 whoop-desktop kernel: [  184.458611] NFS:
> permission(0:16/74386), mask=0x10, res=0
> Jan 25 19:30:37 whoop-desktop kernel: [  184.458647] NFS call  access
> Jan 25 19:30:43 whoop-desktop kernel: [  190.721086] nfs: server
> 192.168.1.130 not responding, timed out
> Jan 25 19:30:43 whoop-desktop kernel: [  190.721113] NFS reply  
> statfs: -5
> Jan 25 19:30:43 whoop-desktop kernel: [  190.721116] nfs_statfs:
> statfs error = 5
> These series of traces are repeating over and over again at a set
> interval (there is no flooding of the logs), even if I do nothing.
> It's even worse than I thought because when I tried to shutdown, the
> machine wouldn't shutdown because it claimed
> the "File manager" was still running (although it was not visible on
> screen); so I had to kill that before I could shutdown (properly).
>
> In Fedora 12 I had a similar user experience (nautilus did show up
> without showing any contents and it was hanging). I had enabled
> tracing and it seems to be logged to /var/log/messages. I got this
> output in fedora:
> Jan 25 20:48:38 localhost kernel: NFS reply statfs: -5
> Jan 25 20:48:38 localhost kernel: nfs_statfs: statfs error = 5
> Jan 25 20:48:38 localhost kernel: NFS call  fsstat
> Jan 25 20:49:14 localhost kernel: nfs: server 192.168.1.130 not
> responding, timed out
> Jan 25 20:49:14 localhost kernel: NFS reply getattr: -5
> Jan 25 20:49:14 localhost kernel: nfs_revalidate_inode: (0:14/74386)
> getattr failed, error=-5
> Jan 25 20:49:25 localhost kernel: NFS: revalidating (0:14/74386)
> Jan 25 20:49:25 localhost kernel: NFS call  getattr
> Jan 25 20:50:14 localhost kernel: nfs: server 192.168.1.130 not
> responding, timed out
> Jan 25 20:50:14 localhost kernel: NFS reply access: -5
> Jan 25 20:50:14 localhost kernel: NFS: permission(0:14/74386),  
> mask=0x1, res=-5
> Jan 25 20:50:14 localhost kernel: NFS call  access
> Jan 25 20:51:14 localhost kernel: nfs: server 192.168.1.130 not
> responding, timed out
> Jan 25 20:51:14 localhost kernel: NFS reply statfs: -5
> Jan 25 20:51:14 localhost kernel: nfs_statfs: statfs error = 5
> Jan 25 20:51:14 localhost kernel: NFS call  fsstat
> Most of the trace is repeating in set intervals as well, there is no
> flooding of the logs...
> Fedora would not shutdown normally either

This verifies that your client is attempting to access the NFS server,  
but doesn't tell us which file it's attempting to access.  Essentially  
the EIO means "failed to connect".

Maybe try an strace of the nautilus process next?

> On Mon, Jan 25, 2010 at 5:48 PM, Chuck Lever  
> <chuck.lever@oracle.com> wrote:
>> On Jan 24, 2010, at 7:09 PM, Whoop Whouzer wrote:
>>>
>>> I did some network traces and there is nothing strange happening as
>>> far as I can tell. I shut down the server (some network traffic
>>> occurred as is to be expected). It got quiet again, I launched
>>> nautilus, it got stuck without displaying anything and there was no
>>> real network activity except 3 broadcasts using the ARP protocol
>>> asking where the server was (could be just coincidence).
>>
>> That sounds like the client does want to reconnect with the server.
>>
>> You could try enabling debug tracing on your client (sudo rpcdebug - 
>> m nfs -s
>> all) after shutting down your server, then try to start nautilus.   
>> The
>> kernel log would then contain NFS-related messages that might  
>> indicate where
>> to look next.
>>
>>> Closing
>>> nautilus and launching it again will let it hang again but I see no
>>> additional network traffic. After a while nautilus will display the
>>> contents of the folder without any network traffic.
>>>
>>> On Sun, Jan 24, 2010 at 10:34 PM, Muntz, Daniel <Dan.Muntz@netapp.com 
>>> >
>>> wrote:
>>>>
>>>> Perhaps something in your $PATH is in the NFS mount?  Do a  
>>>> network trace
>>>> and maybe you can see if, in fact, there are actually NFS  
>>>> operations being
>>>> attempted that you weren't expecting.  Then try to figure out why.
>>>>
>>>>  -Dan
>>>>
>>>>> -----Original Message-----
>>>>> From: Whoop Whouzer [mailto:tiredandnumb@gmail.com]
>>>>> Sent: Saturday, January 23, 2010 8:28 AM
>>>>> To: Peter Chacko
>>>>> Cc: linux-nfs@vger.kernel.org
>>>>> Subject: Re: nfs client performance while server is down
>>>>>
>>>>> I don't remember all the different set-ups I tried it on, but I  
>>>>> just
>>>>> confirmed this with the following combinations:
>>>>>
>>>>> ubuntu server 10.04 (alpha 2) --> ubuntu desktop 9.10, ubuntu  
>>>>> desktop
>>>>> 10.04 (alpha 2), fedora 12
>>>>> ubuntu server 9.10 --> ubuntu desktop 9.10, ubuntu desktop 10.04
>>>>> (alpha 2), fedora 12
>>>>>
>>>>> I'll be happy to test it on another client machine (distro) even
>>>>> another server (although it would require a little more time)
>>>>>
>>>>> Here are some examples on the bugreports I noticed and how they  
>>>>> do not
>>>>> seem to get solved:
>>>>> https://bugzilla.redhat.com/show_bug.cgi?id=175283
>>>>> https://bugs.launchpad.net/ubuntu/+source/nfs-utils/+bug/164120
>>>>>
>>>>> regards,
>>>>> Whoop
>>>>>
>>>>> On Sat, Jan 23, 2010 at 4:57 PM, Peter Chacko
>>>>> <peterchacko35@gmail.com> wrote:
>>>>>>
>>>>>> Which client OS you observed this behavior ?  This has nothing  
>>>>>> to do
>>>>>> NFS design, and its purely stateless...Its upto the client OS
>>>>>> implementation about aspects like how to deal with local
>>>>>
>>>>> IO, when NFS
>>>>>>
>>>>>> share gets  disconnected..
>>>>>>
>>>>>> May be a VFS bug on the local OS you found this problem ..
>>>>>>
>>>>>> thanks
>>>>>>
>>>>>> On Sat, Jan 23, 2010 at 9:15 PM, Whoop Whouzer
>>>>>
>>>>> <tiredandnumb@gmail.com> wrote:
>>>>>>>
>>>>>>> Howdy,
>>>>>>>
>>>>>>> I was wondering why nfs is designed in such a way that the
>>>>>
>>>>> performance
>>>>>>>
>>>>>>> of an nfs client machine gets very bad when the nfs server
>>>>>
>>>>> is offline?
>>>>>>>
>>>>>>> This is even the case with a soft mount (either via mount
>>>>>
>>>>> or fstab).
>>>>>>>
>>>>>>> Just about every application that requires disk access (not  
>>>>>>> talking
>>>>>>> about nfs share acces) gets really slow to unresponsive.
>>>>>
>>>>> For instance
>>>>>>>
>>>>>>> nautilus becomes unresponsive when displaying the contents of  
>>>>>>> any
>>>>>>> folder on the local disk,
>>>>>>> playing movie files (stored on local disk) let totem or
>>>>>
>>>>> vlc get stuck
>>>>>>>
>>>>>>> on set intervals, even the terminal becomes unresponsive at  
>>>>>>> times.
>>>>>>>
>>>>>>> I could understand that these problems would occur while
>>>>>
>>>>> accessing the
>>>>>>>
>>>>>>> nfs share directoiourry while the server is offline, but
>>>>>
>>>>> why for totally
>>>>>>>
>>>>>>> unrelated directories?
>>>>>>>
>>>>>>> I have experienced this behaviour on various distro's, and
>>>>>
>>>>> also found
>>>>>>>
>>>>>>> various bug reports on this issue, they don't seem to get  
>>>>>>> solved as
>>>>>>> this is viewed as nfs design.
>>>>>>> I see this as a flaw because clients are totally dependent on  
>>>>>>> the
>>>>>>> server. This would be less of a deal if the entire home  
>>>>>>> directory
>>>>>>> would be stored on nfs (although I even think some sort of
>>>>>>> synchronisation technology could and should be implemented in  
>>>>>>> this
>>>>>>> case). It is a bit odd that (technically) one machine serving  
>>>>>>> some
>>>>>>> "useless" files to a non-trivial directory on client
>>>>>
>>>>> machines can take
>>>>>>>
>>>>>>> down these client machines.
>>>>>>>
>>>>>>> For me the preferred functionality would be:
>>>>>>> *If an nfs server gets offline the client's nfs share becomes
>>>>>>> unaccessible, but local directories and applications (that only
>>>>>>> require local disk access) stay responsive.
>>>>>>> *If an nfs server gets online (after being offline while the  
>>>>>>> client
>>>>>>> has not been restarted) the nfs share becomes reconnected.
>>>>>>>
>>>>>>> regards,
>>>>>>> Whoop
>>>>>>> --
>>>>>>> To unsubscribe from this list: send the line "unsubscribe
>>>>>
>>>>> linux-nfs" in
>>>>>>>
>>>>>>> the body of a message to majordomo@vger.kernel.org
>>>>>>> More majordomo info at  http://vger.kernel.org/majordomo-info.html
>>>>>>>
>>>>>>
>>>>> --
>>>>> To unsubscribe from this list: send the line "unsubscribe
>>>>> linux-nfs" in
>>>>> the body of a message to majordomo@vger.kernel.org
>>>>> More majordomo info at  http://vger.kernel.org/majordomo-info.html
>>>>>
>>>>
>>> --
>>> To unsubscribe from this list: send the line "unsubscribe linux- 
>>> nfs" in
>>> the body of a message to majordomo@vger.kernel.org
>>> More majordomo info at  http://vger.kernel.org/majordomo-info.html
>>
>> --
>> Chuck Lever
>> chuck[dot]lever[at]oracle[dot]com
>>
>>
>>
>>
>>

--
Chuck Lever
chuck[dot]lever[at]oracle[dot]com




^ permalink raw reply	[flat|nested] 24+ messages in thread

* Re: nfs client performance while server is down
       [not found]                         ` <d7f0b3a81001251138h30e25428o25db9bc8c0884636-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
@ 2010-01-25 19:48                           ` Whoop Whouzer
  2010-01-25 21:01                           ` Chuck Lever
  1 sibling, 0 replies; 24+ messages in thread
From: Whoop Whouzer @ 2010-01-25 19:48 UTC (permalink / raw)
  To: Chuck Lever; +Cc: Muntz, Daniel, Peter Chacko, linux-nfs

Hhm, now that I look closer, maybe there are no real differences at
all. Just different address allocations...

On Mon, Jan 25, 2010 at 8:38 PM, Whoop Whouzer <tiredandnumb@gmail.com>=
 wrote:
> Running =A0"strace nautilus" gives me allot of output. When I run it
> while the server is down it completes the trace without a hiccup, it
> returns and than nautilus is launched and hangs.
> There are differences between the traces (with server up and server
> down). I can't really see where the problem lies in there.
>
> On Mon, Jan 25, 2010 at 8:08 PM, Chuck Lever <chuck.lever@oracle.com>=
 wrote:
>> On Jan 25, 2010, at 2:02 PM, Whoop Whouzer wrote:
>>>
>>> Ok, I did that, after shutting down the server and enabling debug
>>> trace I tried to open the home folder of the current account (total=
ly
>>> unrelated to the nfsshare), it wouldn't open at all, I got no nauti=
lus
>>> at all. During the time my cursor was in busy mode I got the follow=
ing
>>> messages in kern.log (for ubuntu 10.04 client):
>>> Jan 25 19:30:13 whoop-desktop kernel: [ =A0160.719262] NFS call =A0=
fsstat
>>> Jan 25 19:30:37 whoop-desktop kernel: [ =A0184.458611] NFS:
>>> permission(0:16/74386), mask=3D0x10, res=3D0
>>> Jan 25 19:30:37 whoop-desktop kernel: [ =A0184.458647] NFS call =A0=
access
>>> Jan 25 19:30:43 whoop-desktop kernel: [ =A0190.721086] nfs: server
>>> 192.168.1.130 not responding, timed out
>>> Jan 25 19:30:43 whoop-desktop kernel: [ =A0190.721113] NFS reply st=
atfs: -5
>>> Jan 25 19:30:43 whoop-desktop kernel: [ =A0190.721116] nfs_statfs:
>>> statfs error =3D 5
>>> These series of traces are repeating over and over again at a set
>>> interval (there is no flooding of the logs), even if I do nothing.
>>> It's even worse than I thought because when I tried to shutdown, th=
e
>>> machine wouldn't shutdown because it claimed
>>> the "File manager" was still running (although it was not visible o=
n
>>> screen); so I had to kill that before I could shutdown (properly).
>>>
>>> In Fedora 12 I had a similar user experience (nautilus did show up
>>> without showing any contents and it was hanging). I had enabled
>>> tracing and it seems to be logged to /var/log/messages. I got this
>>> output in fedora:
>>> Jan 25 20:48:38 localhost kernel: NFS reply statfs: -5
>>> Jan 25 20:48:38 localhost kernel: nfs_statfs: statfs error =3D 5
>>> Jan 25 20:48:38 localhost kernel: NFS call =A0fsstat
>>> Jan 25 20:49:14 localhost kernel: nfs: server 192.168.1.130 not
>>> responding, timed out
>>> Jan 25 20:49:14 localhost kernel: NFS reply getattr: -5
>>> Jan 25 20:49:14 localhost kernel: nfs_revalidate_inode: (0:14/74386=
)
>>> getattr failed, error=3D-5
>>> Jan 25 20:49:25 localhost kernel: NFS: revalidating (0:14/74386)
>>> Jan 25 20:49:25 localhost kernel: NFS call =A0getattr
>>> Jan 25 20:50:14 localhost kernel: nfs: server 192.168.1.130 not
>>> responding, timed out
>>> Jan 25 20:50:14 localhost kernel: NFS reply access: -5
>>> Jan 25 20:50:14 localhost kernel: NFS: permission(0:14/74386), mask=
=3D0x1,
>>> res=3D-5
>>> Jan 25 20:50:14 localhost kernel: NFS call =A0access
>>> Jan 25 20:51:14 localhost kernel: nfs: server 192.168.1.130 not
>>> responding, timed out
>>> Jan 25 20:51:14 localhost kernel: NFS reply statfs: -5
>>> Jan 25 20:51:14 localhost kernel: nfs_statfs: statfs error =3D 5
>>> Jan 25 20:51:14 localhost kernel: NFS call =A0fsstat
>>> Most of the trace is repeating in set intervals as well, there is n=
o
>>> flooding of the logs...
>>> Fedora would not shutdown normally either
>>
>> This verifies that your client is attempting to access the NFS serve=
r, but
>> doesn't tell us which file it's attempting to access. =A0Essentially=
 the EIO
>> means "failed to connect".
>>
>> Maybe try an strace of the nautilus process next?
>>
>>> On Mon, Jan 25, 2010 at 5:48 PM, Chuck Lever <chuck.lever-QHcLZuEGTsthl2p70BpVqQ@public.gmane.org=
m>
>>> wrote:
>>>>
>>>> On Jan 24, 2010, at 7:09 PM, Whoop Whouzer wrote:
>>>>>
>>>>> I did some network traces and there is nothing strange happening =
as
>>>>> far as I can tell. I shut down the server (some network traffic
>>>>> occurred as is to be expected). It got quiet again, I launched
>>>>> nautilus, it got stuck without displaying anything and there was =
no
>>>>> real network activity except 3 broadcasts using the ARP protocol
>>>>> asking where the server was (could be just coincidence).
>>>>
>>>> That sounds like the client does want to reconnect with the server=
=2E
>>>>
>>>> You could try enabling debug tracing on your client (sudo rpcdebug=
 -m nfs
>>>> -s
>>>> all) after shutting down your server, then try to start nautilus. =
=A0The
>>>> kernel log would then contain NFS-related messages that might indi=
cate
>>>> where
>>>> to look next.
>>>>
>>>>> Closing
>>>>> nautilus and launching it again will let it hang again but I see =
no
>>>>> additional network traffic. After a while nautilus will display t=
he
>>>>> contents of the folder without any network traffic.
>>>>>
>>>>> On Sun, Jan 24, 2010 at 10:34 PM, Muntz, Daniel <Dan.Muntz@netapp=
=2Ecom>
>>>>> wrote:
>>>>>>
>>>>>> Perhaps something in your $PATH is in the NFS mount? =A0Do a net=
work
>>>>>> trace
>>>>>> and maybe you can see if, in fact, there are actually NFS operat=
ions
>>>>>> being
>>>>>> attempted that you weren't expecting. =A0Then try to figure out =
why.
>>>>>>
>>>>>> =A0-Dan
>>>>>>
>>>>>>> -----Original Message-----
>>>>>>> From: Whoop Whouzer [mailto:tiredandnumb@gmail.com]
>>>>>>> Sent: Saturday, January 23, 2010 8:28 AM
>>>>>>> To: Peter Chacko
>>>>>>> Cc: linux-nfs@vger.kernel.org
>>>>>>> Subject: Re: nfs client performance while server is down
>>>>>>>
>>>>>>> I don't remember all the different set-ups I tried it on, but I=
 just
>>>>>>> confirmed this with the following combinations:
>>>>>>>
>>>>>>> ubuntu server 10.04 (alpha 2) --> ubuntu desktop 9.10, ubuntu d=
esktop
>>>>>>> 10.04 (alpha 2), fedora 12
>>>>>>> ubuntu server 9.10 --> ubuntu desktop 9.10, ubuntu desktop 10.0=
4
>>>>>>> (alpha 2), fedora 12
>>>>>>>
>>>>>>> I'll be happy to test it on another client machine (distro) eve=
n
>>>>>>> another server (although it would require a little more time)
>>>>>>>
>>>>>>> Here are some examples on the bugreports I noticed and how they=
 do not
>>>>>>> seem to get solved:
>>>>>>> https://bugzilla.redhat.com/show_bug.cgi?id=3D175283
>>>>>>> https://bugs.launchpad.net/ubuntu/+source/nfs-utils/+bug/164120
>>>>>>>
>>>>>>> regards,
>>>>>>> Whoop
>>>>>>>
>>>>>>> On Sat, Jan 23, 2010 at 4:57 PM, Peter Chacko
>>>>>>> <peterchacko35@gmail.com> wrote:
>>>>>>>>
>>>>>>>> Which client OS you observed this behavior ? =A0This has nothi=
ng to do
>>>>>>>> NFS design, and its purely stateless...Its upto the client OS
>>>>>>>> implementation about aspects like how to deal with local
>>>>>>>
>>>>>>> IO, when NFS
>>>>>>>>
>>>>>>>> share gets =A0disconnected..
>>>>>>>>
>>>>>>>> May be a VFS bug on the local OS you found this problem ..
>>>>>>>>
>>>>>>>> thanks
>>>>>>>>
>>>>>>>> On Sat, Jan 23, 2010 at 9:15 PM, Whoop Whouzer
>>>>>>>
>>>>>>> <tiredandnumb@gmail.com> wrote:
>>>>>>>>>
>>>>>>>>> Howdy,
>>>>>>>>>
>>>>>>>>> I was wondering why nfs is designed in such a way that the
>>>>>>>
>>>>>>> performance
>>>>>>>>>
>>>>>>>>> of an nfs client machine gets very bad when the nfs server
>>>>>>>
>>>>>>> is offline?
>>>>>>>>>
>>>>>>>>> This is even the case with a soft mount (either via mount
>>>>>>>
>>>>>>> or fstab).
>>>>>>>>>
>>>>>>>>> Just about every application that requires disk access (not t=
alking
>>>>>>>>> about nfs share acces) gets really slow to unresponsive.
>>>>>>>
>>>>>>> For instance
>>>>>>>>>
>>>>>>>>> nautilus becomes unresponsive when displaying the contents of=
 any
>>>>>>>>> folder on the local disk,
>>>>>>>>> playing movie files (stored on local disk) let totem or
>>>>>>>
>>>>>>> vlc get stuck
>>>>>>>>>
>>>>>>>>> on set intervals, even the terminal becomes unresponsive at t=
imes.
>>>>>>>>>
>>>>>>>>> I could understand that these problems would occur while
>>>>>>>
>>>>>>> accessing the
>>>>>>>>>
>>>>>>>>> nfs share directoiourry while the server is offline, but
>>>>>>>
>>>>>>> why for totally
>>>>>>>>>
>>>>>>>>> unrelated directories?
>>>>>>>>>
>>>>>>>>> I have experienced this behaviour on various distro's, and
>>>>>>>
>>>>>>> also found
>>>>>>>>>
>>>>>>>>> various bug reports on this issue, they don't seem to get sol=
ved as
>>>>>>>>> this is viewed as nfs design.
>>>>>>>>> I see this as a flaw because clients are totally dependent on=
 the
>>>>>>>>> server. This would be less of a deal if the entire home direc=
tory
>>>>>>>>> would be stored on nfs (although I even think some sort of
>>>>>>>>> synchronisation technology could and should be implemented in=
 this
>>>>>>>>> case). It is a bit odd that (technically) one machine serving=
 some
>>>>>>>>> "useless" files to a non-trivial directory on client
>>>>>>>
>>>>>>> machines can take
>>>>>>>>>
>>>>>>>>> down these client machines.
>>>>>>>>>
>>>>>>>>> For me the preferred functionality would be:
>>>>>>>>> *If an nfs server gets offline the client's nfs share becomes
>>>>>>>>> unaccessible, but local directories and applications (that on=
ly
>>>>>>>>> require local disk access) stay responsive.
>>>>>>>>> *If an nfs server gets online (after being offline while the =
client
>>>>>>>>> has not been restarted) the nfs share becomes reconnected.
>>>>>>>>>
>>>>>>>>> regards,
>>>>>>>>> Whoop
>>>>>>>>> --
>>>>>>>>> To unsubscribe from this list: send the line "unsubscribe
>>>>>>>
>>>>>>> linux-nfs" in
>>>>>>>>>
>>>>>>>>> the body of a message to majordomo@vger.kernel.org
>>>>>>>>> More majordomo info at =A0http://vger.kernel.org/majordomo-in=
fo.html
>>>>>>>>>
>>>>>>>>
>>>>>>> --
>>>>>>> To unsubscribe from this list: send the line "unsubscribe
>>>>>>> linux-nfs" in
>>>>>>> the body of a message to majordomo@vger.kernel.org
>>>>>>> More majordomo info at =A0http://vger.kernel.org/majordomo-info=
=2Ehtml
>>>>>>>
>>>>>>
>>>>> --
>>>>> To unsubscribe from this list: send the line "unsubscribe linux-n=
fs" in
>>>>> the body of a message to majordomo@vger.kernel.org
>>>>> More majordomo info at =A0http://vger.kernel.org/majordomo-info.h=
tml
>>>>
>>>> --
>>>> Chuck Lever
>>>> chuck[dot]lever[at]oracle[dot]com
>>>>
>>>>
>>>>
>>>>
>>>>
>>
>> --
>> Chuck Lever
>> chuck[dot]lever[at]oracle[dot]com
>>
>>
>>
>>
>

^ permalink raw reply	[flat|nested] 24+ messages in thread

* Re: nfs client performance while server is down
       [not found]                         ` <d7f0b3a81001251138h30e25428o25db9bc8c0884636-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
  2010-01-25 19:48                           ` Whoop Whouzer
@ 2010-01-25 21:01                           ` Chuck Lever
  2010-01-25 21:18                             ` Whoop Whouzer
  1 sibling, 1 reply; 24+ messages in thread
From: Chuck Lever @ 2010-01-25 21:01 UTC (permalink / raw)
  To: Whoop Whouzer; +Cc: Muntz, Daniel, Peter Chacko, linux-nfs

On Jan 25, 2010, at 2:38 PM, Whoop Whouzer wrote:
> Running  "strace nautilus" gives me allot of output. When I run it
> while the server is down it completes the trace without a hiccup, it
> returns and than nautilus is launched and hangs.
> There are differences between the traces (with server up and server
> down). I can't really see where the problem lies in there.

I would expect that the command-line nautilus forks when it starts  
up.  If it has some option you can specify to prevent that, it might  
allow a deeper look.  You would need to tell strace to look at the  
children, too.

> On Mon, Jan 25, 2010 at 8:08 PM, Chuck Lever  
> <chuck.lever@oracle.com> wrote:
>> On Jan 25, 2010, at 2:02 PM, Whoop Whouzer wrote:
>>>
>>> Ok, I did that, after shutting down the server and enabling debug
>>> trace I tried to open the home folder of the current account  
>>> (totally
>>> unrelated to the nfsshare), it wouldn't open at all, I got no  
>>> nautilus
>>> at all. During the time my cursor was in busy mode I got the  
>>> following
>>> messages in kern.log (for ubuntu 10.04 client):
>>> Jan 25 19:30:13 whoop-desktop kernel: [  160.719262] NFS call   
>>> fsstat
>>> Jan 25 19:30:37 whoop-desktop kernel: [  184.458611] NFS:
>>> permission(0:16/74386), mask=0x10, res=0
>>> Jan 25 19:30:37 whoop-desktop kernel: [  184.458647] NFS call   
>>> access
>>> Jan 25 19:30:43 whoop-desktop kernel: [  190.721086] nfs: server
>>> 192.168.1.130 not responding, timed out
>>> Jan 25 19:30:43 whoop-desktop kernel: [  190.721113] NFS reply  
>>> statfs: -5
>>> Jan 25 19:30:43 whoop-desktop kernel: [  190.721116] nfs_statfs:
>>> statfs error = 5
>>> These series of traces are repeating over and over again at a set
>>> interval (there is no flooding of the logs), even if I do nothing.
>>> It's even worse than I thought because when I tried to shutdown, the
>>> machine wouldn't shutdown because it claimed
>>> the "File manager" was still running (although it was not visible on
>>> screen); so I had to kill that before I could shutdown (properly).
>>>
>>> In Fedora 12 I had a similar user experience (nautilus did show up
>>> without showing any contents and it was hanging). I had enabled
>>> tracing and it seems to be logged to /var/log/messages. I got this
>>> output in fedora:
>>> Jan 25 20:48:38 localhost kernel: NFS reply statfs: -5
>>> Jan 25 20:48:38 localhost kernel: nfs_statfs: statfs error = 5
>>> Jan 25 20:48:38 localhost kernel: NFS call  fsstat
>>> Jan 25 20:49:14 localhost kernel: nfs: server 192.168.1.130 not
>>> responding, timed out
>>> Jan 25 20:49:14 localhost kernel: NFS reply getattr: -5
>>> Jan 25 20:49:14 localhost kernel: nfs_revalidate_inode: (0:14/74386)
>>> getattr failed, error=-5
>>> Jan 25 20:49:25 localhost kernel: NFS: revalidating (0:14/74386)
>>> Jan 25 20:49:25 localhost kernel: NFS call  getattr
>>> Jan 25 20:50:14 localhost kernel: nfs: server 192.168.1.130 not
>>> responding, timed out
>>> Jan 25 20:50:14 localhost kernel: NFS reply access: -5
>>> Jan 25 20:50:14 localhost kernel: NFS: permission(0:14/74386),  
>>> mask=0x1,
>>> res=-5
>>> Jan 25 20:50:14 localhost kernel: NFS call  access
>>> Jan 25 20:51:14 localhost kernel: nfs: server 192.168.1.130 not
>>> responding, timed out
>>> Jan 25 20:51:14 localhost kernel: NFS reply statfs: -5
>>> Jan 25 20:51:14 localhost kernel: nfs_statfs: statfs error = 5
>>> Jan 25 20:51:14 localhost kernel: NFS call  fsstat
>>> Most of the trace is repeating in set intervals as well, there is no
>>> flooding of the logs...
>>> Fedora would not shutdown normally either
>>
>> This verifies that your client is attempting to access the NFS  
>> server, but
>> doesn't tell us which file it's attempting to access.  Essentially  
>> the EIO
>> means "failed to connect".
>>
>> Maybe try an strace of the nautilus process next?
>>
>>> On Mon, Jan 25, 2010 at 5:48 PM, Chuck Lever  
>>> <chuck.lever@oracle.com>
>>> wrote:
>>>>
>>>> On Jan 24, 2010, at 7:09 PM, Whoop Whouzer wrote:
>>>>>
>>>>> I did some network traces and there is nothing strange happening  
>>>>> as
>>>>> far as I can tell. I shut down the server (some network traffic
>>>>> occurred as is to be expected). It got quiet again, I launched
>>>>> nautilus, it got stuck without displaying anything and there was  
>>>>> no
>>>>> real network activity except 3 broadcasts using the ARP protocol
>>>>> asking where the server was (could be just coincidence).
>>>>
>>>> That sounds like the client does want to reconnect with the server.
>>>>
>>>> You could try enabling debug tracing on your client (sudo  
>>>> rpcdebug -m nfs
>>>> -s
>>>> all) after shutting down your server, then try to start  
>>>> nautilus.  The
>>>> kernel log would then contain NFS-related messages that might  
>>>> indicate
>>>> where
>>>> to look next.
>>>>
>>>>> Closing
>>>>> nautilus and launching it again will let it hang again but I see  
>>>>> no
>>>>> additional network traffic. After a while nautilus will display  
>>>>> the
>>>>> contents of the folder without any network traffic.
>>>>>
>>>>> On Sun, Jan 24, 2010 at 10:34 PM, Muntz, Daniel <Dan.Muntz@netapp.com 
>>>>> >
>>>>> wrote:
>>>>>>
>>>>>> Perhaps something in your $PATH is in the NFS mount?  Do a  
>>>>>> network
>>>>>> trace
>>>>>> and maybe you can see if, in fact, there are actually NFS  
>>>>>> operations
>>>>>> being
>>>>>> attempted that you weren't expecting.  Then try to figure out  
>>>>>> why.
>>>>>>
>>>>>>  -Dan
>>>>>>
>>>>>>> -----Original Message-----
>>>>>>> From: Whoop Whouzer [mailto:tiredandnumb@gmail.com]
>>>>>>> Sent: Saturday, January 23, 2010 8:28 AM
>>>>>>> To: Peter Chacko
>>>>>>> Cc: linux-nfs@vger.kernel.org
>>>>>>> Subject: Re: nfs client performance while server is down
>>>>>>>
>>>>>>> I don't remember all the different set-ups I tried it on, but  
>>>>>>> I just
>>>>>>> confirmed this with the following combinations:
>>>>>>>
>>>>>>> ubuntu server 10.04 (alpha 2) --> ubuntu desktop 9.10, ubuntu  
>>>>>>> desktop
>>>>>>> 10.04 (alpha 2), fedora 12
>>>>>>> ubuntu server 9.10 --> ubuntu desktop 9.10, ubuntu desktop 10.04
>>>>>>> (alpha 2), fedora 12
>>>>>>>
>>>>>>> I'll be happy to test it on another client machine (distro) even
>>>>>>> another server (although it would require a little more time)
>>>>>>>
>>>>>>> Here are some examples on the bugreports I noticed and how  
>>>>>>> they do not
>>>>>>> seem to get solved:
>>>>>>> https://bugzilla.redhat.com/show_bug.cgi?id=175283
>>>>>>> https://bugs.launchpad.net/ubuntu/+source/nfs-utils/+bug/164120
>>>>>>>
>>>>>>> regards,
>>>>>>> Whoop
>>>>>>>
>>>>>>> On Sat, Jan 23, 2010 at 4:57 PM, Peter Chacko
>>>>>>> <peterchacko35@gmail.com> wrote:
>>>>>>>>
>>>>>>>> Which client OS you observed this behavior ?  This has  
>>>>>>>> nothing to do
>>>>>>>> NFS design, and its purely stateless...Its upto the client OS
>>>>>>>> implementation about aspects like how to deal with local
>>>>>>>
>>>>>>> IO, when NFS
>>>>>>>>
>>>>>>>> share gets  disconnected..
>>>>>>>>
>>>>>>>> May be a VFS bug on the local OS you found this problem ..
>>>>>>>>
>>>>>>>> thanks
>>>>>>>>
>>>>>>>> On Sat, Jan 23, 2010 at 9:15 PM, Whoop Whouzer
>>>>>>>
>>>>>>> <tiredandnumb@gmail.com> wrote:
>>>>>>>>>
>>>>>>>>> Howdy,
>>>>>>>>>
>>>>>>>>> I was wondering why nfs is designed in such a way that the
>>>>>>>
>>>>>>> performance
>>>>>>>>>
>>>>>>>>> of an nfs client machine gets very bad when the nfs server
>>>>>>>
>>>>>>> is offline?
>>>>>>>>>
>>>>>>>>> This is even the case with a soft mount (either via mount
>>>>>>>
>>>>>>> or fstab).
>>>>>>>>>
>>>>>>>>> Just about every application that requires disk access (not  
>>>>>>>>> talking
>>>>>>>>> about nfs share acces) gets really slow to unresponsive.
>>>>>>>
>>>>>>> For instance
>>>>>>>>>
>>>>>>>>> nautilus becomes unresponsive when displaying the contents  
>>>>>>>>> of any
>>>>>>>>> folder on the local disk,
>>>>>>>>> playing movie files (stored on local disk) let totem or
>>>>>>>
>>>>>>> vlc get stuck
>>>>>>>>>
>>>>>>>>> on set intervals, even the terminal becomes unresponsive at  
>>>>>>>>> times.
>>>>>>>>>
>>>>>>>>> I could understand that these problems would occur while
>>>>>>>
>>>>>>> accessing the
>>>>>>>>>
>>>>>>>>> nfs share directoiourry while the server is offline, but
>>>>>>>
>>>>>>> why for totally
>>>>>>>>>
>>>>>>>>> unrelated directories?
>>>>>>>>>
>>>>>>>>> I have experienced this behaviour on various distro's, and
>>>>>>>
>>>>>>> also found
>>>>>>>>>
>>>>>>>>> various bug reports on this issue, they don't seem to get  
>>>>>>>>> solved as
>>>>>>>>> this is viewed as nfs design.
>>>>>>>>> I see this as a flaw because clients are totally dependent  
>>>>>>>>> on the
>>>>>>>>> server. This would be less of a deal if the entire home  
>>>>>>>>> directory
>>>>>>>>> would be stored on nfs (although I even think some sort of
>>>>>>>>> synchronisation technology could and should be implemented  
>>>>>>>>> in this
>>>>>>>>> case). It is a bit odd that (technically) one machine  
>>>>>>>>> serving some
>>>>>>>>> "useless" files to a non-trivial directory on client
>>>>>>>
>>>>>>> machines can take
>>>>>>>>>
>>>>>>>>> down these client machines.
>>>>>>>>>
>>>>>>>>> For me the preferred functionality would be:
>>>>>>>>> *If an nfs server gets offline the client's nfs share becomes
>>>>>>>>> unaccessible, but local directories and applications (that  
>>>>>>>>> only
>>>>>>>>> require local disk access) stay responsive.
>>>>>>>>> *If an nfs server gets online (after being offline while the  
>>>>>>>>> client
>>>>>>>>> has not been restarted) the nfs share becomes reconnected.
>>>>>>>>>
>>>>>>>>> regards,
>>>>>>>>> Whoop
>>>>>>>>> --
>>>>>>>>> To unsubscribe from this list: send the line "unsubscribe
>>>>>>>
>>>>>>> linux-nfs" in
>>>>>>>>>
>>>>>>>>> the body of a message to majordomo@vger.kernel.org
>>>>>>>>> More majordomo info at  http://vger.kernel.org/majordomo-info.html
>>>>>>>>>
>>>>>>>>
>>>>>>> --
>>>>>>> To unsubscribe from this list: send the line "unsubscribe
>>>>>>> linux-nfs" in
>>>>>>> the body of a message to majordomo@vger.kernel.org
>>>>>>> More majordomo info at  http://vger.kernel.org/majordomo-info.html
>>>>>>>
>>>>>>
>>>>> --
>>>>> To unsubscribe from this list: send the line "unsubscribe linux- 
>>>>> nfs" in
>>>>> the body of a message to majordomo@vger.kernel.org
>>>>> More majordomo info at  http://vger.kernel.org/majordomo-info.html
>>>>
>>>> --
>>>> Chuck Lever
>>>> chuck[dot]lever[at]oracle[dot]com
>>>>
>>>>
>>>>
>>>>
>>>>
>>
>> --
>> Chuck Lever
>> chuck[dot]lever[at]oracle[dot]com
>>
>>
>>
>>
> <stracesdiff.log>

--
Chuck Lever
chuck[dot]lever[at]oracle[dot]com




^ permalink raw reply	[flat|nested] 24+ messages in thread

* Re: nfs client performance while server is down
  2010-01-25 21:01                           ` Chuck Lever
@ 2010-01-25 21:18                             ` Whoop Whouzer
       [not found]                               ` <d7f0b3a81001251318k42de9be2qe54f83bbd86cabb8-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
  0 siblings, 1 reply; 24+ messages in thread
From: Whoop Whouzer @ 2010-01-25 21:18 UTC (permalink / raw)
  To: Chuck Lever; +Cc: Muntz, Daniel, Peter Chacko, linux-nfs

Any idea how I could do that?

On Mon, Jan 25, 2010 at 10:01 PM, Chuck Lever <chuck.lever@oracle.com> =
wrote:
> On Jan 25, 2010, at 2:38 PM, Whoop Whouzer wrote:
>>
>> Running =A0"strace nautilus" gives me allot of output. When I run it
>> while the server is down it completes the trace without a hiccup, it
>> returns and than nautilus is launched and hangs.
>> There are differences between the traces (with server up and server
>> down). I can't really see where the problem lies in there.
>
> I would expect that the command-line nautilus forks when it starts up=
=2E =A0If
> it has some option you can specify to prevent that, it might allow a =
deeper
> look. =A0You would need to tell strace to look at the children, too.
>
>> On Mon, Jan 25, 2010 at 8:08 PM, Chuck Lever <chuck.lever@oracle.com=
>
>> wrote:
>>>
>>> On Jan 25, 2010, at 2:02 PM, Whoop Whouzer wrote:
>>>>
>>>> Ok, I did that, after shutting down the server and enabling debug
>>>> trace I tried to open the home folder of the current account (tota=
lly
>>>> unrelated to the nfsshare), it wouldn't open at all, I got no naut=
ilus
>>>> at all. During the time my cursor was in busy mode I got the follo=
wing
>>>> messages in kern.log (for ubuntu 10.04 client):
>>>> Jan 25 19:30:13 whoop-desktop kernel: [ =A0160.719262] NFS call =A0=
fsstat
>>>> Jan 25 19:30:37 whoop-desktop kernel: [ =A0184.458611] NFS:
>>>> permission(0:16/74386), mask=3D0x10, res=3D0
>>>> Jan 25 19:30:37 whoop-desktop kernel: [ =A0184.458647] NFS call =A0=
access
>>>> Jan 25 19:30:43 whoop-desktop kernel: [ =A0190.721086] nfs: server
>>>> 192.168.1.130 not responding, timed out
>>>> Jan 25 19:30:43 whoop-desktop kernel: [ =A0190.721113] NFS reply s=
tatfs:
>>>> -5
>>>> Jan 25 19:30:43 whoop-desktop kernel: [ =A0190.721116] nfs_statfs:
>>>> statfs error =3D 5
>>>> These series of traces are repeating over and over again at a set
>>>> interval (there is no flooding of the logs), even if I do nothing.
>>>> It's even worse than I thought because when I tried to shutdown, t=
he
>>>> machine wouldn't shutdown because it claimed
>>>> the "File manager" was still running (although it was not visible =
on
>>>> screen); so I had to kill that before I could shutdown (properly).
>>>>
>>>> In Fedora 12 I had a similar user experience (nautilus did show up
>>>> without showing any contents and it was hanging). I had enabled
>>>> tracing and it seems to be logged to /var/log/messages. I got this
>>>> output in fedora:
>>>> Jan 25 20:48:38 localhost kernel: NFS reply statfs: -5
>>>> Jan 25 20:48:38 localhost kernel: nfs_statfs: statfs error =3D 5
>>>> Jan 25 20:48:38 localhost kernel: NFS call =A0fsstat
>>>> Jan 25 20:49:14 localhost kernel: nfs: server 192.168.1.130 not
>>>> responding, timed out
>>>> Jan 25 20:49:14 localhost kernel: NFS reply getattr: -5
>>>> Jan 25 20:49:14 localhost kernel: nfs_revalidate_inode: (0:14/7438=
6)
>>>> getattr failed, error=3D-5
>>>> Jan 25 20:49:25 localhost kernel: NFS: revalidating (0:14/74386)
>>>> Jan 25 20:49:25 localhost kernel: NFS call =A0getattr
>>>> Jan 25 20:50:14 localhost kernel: nfs: server 192.168.1.130 not
>>>> responding, timed out
>>>> Jan 25 20:50:14 localhost kernel: NFS reply access: -5
>>>> Jan 25 20:50:14 localhost kernel: NFS: permission(0:14/74386), mas=
k=3D0x1,
>>>> res=3D-5
>>>> Jan 25 20:50:14 localhost kernel: NFS call =A0access
>>>> Jan 25 20:51:14 localhost kernel: nfs: server 192.168.1.130 not
>>>> responding, timed out
>>>> Jan 25 20:51:14 localhost kernel: NFS reply statfs: -5
>>>> Jan 25 20:51:14 localhost kernel: nfs_statfs: statfs error =3D 5
>>>> Jan 25 20:51:14 localhost kernel: NFS call =A0fsstat
>>>> Most of the trace is repeating in set intervals as well, there is =
no
>>>> flooding of the logs...
>>>> Fedora would not shutdown normally either
>>>
>>> This verifies that your client is attempting to access the NFS serv=
er,
>>> but
>>> doesn't tell us which file it's attempting to access. =A0Essentiall=
y the
>>> EIO
>>> means "failed to connect".
>>>
>>> Maybe try an strace of the nautilus process next?
>>>
>>>> On Mon, Jan 25, 2010 at 5:48 PM, Chuck Lever <chuck.lever@oracle.c=
om>
>>>> wrote:
>>>>>
>>>>> On Jan 24, 2010, at 7:09 PM, Whoop Whouzer wrote:
>>>>>>
>>>>>> I did some network traces and there is nothing strange happening=
 as
>>>>>> far as I can tell. I shut down the server (some network traffic
>>>>>> occurred as is to be expected). It got quiet again, I launched
>>>>>> nautilus, it got stuck without displaying anything and there was=
 no
>>>>>> real network activity except 3 broadcasts using the ARP protocol
>>>>>> asking where the server was (could be just coincidence).
>>>>>
>>>>> That sounds like the client does want to reconnect with the serve=
r.
>>>>>
>>>>> You could try enabling debug tracing on your client (sudo rpcdebu=
g -m
>>>>> nfs
>>>>> -s
>>>>> all) after shutting down your server, then try to start nautilus.=
 =A0The
>>>>> kernel log would then contain NFS-related messages that might ind=
icate
>>>>> where
>>>>> to look next.
>>>>>
>>>>>> Closing
>>>>>> nautilus and launching it again will let it hang again but I see=
 no
>>>>>> additional network traffic. After a while nautilus will display =
the
>>>>>> contents of the folder without any network traffic.
>>>>>>
>>>>>> On Sun, Jan 24, 2010 at 10:34 PM, Muntz, Daniel <Dan.Muntz@netap=
p.com>
>>>>>> wrote:
>>>>>>>
>>>>>>> Perhaps something in your $PATH is in the NFS mount? =A0Do a ne=
twork
>>>>>>> trace
>>>>>>> and maybe you can see if, in fact, there are actually NFS opera=
tions
>>>>>>> being
>>>>>>> attempted that you weren't expecting. =A0Then try to figure out=
 why.
>>>>>>>
>>>>>>> =A0-Dan
>>>>>>>
>>>>>>>> -----Original Message-----
>>>>>>>> From: Whoop Whouzer [mailto:tiredandnumb@gmail.com]
>>>>>>>> Sent: Saturday, January 23, 2010 8:28 AM
>>>>>>>> To: Peter Chacko
>>>>>>>> Cc: linux-nfs@vger.kernel.org
>>>>>>>> Subject: Re: nfs client performance while server is down
>>>>>>>>
>>>>>>>> I don't remember all the different set-ups I tried it on, but =
I just
>>>>>>>> confirmed this with the following combinations:
>>>>>>>>
>>>>>>>> ubuntu server 10.04 (alpha 2) --> ubuntu desktop 9.10, ubuntu
>>>>>>>> desktop
>>>>>>>> 10.04 (alpha 2), fedora 12
>>>>>>>> ubuntu server 9.10 --> ubuntu desktop 9.10, ubuntu desktop 10.=
04
>>>>>>>> (alpha 2), fedora 12
>>>>>>>>
>>>>>>>> I'll be happy to test it on another client machine (distro) ev=
en
>>>>>>>> another server (although it would require a little more time)
>>>>>>>>
>>>>>>>> Here are some examples on the bugreports I noticed and how the=
y do
>>>>>>>> not
>>>>>>>> seem to get solved:
>>>>>>>> https://bugzilla.redhat.com/show_bug.cgi?id=3D175283
>>>>>>>> https://bugs.launchpad.net/ubuntu/+source/nfs-utils/+bug/16412=
0
>>>>>>>>
>>>>>>>> regards,
>>>>>>>> Whoop
>>>>>>>>
>>>>>>>> On Sat, Jan 23, 2010 at 4:57 PM, Peter Chacko
>>>>>>>> <peterchacko35@gmail.com> wrote:
>>>>>>>>>
>>>>>>>>> Which client OS you observed this behavior ? =A0This has noth=
ing to
>>>>>>>>> do
>>>>>>>>> NFS design, and its purely stateless...Its upto the client OS
>>>>>>>>> implementation about aspects like how to deal with local
>>>>>>>>
>>>>>>>> IO, when NFS
>>>>>>>>>
>>>>>>>>> share gets =A0disconnected..
>>>>>>>>>
>>>>>>>>> May be a VFS bug on the local OS you found this problem ..
>>>>>>>>>
>>>>>>>>> thanks
>>>>>>>>>
>>>>>>>>> On Sat, Jan 23, 2010 at 9:15 PM, Whoop Whouzer
>>>>>>>>
>>>>>>>> <tiredandnumb@gmail.com> wrote:
>>>>>>>>>>
>>>>>>>>>> Howdy,
>>>>>>>>>>
>>>>>>>>>> I was wondering why nfs is designed in such a way that the
>>>>>>>>
>>>>>>>> performance
>>>>>>>>>>
>>>>>>>>>> of an nfs client machine gets very bad when the nfs server
>>>>>>>>
>>>>>>>> is offline?
>>>>>>>>>>
>>>>>>>>>> This is even the case with a soft mount (either via mount
>>>>>>>>
>>>>>>>> or fstab).
>>>>>>>>>>
>>>>>>>>>> Just about every application that requires disk access (not
>>>>>>>>>> talking
>>>>>>>>>> about nfs share acces) gets really slow to unresponsive.
>>>>>>>>
>>>>>>>> For instance
>>>>>>>>>>
>>>>>>>>>> nautilus becomes unresponsive when displaying the contents o=
f any
>>>>>>>>>> folder on the local disk,
>>>>>>>>>> playing movie files (stored on local disk) let totem or
>>>>>>>>
>>>>>>>> vlc get stuck
>>>>>>>>>>
>>>>>>>>>> on set intervals, even the terminal becomes unresponsive at =
times.
>>>>>>>>>>
>>>>>>>>>> I could understand that these problems would occur while
>>>>>>>>
>>>>>>>> accessing the
>>>>>>>>>>
>>>>>>>>>> nfs share directoiourry while the server is offline, but
>>>>>>>>
>>>>>>>> why for totally
>>>>>>>>>>
>>>>>>>>>> unrelated directories?
>>>>>>>>>>
>>>>>>>>>> I have experienced this behaviour on various distro's, and
>>>>>>>>
>>>>>>>> also found
>>>>>>>>>>
>>>>>>>>>> various bug reports on this issue, they don't seem to get so=
lved
>>>>>>>>>> as
>>>>>>>>>> this is viewed as nfs design.
>>>>>>>>>> I see this as a flaw because clients are totally dependent o=
n the
>>>>>>>>>> server. This would be less of a deal if the entire home dire=
ctory
>>>>>>>>>> would be stored on nfs (although I even think some sort of
>>>>>>>>>> synchronisation technology could and should be implemented i=
n this
>>>>>>>>>> case). It is a bit odd that (technically) one machine servin=
g some
>>>>>>>>>> "useless" files to a non-trivial directory on client
>>>>>>>>
>>>>>>>> machines can take
>>>>>>>>>>
>>>>>>>>>> down these client machines.
>>>>>>>>>>
>>>>>>>>>> For me the preferred functionality would be:
>>>>>>>>>> *If an nfs server gets offline the client's nfs share become=
s
>>>>>>>>>> unaccessible, but local directories and applications (that o=
nly
>>>>>>>>>> require local disk access) stay responsive.
>>>>>>>>>> *If an nfs server gets online (after being offline while the
>>>>>>>>>> client
>>>>>>>>>> has not been restarted) the nfs share becomes reconnected.
>>>>>>>>>>
>>>>>>>>>> regards,
>>>>>>>>>> Whoop
>>>>>>>>>> --
>>>>>>>>>> To unsubscribe from this list: send the line "unsubscribe
>>>>>>>>
>>>>>>>> linux-nfs" in
>>>>>>>>>>
>>>>>>>>>> the body of a message to majordomo@vger.kernel.org
>>>>>>>>>> More majordomo info at =A0http://vger.kernel.org/majordomo-i=
nfo.html
>>>>>>>>>>
>>>>>>>>>
>>>>>>>> --
>>>>>>>> To unsubscribe from this list: send the line "unsubscribe
>>>>>>>> linux-nfs" in
>>>>>>>> the body of a message to majordomo@vger.kernel.org
>>>>>>>> More majordomo info at =A0http://vger.kernel.org/majordomo-inf=
o.html
>>>>>>>>
>>>>>>>
>>>>>> --
>>>>>> To unsubscribe from this list: send the line "unsubscribe linux-=
nfs"
>>>>>> in
>>>>>> the body of a message to majordomo@vger.kernel.org
>>>>>> More majordomo info at =A0http://vger.kernel.org/majordomo-info.=
html
>>>>>
>>>>> --
>>>>> Chuck Lever
>>>>> chuck[dot]lever[at]oracle[dot]com
>>>>>
>>>>>
>>>>>
>>>>>
>>>>>
>>>
>>> --
>>> Chuck Lever
>>> chuck[dot]lever[at]oracle[dot]com
>>>
>>>
>>>
>>>
>> <stracesdiff.log>
>
> --
> Chuck Lever
> chuck[dot]lever[at]oracle[dot]com
>
>
>
>

^ permalink raw reply	[flat|nested] 24+ messages in thread

* Re: nfs client performance while server is down
       [not found]                               ` <d7f0b3a81001251318k42de9be2qe54f83bbd86cabb8-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
@ 2010-01-25 21:26                                 ` Chuck Lever
  2010-01-25 23:03                                   ` Whoop Whouzer
  0 siblings, 1 reply; 24+ messages in thread
From: Chuck Lever @ 2010-01-25 21:26 UTC (permalink / raw)
  To: Whoop Whouzer; +Cc: Muntz, Daniel, Peter Chacko, linux-nfs

On Jan 25, 2010, at 4:18 PM, Whoop Whouzer wrote:
> Any idea how I could do that?

The strace(1) man page says "-f" follows children.  In any event, you  
can strace the running children processes using "strace -p <pid>".

> On Mon, Jan 25, 2010 at 10:01 PM, Chuck Lever  
> <chuck.lever@oracle.com> wrote:
>> On Jan 25, 2010, at 2:38 PM, Whoop Whouzer wrote:
>>>
>>> Running  "strace nautilus" gives me allot of output. When I run it
>>> while the server is down it completes the trace without a hiccup, it
>>> returns and than nautilus is launched and hangs.
>>> There are differences between the traces (with server up and server
>>> down). I can't really see where the problem lies in there.
>>
>> I would expect that the command-line nautilus forks when it starts  
>> up.  If
>> it has some option you can specify to prevent that, it might allow  
>> a deeper
>> look.  You would need to tell strace to look at the children, too.
>>
>>> On Mon, Jan 25, 2010 at 8:08 PM, Chuck Lever  
>>> <chuck.lever@oracle.com>
>>> wrote:
>>>>
>>>> On Jan 25, 2010, at 2:02 PM, Whoop Whouzer wrote:
>>>>>
>>>>> Ok, I did that, after shutting down the server and enabling debug
>>>>> trace I tried to open the home folder of the current account  
>>>>> (totally
>>>>> unrelated to the nfsshare), it wouldn't open at all, I got no  
>>>>> nautilus
>>>>> at all. During the time my cursor was in busy mode I got the  
>>>>> following
>>>>> messages in kern.log (for ubuntu 10.04 client):
>>>>> Jan 25 19:30:13 whoop-desktop kernel: [  160.719262] NFS call   
>>>>> fsstat
>>>>> Jan 25 19:30:37 whoop-desktop kernel: [  184.458611] NFS:
>>>>> permission(0:16/74386), mask=0x10, res=0
>>>>> Jan 25 19:30:37 whoop-desktop kernel: [  184.458647] NFS call   
>>>>> access
>>>>> Jan 25 19:30:43 whoop-desktop kernel: [  190.721086] nfs: server
>>>>> 192.168.1.130 not responding, timed out
>>>>> Jan 25 19:30:43 whoop-desktop kernel: [  190.721113] NFS reply  
>>>>> statfs:
>>>>> -5
>>>>> Jan 25 19:30:43 whoop-desktop kernel: [  190.721116] nfs_statfs:
>>>>> statfs error = 5
>>>>> These series of traces are repeating over and over again at a set
>>>>> interval (there is no flooding of the logs), even if I do nothing.
>>>>> It's even worse than I thought because when I tried to shutdown,  
>>>>> the
>>>>> machine wouldn't shutdown because it claimed
>>>>> the "File manager" was still running (although it was not  
>>>>> visible on
>>>>> screen); so I had to kill that before I could shutdown (properly).
>>>>>
>>>>> In Fedora 12 I had a similar user experience (nautilus did show up
>>>>> without showing any contents and it was hanging). I had enabled
>>>>> tracing and it seems to be logged to /var/log/messages. I got this
>>>>> output in fedora:
>>>>> Jan 25 20:48:38 localhost kernel: NFS reply statfs: -5
>>>>> Jan 25 20:48:38 localhost kernel: nfs_statfs: statfs error = 5
>>>>> Jan 25 20:48:38 localhost kernel: NFS call  fsstat
>>>>> Jan 25 20:49:14 localhost kernel: nfs: server 192.168.1.130 not
>>>>> responding, timed out
>>>>> Jan 25 20:49:14 localhost kernel: NFS reply getattr: -5
>>>>> Jan 25 20:49:14 localhost kernel: nfs_revalidate_inode:  
>>>>> (0:14/74386)
>>>>> getattr failed, error=-5
>>>>> Jan 25 20:49:25 localhost kernel: NFS: revalidating (0:14/74386)
>>>>> Jan 25 20:49:25 localhost kernel: NFS call  getattr
>>>>> Jan 25 20:50:14 localhost kernel: nfs: server 192.168.1.130 not
>>>>> responding, timed out
>>>>> Jan 25 20:50:14 localhost kernel: NFS reply access: -5
>>>>> Jan 25 20:50:14 localhost kernel: NFS: permission(0:14/74386),  
>>>>> mask=0x1,
>>>>> res=-5
>>>>> Jan 25 20:50:14 localhost kernel: NFS call  access
>>>>> Jan 25 20:51:14 localhost kernel: nfs: server 192.168.1.130 not
>>>>> responding, timed out
>>>>> Jan 25 20:51:14 localhost kernel: NFS reply statfs: -5
>>>>> Jan 25 20:51:14 localhost kernel: nfs_statfs: statfs error = 5
>>>>> Jan 25 20:51:14 localhost kernel: NFS call  fsstat
>>>>> Most of the trace is repeating in set intervals as well, there  
>>>>> is no
>>>>> flooding of the logs...
>>>>> Fedora would not shutdown normally either
>>>>
>>>> This verifies that your client is attempting to access the NFS  
>>>> server,
>>>> but
>>>> doesn't tell us which file it's attempting to access.   
>>>> Essentially the
>>>> EIO
>>>> means "failed to connect".
>>>>
>>>> Maybe try an strace of the nautilus process next?
>>>>
>>>>> On Mon, Jan 25, 2010 at 5:48 PM, Chuck Lever <chuck.lever@oracle.com 
>>>>> >
>>>>> wrote:
>>>>>>
>>>>>> On Jan 24, 2010, at 7:09 PM, Whoop Whouzer wrote:
>>>>>>>
>>>>>>> I did some network traces and there is nothing strange  
>>>>>>> happening as
>>>>>>> far as I can tell. I shut down the server (some network traffic
>>>>>>> occurred as is to be expected). It got quiet again, I launched
>>>>>>> nautilus, it got stuck without displaying anything and there  
>>>>>>> was no
>>>>>>> real network activity except 3 broadcasts using the ARP protocol
>>>>>>> asking where the server was (could be just coincidence).
>>>>>>
>>>>>> That sounds like the client does want to reconnect with the  
>>>>>> server.
>>>>>>
>>>>>> You could try enabling debug tracing on your client (sudo  
>>>>>> rpcdebug -m
>>>>>> nfs
>>>>>> -s
>>>>>> all) after shutting down your server, then try to start  
>>>>>> nautilus.  The
>>>>>> kernel log would then contain NFS-related messages that might  
>>>>>> indicate
>>>>>> where
>>>>>> to look next.
>>>>>>
>>>>>>> Closing
>>>>>>> nautilus and launching it again will let it hang again but I  
>>>>>>> see no
>>>>>>> additional network traffic. After a while nautilus will  
>>>>>>> display the
>>>>>>> contents of the folder without any network traffic.
>>>>>>>
>>>>>>> On Sun, Jan 24, 2010 at 10:34 PM, Muntz, Daniel <Dan.Muntz@netapp.com 
>>>>>>> >
>>>>>>> wrote:
>>>>>>>>
>>>>>>>> Perhaps something in your $PATH is in the NFS mount?  Do a  
>>>>>>>> network
>>>>>>>> trace
>>>>>>>> and maybe you can see if, in fact, there are actually NFS  
>>>>>>>> operations
>>>>>>>> being
>>>>>>>> attempted that you weren't expecting.  Then try to figure out  
>>>>>>>> why.
>>>>>>>>
>>>>>>>>  -Dan
>>>>>>>>
>>>>>>>>> -----Original Message-----
>>>>>>>>> From: Whoop Whouzer [mailto:tiredandnumb@gmail.com]
>>>>>>>>> Sent: Saturday, January 23, 2010 8:28 AM
>>>>>>>>> To: Peter Chacko
>>>>>>>>> Cc: linux-nfs@vger.kernel.org
>>>>>>>>> Subject: Re: nfs client performance while server is down
>>>>>>>>>
>>>>>>>>> I don't remember all the different set-ups I tried it on,  
>>>>>>>>> but I just
>>>>>>>>> confirmed this with the following combinations:
>>>>>>>>>
>>>>>>>>> ubuntu server 10.04 (alpha 2) --> ubuntu desktop 9.10, ubuntu
>>>>>>>>> desktop
>>>>>>>>> 10.04 (alpha 2), fedora 12
>>>>>>>>> ubuntu server 9.10 --> ubuntu desktop 9.10, ubuntu desktop  
>>>>>>>>> 10.04
>>>>>>>>> (alpha 2), fedora 12
>>>>>>>>>
>>>>>>>>> I'll be happy to test it on another client machine (distro)  
>>>>>>>>> even
>>>>>>>>> another server (although it would require a little more time)
>>>>>>>>>
>>>>>>>>> Here are some examples on the bugreports I noticed and how  
>>>>>>>>> they do
>>>>>>>>> not
>>>>>>>>> seem to get solved:
>>>>>>>>> https://bugzilla.redhat.com/show_bug.cgi?id=175283
>>>>>>>>> https://bugs.launchpad.net/ubuntu/+source/nfs-utils/+bug/ 
>>>>>>>>> 164120
>>>>>>>>>
>>>>>>>>> regards,
>>>>>>>>> Whoop
>>>>>>>>>
>>>>>>>>> On Sat, Jan 23, 2010 at 4:57 PM, Peter Chacko
>>>>>>>>> <peterchacko35@gmail.com> wrote:
>>>>>>>>>>
>>>>>>>>>> Which client OS you observed this behavior ?  This has  
>>>>>>>>>> nothing to
>>>>>>>>>> do
>>>>>>>>>> NFS design, and its purely stateless...Its upto the client OS
>>>>>>>>>> implementation about aspects like how to deal with local
>>>>>>>>>
>>>>>>>>> IO, when NFS
>>>>>>>>>>
>>>>>>>>>> share gets  disconnected..
>>>>>>>>>>
>>>>>>>>>> May be a VFS bug on the local OS you found this problem ..
>>>>>>>>>>
>>>>>>>>>> thanks
>>>>>>>>>>
>>>>>>>>>> On Sat, Jan 23, 2010 at 9:15 PM, Whoop Whouzer
>>>>>>>>>
>>>>>>>>> <tiredandnumb@gmail.com> wrote:
>>>>>>>>>>>
>>>>>>>>>>> Howdy,
>>>>>>>>>>>
>>>>>>>>>>> I was wondering why nfs is designed in such a way that the
>>>>>>>>>
>>>>>>>>> performance
>>>>>>>>>>>
>>>>>>>>>>> of an nfs client machine gets very bad when the nfs server
>>>>>>>>>
>>>>>>>>> is offline?
>>>>>>>>>>>
>>>>>>>>>>> This is even the case with a soft mount (either via mount
>>>>>>>>>
>>>>>>>>> or fstab).
>>>>>>>>>>>
>>>>>>>>>>> Just about every application that requires disk access (not
>>>>>>>>>>> talking
>>>>>>>>>>> about nfs share acces) gets really slow to unresponsive.
>>>>>>>>>
>>>>>>>>> For instance
>>>>>>>>>>>
>>>>>>>>>>> nautilus becomes unresponsive when displaying the contents  
>>>>>>>>>>> of any
>>>>>>>>>>> folder on the local disk,
>>>>>>>>>>> playing movie files (stored on local disk) let totem or
>>>>>>>>>
>>>>>>>>> vlc get stuck
>>>>>>>>>>>
>>>>>>>>>>> on set intervals, even the terminal becomes unresponsive  
>>>>>>>>>>> at times.
>>>>>>>>>>>
>>>>>>>>>>> I could understand that these problems would occur while
>>>>>>>>>
>>>>>>>>> accessing the
>>>>>>>>>>>
>>>>>>>>>>> nfs share directoiourry while the server is offline, but
>>>>>>>>>
>>>>>>>>> why for totally
>>>>>>>>>>>
>>>>>>>>>>> unrelated directories?
>>>>>>>>>>>
>>>>>>>>>>> I have experienced this behaviour on various distro's, and
>>>>>>>>>
>>>>>>>>> also found
>>>>>>>>>>>
>>>>>>>>>>> various bug reports on this issue, they don't seem to get  
>>>>>>>>>>> solved
>>>>>>>>>>> as
>>>>>>>>>>> this is viewed as nfs design.
>>>>>>>>>>> I see this as a flaw because clients are totally dependent  
>>>>>>>>>>> on the
>>>>>>>>>>> server. This would be less of a deal if the entire home  
>>>>>>>>>>> directory
>>>>>>>>>>> would be stored on nfs (although I even think some sort of
>>>>>>>>>>> synchronisation technology could and should be implemented  
>>>>>>>>>>> in this
>>>>>>>>>>> case). It is a bit odd that (technically) one machine  
>>>>>>>>>>> serving some
>>>>>>>>>>> "useless" files to a non-trivial directory on client
>>>>>>>>>
>>>>>>>>> machines can take
>>>>>>>>>>>
>>>>>>>>>>> down these client machines.
>>>>>>>>>>>
>>>>>>>>>>> For me the preferred functionality would be:
>>>>>>>>>>> *If an nfs server gets offline the client's nfs share  
>>>>>>>>>>> becomes
>>>>>>>>>>> unaccessible, but local directories and applications (that  
>>>>>>>>>>> only
>>>>>>>>>>> require local disk access) stay responsive.
>>>>>>>>>>> *If an nfs server gets online (after being offline while the
>>>>>>>>>>> client
>>>>>>>>>>> has not been restarted) the nfs share becomes reconnected.
>>>>>>>>>>>
>>>>>>>>>>> regards,
>>>>>>>>>>> Whoop
>>>>>>>>>>> --
>>>>>>>>>>> To unsubscribe from this list: send the line "unsubscribe
>>>>>>>>>
>>>>>>>>> linux-nfs" in
>>>>>>>>>>>
>>>>>>>>>>> the body of a message to majordomo@vger.kernel.org
>>>>>>>>>>> More majordomo info at  http://vger.kernel.org/majordomo-info.html
>>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>> --
>>>>>>>>> To unsubscribe from this list: send the line "unsubscribe
>>>>>>>>> linux-nfs" in
>>>>>>>>> the body of a message to majordomo@vger.kernel.org
>>>>>>>>> More majordomo info at  http://vger.kernel.org/majordomo-info.html
>>>>>>>>>
>>>>>>>>
>>>>>>> --
>>>>>>> To unsubscribe from this list: send the line "unsubscribe  
>>>>>>> linux-nfs"
>>>>>>> in
>>>>>>> the body of a message to majordomo@vger.kernel.org
>>>>>>> More majordomo info at  http://vger.kernel.org/majordomo-info.html
>>>>>>
>>>>>> --
>>>>>> Chuck Lever
>>>>>> chuck[dot]lever[at]oracle[dot]com
>>>>>>
>>>>>>
>>>>>>
>>>>>>
>>>>>>
>>>>
>>>> --
>>>> Chuck Lever
>>>> chuck[dot]lever[at]oracle[dot]com
>>>>
>>>>
>>>>
>>>>
>>> <stracesdiff.log>
>>
>> --
>> Chuck Lever
>> chuck[dot]lever[at]oracle[dot]com
>>
>>
>>
>>

--
Chuck Lever
chuck[dot]lever[at]oracle[dot]com




^ permalink raw reply	[flat|nested] 24+ messages in thread

* Re: nfs client performance while server is down
  2010-01-25 21:26                                 ` Chuck Lever
@ 2010-01-25 23:03                                   ` Whoop Whouzer
  0 siblings, 0 replies; 24+ messages in thread
From: Whoop Whouzer @ 2010-01-25 23:03 UTC (permalink / raw)
  To: Chuck Lever; +Cc: Muntz, Daniel, Peter Chacko, linux-nfs

strace -f gives me about the same as a normal strace (there is some
extra data concerning childs and there pid's but they do not seem to
give any information about nfs or attempting to access the
inaccessible nfs directory).
strace -p can't really give me any extra output as the processes
finish really fast.

On Mon, Jan 25, 2010 at 10:26 PM, Chuck Lever <chuck.lever@oracle.com> =
wrote:
> On Jan 25, 2010, at 4:18 PM, Whoop Whouzer wrote:
>>
>> Any idea how I could do that?
>
> The strace(1) man page says "-f" follows children. =A0In any event, y=
ou can
> strace the running children processes using "strace -p <pid>".
>
>> On Mon, Jan 25, 2010 at 10:01 PM, Chuck Lever <chuck.lever-QHcLZuEGTsthl2p70BpVqQ@public.gmane.org=
m>
>> wrote:
>>>
>>> On Jan 25, 2010, at 2:38 PM, Whoop Whouzer wrote:
>>>>
>>>> Running =A0"strace nautilus" gives me allot of output. When I run =
it
>>>> while the server is down it completes the trace without a hiccup, =
it
>>>> returns and than nautilus is launched and hangs.
>>>> There are differences between the traces (with server up and serve=
r
>>>> down). I can't really see where the problem lies in there.
>>>
>>> I would expect that the command-line nautilus forks when it starts =
up.
>>> =A0If
>>> it has some option you can specify to prevent that, it might allow =
a
>>> deeper
>>> look. =A0You would need to tell strace to look at the children, too=
=2E
>>>
>>>> On Mon, Jan 25, 2010 at 8:08 PM, Chuck Lever <chuck.lever@oracle.c=
om>
>>>> wrote:
>>>>>
>>>>> On Jan 25, 2010, at 2:02 PM, Whoop Whouzer wrote:
>>>>>>
>>>>>> Ok, I did that, after shutting down the server and enabling debu=
g
>>>>>> trace I tried to open the home folder of the current account (to=
tally
>>>>>> unrelated to the nfsshare), it wouldn't open at all, I got no na=
utilus
>>>>>> at all. During the time my cursor was in busy mode I got the fol=
lowing
>>>>>> messages in kern.log (for ubuntu 10.04 client):
>>>>>> Jan 25 19:30:13 whoop-desktop kernel: [ =A0160.719262] NFS call =
=A0fsstat
>>>>>> Jan 25 19:30:37 whoop-desktop kernel: [ =A0184.458611] NFS:
>>>>>> permission(0:16/74386), mask=3D0x10, res=3D0
>>>>>> Jan 25 19:30:37 whoop-desktop kernel: [ =A0184.458647] NFS call =
=A0access
>>>>>> Jan 25 19:30:43 whoop-desktop kernel: [ =A0190.721086] nfs: serv=
er
>>>>>> 192.168.1.130 not responding, timed out
>>>>>> Jan 25 19:30:43 whoop-desktop kernel: [ =A0190.721113] NFS reply=
 statfs:
>>>>>> -5
>>>>>> Jan 25 19:30:43 whoop-desktop kernel: [ =A0190.721116] nfs_statf=
s:
>>>>>> statfs error =3D 5
>>>>>> These series of traces are repeating over and over again at a se=
t
>>>>>> interval (there is no flooding of the logs), even if I do nothin=
g.
>>>>>> It's even worse than I thought because when I tried to shutdown,=
 the
>>>>>> machine wouldn't shutdown because it claimed
>>>>>> the "File manager" was still running (although it was not visibl=
e on
>>>>>> screen); so I had to kill that before I could shutdown (properly=
).
>>>>>>
>>>>>> In Fedora 12 I had a similar user experience (nautilus did show =
up
>>>>>> without showing any contents and it was hanging). I had enabled
>>>>>> tracing and it seems to be logged to /var/log/messages. I got th=
is
>>>>>> output in fedora:
>>>>>> Jan 25 20:48:38 localhost kernel: NFS reply statfs: -5
>>>>>> Jan 25 20:48:38 localhost kernel: nfs_statfs: statfs error =3D 5
>>>>>> Jan 25 20:48:38 localhost kernel: NFS call =A0fsstat
>>>>>> Jan 25 20:49:14 localhost kernel: nfs: server 192.168.1.130 not
>>>>>> responding, timed out
>>>>>> Jan 25 20:49:14 localhost kernel: NFS reply getattr: -5
>>>>>> Jan 25 20:49:14 localhost kernel: nfs_revalidate_inode: (0:14/74=
386)
>>>>>> getattr failed, error=3D-5
>>>>>> Jan 25 20:49:25 localhost kernel: NFS: revalidating (0:14/74386)
>>>>>> Jan 25 20:49:25 localhost kernel: NFS call =A0getattr
>>>>>> Jan 25 20:50:14 localhost kernel: nfs: server 192.168.1.130 not
>>>>>> responding, timed out
>>>>>> Jan 25 20:50:14 localhost kernel: NFS reply access: -5
>>>>>> Jan 25 20:50:14 localhost kernel: NFS: permission(0:14/74386),
>>>>>> mask=3D0x1,
>>>>>> res=3D-5
>>>>>> Jan 25 20:50:14 localhost kernel: NFS call =A0access
>>>>>> Jan 25 20:51:14 localhost kernel: nfs: server 192.168.1.130 not
>>>>>> responding, timed out
>>>>>> Jan 25 20:51:14 localhost kernel: NFS reply statfs: -5
>>>>>> Jan 25 20:51:14 localhost kernel: nfs_statfs: statfs error =3D 5
>>>>>> Jan 25 20:51:14 localhost kernel: NFS call =A0fsstat
>>>>>> Most of the trace is repeating in set intervals as well, there i=
s no
>>>>>> flooding of the logs...
>>>>>> Fedora would not shutdown normally either
>>>>>
>>>>> This verifies that your client is attempting to access the NFS se=
rver,
>>>>> but
>>>>> doesn't tell us which file it's attempting to access. =A0Essentia=
lly the
>>>>> EIO
>>>>> means "failed to connect".
>>>>>
>>>>> Maybe try an strace of the nautilus process next?
>>>>>
>>>>>> On Mon, Jan 25, 2010 at 5:48 PM, Chuck Lever <chuck.lever@oracle=
=2Ecom>
>>>>>> wrote:
>>>>>>>
>>>>>>> On Jan 24, 2010, at 7:09 PM, Whoop Whouzer wrote:
>>>>>>>>
>>>>>>>> I did some network traces and there is nothing strange happeni=
ng as
>>>>>>>> far as I can tell. I shut down the server (some network traffi=
c
>>>>>>>> occurred as is to be expected). It got quiet again, I launched
>>>>>>>> nautilus, it got stuck without displaying anything and there w=
as no
>>>>>>>> real network activity except 3 broadcasts using the ARP protoc=
ol
>>>>>>>> asking where the server was (could be just coincidence).
>>>>>>>
>>>>>>> That sounds like the client does want to reconnect with the ser=
ver.
>>>>>>>
>>>>>>> You could try enabling debug tracing on your client (sudo rpcde=
bug -m
>>>>>>> nfs
>>>>>>> -s
>>>>>>> all) after shutting down your server, then try to start nautilu=
s.
>>>>>>> =A0The
>>>>>>> kernel log would then contain NFS-related messages that might
>>>>>>> indicate
>>>>>>> where
>>>>>>> to look next.
>>>>>>>
>>>>>>>> Closing
>>>>>>>> nautilus and launching it again will let it hang again but I s=
ee no
>>>>>>>> additional network traffic. After a while nautilus will displa=
y the
>>>>>>>> contents of the folder without any network traffic.
>>>>>>>>
>>>>>>>> On Sun, Jan 24, 2010 at 10:34 PM, Muntz, Daniel
>>>>>>>> <Dan.Muntz@netapp.com>
>>>>>>>> wrote:
>>>>>>>>>
>>>>>>>>> Perhaps something in your $PATH is in the NFS mount? =A0Do a =
network
>>>>>>>>> trace
>>>>>>>>> and maybe you can see if, in fact, there are actually NFS
>>>>>>>>> operations
>>>>>>>>> being
>>>>>>>>> attempted that you weren't expecting. =A0Then try to figure o=
ut why.
>>>>>>>>>
>>>>>>>>> =A0-Dan
>>>>>>>>>
>>>>>>>>>> -----Original Message-----
>>>>>>>>>> From: Whoop Whouzer [mailto:tiredandnumb@gmail.com]
>>>>>>>>>> Sent: Saturday, January 23, 2010 8:28 AM
>>>>>>>>>> To: Peter Chacko
>>>>>>>>>> Cc: linux-nfs@vger.kernel.org
>>>>>>>>>> Subject: Re: nfs client performance while server is down
>>>>>>>>>>
>>>>>>>>>> I don't remember all the different set-ups I tried it on, bu=
t I
>>>>>>>>>> just
>>>>>>>>>> confirmed this with the following combinations:
>>>>>>>>>>
>>>>>>>>>> ubuntu server 10.04 (alpha 2) --> ubuntu desktop 9.10, ubunt=
u
>>>>>>>>>> desktop
>>>>>>>>>> 10.04 (alpha 2), fedora 12
>>>>>>>>>> ubuntu server 9.10 --> ubuntu desktop 9.10, ubuntu desktop 1=
0.04
>>>>>>>>>> (alpha 2), fedora 12
>>>>>>>>>>
>>>>>>>>>> I'll be happy to test it on another client machine (distro) =
even
>>>>>>>>>> another server (although it would require a little more time=
)
>>>>>>>>>>
>>>>>>>>>> Here are some examples on the bugreports I noticed and how t=
hey do
>>>>>>>>>> not
>>>>>>>>>> seem to get solved:
>>>>>>>>>> https://bugzilla.redhat.com/show_bug.cgi?id=3D175283
>>>>>>>>>> https://bugs.launchpad.net/ubuntu/+source/nfs-utils/+bug/164=
120
>>>>>>>>>>
>>>>>>>>>> regards,
>>>>>>>>>> Whoop
>>>>>>>>>>
>>>>>>>>>> On Sat, Jan 23, 2010 at 4:57 PM, Peter Chacko
>>>>>>>>>> <peterchacko35@gmail.com> wrote:
>>>>>>>>>>>
>>>>>>>>>>> Which client OS you observed this behavior ? =A0This has no=
thing to
>>>>>>>>>>> do
>>>>>>>>>>> NFS design, and its purely stateless...Its upto the client =
OS
>>>>>>>>>>> implementation about aspects like how to deal with local
>>>>>>>>>>
>>>>>>>>>> IO, when NFS
>>>>>>>>>>>
>>>>>>>>>>> share gets =A0disconnected..
>>>>>>>>>>>
>>>>>>>>>>> May be a VFS bug on the local OS you found this problem ..
>>>>>>>>>>>
>>>>>>>>>>> thanks
>>>>>>>>>>>
>>>>>>>>>>> On Sat, Jan 23, 2010 at 9:15 PM, Whoop Whouzer
>>>>>>>>>>
>>>>>>>>>> <tiredandnumb@gmail.com> wrote:
>>>>>>>>>>>>
>>>>>>>>>>>> Howdy,
>>>>>>>>>>>>
>>>>>>>>>>>> I was wondering why nfs is designed in such a way that the
>>>>>>>>>>
>>>>>>>>>> performance
>>>>>>>>>>>>
>>>>>>>>>>>> of an nfs client machine gets very bad when the nfs server
>>>>>>>>>>
>>>>>>>>>> is offline?
>>>>>>>>>>>>
>>>>>>>>>>>> This is even the case with a soft mount (either via mount
>>>>>>>>>>
>>>>>>>>>> or fstab).
>>>>>>>>>>>>
>>>>>>>>>>>> Just about every application that requires disk access (no=
t
>>>>>>>>>>>> talking
>>>>>>>>>>>> about nfs share acces) gets really slow to unresponsive.
>>>>>>>>>>
>>>>>>>>>> For instance
>>>>>>>>>>>>
>>>>>>>>>>>> nautilus becomes unresponsive when displaying the contents=
 of
>>>>>>>>>>>> any
>>>>>>>>>>>> folder on the local disk,
>>>>>>>>>>>> playing movie files (stored on local disk) let totem or
>>>>>>>>>>
>>>>>>>>>> vlc get stuck
>>>>>>>>>>>>
>>>>>>>>>>>> on set intervals, even the terminal becomes unresponsive a=
t
>>>>>>>>>>>> times.
>>>>>>>>>>>>
>>>>>>>>>>>> I could understand that these problems would occur while
>>>>>>>>>>
>>>>>>>>>> accessing the
>>>>>>>>>>>>
>>>>>>>>>>>> nfs share directoiourry while the server is offline, but
>>>>>>>>>>
>>>>>>>>>> why for totally
>>>>>>>>>>>>
>>>>>>>>>>>> unrelated directories?
>>>>>>>>>>>>
>>>>>>>>>>>> I have experienced this behaviour on various distro's, and
>>>>>>>>>>
>>>>>>>>>> also found
>>>>>>>>>>>>
>>>>>>>>>>>> various bug reports on this issue, they don't seem to get =
solved
>>>>>>>>>>>> as
>>>>>>>>>>>> this is viewed as nfs design.
>>>>>>>>>>>> I see this as a flaw because clients are totally dependent=
 on
>>>>>>>>>>>> the
>>>>>>>>>>>> server. This would be less of a deal if the entire home
>>>>>>>>>>>> directory
>>>>>>>>>>>> would be stored on nfs (although I even think some sort of
>>>>>>>>>>>> synchronisation technology could and should be implemented=
 in
>>>>>>>>>>>> this
>>>>>>>>>>>> case). It is a bit odd that (technically) one machine serv=
ing
>>>>>>>>>>>> some
>>>>>>>>>>>> "useless" files to a non-trivial directory on client
>>>>>>>>>>
>>>>>>>>>> machines can take
>>>>>>>>>>>>
>>>>>>>>>>>> down these client machines.
>>>>>>>>>>>>
>>>>>>>>>>>> For me the preferred functionality would be:
>>>>>>>>>>>> *If an nfs server gets offline the client's nfs share beco=
mes
>>>>>>>>>>>> unaccessible, but local directories and applications (that=
 only
>>>>>>>>>>>> require local disk access) stay responsive.
>>>>>>>>>>>> *If an nfs server gets online (after being offline while t=
he
>>>>>>>>>>>> client
>>>>>>>>>>>> has not been restarted) the nfs share becomes reconnected.
>>>>>>>>>>>>
>>>>>>>>>>>> regards,
>>>>>>>>>>>> Whoop
>>>>>>>>>>>> --
>>>>>>>>>>>> To unsubscribe from this list: send the line "unsubscribe
>>>>>>>>>>
>>>>>>>>>> linux-nfs" in
>>>>>>>>>>>>
>>>>>>>>>>>> the body of a message to majordomo@vger.kernel.org
>>>>>>>>>>>> More majordomo info at
>>>>>>>>>>>> =A0http://vger.kernel.org/majordomo-info.html
>>>>>>>>>>>>
>>>>>>>>>>>
>>>>>>>>>> --
>>>>>>>>>> To unsubscribe from this list: send the line "unsubscribe
>>>>>>>>>> linux-nfs" in
>>>>>>>>>> the body of a message to majordomo@vger.kernel.org
>>>>>>>>>> More majordomo info at =A0http://vger.kernel.org/majordomo-i=
nfo.html
>>>>>>>>>>
>>>>>>>>>
>>>>>>>> --
>>>>>>>> To unsubscribe from this list: send the line "unsubscribe linu=
x-nfs"
>>>>>>>> in
>>>>>>>> the body of a message to majordomo@vger.kernel.org
>>>>>>>> More majordomo info at =A0http://vger.kernel.org/majordomo-inf=
o.html
>>>>>>>
>>>>>>> --
>>>>>>> Chuck Lever
>>>>>>> chuck[dot]lever[at]oracle[dot]com
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>
>>>>> --
>>>>> Chuck Lever
>>>>> chuck[dot]lever[at]oracle[dot]com
>>>>>
>>>>>
>>>>>
>>>>>
>>>> <stracesdiff.log>
>>>
>>> --
>>> Chuck Lever
>>> chuck[dot]lever[at]oracle[dot]com
>>>
>>>
>>>
>>>
>
> --
> Chuck Lever
> chuck[dot]lever[at]oracle[dot]com
>
>
>
>

^ permalink raw reply	[flat|nested] 24+ messages in thread

* Re: nfs client performance while server is down
  2010-01-25 19:08                     ` Chuck Lever
       [not found]                       ` <d7f0b3a81001251138h30e25428o25db9bc8c0884636@mail.gmail.com>
@ 2010-01-26 23:21                       ` J. Bruce Fields
  2010-01-27  0:40                         ` Whoop Whouzer
  2010-01-27 18:23                         ` Chuck Lever
  1 sibling, 2 replies; 24+ messages in thread
From: J. Bruce Fields @ 2010-01-26 23:21 UTC (permalink / raw)
  To: Chuck Lever; +Cc: Whoop Whouzer, Muntz, Daniel, Peter Chacko, linux-nfs

On Mon, Jan 25, 2010 at 02:08:47PM -0500, Chuck Lever wrote:
> On Jan 25, 2010, at 2:02 PM, Whoop Whouzer wrote:
>> Ok, I did that, after shutting down the server and enabling debug
>> trace I tried to open the home folder of the current account (totally
>> unrelated to the nfsshare), it wouldn't open at all, I got no nautilus
>> at all. During the time my cursor was in busy mode I got the following
>> messages in kern.log (for ubuntu 10.04 client):
>> Jan 25 19:30:13 whoop-desktop kernel: [  160.719262] NFS call  fsstat
>> Jan 25 19:30:37 whoop-desktop kernel: [  184.458611] NFS:
>> permission(0:16/74386), mask=0x10, res=0
>> Jan 25 19:30:37 whoop-desktop kernel: [  184.458647] NFS call  access
>> Jan 25 19:30:43 whoop-desktop kernel: [  190.721086] nfs: server
>> 192.168.1.130 not responding, timed out
>> Jan 25 19:30:43 whoop-desktop kernel: [  190.721113] NFS reply statfs: 
>> -5
>> Jan 25 19:30:43 whoop-desktop kernel: [  190.721116] nfs_statfs:
>> statfs error = 5
...
> This verifies that your client is attempting to access the NFS server,  
> but doesn't tell us which file it's attempting to access.  Essentially  
> the EIO means "failed to connect".

I wonder if nautilus (or some library it uses) likes to regularly
"statfs" all the filesystems it knows about?

--b.

^ permalink raw reply	[flat|nested] 24+ messages in thread

* Re: nfs client performance while server is down
  2010-01-26 23:21                       ` J. Bruce Fields
@ 2010-01-27  0:40                         ` Whoop Whouzer
  2010-01-27 17:10                           ` J. Bruce Fields
  2010-01-27 18:23                         ` Chuck Lever
  1 sibling, 1 reply; 24+ messages in thread
From: Whoop Whouzer @ 2010-01-27  0:40 UTC (permalink / raw)
  To: J. Bruce Fields; +Cc: Chuck Lever, Muntz, Daniel, Peter Chacko, linux-nfs

Could be, although not very likely as it was also reported happening
with thunar (although I have not tested this myself).
But I am also experiencing similar problems with other applications
even gnome-terminal (basically all applications requiring (local) disk
access).
So this would led me to think it is some sub-process, that is used by
all application requiring disk access, that is to blame...

On Wed, Jan 27, 2010 at 12:21 AM, J. Bruce Fields <bfields@fieldses.org> wrote:
> On Mon, Jan 25, 2010 at 02:08:47PM -0500, Chuck Lever wrote:
>> On Jan 25, 2010, at 2:02 PM, Whoop Whouzer wrote:
>>> Ok, I did that, after shutting down the server and enabling debug
>>> trace I tried to open the home folder of the current account (totally
>>> unrelated to the nfsshare), it wouldn't open at all, I got no nautilus
>>> at all. During the time my cursor was in busy mode I got the following
>>> messages in kern.log (for ubuntu 10.04 client):
>>> Jan 25 19:30:13 whoop-desktop kernel: [  160.719262] NFS call  fsstat
>>> Jan 25 19:30:37 whoop-desktop kernel: [  184.458611] NFS:
>>> permission(0:16/74386), mask=0x10, res=0
>>> Jan 25 19:30:37 whoop-desktop kernel: [  184.458647] NFS call  access
>>> Jan 25 19:30:43 whoop-desktop kernel: [  190.721086] nfs: server
>>> 192.168.1.130 not responding, timed out
>>> Jan 25 19:30:43 whoop-desktop kernel: [  190.721113] NFS reply statfs:
>>> -5
>>> Jan 25 19:30:43 whoop-desktop kernel: [  190.721116] nfs_statfs:
>>> statfs error = 5
> ...
>> This verifies that your client is attempting to access the NFS server,
>> but doesn't tell us which file it's attempting to access.  Essentially
>> the EIO means "failed to connect".
>
> I wonder if nautilus (or some library it uses) likes to regularly
> "statfs" all the filesystems it knows about?
>
> --b.
>

^ permalink raw reply	[flat|nested] 24+ messages in thread

* Re: nfs client performance while server is down
  2010-01-27  0:40                         ` Whoop Whouzer
@ 2010-01-27 17:10                           ` J. Bruce Fields
  0 siblings, 0 replies; 24+ messages in thread
From: J. Bruce Fields @ 2010-01-27 17:10 UTC (permalink / raw)
  To: Whoop Whouzer; +Cc: Chuck Lever, Muntz, Daniel, Peter Chacko, linux-nfs

On Wed, Jan 27, 2010 at 01:40:07AM +0100, Whoop Whouzer wrote:
> Could be, although not very likely as it was also reported happening
> with thunar (although I have not tested this myself).
> But I am also experiencing similar problems with other applications
> even gnome-terminal (basically all applications requiring (local) disk
> access).
> So this would led me to think it is some sub-process, that is used by
> all application requiring disk access, that is to blame...

You could patch the kernel to add printk()'s in statfs showing who is
calling it (and with what path).

But probably there's some tracing infrastructure that would make this
possible without patching.

--b.

> 
> On Wed, Jan 27, 2010 at 12:21 AM, J. Bruce Fields <bfields@fieldses.org> wrote:
> > On Mon, Jan 25, 2010 at 02:08:47PM -0500, Chuck Lever wrote:
> >> On Jan 25, 2010, at 2:02 PM, Whoop Whouzer wrote:
> >>> Ok, I did that, after shutting down the server and enabling debug
> >>> trace I tried to open the home folder of the current account (totally
> >>> unrelated to the nfsshare), it wouldn't open at all, I got no nautilus
> >>> at all. During the time my cursor was in busy mode I got the following
> >>> messages in kern.log (for ubuntu 10.04 client):
> >>> Jan 25 19:30:13 whoop-desktop kernel: [  160.719262] NFS call  fsstat
> >>> Jan 25 19:30:37 whoop-desktop kernel: [  184.458611] NFS:
> >>> permission(0:16/74386), mask=0x10, res=0
> >>> Jan 25 19:30:37 whoop-desktop kernel: [  184.458647] NFS call  access
> >>> Jan 25 19:30:43 whoop-desktop kernel: [  190.721086] nfs: server
> >>> 192.168.1.130 not responding, timed out
> >>> Jan 25 19:30:43 whoop-desktop kernel: [  190.721113] NFS reply statfs:
> >>> -5
> >>> Jan 25 19:30:43 whoop-desktop kernel: [  190.721116] nfs_statfs:
> >>> statfs error = 5
> > ...
> >> This verifies that your client is attempting to access the NFS server,
> >> but doesn't tell us which file it's attempting to access.  Essentially
> >> the EIO means "failed to connect".
> >
> > I wonder if nautilus (or some library it uses) likes to regularly
> > "statfs" all the filesystems it knows about?
> >
> > --b.
> >

^ permalink raw reply	[flat|nested] 24+ messages in thread

* Re: nfs client performance while server is down
  2010-01-26 23:21                       ` J. Bruce Fields
  2010-01-27  0:40                         ` Whoop Whouzer
@ 2010-01-27 18:23                         ` Chuck Lever
  2010-01-27 18:40                           ` Trond Myklebust
  1 sibling, 1 reply; 24+ messages in thread
From: Chuck Lever @ 2010-01-27 18:23 UTC (permalink / raw)
  To: Whoop Whouzer; +Cc: J. Bruce Fields, Muntz, Daniel, Peter Chacko, linux-nfs

On 01/26/2010 06:21 PM, J. Bruce Fields wrote:
> On Mon, Jan 25, 2010 at 02:08:47PM -0500, Chuck Lever wrote:
>> On Jan 25, 2010, at 2:02 PM, Whoop Whouzer wrote:
>>> Ok, I did that, after shutting down the server and enabling debug
>>> trace I tried to open the home folder of the current account (totally
>>> unrelated to the nfsshare), it wouldn't open at all, I got no nautilus
>>> at all. During the time my cursor was in busy mode I got the following
>>> messages in kern.log (for ubuntu 10.04 client):
>>> Jan 25 19:30:13 whoop-desktop kernel: [  160.719262] NFS call  fsstat
>>> Jan 25 19:30:37 whoop-desktop kernel: [  184.458611] NFS:
>>> permission(0:16/74386), mask=0x10, res=0
>>> Jan 25 19:30:37 whoop-desktop kernel: [  184.458647] NFS call  access
>>> Jan 25 19:30:43 whoop-desktop kernel: [  190.721086] nfs: server
>>> 192.168.1.130 not responding, timed out
>>> Jan 25 19:30:43 whoop-desktop kernel: [  190.721113] NFS reply statfs:
>>> -5
>>> Jan 25 19:30:43 whoop-desktop kernel: [  190.721116] nfs_statfs:
>>> statfs error = 5
> ...
>> This verifies that your client is attempting to access the NFS server,
>> but doesn't tell us which file it's attempting to access.  Essentially
>> the EIO means "failed to connect".
>
> I wonder if nautilus (or some library it uses) likes to regularly
> "statfs" all the filesystems it knows about?

The NFS client seems to like to send these periodically, but I've never 
looked into why.  It's probably triggered by some cache timeout, and 
gathers recent server file system information.

The ACCESS command is for a particular file, however.  That's probably 
where we will get the most interesting and specific information.  A 
network trace would capture the FH in the ACCESS request.  When the 
server is up, you could match that FH to other requests where the actual 
pathname of the file is known.

Simply run wireshark on your client, and it should automatically sniff 
the FH information.  Wireshark would need to be running while the server 
is up and continue running after the server is taken down.

-- 
chuck[dot]lever[at]oracle[dot]com

^ permalink raw reply	[flat|nested] 24+ messages in thread

* Re: nfs client performance while server is down
  2010-01-27 18:23                         ` Chuck Lever
@ 2010-01-27 18:40                           ` Trond Myklebust
  2010-01-27 18:47                             ` Whoop Whouzer
  0 siblings, 1 reply; 24+ messages in thread
From: Trond Myklebust @ 2010-01-27 18:40 UTC (permalink / raw)
  To: Chuck Lever
  Cc: Whoop Whouzer, J. Bruce Fields, Muntz, Daniel, Peter Chacko,
	linux-nfs

On Wed, 2010-01-27 at 13:23 -0500, Chuck Lever wrote: 
> On 01/26/2010 06:21 PM, J. Bruce Fields wrote:
> > I wonder if nautilus (or some library it uses) likes to regularly
> > "statfs" all the filesystems it knows about?
> 
> The NFS client seems to like to send these periodically, but I've never 
> looked into why.  It's probably triggered by some cache timeout, and 
> gathers recent server file system information.

No. It is entirely application driven. Furthermore, most of the statfs
data is uncached, since it should not be performance critical in any
sane application environment.

IOW: I agree with Bruce that this is most likely GNOME or nautilus
triggering statfs calls. Indeed, when I do actually open a window on
some directory it also appears to display the free space.

Trond



^ permalink raw reply	[flat|nested] 24+ messages in thread

* Re: nfs client performance while server is down
  2010-01-27 18:40                           ` Trond Myklebust
@ 2010-01-27 18:47                             ` Whoop Whouzer
  2010-01-27 19:09                               ` Trond Myklebust
  0 siblings, 1 reply; 24+ messages in thread
From: Whoop Whouzer @ 2010-01-27 18:47 UTC (permalink / raw)
  To: Trond Myklebust
  Cc: Chuck Lever, J. Bruce Fields, Muntz, Daniel, Peter Chacko,
	linux-nfs

ok, but it's not just GNOME/nautilus behaviour. For one, I am
experiencing problems with just about all applications that require
(local) disk access. Furthermore, problems have also been reported
with xfce/thunar and also with KDE.

A bug for this issue has just been created for xfce/thunar:
http://bugzilla.xfce.org/show_bug.cgi?id=6185

On Wed, Jan 27, 2010 at 7:40 PM, Trond Myklebust
<trond.myklebust@fys.uio.no> wrote:
> On Wed, 2010-01-27 at 13:23 -0500, Chuck Lever wrote:
>> On 01/26/2010 06:21 PM, J. Bruce Fields wrote:
>> > I wonder if nautilus (or some library it uses) likes to regularly
>> > "statfs" all the filesystems it knows about?
>>
>> The NFS client seems to like to send these periodically, but I've never
>> looked into why.  It's probably triggered by some cache timeout, and
>> gathers recent server file system information.
>
> No. It is entirely application driven. Furthermore, most of the statfs
> data is uncached, since it should not be performance critical in any
> sane application environment.
>
> IOW: I agree with Bruce that this is most likely GNOME or nautilus
> triggering statfs calls. Indeed, when I do actually open a window on
> some directory it also appears to display the free space.
>
> Trond
>
>
>

^ permalink raw reply	[flat|nested] 24+ messages in thread

* Re: nfs client performance while server is down
  2010-01-27 18:47                             ` Whoop Whouzer
@ 2010-01-27 19:09                               ` Trond Myklebust
  2010-01-27 19:25                                 ` Whoop Whouzer
  0 siblings, 1 reply; 24+ messages in thread
From: Trond Myklebust @ 2010-01-27 19:09 UTC (permalink / raw)
  To: Whoop Whouzer
  Cc: Chuck Lever, J. Bruce Fields, Muntz, Daniel, Peter Chacko,
	linux-nfs

So? I don't see why that would be an NFS problem.

As far as I can see from this thread, you are basically asking us to fix
these broken applications by implementing a "disconnected NFS" mode.
While that may indeed be a cool thing to support, I haven't seen anybody
so far stepping up and saying that they have the time and resources to
work on it. Are you volunteering?

Trond

On Wed, 2010-01-27 at 19:47 +0100, Whoop Whouzer wrote: 
> ok, but it's not just GNOME/nautilus behaviour. For one, I am
> experiencing problems with just about all applications that require
> (local) disk access. Furthermore, problems have also been reported
> with xfce/thunar and also with KDE.
> 
> A bug for this issue has just been created for xfce/thunar:
> http://bugzilla.xfce.org/show_bug.cgi?id=6185
> 
> On Wed, Jan 27, 2010 at 7:40 PM, Trond Myklebust
> <trond.myklebust@fys.uio.no> wrote:
> > On Wed, 2010-01-27 at 13:23 -0500, Chuck Lever wrote:
> >> On 01/26/2010 06:21 PM, J. Bruce Fields wrote:
> >> > I wonder if nautilus (or some library it uses) likes to regularly
> >> > "statfs" all the filesystems it knows about?
> >>
> >> The NFS client seems to like to send these periodically, but I've never
> >> looked into why.  It's probably triggered by some cache timeout, and
> >> gathers recent server file system information.
> >
> > No. It is entirely application driven. Furthermore, most of the statfs
> > data is uncached, since it should not be performance critical in any
> > sane application environment.
> >
> > IOW: I agree with Bruce that this is most likely GNOME or nautilus
> > triggering statfs calls. Indeed, when I do actually open a window on
> > some directory it also appears to display the free space.
> >
> > Trond
> >
> >
> >




^ permalink raw reply	[flat|nested] 24+ messages in thread

* Re: nfs client performance while server is down
  2010-01-27 19:09                               ` Trond Myklebust
@ 2010-01-27 19:25                                 ` Whoop Whouzer
  2010-01-27 19:30                                   ` Ray Van Dolson
  2010-01-27 19:31                                   ` Peter Staubach
  0 siblings, 2 replies; 24+ messages in thread
From: Whoop Whouzer @ 2010-01-27 19:25 UTC (permalink / raw)
  To: Trond Myklebust
  Cc: Chuck Lever, J. Bruce Fields, Muntz, Daniel, Peter Chacko,
	linux-nfs

I am not stating this is an NFS problem at all. I am not asking anybody to fix
anything.
I asked if this issue was by design. I was told it wasn't (as nfs is stateless).
So, therefore I considered it as a bug (which I don't believe to
reside in either nfs or nautilus). I am just trying to figure out
where the problem lies.

I am not talking about implementing "disconnected NFS" mode,
synchronisation or anything like that. There is not something missing,
there is something not working properly, somewhere, and I'm trying to
find out where..

On Wed, Jan 27, 2010 at 8:09 PM, Trond Myklebust
<trond.myklebust@fys.uio.no> wrote:
> So? I don't see why that would be an NFS problem.
>
> As far as I can see from this thread, you are basically asking us to fix
> these broken applications by implementing a "disconnected NFS" mode.
> While that may indeed be a cool thing to support, I haven't seen anybody
> so far stepping up and saying that they have the time and resources to
> work on it. Are you volunteering?
>
> Trond
>
> On Wed, 2010-01-27 at 19:47 +0100, Whoop Whouzer wrote:
>> ok, but it's not just GNOME/nautilus behaviour. For one, I am
>> experiencing problems with just about all applications that require
>> (local) disk access. Furthermore, problems have also been reported
>> with xfce/thunar and also with KDE.
>>
>> A bug for this issue has just been created for xfce/thunar:
>> http://bugzilla.xfce.org/show_bug.cgi?id=6185
>>
>> On Wed, Jan 27, 2010 at 7:40 PM, Trond Myklebust
>> <trond.myklebust@fys.uio.no> wrote:
>> > On Wed, 2010-01-27 at 13:23 -0500, Chuck Lever wrote:
>> >> On 01/26/2010 06:21 PM, J. Bruce Fields wrote:
>> >> > I wonder if nautilus (or some library it uses) likes to regularly
>> >> > "statfs" all the filesystems it knows about?
>> >>
>> >> The NFS client seems to like to send these periodically, but I've never
>> >> looked into why.  It's probably triggered by some cache timeout, and
>> >> gathers recent server file system information.
>> >
>> > No. It is entirely application driven. Furthermore, most of the statfs
>> > data is uncached, since it should not be performance critical in any
>> > sane application environment.
>> >
>> > IOW: I agree with Bruce that this is most likely GNOME or nautilus
>> > triggering statfs calls. Indeed, when I do actually open a window on
>> > some directory it also appears to display the free space.
>> >
>> > Trond
>> >
>> >
>> >
>
>
>
>

^ permalink raw reply	[flat|nested] 24+ messages in thread

* Re: nfs client performance while server is down
  2010-01-27 19:25                                 ` Whoop Whouzer
@ 2010-01-27 19:30                                   ` Ray Van Dolson
  2010-01-27 19:31                                   ` Peter Staubach
  1 sibling, 0 replies; 24+ messages in thread
From: Ray Van Dolson @ 2010-01-27 19:30 UTC (permalink / raw)
  To: Whoop Whouzer
  Cc: Trond Myklebust, Chuck Lever, J. Bruce Fields, Muntz, Daniel,
	Peter Chacko, linux-nfs@vger.kernel.org

On Wed, Jan 27, 2010 at 11:25:39AM -0800, Whoop Whouzer wrote:
> I am not stating this is an NFS problem at all. I am not asking
> anybody to fix anything.  I asked if this issue was by design. I was
> told it wasn't (as nfs is stateless).  So, therefore I considered it
> as a bug (which I don't believe to reside in either nfs or nautilus).
> I am just trying to figure out where the problem lies.
> 
> I am not talking about implementing "disconnected NFS" mode,
> synchronisation or anything like that. There is not something
> missing, there is something not working properly, somewhere, and I'm
> trying to find out where..

My impression is that this is "by design" in that NFS mounts, when
mounted in "hard" mode (which is the case by default) will "block"
until the remote server responds.

For the most part this is a good thing.  Applications expect their
filesystem calls to behave a certain way regardless of what type of
filesystem is underneath.

In this case, it seems like Nautilus tries to open the mount point and
it just hangs... forever.  This would be expected with an NFS mount in
my view.

One way I could think of getting around it would be to ensure that the
NFS mount is mounted with "intr", and then get Nautilus to monitor how
long it takes to read a mount point and "terminate" after a timeout is
reached, perhaps flagging that mount so future accesses are quicker.

Anyways, that goes beyond the scope of NFS... and good luck convincing
the GNOME developers to make a change like that. :)

Also, even if you mount an NFS share with "intr", you can't always
guarantee that you'll be able to kill a process trying to access said
mount..... at least in my experience.

Ray

^ permalink raw reply	[flat|nested] 24+ messages in thread

* Re: nfs client performance while server is down
  2010-01-27 19:25                                 ` Whoop Whouzer
  2010-01-27 19:30                                   ` Ray Van Dolson
@ 2010-01-27 19:31                                   ` Peter Staubach
  1 sibling, 0 replies; 24+ messages in thread
From: Peter Staubach @ 2010-01-27 19:31 UTC (permalink / raw)
  To: Whoop Whouzer
  Cc: Trond Myklebust, Chuck Lever, J. Bruce Fields, Muntz, Daniel,
	Peter Chacko, linux-nfs

Whoop Whouzer wrote:
> I am not stating this is an NFS problem at all. I am not asking anybody to fix
> anything.
> I asked if this issue was by design. I was told it wasn't (as nfs is stateless).
> So, therefore I considered it as a bug (which I don't believe to
> reside in either nfs or nautilus). I am just trying to figure out
> where the problem lies.
> 

There is a misconception here.  NFS is not stateless.  To
be accurate, the NFSv2 and NFSv3 protocols were defined in
such as to allow the NFS server to be stateless.  The server
was not supposed to be required to remember anything about
what a client was doing from operation to the next.  (In
reality, there are non-idempotent operations, ie. operations
which can not be done twice and get the same results, so it
is very helpful if the server remembers some state.)

NFS clients have always been _very_ stateful.  They have to
know about all mounted file systems, open files, current
directories, etc.

The problem here is some application which is attempting to
touch all of the mounted file systems.  When it tries to
touch one from a non-responsive NFS server, then it hangs.
This represents an architectural problem with the
application making an assumption that is okay and acceptable
to access all file systems which are currently mounted.
This assumption leads to situations such as you are observing.

This isn't new.

		ps

> I am not talking about implementing "disconnected NFS" mode,
> synchronisation or anything like that. There is not something missing,
> there is something not working properly, somewhere, and I'm trying to
> find out where..
> 
> On Wed, Jan 27, 2010 at 8:09 PM, Trond Myklebust
> <trond.myklebust@fys.uio.no> wrote:
>> So? I don't see why that would be an NFS problem.
>>
>> As far as I can see from this thread, you are basically asking us to fix
>> these broken applications by implementing a "disconnected NFS" mode.
>> While that may indeed be a cool thing to support, I haven't seen anybody
>> so far stepping up and saying that they have the time and resources to
>> work on it. Are you volunteering?
>>
>> Trond
>>
>> On Wed, 2010-01-27 at 19:47 +0100, Whoop Whouzer wrote:
>>> ok, but it's not just GNOME/nautilus behaviour. For one, I am
>>> experiencing problems with just about all applications that require
>>> (local) disk access. Furthermore, problems have also been reported
>>> with xfce/thunar and also with KDE.
>>>
>>> A bug for this issue has just been created for xfce/thunar:
>>> http://bugzilla.xfce.org/show_bug.cgi?id=6185
>>>
>>> On Wed, Jan 27, 2010 at 7:40 PM, Trond Myklebust
>>> <trond.myklebust@fys.uio.no> wrote:
>>>> On Wed, 2010-01-27 at 13:23 -0500, Chuck Lever wrote:
>>>>> On 01/26/2010 06:21 PM, J. Bruce Fields wrote:
>>>>>> I wonder if nautilus (or some library it uses) likes to regularly
>>>>>> "statfs" all the filesystems it knows about?
>>>>> The NFS client seems to like to send these periodically, but I've never
>>>>> looked into why.  It's probably triggered by some cache timeout, and
>>>>> gathers recent server file system information.
>>>> No. It is entirely application driven. Furthermore, most of the statfs
>>>> data is uncached, since it should not be performance critical in any
>>>> sane application environment.
>>>>
>>>> IOW: I agree with Bruce that this is most likely GNOME or nautilus
>>>> triggering statfs calls. Indeed, when I do actually open a window on
>>>> some directory it also appears to display the free space.
>>>>
>>>> Trond
>>>>
>>>>
>>>>
>>
>>
>>
> --
> To unsubscribe from this list: send the line "unsubscribe linux-nfs" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html


^ permalink raw reply	[flat|nested] 24+ messages in thread

end of thread, other threads:[~2010-01-27 19:32 UTC | newest]

Thread overview: 24+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2010-01-23 15:45 nfs client performance while server is down Whoop Whouzer
     [not found] ` <d7f0b3a81001230745h18dbb14fi42f28adff0c45294-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2010-01-23 15:57   ` Peter Chacko
2010-01-23 16:27     ` Whoop Whouzer
     [not found]       ` <d7f0b3a81001230827y52727993nf60210ae610643b7-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2010-01-24 21:34         ` Muntz, Daniel
     [not found]           ` <7A24DF798E223B4C9864E8F92E8C93EC0527810C-hX7t0kiaRRpT+ZUat5FNkAK/GNPrWCqfQQ4Iyu8u01E@public.gmane.org>
2010-01-24 22:03             ` Whoop Whouzer
2010-01-25  0:09             ` Whoop Whouzer
2010-01-25 16:48               ` Chuck Lever
2010-01-25 19:02                 ` Whoop Whouzer
     [not found]                   ` <d7f0b3a81001251102p5e631706jfd9f147a00487061-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2010-01-25 19:08                     ` Chuck Lever
     [not found]                       ` <d7f0b3a81001251138h30e25428o25db9bc8c0884636@mail.gmail.com>
     [not found]                         ` <d7f0b3a81001251138h30e25428o25db9bc8c0884636-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2010-01-25 19:48                           ` Whoop Whouzer
2010-01-25 21:01                           ` Chuck Lever
2010-01-25 21:18                             ` Whoop Whouzer
     [not found]                               ` <d7f0b3a81001251318k42de9be2qe54f83bbd86cabb8-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2010-01-25 21:26                                 ` Chuck Lever
2010-01-25 23:03                                   ` Whoop Whouzer
2010-01-26 23:21                       ` J. Bruce Fields
2010-01-27  0:40                         ` Whoop Whouzer
2010-01-27 17:10                           ` J. Bruce Fields
2010-01-27 18:23                         ` Chuck Lever
2010-01-27 18:40                           ` Trond Myklebust
2010-01-27 18:47                             ` Whoop Whouzer
2010-01-27 19:09                               ` Trond Myklebust
2010-01-27 19:25                                 ` Whoop Whouzer
2010-01-27 19:30                                   ` Ray Van Dolson
2010-01-27 19:31                                   ` Peter Staubach

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox