* Poor NFS client performance on 2.4.18? @ 2002-04-25 20:49 Dan Yocum 2002-04-26 13:14 ` Trond Myklebust 0 siblings, 1 reply; 11+ messages in thread From: Dan Yocum @ 2002-04-25 20:49 UTC (permalink / raw) To: linux kernel Trond, et al. I'm getting poor NFS performance (~250KBps read and write) on 2.4.18 and am wondering if I'm the only one. There is no performance drop under other OSs or other kernel versions, so I don't think it's the server. Here's the the details: 2.4.18 patched with: NFS client patches (linux-2.4.18-NFS_ALL.dif) xfs-1.1-PR1-2.4.18-all.patch Ingo's Foster IRQ patch (these are dual Xeons) If you need any more details, let me know. Thanks, Dan -- Dan Yocum Sloan Digital Sky Survey, Fermilab 630.840.6509 yocum@fnal.gov, http://www.sdss.org SDSS. Mapping the Universe. ^ permalink raw reply [flat|nested] 11+ messages in thread
* Re: Poor NFS client performance on 2.4.18? 2002-04-25 20:49 Poor NFS client performance on 2.4.18? Dan Yocum @ 2002-04-26 13:14 ` Trond Myklebust 2002-04-26 14:51 ` Dan Yocum 2002-05-06 22:05 ` Dan Yocum 0 siblings, 2 replies; 11+ messages in thread From: Trond Myklebust @ 2002-04-26 13:14 UTC (permalink / raw) To: Dan Yocum; +Cc: linux kernel >>>>> " " == Dan Yocum <yocum@fnal.gov> writes: > Trond, et al. I'm getting poor NFS performance (~250KBps read > and write) on 2.4.18 and am wondering if I'm the only one. > There is no performance drop under other OSs or other kernel > versions, so I don't think it's the server. > Here's the the details: > 2.4.18 patched with: > NFS client patches (linux-2.4.18-NFS_ALL.dif) > xfs-1.1-PR1-2.4.18-all.patch Ingo's Foster IRQ patch (these > are dual Xeons) > If you need any more details, let me know. The latest NFS_ALL patches include experimental code that changes the UDP congestion control. I'm basically trying to relax the algorithm to what is standard on *BSD (i.e. we follow the standard Van Jacobson). This would mean that we don't wait for the reply from the server before we send off the next request. Unfortunately, there appears to be a lot of setups out there that start to drop packets when this occurs, and I haven't yet finished determining the root cause. If I can manage to get my laptop to work again, I'll try to investigate a bit more this weekend... Cheers, Trond ^ permalink raw reply [flat|nested] 11+ messages in thread
* Re: Poor NFS client performance on 2.4.18? 2002-04-26 13:14 ` Trond Myklebust @ 2002-04-26 14:51 ` Dan Yocum 2002-04-28 19:29 ` Trond Myklebust 2002-05-06 22:05 ` Dan Yocum 1 sibling, 1 reply; 11+ messages in thread From: Dan Yocum @ 2002-04-26 14:51 UTC (permalink / raw) To: Trond Myklebust; +Cc: linux kernel Trond, So, would I be correct in assuming that backing out the linux-2.4.18-ping.dif patch would solve the problem in the short term? Thanks, Dan Trond Myklebust wrote: > > >>>>> " " == Dan Yocum <yocum@fnal.gov> writes: > > > Trond, et al. I'm getting poor NFS performance (~250KBps read > > and write) on 2.4.18 and am wondering if I'm the only one. > > There is no performance drop under other OSs or other kernel > > versions, so I don't think it's the server. > > > Here's the the details: > > > 2.4.18 patched with: > > NFS client patches (linux-2.4.18-NFS_ALL.dif) > > xfs-1.1-PR1-2.4.18-all.patch Ingo's Foster IRQ patch (these > > are dual Xeons) > > > If you need any more details, let me know. > > The latest NFS_ALL patches include experimental code that changes the > UDP congestion control. I'm basically trying to relax the algorithm to > what is standard on *BSD (i.e. we follow the standard Van Jacobson). > > This would mean that we don't wait for the reply from the server > before we send off the next request. Unfortunately, there appears to > be a lot of setups out there that start to drop packets when this > occurs, and I haven't yet finished determining the root cause. > > If I can manage to get my laptop to work again, I'll try to > investigate a bit more this weekend... > > Cheers, > Trond -- Dan Yocum Sloan Digital Sky Survey, Fermilab 630.840.6509 yocum@fnal.gov, http://www.sdss.org SDSS. Mapping the Universe. ^ permalink raw reply [flat|nested] 11+ messages in thread
* Re: Poor NFS client performance on 2.4.18? 2002-04-26 14:51 ` Dan Yocum @ 2002-04-28 19:29 ` Trond Myklebust 0 siblings, 0 replies; 11+ messages in thread From: Trond Myklebust @ 2002-04-28 19:29 UTC (permalink / raw) To: Dan Yocum; +Cc: linux kernel >>>>> " " == Dan Yocum <yocum@fnal.gov> writes: > Trond, So, would I be correct in assuming that backing out the > linux-2.4.18-ping.dif patch would solve the problem in the > short term? Not ping, but linux-2.4.18-rpc_tweaks.dif... Cheers, Trond ^ permalink raw reply [flat|nested] 11+ messages in thread
* Re: Poor NFS client performance on 2.4.18? 2002-04-26 13:14 ` Trond Myklebust 2002-04-26 14:51 ` Dan Yocum @ 2002-05-06 22:05 ` Dan Yocum 2002-05-07 7:28 ` Trond Myklebust 1 sibling, 1 reply; 11+ messages in thread From: Dan Yocum @ 2002-05-06 22:05 UTC (permalink / raw) To: Trond Myklebust; +Cc: linux kernel Trond, OK, so backing out the rpc_tweaks dif fixed the performance problem, however, seems to have introduced another problem that appears to be stemming from the seekdir.dif. Attempting to run an app from an IRIX client (that has the 32bitclients option set) freezes the NFS volume - one can't access it from the Linux side, anymore. You can read and write to the NFS volume *before* trying to run something from there, but not after. Ideas? Thanks, Dan Trond Myklebust wrote: > > >>>>> " " == Dan Yocum <yocum@fnal.gov> writes: > > > Trond, et al. I'm getting poor NFS performance (~250KBps read > > and write) on 2.4.18 and am wondering if I'm the only one. > > There is no performance drop under other OSs or other kernel > > versions, so I don't think it's the server. > > > Here's the the details: > > > 2.4.18 patched with: > > NFS client patches (linux-2.4.18-NFS_ALL.dif) > > xfs-1.1-PR1-2.4.18-all.patch Ingo's Foster IRQ patch (these > > are dual Xeons) > > > If you need any more details, let me know. > > The latest NFS_ALL patches include experimental code that changes the > UDP congestion control. I'm basically trying to relax the algorithm to > what is standard on *BSD (i.e. we follow the standard Van Jacobson). > > This would mean that we don't wait for the reply from the server > before we send off the next request. Unfortunately, there appears to > be a lot of setups out there that start to drop packets when this > occurs, and I haven't yet finished determining the root cause. > > If I can manage to get my laptop to work again, I'll try to > investigate a bit more this weekend... > > Cheers, > Trond -- Dan Yocum Sloan Digital Sky Survey, Fermilab 630.840.6509 yocum@fnal.gov, http://www.sdss.org SDSS. Mapping the Universe. ^ permalink raw reply [flat|nested] 11+ messages in thread
* Re: Poor NFS client performance on 2.4.18? 2002-05-06 22:05 ` Dan Yocum @ 2002-05-07 7:28 ` Trond Myklebust 2002-05-07 15:32 ` Dan Yocum 0 siblings, 1 reply; 11+ messages in thread From: Trond Myklebust @ 2002-05-07 7:28 UTC (permalink / raw) To: Dan Yocum; +Cc: linux kernel On Tuesday 7. May 2002 00:05, Dan Yocum wrote: > Trond, > > OK, so backing out the rpc_tweaks dif fixed the performance problem, > however, seems to have introduced another problem that appears to be > stemming from the seekdir.dif. Attempting to run an app from an IRIX > client (that has the 32bitclients option set) freezes the NFS volume - one > can't access it from the Linux side, anymore. > > You can read and write to the NFS volume *before* trying to run something > from there, but not after. > > Ideas? That smells like another network driver bug. Have you tcpdumped the traffic between client and server? Cheers, Trond ^ permalink raw reply [flat|nested] 11+ messages in thread
* Re: Poor NFS client performance on 2.4.18? 2002-05-07 7:28 ` Trond Myklebust @ 2002-05-07 15:32 ` Dan Yocum 2002-05-07 15:54 ` Dan Yocum 0 siblings, 1 reply; 11+ messages in thread From: Dan Yocum @ 2002-05-07 15:32 UTC (permalink / raw) To: Trond Myklebust; +Cc: linux kernel Trond Myklebust wrote: > > On Tuesday 7. May 2002 00:05, Dan Yocum wrote: > > Trond, > > > > OK, so backing out the rpc_tweaks dif fixed the performance problem, > > however, seems to have introduced another problem that appears to be > > stemming from the seekdir.dif. Attempting to run an app from an IRIX > > client (that has the 32bitclients option set) freezes the NFS volume - one > > can't access it from the Linux side, anymore. > > > > You can read and write to the NFS volume *before* trying to run something > > from there, but not after. > > > > Ideas? > > That smells like another network driver bug. Have you tcpdumped the traffic > between client and server? Ah, that may be the case - the problem also exists with a Linux server as well... let me check, and I'll let you know. Thanks, Dan -- Dan Yocum Sloan Digital Sky Survey, Fermilab 630.840.6509 yocum@fnal.gov, http://www.sdss.org SDSS. Mapping the Universe. ^ permalink raw reply [flat|nested] 11+ messages in thread
* Re: Poor NFS client performance on 2.4.18? 2002-05-07 15:32 ` Dan Yocum @ 2002-05-07 15:54 ` Dan Yocum 2002-05-08 20:19 ` ns83820 bug. [was Re: Poor NFS client performance on 2.4.18?] Dan Yocum 0 siblings, 1 reply; 11+ messages in thread From: Dan Yocum @ 2002-05-07 15:54 UTC (permalink / raw) To: Trond Myklebust, linux kernel Dan Yocum wrote: > > Trond Myklebust wrote: > > > > On Tuesday 7. May 2002 00:05, Dan Yocum wrote: > > > Trond, > > > > > > OK, so backing out the rpc_tweaks dif fixed the performance problem, > > > however, seems to have introduced another problem that appears to be > > > stemming from the seekdir.dif. Attempting to run an app from an IRIX > > > client (that has the 32bitclients option set) freezes the NFS volume - one > > > can't access it from the Linux side, anymore. > > > > > > You can read and write to the NFS volume *before* trying to run something > > > from there, but not after. > > > > > > Ideas? > > > > That smells like another network driver bug. Have you tcpdumped the traffic > > between client and server? > > Ah, that may be the case - the problem also exists with a Linux server as > well... let me check, and I'll let you know. I take that back - it's only hanging on the Linux server when the IRIX server is already hung. -- Dan Yocum Sloan Digital Sky Survey, Fermilab 630.840.6509 yocum@fnal.gov, http://www.sdss.org SDSS. Mapping the Universe. ^ permalink raw reply [flat|nested] 11+ messages in thread
* ns83820 bug. [was Re: Poor NFS client performance on 2.4.18?] 2002-05-07 15:54 ` Dan Yocum @ 2002-05-08 20:19 ` Dan Yocum 2002-05-08 22:07 ` Benjamin LaHaise 0 siblings, 1 reply; 11+ messages in thread From: Dan Yocum @ 2002-05-08 20:19 UTC (permalink / raw) To: Trond Myklebust, linux kernel Trond, et al. You're right, it's a driver (ns83820) issue. Strange that it only shows up when trying to execute an app that's mounted via NFS, but, whatever. Running apps from the the NFS volumes with the eepro100 adapter that's on the machine works fine with the updated NFS_all patch applied. Thanks, again, Dan Dan Yocum wrote: > > Dan Yocum wrote: > > > > Trond Myklebust wrote: > > > > > > On Tuesday 7. May 2002 00:05, Dan Yocum wrote: > > > > Trond, > > > > > > > > OK, so backing out the rpc_tweaks dif fixed the performance problem, > > > > however, seems to have introduced another problem that appears to be > > > > stemming from the seekdir.dif. Attempting to run an app from an IRIX > > > > client (that has the 32bitclients option set) freezes the NFS volume - one > > > > can't access it from the Linux side, anymore. > > > > > > > > You can read and write to the NFS volume *before* trying to run something > > > > from there, but not after. > > > > > > > > Ideas? > > > > > > That smells like another network driver bug. Have you tcpdumped the traffic > > > between client and server? > > > > Ah, that may be the case - the problem also exists with a Linux server as > > well... let me check, and I'll let you know. > > I take that back - it's only hanging on the Linux server when the IRIX > server is already hung. -- Dan Yocum Sloan Digital Sky Survey, Fermilab 630.840.6509 yocum@fnal.gov, http://www.sdss.org SDSS. Mapping the Universe. ^ permalink raw reply [flat|nested] 11+ messages in thread
* Re: ns83820 bug. [was Re: Poor NFS client performance on 2.4.18?] 2002-05-08 20:19 ` ns83820 bug. [was Re: Poor NFS client performance on 2.4.18?] Dan Yocum @ 2002-05-08 22:07 ` Benjamin LaHaise 2002-05-09 20:54 ` Dan Yocum 0 siblings, 1 reply; 11+ messages in thread From: Benjamin LaHaise @ 2002-05-08 22:07 UTC (permalink / raw) To: Dan Yocum; +Cc: Trond Myklebust, linux kernel Upgrade to 0.17 (which is in 2.4.19-pre5 or so and later) and you should find the issue resolved. -ben On Wed, May 08, 2002 at 03:19:03PM -0500, Dan Yocum wrote: > Trond, et al. > > You're right, it's a driver (ns83820) issue. Strange that it only shows up > when trying to execute an app that's mounted via NFS, but, whatever. > Running apps from the the NFS volumes with the eepro100 adapter that's on > the machine works fine with the updated NFS_all patch applied. > > Thanks, again, > Dan > > > Dan Yocum wrote: > > > > Dan Yocum wrote: > > > > > > Trond Myklebust wrote: > > > > > > > > On Tuesday 7. May 2002 00:05, Dan Yocum wrote: > > > > > Trond, > > > > > > > > > > OK, so backing out the rpc_tweaks dif fixed the performance problem, > > > > > however, seems to have introduced another problem that appears to be > > > > > stemming from the seekdir.dif. Attempting to run an app from an IRIX > > > > > client (that has the 32bitclients option set) freezes the NFS volume - one > > > > > can't access it from the Linux side, anymore. > > > > > > > > > > You can read and write to the NFS volume *before* trying to run something > > > > > from there, but not after. > > > > > > > > > > Ideas? > > > > > > > > That smells like another network driver bug. Have you tcpdumped the traffic > > > > between client and server? > > > > > > Ah, that may be the case - the problem also exists with a Linux server as > > > well... let me check, and I'll let you know. > > > > I take that back - it's only hanging on the Linux server when the IRIX > > server is already hung. > > > -- > Dan Yocum > Sloan Digital Sky Survey, Fermilab 630.840.6509 > yocum@fnal.gov, http://www.sdss.org > SDSS. Mapping the Universe. > - > To unsubscribe from this list: send the line "unsubscribe linux-kernel" in > the body of a message to majordomo@vger.kernel.org > More majordomo info at http://vger.kernel.org/majordomo-info.html > Please read the FAQ at http://www.tux.org/lkml/ -- "You will be reincarnated as a toad; and you will be much happier." ^ permalink raw reply [flat|nested] 11+ messages in thread
* Re: ns83820 bug. [was Re: Poor NFS client performance on 2.4.18?] 2002-05-08 22:07 ` Benjamin LaHaise @ 2002-05-09 20:54 ` Dan Yocum 0 siblings, 0 replies; 11+ messages in thread From: Dan Yocum @ 2002-05-09 20:54 UTC (permalink / raw) To: Benjamin LaHaise; +Cc: Trond Myklebust, linux kernel Ben, Trond, Ugh. It's been a bad week in general. Anyway, yes, you're right, the 0.17 upgrade did the trick - I had actually tried that, but had accidentally re-applied the rpc_tweaks dif which knocked the performance down again. Backing out that patch, updating to v 0.17 of the ns83820 driver and all is well, again. So, it's back in Trond's court, with the rpc_tweaks causing the slow read/write over NFS, problem. Trond, if all I want is 32k block transfers from that dif, can I just apply the following, or is there more to it: diff -u --recursive --new-file linux-2.4.18-svc_tcp/include/linux/nfsd/const.h linux-2.4.18-rpc_cong/include/linux/nfsd/const.h --- linux-2.4.18-svc_tcp/include/linux/nfsd/const.h Sat Apr 1 18:04:27 2000+++ linux-2.4.18-rpc_cong/include/linux/nfsd/const.h Wed Feb 20 17:20:45 2002@@ -21,7 +21,7 @@ /* * Maximum blocksize supported by daemon currently at 8K */ -#define NFSSVC_MAXBLKSIZE 8192 +#define NFSSVC_MAXBLKSIZE (32*1024) #ifdef __KERNEL__ Thanks, Dan Benjamin LaHaise wrote: > > Upgrade to 0.17 (which is in 2.4.19-pre5 or so and later) and you should > find the issue resolved. > > -ben > > On Wed, May 08, 2002 at 03:19:03PM -0500, Dan Yocum wrote: > > Trond, et al. > > > > You're right, it's a driver (ns83820) issue. Strange that it only shows up > > when trying to execute an app that's mounted via NFS, but, whatever. > > Running apps from the the NFS volumes with the eepro100 adapter that's on > > the machine works fine with the updated NFS_all patch applied. > > > > Thanks, again, > > Dan > > > > > > Dan Yocum wrote: > > > > > > Dan Yocum wrote: > > > > > > > > Trond Myklebust wrote: > > > > > > > > > > On Tuesday 7. May 2002 00:05, Dan Yocum wrote: > > > > > > Trond, > > > > > > > > > > > > OK, so backing out the rpc_tweaks dif fixed the performance problem, > > > > > > however, seems to have introduced another problem that appears to be > > > > > > stemming from the seekdir.dif. Attempting to run an app from an IRIX > > > > > > client (that has the 32bitclients option set) freezes the NFS volume - one > > > > > > can't access it from the Linux side, anymore. > > > > > > > > > > > > You can read and write to the NFS volume *before* trying to run something > > > > > > from there, but not after. > > > > > > > > > > > > Ideas? > > > > > > > > > > That smells like another network driver bug. Have you tcpdumped the traffic > > > > > between client and server? > > > > > > > > Ah, that may be the case - the problem also exists with a Linux server as > > > > well... let me check, and I'll let you know. > > > > > > I take that back - it's only hanging on the Linux server when the IRIX > > > server is already hung. > > > > > > -- > > Dan Yocum > > Sloan Digital Sky Survey, Fermilab 630.840.6509 > > yocum@fnal.gov, http://www.sdss.org > > SDSS. Mapping the Universe. > > - > > To unsubscribe from this list: send the line "unsubscribe linux-kernel" in > > the body of a message to majordomo@vger.kernel.org > > More majordomo info at http://vger.kernel.org/majordomo-info.html > > Please read the FAQ at http://www.tux.org/lkml/ > > -- > "You will be reincarnated as a toad; and you will be much happier." -- Dan Yocum Sloan Digital Sky Survey, Fermilab 630.840.6509 yocum@fnal.gov, http://www.sdss.org SDSS. Mapping the Universe. ^ permalink raw reply [flat|nested] 11+ messages in thread
end of thread, other threads:[~2002-05-09 20:54 UTC | newest] Thread overview: 11+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2002-04-25 20:49 Poor NFS client performance on 2.4.18? Dan Yocum 2002-04-26 13:14 ` Trond Myklebust 2002-04-26 14:51 ` Dan Yocum 2002-04-28 19:29 ` Trond Myklebust 2002-05-06 22:05 ` Dan Yocum 2002-05-07 7:28 ` Trond Myklebust 2002-05-07 15:32 ` Dan Yocum 2002-05-07 15:54 ` Dan Yocum 2002-05-08 20:19 ` ns83820 bug. [was Re: Poor NFS client performance on 2.4.18?] Dan Yocum 2002-05-08 22:07 ` Benjamin LaHaise 2002-05-09 20:54 ` Dan Yocum
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox