All of lore.kernel.org
 help / color / mirror / Atom feed
* Stopping NFS, ip address take over, zero-copy NFS for 2.4.21, and misc
@ 2003-09-18 15:59 Chris Worley
  2003-09-18 21:21 ` James Pearson
                   ` (3 more replies)
  0 siblings, 4 replies; 11+ messages in thread
From: Chris Worley @ 2003-09-18 15:59 UTC (permalink / raw)
  To: nfs

Hi,

background...

Configuration: FC SAN serving all luns to multiple dual-cpu 3.0GHz XEON
I/O servers (using Qlogic 24xx HBA's) running GFS, re-exported via NFS
to about a dozen clients per NFS server.  Each I/O server has 96 nfsd
threads running.  NFS is being served over Myrinet over IP.  Any
problems listed below are true for both NFS over Ethernet and over
Myrinet over IP (but Myrinet is a lot more stable, with no frag
problems).  Servers and clients all running RH7.3 w/ a 2.4.21 kernel.
 
Patches: GFS, Direct-I/O and related kernel patches (doesn't seem to
work with IOZone "-I" option), and NFSSVC_MAXBLKSIZE set to 32768. Any
problems listed below are both with and without these changes (except
GFS patches, I gotta have those).  Qla23x0 driver: both SG_SEGMENTS and
MAX_OUTSTANDING_COMMANDS set to 4096. 

Clients mount with options (all performance related):

	bg,nocto,intr,vers=3,rsize=32768,wsize=32768,hard,retrans=1000,timeo=3,nolock,async

1)  NFS won't shutdown. 

No matter the number of nfsd threads, NFS won't shutdown.  It sticks and
eventually times-out trying to kill the nfsd threads.  With only one
client, this isn't a problem.  So it's number-of-clients related.

If NFS doesn't shutdown, then I can't gracefully unmount and shutdown
GFS... which means the only way to reboot an NFS server is take down the
network, and let the lock server fence the I/O server.  Not pretty.

Any ideas on forcing NFS down?

2) IP address takeover between NFS servers.

With NFS stateless, and not running lock servers, I thought a simple IP
address takeover scheme (when an I/O server goes down, another just adds
the failed server's IP address as a virtual interface) would allow
clients to immediately renegotiate with the same IP address pointing to
another NFS server (serving the same partitions).  The take-over is
successful: the clients can communicate with the new I/O server, but I
get "permission denied" (as root or otherwise) on the NFS mounted
partitions most of the time (sometimes it works).

What am I missing?

3) Zero-copy NFS patches had been available for kernels prior to
2.4.21... but are missing from Trond's 2.4.21 patches.  I have to use
2.4.21 for the time being (can't use 2.6).  

Is there hope of getting these patches for this kernel rev?

4) I Need to have more outstanding SCSI requests.

The SAN I'm using can parallelize many more outstanding SCSI requests
than I'm sending it.  The Qlogic scatter-gather list size and
outstanding command queue seem to be big enough to handle more
requests.  I'm seeing, at most, 5 outstanding requests per NFS server.

Is there something at the SCSI layer or driver layer that will allow for
more outstanding I/O requests?  

Is there a way to find out if this is a SCSI layer problem, vs. driver
or NFS or GFS file system problem (i.e. something in proc I can monitor
to see outstanding requests at these different levels)?

5) How come the "retrans" and "timeo" values set on the client mount
don't show up in /proc/mounts?

6) Any performance hints I'm missing?

Thanks,

Chris



-------------------------------------------------------
This sf.net email is sponsored by:ThinkGeek
Welcome to geek heaven.
http://thinkgeek.com/sf
_______________________________________________
NFS maillist  -  NFS@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nfs

^ permalink raw reply	[flat|nested] 11+ messages in thread
* Re: An NFS-HA HOWTO anyone? WAS> Re: Stopping NFS, ip address take over, zero-copy NFS for 2.4.21, and misc
@ 2003-09-22 16:33 trmcneal
  0 siblings, 0 replies; 11+ messages in thread
From: trmcneal @ 2003-09-22 16:33 UTC (permalink / raw)
  To: nfs

Hi -

The NFS failover is not that simple, depending on the HA cluster used.
Mission Critical Linux used an IP migration scheme, but had to jump through
hoops to force the client to renegotiate; otherwise the new server would just
look at the file handle, which had MAC address info, and return errors.
They (ok, We) did not publish that in open source, so that the Convolo 
cluster had lock failover capability, but the open source Kimberlite cluster
did not.

Anyway, since the behavior is so product dependent, it hasn't been really
addressed in the FAQ, but I'll look at it again.

Regards -

Tom

--
Tom McNeal
(650)906-0761(cell)
(650)964-8459(fax)
> On Sat, 2003-09-20 at 05:22, ian sison (mailing list) wrote:
> > 
> > On Fri, 19 Sep 2003 rnews@tasking.nl wrote:
> > 
> > > Chris Worley <cworley@lnxi.com> wrote:
> > > | 2) IP address takeover between NFS servers.
> > > |
> > > | With NFS stateless, and not running lock servers, I thought a simple IP
> > > | address takeover scheme (when an I/O server goes down, another just adds
> > > | the failed server's IP address as a virtual interface) would allow
> > > | clients to immediately renegotiate with the same IP address pointing to
> > > | another NFS server (serving the same partitions).  The take-over is
> > > | successful: the clients can communicate with the new I/O server, but I
> > > | get "permission denied" (as root or otherwise) on the NFS mounted
> > > | partitions most of the time (sometimes it works).
> > >
> > > You need to migrate the entries in /var/lib/nfs/rmtab to the new
> > > machine. Also, when the device number on the new machine differs
> > > from the device number on the old machine, you need to use the "fsid"
> > > option in /etc/exports, to make sure the file handles don't change.
> > 
> > This is important information, and it should go into a howto. NFS-HA is a
> > very common requirement nowadays, and there's very little information i've
> > seen on the net except for the Mission Critical Linux site.
> > 
> I need to update info on the page, and quite honestly, i think the info
> i have is buggy.. (I will update this week), but i have a document
> showing how I achieve 'Active-Active' HA NFS over Shared SCSI.. Mainly,
> it's by replicating rmtab entries by storing mountpoint specific rmtab
> entries on the shared storage mountpoint... 
> 
> the address is:
> 
> http://chilli.linuxmds.com/~mschilli/NFS/
> 
> Bookmark the page, and I promise by mid week, i'll have it updated.. I
> will post to the mailing list when the update is complete.
> 
> Matt Schillinger
> mschilli@vss.fsi.com
> > 
> > 
> > 
> > 
> > -------------------------------------------------------
> > This sf.net email is sponsored by:ThinkGeek
> > Welcome to geek heaven.
> > http://thinkgeek.com/sf
> > _______________________________________________
> > NFS maillist  -  NFS@lists.sourceforge.net
> > https://lists.sourceforge.net/lists/listinfo/nfs
> 
> 
> 
> 
> -------------------------------------------------------
> This sf.net email is sponsored by:ThinkGeek
> Welcome to geek heaven.
> http://thinkgeek.com/sf
> _______________________________________________
> NFS maillist  -  NFS@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nfs


-------------------------------------------------------
This sf.net email is sponsored by:ThinkGeek
Welcome to geek heaven.
http://thinkgeek.com/sf
_______________________________________________
NFS maillist  -  NFS@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nfs

^ permalink raw reply	[flat|nested] 11+ messages in thread

end of thread, other threads:[~2003-09-22 16:34 UTC | newest]

Thread overview: 11+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2003-09-18 15:59 Stopping NFS, ip address take over, zero-copy NFS for 2.4.21, and misc Chris Worley
2003-09-18 21:21 ` James Pearson
2003-09-19  1:11   ` Chris Worley
2003-09-19 13:26   ` Matt Schillinger
2003-09-19 22:41     ` Chris Worley
2003-09-19  9:04 ` rnews
2003-09-20 10:22   ` An NFS-HA HOWTO anyone? WAS> " ian sison (mailing list)
2003-09-22 14:24     ` Matt Schillinger
2003-09-19 12:50 ` Bernd Schubert
2003-09-19 15:35 ` rnews
  -- strict thread matches above, loose matches on Subject: below --
2003-09-22 16:33 An NFS-HA HOWTO anyone? WAS> " trmcneal

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.