All of lore.kernel.org
 help / color / mirror / Atom feed
From: "Martin L. Purschke" <purschke@bnl.gov>
To: nfs@lists.sourceforge.net
Subject: nfs stalls on cp with certain file sizes with jumbo-gigabit
Date: Sun, 28 Dec 2003 15:50:14 -0500	[thread overview]
Message-ID: <3FEF4206.7070300@bnl.gov> (raw)


Hello,

I have a few linux nfs servers, all identical, dual gigabit NIC's, one
on mtu 1500 and one on another jumbo-frame network. I have dozens of nfs
clients on the regular network and never had a problem. I recently
started to move machines to the jumbo-frame network. I came across a
strange problem that a nfs client stalls when cp'ing a file with a
certain file size. (There are many other ways to get the nfs client to
stall that has so far prevented me from deploying machines on the
J-network, but this one is easily reproducible).

I tracked down a file that would cause the problem. I made another file
with the exact same size and could reproduce the error (so it's not the
particular file, but the size that does it). The size happens to be 4698
bytes, here is a strace of the cp command of the special-made file with
that exact length:

> [root@va033 tmp]# strace cp /root/4698.dat .
> execve("/bin/cp", ["cp", "/root/4698.dat", "."], [/* 45 vars */]) = 0
> uname({sys="Linux", node="va033.phenix.bnl.local", ...}) = 0
>
> ....
>
> lstat64(".", {st_mode=S_IFDIR|0777, st_size=4096, ...}) = 0
> stat64(".", {st_mode=S_IFDIR|0777, st_size=4096, ...}) = 0
> stat64("/root/4698.dat", {st_mode=S_IFREG|0644, st_size=4698, ...}) = 0
> stat64("./4698.dat", {st_mode=S_IFREG|0644, st_size=0, ...}) = 0
> open("/root/4698.dat", O_RDONLY|O_LARGEFILE) = 3
> fstat64(3, {st_mode=S_IFREG|0644, st_size=4698, ...}) = 0
> open("./4698.dat", O_WRONLY|O_TRUNC|O_LARGEFILE) = 4
> fstat64(4, {st_mode=S_IFREG|0644, st_size=0, ...}) = 0
> fstat64(3, {st_mode=S_IFREG|0644, st_size=4698, ...}) = 0
> read(3, "\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0"...,
> 32768) = 4698
> write(4, "\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0"...,
> 4698) = 4698
> read(3, "", 32768)                      = 0
> close(4

and there is hangs indefinitely. I have no problem doing the exact same
thing on the mtu=1500 network, in fact, I dropped the jumbo network down
to 1500 and the same operation works.

Both client and server run kernel 2.4.22 with SMP and high-mem
nfs is:  type nfs
(rw,nfsvers=3,hard,wsize=32768,rsize=32768,addr=192.168.30.6)
I played with the sizes, no effect.

Does this ring a bell? I looked in this list and saw some of Trond's
patches, but the errors they address were different, and I'm not sure if
I need to look in the network drivers instead. I'm  planning to install
the patches as advised in
http://sourceforge.net/mailarchive/message.php?msg_id=6283173
but these are production machines, and the patches are tagged
"experimental". Any advice is highly appreciated.

Thanks,
   Martin

-- 
Martin L. Purschke               ;   purschke@bnl.gov
                                 ;   http://www.phenix.bnl.gov/~purschke
                                 ;
Brookhaven National Laboratory   ;   phone: +1-631-344-5244
Physics Department Bldg 510 C    ;   fax:   +1-631-344-3253
Upton, NY 11973-5000             ;
-----------------------------------------------------------------------




-------------------------------------------------------
This SF.net email is sponsored by: IBM Linux Tutorials.
Become an expert in LINUX or just sharpen your skills.  Sign up for IBM's
Free Linux Tutorials.  Learn everything from the bash shell to sys admin.
Click now! http://ads.osdn.com/?ad_id=1278&alloc_id=3371&op=click
_______________________________________________
NFS maillist  -  NFS@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nfs

             reply	other threads:[~2003-12-28 20:50 UTC|newest]

Thread overview: 2+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2003-12-28 20:50 Martin L. Purschke [this message]
2003-12-28 21:21 ` nfs stalls on cp with certain file sizes with jumbo-gigabit Trond Myklebust

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=3FEF4206.7070300@bnl.gov \
    --to=purschke@bnl.gov \
    --cc=nfs@lists.sourceforge.net \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.