public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
From: Nix <nix@esperi.org.uk>
To: thockin@hockin.org
Cc: linux-kernel@vger.kernel.org
Subject: Re: 2.6.15.1: UDP fragments >27208 bytes lost with ne2k-pci on DP83815
Date: Mon, 30 Jan 2006 22:18:10 +0000	[thread overview]
Message-ID: <87irs1qs0t.fsf@amaterasu.srvr.nix> (raw)
In-Reply-To: <87ek2pts25.fsf@amaterasu.srvr.nix> (nix@esperi.org.uk's message of "Mon, 30 Jan 2006 19:49:06 +0000")

On Mon, 30 Jan 2006, nix@esperi.org.uk uttered the following:
> On Mon, 30 Jan 2006, thockin@hockin.org said:
>> I can't imagine what kind of hidden bug would lay dormant for 4 years and
>> then pop up for just one person.  I don't know what I can offer except
>> extra eyes  to vet any potential solutions.  Happt Hunting.
> 
> I'll verify that the cause really is the network card shortly (by
> swapping the network cards I use for ADSL and internal networking:
> the ADSL link has very little large UDP traffic). This'll let us
> localize the problem to the NIC, or not, as the case may be.
> 
> More once this is done.

We pretty much have proof now that the problem is that the card and/or
driver can't consume UDP packets at the rate at which 100Mb/s Ethernet
can throw them at it. (This is peculiar, as it seems to have no
difficulty consuming TCP packets at that rate: there are no
reported retransmissions.)

- catting a huge file over NFS *from* the hanging machine works fine.
  Confirmation that it's UDP *reception* at fault.

- catting a huge file over UDP netcat from the hanging machine works fine:

loki $ nc -v -p 20202 -u amaterasu 20202 < ~/Graphics/octo-snake.mov
   on one end, and
amaterasu $ nc -v -l -p 20202 -u > /tmp/octo-snake.mov
   on the other, yields

-rw-r--r-- 1 nix users 3255013 Nov 30  2004 /home/nix/Graphics/octo-snake.mov
   on one end, and
-rw-r--r-- 1 nix users 3255013 Jan 30 21:55 /tmp/octo-snake.mov
   on the other.

- catting a huge file over UDP netcat to the hanging machine doesn't work so
  well: we stall after 40 packets; above the 32K rsize for NFS UDP on this
  connection, but not by far, and you can see how a small change in the
  rate at which the CPU sucks packets from the card could cause the limit
  to fall below 32K:

-rw-r--r-- 1 nix users 57280 Jan 30 21:53 /tmp/octo-snake.mov

  There is a bridge in the way, so I captured both the raw interface and
  the bridge. Both saw only 40 packets.

- this machine also has an RTL-8029 in it, used for chattering
  over ADSL. With this in place, I see rather better results.
  Results from three consecutive runs:

-rw-r--r-- 1 nix users 2214629 Jan 30 22:03 /tmp/octo-snake.mov
-rw-r--r-- 1 nix users 3107557 Jan 30 22:11 /tmp/octo-snake.mov
-rw-r--r-- 1 nix users 3140325 Jan 30 22:11 /tmp/octo-snake.mov

  It's not perfect, but it's *way* more than the other card is capable
  of, and it's way above the 32K limit for NFS UDP. And indeed with this
  card NFS UDP works fine, no matter how vast the files I request.

  (It *is* strange that it's still losing packets even though the
  machine is faster than amaterasu up there, which was receiving packets
  without difficulty and dropping none. Ah well, one can't really
  compare IA32 and UltraSPARC speeds like that, and the failing machine
  has the extra CPU load of a bridge in the way.)


I find myself wondering if receiving consecutive line-saturating UDP
traffic is just more than this cheap little card is capable of :/

It's either the NIC or the driver for it; that much is clear.  Plus it's
really easy to reproduce the problem now: spray stuff from `netcat -u'
at such a card and see how much arrives.

-- 
`I won't make a secret of the fact that your statement/question
 sent a wave of shock and horror through us.' --- David Anderson

  reply	other threads:[~2006-01-30 22:18 UTC|newest]

Thread overview: 16+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2006-01-28 22:52 2.6.15.1: persistent nasty hang in sync_page killing NFS (ne2k-pci / DP83815-related?), i686/PIII Nix
2006-01-29  1:59 ` Trond Myklebust
2006-01-29 14:24   ` Nix
2006-01-29 19:56     ` Nix
2006-01-29 20:21       ` Trond Myklebust
2006-01-29 21:54         ` Nix
2006-01-29 22:02           ` Trond Myklebust
2006-01-30 16:55             ` Nix
2006-01-30 17:09               ` Trond Myklebust
2006-01-30 17:31                 ` 2.6.15.1: UDP fragments >27208 bytes lost with ne2k-pci on DP83815 (was Re: persistent nasty hang in sync_page killing NFS (ne2k-pci / DP83815-related?), i686/PIII) Nix
2006-01-30 19:03                   ` thockin
2006-01-30 19:07                     ` 2.6.15.1: UDP fragments >27208 bytes lost with ne2k-pci on DP83815 Nix
2006-01-30 19:32                       ` thockin
2006-01-30 19:49                         ` Nix
2006-01-30 22:18                           ` Nix [this message]
2006-01-30 19:36   ` 2.6.15.1: persistent nasty hang in sync_page killing NFS(ne2k-pci / DP83815-related?), i686/PIII Roger Heflin

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=87irs1qs0t.fsf@amaterasu.srvr.nix \
    --to=nix@esperi.org.uk \
    --cc=linux-kernel@vger.kernel.org \
    --cc=thockin@hockin.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox