All of lore.kernel.org
 help / color / mirror / Atom feed
From: Bill Fink <billfink@mindspring.com>
To: Urs Thuermann <urs@isnogud.escape.de>
Cc: "Brandeburg, Jesse" <jesse.brandeburg@intel.com>,
	"L F" <lfabio.linux@gmail.com>,
	"Kok, Auke-jan H" <auke-jan.h.kok@intel.com>,
	"James Chapman" <jchapman@katalix.com>, <netdev@vger.kernel.org>
Subject: Re: e1000 driver and samba
Date: Tue, 18 Sep 2007 04:47:02 -0400	[thread overview]
Message-ID: <20070918044702.0b5db963.billfink@mindspring.com> (raw)
In-Reply-To: <ygfps0g8lj0.fsf@janus.isnogud.escape.de>

On 18 Sep 2007, Urs Thuermann wrote:

> Bill Fink <billfink@mindspring.com> writes:
> 
> > It may also be a useful test to disable hardware TSO support
> > via "ethtool -K ethX tso off".
> 
> All suggestions here on the list, i.e. checking for flow control,
> duplex, cable problems, etc. don't explain (at least to me) why LF
> sees file corruption.  How can a corrupted frame pass the TCP checksum
> check?  Does TCP use the hardware checksum of the NIC if available?
> AFAICS, this would be the only way for a corrupt frame to make it into
> the file.  But Bill already suggested this and LF reported that it
> didn't make a difference.
> 
> A few months ago I had hadware problems with an embedded device, where
> transmission from the NIC via the PCI bus to the CPU had some bits
> flipped.  But tcpdump clearly showed the TCP checksum errors and also
> TCP recognized the errors and the connection was stalled.  And, BTW,
> we also observed an increasing percentage of corrupted frames with
> increasing traffic on that interface, i.e. increasing load on the PCI
> bus.
> 
> So I would run tcpdump -s0 and watch for "incorrect checksum" messages.

I agree TSO is an unlikely candidate since it should only affect
transmits and the problem as I understand it is with receives.
But still one of the first things I try doing when dealing with
weird problems is disabling all hardware assists.

But I also agree with you that network errors should normally be
detected by the TCP checksum (unless hardware checksumming was
messed up), and from what I recall there were no receive checksum
errors being seen.  That and the fact that the problem was seen
with two different NICs would lead me to believe that the problem
is elsewhere in the system.

That leaves many possibilities.  It could be a memory problem,
although it was indicated that memory testing was successfully
performed (but we don't know how extensive the memory checking
is enabled via the BIOS).  It could be the PCI bus writes back
to the disk, or a problem with the disk/controller/fs writes
themselves (some kind of disk stress test might be useful).

						-Bill

  reply	other threads:[~2007-09-18  8:47 UTC|newest]

Thread overview: 40+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2007-09-14  2:04 e1000 driver and samba L F
2007-09-14 17:18 ` Kok, Auke
2007-09-14 18:40   ` L F
2007-09-14 20:59     ` Kok, Auke
2007-09-15  0:37       ` L F
2007-09-15  5:09         ` Bill Fink
2007-09-15 12:27           ` L F
2007-09-15 12:44             ` L F
2007-09-15 17:44       ` James Chapman
2007-09-15 19:07         ` Kok, Auke
2007-09-16  4:06           ` L F
2007-09-16  5:04             ` Kok, Auke
2007-09-17 16:42               ` L F
2007-09-17 17:02                 ` Kok, Auke
2007-09-17 18:58                   ` L F
2007-09-17 21:01                     ` Brandeburg, Jesse
2007-09-18  6:03                       ` Bill Fink
2007-09-18  7:45                         ` Urs Thuermann
2007-09-18  8:47                           ` Bill Fink [this message]
2007-09-18 13:39                           ` Florian Weimer
2007-09-18 16:32                             ` L F
2007-09-18 17:04                               ` Tantilov, Emil S
2007-09-19 14:53                                 ` L F
2007-09-20  2:51                                   ` Bill Fink
2007-09-21 14:08                                     ` L F
2007-09-20  4:53                                   ` Bill Fink
2007-09-18 16:44                             ` Bill Fink
2007-09-17 18:02               ` Rick Jones
2007-09-17 18:51                 ` Kok, Auke
2007-09-16 16:24           ` James Chapman
2007-09-16 20:03             ` Kok, Auke
2007-09-16  4:07         ` L F
2007-09-14 18:26 ` Francois Romieu
2007-09-14 18:41   ` L F
  -- strict thread matches above, loose matches on Subject: below --
2007-09-20 23:30 Bruce Cole
2007-09-21 14:13 ` L F
2007-09-21 18:21   ` Bruce Cole
2007-09-21 21:49 ` Francois Romieu
2007-09-21 22:01   ` Bruce Cole
2007-09-21 22:41     ` Francois Romieu

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20070918044702.0b5db963.billfink@mindspring.com \
    --to=billfink@mindspring.com \
    --cc=auke-jan.h.kok@intel.com \
    --cc=jchapman@katalix.com \
    --cc=jesse.brandeburg@intel.com \
    --cc=lfabio.linux@gmail.com \
    --cc=netdev@vger.kernel.org \
    --cc=urs@isnogud.escape.de \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.