All of lore.kernel.org
 help / color / mirror / Atom feed
* [2.6.0-mm2] e100 driver hangs after period of moderate receive load
@ 2003-12-31 11:02 Lennert Buytenhek
  2003-12-31 12:11 ` Jeff Garzik
  0 siblings, 1 reply; 10+ messages in thread
From: Lennert Buytenhek @ 2003-12-31 11:02 UTC (permalink / raw)
  To: linux-kernel

Hi,

After banging on an e100 card for about ten minutes with a ~60kpps stream,
the interface stops receiving packets.  Interrupts come in once every few
seconds (from /proc/interrupts), but no packets are received anymore at all.
Lots of slab corruption messages in the syslog that were generated during
that packet stream (see other email I sent.)  Stopping the packet stream
still leaves the interface unusable.  'ifconfig eth1 down ; ifconfig eth1 up'
seems to fix things.

Clues?


thanks,
Lennert

^ permalink raw reply	[flat|nested] 10+ messages in thread
* RE: [2.6.0-mm2] e100 driver hangs after period of moderate receive load
@ 2004-01-05  0:25 Feldman, Scott
  0 siblings, 0 replies; 10+ messages in thread
From: Feldman, Scott @ 2004-01-05  0:25 UTC (permalink / raw)
  To: Lennert Buytenhek, Thomas Molina; +Cc: Jeff Garzik, linux-kernel

Lennert Buytenhek wrote:
> Make sure you have slab debugging enabled to see if you also 
> get the slab corruption messages, and then hit the machine 
> with anything above 50000 packets per second.  pktgen from a 
> different machine on the same subnet works nicely for that.  
> I doubt that downloading a Red Hat iso would give you a load 
> anywhere near that.
> 
> Oh, do you have an SMP box?  This was on a 2-way (4-way HT) 
> SMP box.  Not sure if that matters here.
> 
> I'm just about to try 2.6.0-mm2 without NAPI.

Ok, I've repro'd this (w/ and w/o NAPI, but w/o is much harder).  I
wasted a bunch of time having both page alloc debugging and slab
debugging on.  Seems one masks the other.  (Jeff warned me!)  In any
case, what I know so far is the problem happens when HW runs out of Rx
resources, and SW tries to resume the receiver after supplying new
resources.  Somehow HW is scribbling on resources already given back to
the OS.  It's something about the list management in this new e100.
eepro100 and legacy e100 work fine.  Investigation continuing...

-scott

^ permalink raw reply	[flat|nested] 10+ messages in thread
[parent not found: <C6F5CF431189FA4CBAEC9E7DD5441E01034D50AD@orsmsx402.jf.intel.com>]

end of thread, other threads:[~2004-01-10 15:52 UTC | newest]

Thread overview: 10+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2003-12-31 11:02 [2.6.0-mm2] e100 driver hangs after period of moderate receive load Lennert Buytenhek
2003-12-31 12:11 ` Jeff Garzik
2003-12-31 12:21   ` Lennert Buytenhek
2003-12-31 12:34     ` Jeff Garzik
2004-01-01  2:21       ` Thomas Molina
2004-01-01 10:27         ` Lennert Buytenhek
2004-01-10 15:49       ` Lennert Buytenhek
  -- strict thread matches above, loose matches on Subject: below --
2004-01-05  0:25 Feldman, Scott
     [not found] <C6F5CF431189FA4CBAEC9E7DD5441E01034D50AD@orsmsx402.jf.intel.com>
2004-01-10  1:17 ` Feldman, Scott
2004-01-10 15:51   ` Lennert Buytenhek

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.