public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
* Strange problem with e1000 driver - ping packet loss
@ 2008-06-18 12:52 Srivatsa Vaddagiri
  2008-06-18 19:18 ` Brandeburg, Jesse
  0 siblings, 1 reply; 12+ messages in thread
From: Srivatsa Vaddagiri @ 2008-06-18 12:52 UTC (permalink / raw)
  To: linux-kernel, e1000-devel; +Cc: varunc, jbarnes, greg

Hi,
	I happened to look at a system which was exhibiting poor ping
performance with e1000 driver (in 2.6.25) and had some questions regarding that.

Ping test was done between the system and a laptop, which were connected
using a straight ethernet cable. Ping reported round trip times running
into seconds (!) and also packet loss.

Upon some investigation, I found that the interrupt count field in
/proc/interrupts (associated with eth1) is not incrementing as fast as
it should. Moreover eth1 interrupt line is shared with the hard disk
interrupt (ata_piix) as below:

# cat /proc/interrupts

..

 10:       2296    XT-PIC-XT        ata_piix, eth0, eth1

..

IRQ10 is thus being shared by both the hard disk and eth0/eth1.

Here's the strange observation I made:

When I initiate some disk activity (ex: dd if=/dev/zero of=/tmp/file), ping 
performance suddently shot up (round trip time in double digits ms, 0% packet 
loss)! I presume this is because that e1000 intr handler is called
whenever there was a interrupt from hard disk on IRQ10, which polled
NIC and processed packets immediately.

As soon as I kill the background disk-write intensive job, ping
performance again dropped.

This meant that e1000 NIC is having trouble interrupting the OS.

Before I could jump up and say this is a hardware issue, I was told
that Windows works just fine on the server (and as well as 2.4 kernel,
which I couldnt verify) :(


Some more observations:

1. I tried setting e1000 parameters (RxIntDelay=0, RxAbsIntDelay=0,
   TxIntDelay=0, TxAbsIntDelay=0, InterruptThrottleRate=0). None of
   them helped.

2. When ping performance was poor, readprofile showed that system
   is mostly idle. This confirms that OS is not getting very
   frequenty interrupts from eth1 and hence idling.

3. When ping performance was poor, ethtool -S eth1 showed that
   rx_bytes was incrementing at a good pace, showing that the 
   NIC was receiving ping responses back, but not handing them over
   to OS for further processing

4. e1000 chipset is 82546GB

5. e1000e driver didnt work at all (it doesnt recognize the cards).


Any advice on how to fix this problem?


-- 
Regards,
vatsa

^ permalink raw reply	[flat|nested] 12+ messages in thread

end of thread, other threads:[~2008-06-26 13:35 UTC | newest]

Thread overview: 12+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
     [not found] <fa.QBOn2aWyqGnBJJticG4h09lpxD0@ifi.uio.no>
2008-06-19 17:25 ` Strange problem with e1000 driver - ping packet loss Robert Hancock
2008-06-19 20:25   ` Vegard Nossum
2008-06-20 12:30   ` Srivatsa Vaddagiri
2008-06-20 14:40     ` Robert Hancock
2008-06-25 14:32       ` Srivatsa Vaddagiri
2008-06-25 18:09         ` [E1000-devel] " hong zhang
     [not found]     ` <alpine.LFD.1.10.0806251455500.3014@localhost.localdomain>
2008-06-25 19:04       ` Len Brown
2008-06-26 13:37         ` Srivatsa Vaddagiri
2008-06-24  4:33   ` Varun Chandramohan
2008-06-18 12:52 Srivatsa Vaddagiri
2008-06-18 19:18 ` Brandeburg, Jesse
2008-06-19  3:45   ` Srivatsa Vaddagiri

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox