* e1000: eth2: e1000_clean_tx_irq: Detected Tx Unit Hang
@ 2009-04-05 8:11 Jesper Krogh
2009-04-06 17:36 ` Brandeburg, Jesse
0 siblings, 1 reply; 3+ messages in thread
From: Jesper Krogh @ 2009-04-05 8:11 UTC (permalink / raw)
To: Linux Kernel Mailing List, netdev@vger.kernel.org
I have a 2.6.27.20 system in production, the e1000 drivers seem pretty
"noisy" allthough everything appears to work excellent.
dmesg here: http://krogh.cc/~jesper/dmesg-ko-2.6.27.20.txt
[476197.380486] e1000: eth3: e1000_clean_tx_irq: Detected Tx Unit Hang
[476197.380488] Tx Queue <0>
[476197.380489] TDH <c>
[476197.380490] TDT <63>
[476197.380490] next_to_use <63>
[476197.380491] next_to_clean <b>
[476197.380491] buffer_info[next_to_clean]
[476197.380492] time_stamp <10717579a>
[476197.380492] next_to_watch <f>
[476197.380493] jiffies <107175a3e>
[476197.380494] next_to_watch.status <0>
The system has been up for 14 days but the dmesg-buffer has allready
overflown with these.
Configuratoin is a 4 x 1GbitE bond all with Intel NICs
06:01.0 Ethernet controller: Intel Corporation 82546EB Gigabit Ethernet
Controller (Copper) (rev 03)
06:01.1 Ethernet controller: Intel Corporation 82546EB Gigabit Ethernet
Controller (Copper) (rev 03)
06:02.0 Ethernet controller: Intel Corporation 82546EB Gigabit Ethernet
Controller (Copper) (rev 03)
06:02.1 Ethernet controller: Intel Corporation 82546EB Gigabit Ethernet
Controller (Copper) (rev 03)
--
Jesper
^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: e1000: eth2: e1000_clean_tx_irq: Detected Tx Unit Hang
2009-04-05 8:11 e1000: eth2: e1000_clean_tx_irq: Detected Tx Unit Hang Jesper Krogh
@ 2009-04-06 17:36 ` Brandeburg, Jesse
2009-04-06 18:53 ` Jesper Krogh
0 siblings, 1 reply; 3+ messages in thread
From: Brandeburg, Jesse @ 2009-04-06 17:36 UTC (permalink / raw)
To: Jesper Krogh
Cc: Linux Kernel Mailing List, netdev@vger.kernel.org,
jesse.brandeburg, e1000-devel
Hi Jesper,
On Sun, 5 Apr 2009, Jesper Krogh wrote:
> I have a 2.6.27.20 system in production, the e1000 drivers seem pretty
> "noisy" allthough everything appears to work excellent.
well, nice to hear its working, but wierd about the messages.
> dmesg here: http://krogh.cc/~jesper/dmesg-ko-2.6.27.20.txt
>
> [476197.380486] e1000: eth3: e1000_clean_tx_irq: Detected Tx Unit Hang
> [476197.380488] Tx Queue <0>
> [476197.380489] TDH <c>
> [476197.380490] TDT <63>
> [476197.380490] next_to_use <63>
> [476197.380491] next_to_clean <b>
> [476197.380491] buffer_info[next_to_clean]
> [476197.380492] time_stamp <10717579a>
> [476197.380492] next_to_watch <f>
> [476197.380493] jiffies <107175a3e>
> [476197.380494] next_to_watch.status <0>
>
> The system has been up for 14 days but the dmesg-buffer has allready
> overflown with these.
I looked at your dmesg and it appears that there is never a
NETDEV_WATCHDOG message, which would normally indicate that the driver
isn't resetting itself out of the problem. Does ethtool -S eth3 show any
tx_timeout_count ?
> Configuratoin is a 4 x 1GbitE bond all with Intel NICs
>
> 06:01.0 Ethernet controller: Intel Corporation 82546EB Gigabit Ethernet
> Controller (Copper) (rev 03)
> 06:01.1 Ethernet controller: Intel Corporation 82546EB Gigabit Ethernet
> Controller (Copper) (rev 03)
> 06:02.0 Ethernet controller: Intel Corporation 82546EB Gigabit Ethernet
> Controller (Copper) (rev 03)
> 06:02.1 Ethernet controller: Intel Corporation 82546EB Gigabit Ethernet
> Controller (Copper) (rev 03)
are you doing testing with the remote end of this link? I'm wondering if
something changed in the kernel that is causing remote link down events to
not stop the tx queue (our hardware just completely stops in its tracks
w.r.t tx when link goes down)
^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: e1000: eth2: e1000_clean_tx_irq: Detected Tx Unit Hang
2009-04-06 17:36 ` Brandeburg, Jesse
@ 2009-04-06 18:53 ` Jesper Krogh
0 siblings, 0 replies; 3+ messages in thread
From: Jesper Krogh @ 2009-04-06 18:53 UTC (permalink / raw)
To: Brandeburg, Jesse
Cc: Linux Kernel Mailing List, netdev@vger.kernel.org, e1000-devel
Brandeburg, Jesse wrote:
> Hi Jesper,
>
> On Sun, 5 Apr 2009, Jesper Krogh wrote:
>> I have a 2.6.27.20 system in production, the e1000 drivers seem pretty
>> "noisy" allthough everything appears to work excellent.
>
> well, nice to hear its working, but wierd about the messages.
>
>> dmesg here: http://krogh.cc/~jesper/dmesg-ko-2.6.27.20.txt
>>
>> [476197.380486] e1000: eth3: e1000_clean_tx_irq: Detected Tx Unit Hang
>> [476197.380488] Tx Queue <0>
>> [476197.380489] TDH <c>
>> [476197.380490] TDT <63>
>> [476197.380490] next_to_use <63>
>> [476197.380491] next_to_clean <b>
>> [476197.380491] buffer_info[next_to_clean]
>> [476197.380492] time_stamp <10717579a>
>> [476197.380492] next_to_watch <f>
>> [476197.380493] jiffies <107175a3e>
>> [476197.380494] next_to_watch.status <0>
>>
>> The system has been up for 14 days but the dmesg-buffer has allready
>> overflown with these.
>
> I looked at your dmesg and it appears that there is never a
> NETDEV_WATCHDOG message, which would normally indicate that the driver
> isn't resetting itself out of the problem. Does ethtool -S eth3 show any
> tx_timeout_count ?
$ for i in 0 1 2 3; do sudo ethtool -S eth${i} | grep tx_timeout_count; done
tx_timeout_count: 6
tx_timeout_count: 3
tx_timeout_count: 14
tx_timeout_count: 23
>> Configuratoin is a 4 x 1GbitE bond all with Intel NICs
>>
>> 06:01.0 Ethernet controller: Intel Corporation 82546EB Gigabit Ethernet
>> Controller (Copper) (rev 03)
>> 06:01.1 Ethernet controller: Intel Corporation 82546EB Gigabit Ethernet
>> Controller (Copper) (rev 03)
>> 06:02.0 Ethernet controller: Intel Corporation 82546EB Gigabit Ethernet
>> Controller (Copper) (rev 03)
>> 06:02.1 Ethernet controller: Intel Corporation 82546EB Gigabit Ethernet
>> Controller (Copper) (rev 03)
>
> are you doing testing with the remote end of this link? I'm wondering if
> something changed in the kernel that is causing remote link down events to
> not stop the tx queue (our hardware just completely stops in its tracks
> w.r.t tx when link goes down)
They are connected directly to a switch stack. I'd be surprised if there
is anything in there that does magic. I have around 100 other cables
into that one.
--
Jesper
^ permalink raw reply [flat|nested] 3+ messages in thread
end of thread, other threads:[~2009-04-06 18:53 UTC | newest]
Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2009-04-05 8:11 e1000: eth2: e1000_clean_tx_irq: Detected Tx Unit Hang Jesper Krogh
2009-04-06 17:36 ` Brandeburg, Jesse
2009-04-06 18:53 ` Jesper Krogh
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).