netdev.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* r8169 crash
@ 2007-11-17  5:39 Denys
  2007-11-17 22:07 ` Francois Romieu
  0 siblings, 1 reply; 6+ messages in thread
From: Denys @ 2007-11-17  5:39 UTC (permalink / raw)
  To: netdev

Hi

Recently i start to use r8169 cards and faced some issues.

How to reproduce (kind of difficult, and i cannot that on my live server 
anymore):

1)Badly crimped cable.
2)PCI-Express R8169 card

Kernel 2.6.23.1

05:01.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL-8169 Gigabit 
Ethernet (rev 10)
        Subsystem: Realtek Semiconductor Co., Ltd. RTL-8169 Gigabit Ethernet
        Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV+ VGASnoop- ParErr- 
Stepping- SERR- FastB2B-
        Status: Cap+ 66MHz+ UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- 
<TAbort- <MAbort- >SERR- <PERR-
        Latency: 64 (8000ns min, 16000ns max), Cache Line Size: 32 bytes
        Interrupt: pin A routed to IRQ 21
        Region 0: I/O ports at 1000 [size=256]
        Region 1: Memory at 55001000 (32-bit, non-prefetchable) [size=256]
        Expansion ROM at fffe0000 [disabled] [size=128K]
        Capabilities: [dc] Power Management version 2
                Flags: PMEClk- DSI- D1+ D2+ AuxCurrent=375mA 
PME(D0-,D1+,D2+,D3hot+,D3cold+)
                Status: D0 PME-Enable- DSel=0 DScale=0 PME-


r8169 Gigabit Ethernet driver 2.2LK-NAPI loaded
eth1: RTL8169sb/8110sb at 0xf881c000, 00:19:e0:72:49:88, XID 10000000 IRQ 21

To reproduce issue i need to shake faulty cable. Card will report:
Nov 15 22:09:36 vzone r8169: eth1: link up
So it is going up without going down!

Normally it is like this:
Nov  1 00:50:33 vzone r8169: eth1: link down
Nov  1 00:50:35 vzone r8169: eth1: link up

After that i have.
Nov 15 22:11:37 vzone NETDEV WATCHDOG: eth1: transmit timed out
Nov 15 22:11:49 vzone NETDEV WATCHDOG: eth1: transmit timed out
Nov 15 22:12:01 vzone NETDEV WATCHDOG: eth1: transmit timed out
Nov 15 22:12:13 vzone NETDEV WATCHDOG: eth1: transmit timed out
Nov 15 22:12:25 vzone NETDEV WATCHDOG: eth1: transmit timed out
Nov 15 22:12:37 vzone NETDEV WATCHDOG: eth1: transmit timed out
Nov 15 22:12:49 vzone NETDEV WATCHDOG: eth1: transmit timed out
Nov 15 22:13:01 vzone NETDEV WATCHDOG: eth1: transmit timed out

And card completely non-functional (doesn't see traffic at all and doesn't 
send anything). Probably some functions (like NAPI enabling) is has to be 
disabled first, and if interface come twice up, without going down, card come 
crazy.


--
Denys Fedoryshchenko
Technical Manager
Virtual ISP S.A.L.


^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: r8169 crash
  2007-11-17  5:39 r8169 crash Denys
@ 2007-11-17 22:07 ` Francois Romieu
  2007-11-17 22:44   ` Denys
  0 siblings, 1 reply; 6+ messages in thread
From: Francois Romieu @ 2007-11-17 22:07 UTC (permalink / raw)
  To: Denys; +Cc: netdev

Denys <nuclearcat@nuclearcat.com> :
[...]
> After that i have.
> Nov 15 22:11:37 vzone NETDEV WATCHDOG: eth1: transmit timed out
> Nov 15 22:11:49 vzone NETDEV WATCHDOG: eth1: transmit timed out
> Nov 15 22:12:01 vzone NETDEV WATCHDOG: eth1: transmit timed out
> Nov 15 22:12:13 vzone NETDEV WATCHDOG: eth1: transmit timed out
> Nov 15 22:12:25 vzone NETDEV WATCHDOG: eth1: transmit timed out
> Nov 15 22:12:37 vzone NETDEV WATCHDOG: eth1: transmit timed out
> Nov 15 22:12:49 vzone NETDEV WATCHDOG: eth1: transmit timed out
> Nov 15 22:13:01 vzone NETDEV WATCHDOG: eth1: transmit timed out

Is there a chance for you to verify that the network interface does
not recover if a real cable is plugged at this point ?

-- 
Ueimor

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: r8169 crash
  2007-11-17 22:07 ` Francois Romieu
@ 2007-11-17 22:44   ` Denys
  2007-11-18 21:00     ` Francois Romieu
  0 siblings, 1 reply; 6+ messages in thread
From: Denys @ 2007-11-17 22:44 UTC (permalink / raw)
  To: Francois Romieu; +Cc: netdev


Before it happens on 2.6.22, i tried to attach good cable, plug-unplug, 
whatever, interface up/down - card still remains dead.

I try to plug cable to laptop with rtl8139 and pc with e100 - it is worked 
when i shake bad cable(interface was going up/down as well), and good cable 
also worked fine, and never crashed. On 2.6.22 i was able to reproduce it 
easily, but later i can't cause it is server in internet cafe, and customers 
going crazy.


On Sat, 17 Nov 2007 23:07:59 +0100, Francois Romieu wrote
> Denys <nuclearcat@nuclearcat.com> :
> [...]
> > After that i have.
> > Nov 15 22:11:37 vzone NETDEV WATCHDOG: eth1: transmit timed out
> > Nov 15 22:11:49 vzone NETDEV WATCHDOG: eth1: transmit timed out
> > Nov 15 22:12:01 vzone NETDEV WATCHDOG: eth1: transmit timed out
> > Nov 15 22:12:13 vzone NETDEV WATCHDOG: eth1: transmit timed out
> > Nov 15 22:12:25 vzone NETDEV WATCHDOG: eth1: transmit timed out
> > Nov 15 22:12:37 vzone NETDEV WATCHDOG: eth1: transmit timed out
> > Nov 15 22:12:49 vzone NETDEV WATCHDOG: eth1: transmit timed out
> > Nov 15 22:13:01 vzone NETDEV WATCHDOG: eth1: transmit timed out
> 
> Is there a chance for you to verify that the network interface does
> not recover if a real cable is plugged at this point ?
> 
> -- 
> Ueimor


--
Denys Fedoryshchenko
Technical Manager
Virtual ISP S.A.L.


^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: r8169 crash
  2007-11-17 22:44   ` Denys
@ 2007-11-18 21:00     ` Francois Romieu
  2007-11-18 23:31       ` Denys
  0 siblings, 1 reply; 6+ messages in thread
From: Francois Romieu @ 2007-11-18 21:00 UTC (permalink / raw)
  To: Denys; +Cc: netdev

Denys <nuclearcat@nuclearcat.com> :
> 
> Before it happens on 2.6.22, i tried to attach good cable, plug-unplug, 
> whatever, interface up/down - card still remains dead.

A few things have changed since 2.6.22 but I'll take that it is a
real bug and that 2.6.23 would not recover either. Can you fill a
PR at http://bugzilla.kernel.org, add a lspci -vvvxxxx and a dmesg ?

A dmesg including the watchdog message would be nice but if you can
not retrieve one, a dmesg from your current 2.6.23.x kernel should be
enough.

I'll take care of it once I am done with the sis190 watchdog failure
(nothing hard, simply ugly).

[...]
> I try to plug cable to laptop with rtl8139 and pc with e100 - it is worked 
> when i shake bad cable(interface was going up/down as well), and good cable 
> also worked fine, and never crashed. On 2.6.22 i was able to reproduce it 
> easily, but later i can't cause it is server in internet cafe, and customers 
> going crazy.

:o/

-- 
Ueimor

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: r8169 crash
  2007-11-18 21:00     ` Francois Romieu
@ 2007-11-18 23:31       ` Denys
  2007-11-19 23:24         ` Francois Romieu
  0 siblings, 1 reply; 6+ messages in thread
From: Denys @ 2007-11-18 23:31 UTC (permalink / raw)
  To: Francois Romieu; +Cc: netdev

Maybe it is good idea to print some registers/etc in watchdog message?

I will try to reproduce problem with debug enabled, maybe it will be possible 
to catch problem with more debug. 

On Sun, 18 Nov 2007 22:00:38 +0100, Francois Romieu wrote
> Denys <nuclearcat@nuclearcat.com> :
> > 
> > Before it happens on 2.6.22, i tried to attach good cable, plug-unplug, 
> > whatever, interface up/down - card still remains dead.
> 
> A few things have changed since 2.6.22 but I'll take that it is a
> real bug and that 2.6.23 would not recover either. Can you fill a
> PR at http://bugzilla.kernel.org, add a lspci -vvvxxxx and a dmesg ?
> 
> A dmesg including the watchdog message would be nice but if you can
> not retrieve one, a dmesg from your current 2.6.23.x kernel should be
> enough.
> 
> I'll take care of it once I am done with the sis190 watchdog failure
> (nothing hard, simply ugly).
> 
> [...]
> > I try to plug cable to laptop with rtl8139 and pc with e100 - it is 
worked 
> > when i shake bad cable(interface was going up/down as well), and good 
cable 
> > also worked fine, and never crashed. On 2.6.22 i was able to reproduce it 
> > easily, but later i can't cause it is server in internet cafe, and 
customers 
> > going crazy.
> 
> :o/
> 
> -- 
> Ueimor
> -
> To unsubscribe from this list: send the line "unsubscribe netdev" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html


--
Denys Fedoryshchenko
Technical Manager
Virtual ISP S.A.L.


^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: r8169 crash
  2007-11-18 23:31       ` Denys
@ 2007-11-19 23:24         ` Francois Romieu
  0 siblings, 0 replies; 6+ messages in thread
From: Francois Romieu @ 2007-11-19 23:24 UTC (permalink / raw)
  To: Denys; +Cc: netdev

Denys <nuclearcat@nuclearcat.com> :
> Maybe it is good idea to print some registers/etc in watchdog message?
> 
> I will try to reproduce problem with debug enabled, maybe it will be possible 
> to catch problem with more debug. 

Your customers will not necessarily like it but it could be useful to
trace the Rx/Tx descriptor ring content and the irq status word.

If your make it dependent on tp->msg_enable (settable via
'ethtool -s ethX msglvl 65535), you can enable it at will and catch
some bit of normal traffic just before you experiment with the
magic cable.


> On Sun, 18 Nov 2007 22:00:38 +0100, Francois Romieu wrote
> > Denys <nuclearcat@nuclearcat.com> :
[...]
> > I'll take care of it once I am done with the sis190 watchdog failure
> > (nothing hard, simply ugly).

Actually I lack a reliable indication that the Tx process is stuck.
My hack seems able to recover but it can be quite slow to trigger.
A contact at SiS would be welcome.

-- 
Ueimor

^ permalink raw reply	[flat|nested] 6+ messages in thread

end of thread, other threads:[~2007-11-19 23:26 UTC | newest]

Thread overview: 6+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2007-11-17  5:39 r8169 crash Denys
2007-11-17 22:07 ` Francois Romieu
2007-11-17 22:44   ` Denys
2007-11-18 21:00     ` Francois Romieu
2007-11-18 23:31       ` Denys
2007-11-19 23:24         ` Francois Romieu

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).