netdev.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* niu lock-up (Transmit timed out, resetting) and NETDEV WATCHDOG
       [not found] <CAHOfOo21+hSwFrXZuoMppcivcOonhx-m1p-yyZsm6c5UCh0joQ@mail.gmail.com>
@ 2013-03-26 13:46 ` Andrew Brooks
  2013-03-28 14:52   ` Andrew Brooks
  2013-06-24 10:20 ` Andrew Brooks
  1 sibling, 1 reply; 5+ messages in thread
From: Andrew Brooks @ 2013-03-26 13:46 UTC (permalink / raw)
  To: Linux Net-Dev Mailing List

Hello

Using niu driver for this card: Oracle/SUN Multithreaded 10-Gigabit
Ethernet Network Controller
after a period (often less than 24 hours) the interface will hang with
errors every 5 seconds
"niu: xxx: eth2: Transmit timed out, resetting"

Sometimes also in syslog are messages
WARNING: at sch_generic:255 dev_watchdog
NETDEV WATCHDOG: eth2 (niu): transmit queue 10 timed out

I've seen this in kernel 3.5.0-26-generic #42~precise1-Ubuntu SMP
but I've not seen it in kernel 3.2.0-38-generic #61-Ubuntu SMP

Is there some change between kernels which has broken the driver
or is the difference elsewhere?

Thanks

Andrew

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: niu lock-up (Transmit timed out, resetting) and NETDEV WATCHDOG
  2013-03-26 13:46 ` niu lock-up (Transmit timed out, resetting) and NETDEV WATCHDOG Andrew Brooks
@ 2013-03-28 14:52   ` Andrew Brooks
  2013-04-04 12:53     ` Andrew Brooks
  0 siblings, 1 reply; 5+ messages in thread
From: Andrew Brooks @ 2013-03-28 14:52 UTC (permalink / raw)
  To: Linux Net-Dev Mailing List

Hi folks

This is a show-stopper. Does anyone have any ideas?

Thanks!

On 26 March 2013 13:46, Andrew Brooks <arb@sat.dundee.ac.uk> wrote:
>
> Hello
>
> Using niu driver for this card: Oracle/SUN Multithreaded 10-Gigabit
> Ethernet Network Controller
> after a period (often less than 24 hours) the interface will hang with
> errors every 5 seconds
> "niu: xxx: eth2: Transmit timed out, resetting"
>
> Sometimes also in syslog are messages
> WARNING: at sch_generic:255 dev_watchdog
> NETDEV WATCHDOG: eth2 (niu): transmit queue 10 timed out
>
> I've seen this in kernel 3.5.0-26-generic #42~precise1-Ubuntu SMP
> but I've not seen it in kernel 3.2.0-38-generic #61-Ubuntu SMP
>
> Is there some change between kernels which has broken the driver
> or is the difference elsewhere?

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: niu lock-up (Transmit timed out, resetting) and NETDEV WATCHDOG
  2013-03-28 14:52   ` Andrew Brooks
@ 2013-04-04 12:53     ` Andrew Brooks
  0 siblings, 0 replies; 5+ messages in thread
From: Andrew Brooks @ 2013-04-04 12:53 UTC (permalink / raw)
  To: Linux Net-Dev Mailing List

Hi

This bug occurs in kernel 3.2.0-39 too.

Is anyone able to offer any advice?  Thanks.



> On 26 March 2013 13:46, Andrew Brooks <arb@sat.dundee.ac.uk> wrote:
> >
> > Using niu driver for this card: Oracle/SUN Multithreaded 10-Gigabit
> > Ethernet Network Controller
> > after a period (often less than 24 hours) the interface will hang with
> > errors every 5 seconds
> > "niu: xxx: eth2: Transmit timed out, resetting"
> >
> > Sometimes also in syslog are messages
> > WARNING: at sch_generic:255 dev_watchdog
> > NETDEV WATCHDOG: eth2 (niu): transmit queue 10 timed out
> >
> > I've seen this in kernel 3.5.0-26-generic #42~precise1-Ubuntu SMP
> > but I've not seen it in kernel 3.2.0-38-generic #61-Ubuntu SMP
> >
> > Is there some change between kernels which has broken the driver
> > or is the difference elsewhere?

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: niu lock-up (Transmit timed out, resetting) and NETDEV WATCHDOG
       [not found] <CAHOfOo21+hSwFrXZuoMppcivcOonhx-m1p-yyZsm6c5UCh0joQ@mail.gmail.com>
  2013-03-26 13:46 ` niu lock-up (Transmit timed out, resetting) and NETDEV WATCHDOG Andrew Brooks
@ 2013-06-24 10:20 ` Andrew Brooks
  2013-08-27 15:01   ` Andrew Brooks
  1 sibling, 1 reply; 5+ messages in thread
From: Andrew Brooks @ 2013-06-24 10:20 UTC (permalink / raw)
  To: Linux Net-Dev Mailing List, Linux Kernel Mailing List

On 26 March 2013 13:44, Andrew Brooks <arb@sat.dundee.ac.uk> wrote:
>
> Using niu driver for this card: Oracle/SUN Multithreaded 10-Gigabit
> Ethernet Network Controller and after a period the interface will hang
> with errors every 5 seconds
> "niu: xxx: eth2: Transmit timed out, resetting"
>
> Sometimes also in syslog are messages
> WARNING: at sch_generic:255 dev_watchdog
> NETDEV WATCHDOG: eth2 (niu): transmit queue 10 timed out

Do you think this could be caused by a problem I've seen reported
by other machines on the network
"received unsolicited ack for DL_UNITDATA_REQ on nxge0" ?
Is there some bad packet flying around that causes the
niu driver to lock up the kernel?

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: niu lock-up (Transmit timed out, resetting) and NETDEV WATCHDOG
  2013-06-24 10:20 ` Andrew Brooks
@ 2013-08-27 15:01   ` Andrew Brooks
  0 siblings, 0 replies; 5+ messages in thread
From: Andrew Brooks @ 2013-08-27 15:01 UTC (permalink / raw)
  To: Linux Net-Dev Mailing List, Linux Kernel Mailing List

Hi

> On 26 March 2013 13:44, Andrew Brooks <arb@sat.dundee.ac.uk> wrote:
>> Using niu driver for this card: Oracle/SUN Multithreaded 10-Gigabit
>> Ethernet Network Controller and after a period the interface will hang
>> with errors every 5 seconds
>> "niu: xxx: eth2: Transmit timed out, resetting"

Here's more information about the problem:
When the interface hangs we see these messages from the driver:

[3408740.816032] niu: niu_interrupt() ldg[ffff8807141d16d0](18)
v0[8000000000] v1[0] v2[0]
[3408740.816036] niu 0000:09:00.0: eth2: niu_txchan_intr() cs[b860b860000c000]
[3408740.816038] niu 0000:09:00.0: eth2: niu_poll_core() v0[0000008000000000]
[3408740.816040] niu 0000:09:00.0: eth2: niu_tx_work() pkt_cnt[0] cons[119]
[3408740.816042] niu: niu_interrupt() ldg[ffff8807141d16d0](18)
v0[8000000000] v1[0] v2[0]
[3408740.820004] [sched_delayed] sched: RT throttling activated
[3408740.824021] niu 0000:09:00.0: eth2: Disable interrupts
[3408740.824044] niu 0000:09:00.0: eth2: Disable RX MAC
[3408740.824048] niu 0000:09:00.0: eth2: Disable IPP
[3408740.824054] niu 0000:09:00.0: eth2: Stop TX channels
[3408740.824641] niu 0000:09:00.0: eth2: Stop RX channels
[3408740.824652] niu 0000:09:00.0: eth2: Reset TX channels
[3408740.825212] niu 0000:09:00.0: eth2: Reset RX channels
[3408740.825999] niu 0000:09:00.0: eth2: Initialize TXC
[3408740.826002] niu 0000:09:00.0: eth2: Initialize TX channels

However the interface doesn't recover :-(
Are there any clues there?

Thanks!

^ permalink raw reply	[flat|nested] 5+ messages in thread

end of thread, other threads:[~2013-08-27 15:01 UTC | newest]

Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
     [not found] <CAHOfOo21+hSwFrXZuoMppcivcOonhx-m1p-yyZsm6c5UCh0joQ@mail.gmail.com>
2013-03-26 13:46 ` niu lock-up (Transmit timed out, resetting) and NETDEV WATCHDOG Andrew Brooks
2013-03-28 14:52   ` Andrew Brooks
2013-04-04 12:53     ` Andrew Brooks
2013-06-24 10:20 ` Andrew Brooks
2013-08-27 15:01   ` Andrew Brooks

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).