From: Philip Molter <philip@datafoundry.com>
To: Michael Chan <mchan@broadcom.com>
Cc: Bernd Schubert <bernd-schubert@gmx.de>, netdev@vger.kernel.org
Subject: Re: tg3: tg3_stop_block timed out
Date: Mon, 04 Sep 2006 16:27:01 -0500 [thread overview]
Message-ID: <44FC9A25.9030608@datafoundry.com> (raw)
In-Reply-To: <1551EAE59135BE47B544934E30FC4FC093FAF4@NT-IRVA-0751.brcm.ad.broadcom.com>
Michael Chan wrote:
> Philip Molter wrote:
>
>> Is there any additional information that I can give to help get some
>> more work targeted at this bug? I've been getting this
>> lockup three or
>> four times a week per server (I have four of them exhibiting
>> this behavior).
>>
>> The network setup is fairly complicated, but unfortunately, these are
>> production machines pushing multi-gigabit traffic loads. We're using
>> vlans on top of bonding on top of anywhere from 2-to-6
>> broadcomm NICs,
>> but it appears that the problem is unrelated to the bonding
>> and vlans,
>> as others are reporting similar problems without those enabled.
>>
>> Any assistance would be appreciated. I've left the original
>> information
>> below for reference.
>
> Since you're using a rather old version of tg3, I suggest that you
> upgrade to a newer version first. Your problem is probably
> different from Bernd Schubert's since he has ASF enabled and you
> don't.
>
>> If anyone could even explain what this error means, that would be
>> helpful. Maybe we can change something to work around it.
>>
>
> The stop_block error messages are not too important. The important
> thing is that you're getting a transmit timeout. It means that
> the tx queue is getting full because the NIC is no longer getting
> interrupts. When this condition is detected, the NIC will get reset
> which should normally bring the NIC back to life. It seems that
> in your case, it doesn't come back. Do you get these timeouts on
> both ports at the same time?
It's hard to tell. When the error gets logged, it doesn't say which
interface it's happening on. The box is locked up by the time we get to
it, but I think it's happening on both.
I've had NICs lock up with queue issues before, but I've never had it
lock up a box completely, unresponsive on console even. Normally,
network just breaks, and sure, it requires a reboot, but at least we can
do a controlled reboot.
This only started happening when we moved these NICs to jumbo frames.
We've used the exact same hardware in less demanding applications (up to
500Mbits vs. 750+Mbits) with jumbo without issue, but these particular
machines, these pushers, only started locking up when we switched to jumbo.
> Please try the latest driver. If you still get the timeouts, I'll
> need to send you some debug patches to dump the state when these
> timeouts occur.
Will do.
next prev parent reply other threads:[~2006-09-04 21:27 UTC|newest]
Thread overview: 10+ messages / expand[flat|nested] mbox.gz Atom feed top
2006-08-07 22:43 tg3: tg3_stop_block timed out Bernd Schubert
2006-08-07 23:07 ` Michael Chan
2006-08-07 23:24 ` Bernd Schubert
2006-08-07 23:46 ` Michael Chan
2006-08-09 14:44 ` Philip Molter
2006-09-03 22:35 ` Philip Molter
2006-09-04 18:25 ` Michael Chan
2006-09-04 21:27 ` Philip Molter [this message]
2006-09-12 17:22 ` Philip Molter
2006-08-09 15:20 ` Bernd Schubert
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=44FC9A25.9030608@datafoundry.com \
--to=philip@datafoundry.com \
--cc=bernd-schubert@gmx.de \
--cc=mchan@broadcom.com \
--cc=netdev@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).