netdev.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: "Michael Chan" <mchan@broadcom.com>
To: "Josip Rodin" <joy@entuzijast.net>
Cc: "'Bill Fink'" <billfink@mindspring.com>,
	"Ben Hutchings" <bhutchings@solarflare.com>,
	netdev <netdev@vger.kernel.org>,
	"mirrors@debian.org" <mirrors@debian.org>
Subject: Re: bnx2_poll panicking kernel
Date: Mon, 23 Jun 2008 15:48:39 -0700	[thread overview]
Message-ID: <48602847.1020203@broadcom.com> (raw)
In-Reply-To: <20080623213657.GA26447@orion.carnet.hr>

Josip Rodin wrote:
> On Mon, Jun 23, 2008 at 08:04:39PM +0200, Josip Rodin wrote:
>> Oh, duh, yes, I'm a moron. It's back on now, sorry about that.
> 
> There we go, I got the debugging messages:
> 
> [...]
> Jun 23 19:53:18 arrakis kernel: HTB: quantum of class 10100 is big. Consider r2q change.
> Jun 23 22:57:55 arrakis kernel: bnx2: skb->nr_frags=1 is corrupted, should be 4
> Jun 23 22:58:32 arrakis kernel: bnx2: skb->nr_frags=1 is corrupted, should be 2
> Jun 23 22:59:02 arrakis kernel: bnx2: skb->nr_frags=1 is corrupted, should be 3
> Jun 23 22:59:23 arrakis kernel: bnx2: skb->nr_frags=1 is corrupted, should be 9
> Jun 23 22:59:36 arrakis kernel: bnx2: skb->nr_frags=1 is corrupted, should be 3
> Jun 23 23:08:19 arrakis kernel: bnx2: skb->nr_frags=1 is corrupted, should be 3
> 

OK, this definitely confirms the theory that the skb->nr_frags is changed
between ->hard_start_xmit() and tx completion.  Since we rely on nr_frags to
locate the packet boundaries in the tx ring, it would definitely crash.

One possibility is that it is corrupted by the driver and only happens when
there are HTB rules.  I think this is unlikely.

TG3 which operates the same way has also been reported to crash in the
presence of HTB rules.  We were not able to pinpoint the problem at that
time.

Can anyone think of a scenario where the stack can modify the SKB this way?
These SKBs look like they are TSO packets.  

If not, I will send Josip another patch to print more SKB fields.  I can
even save all the SKB fields and see which other ones are modified besides
the nr_frags.  May be that will give us a better clue.

Thanks.


  reply	other threads:[~2008-06-23 22:46 UTC|newest]

Thread overview: 50+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2008-06-16 12:57 bnx2_poll panicking kernel Josip Rodin
2008-06-16 14:04 ` Ben Hutchings
2008-06-16 15:52   ` Michael Chan
2008-06-16 19:13     ` Josip Rodin
2008-06-16 21:38       ` Josip Rodin
2008-06-16 21:48         ` Josip Rodin
2008-06-16 23:45           ` Michael Chan
2008-06-17 22:37             ` Josip Rodin
2008-06-17 22:47               ` Michael Chan
2008-06-21 11:18                 ` Josip Rodin
2008-06-21 15:34                   ` Bill Fink
2008-06-21 16:11                     ` Michael Chan
2008-06-23 18:04                       ` Josip Rodin
2008-06-23 21:36                         ` Josip Rodin
2008-06-23 22:48                           ` Michael Chan [this message]
2008-06-24 22:58                             ` Michael Chan
2008-06-25  0:04                               ` David Miller
2008-06-26 11:01                               ` Josip Rodin
2008-06-26 18:04                                 ` Michael Chan
2008-07-09 16:46                                   ` Josip Rodin
2008-07-09 16:57                                     ` Michael Chan
2008-07-09 23:46                                       ` David Miller
2008-07-10  9:45                                         ` Aviv Greenberg
2008-07-10 10:09                                           ` David Miller
2008-07-10 21:00                                         ` Michael Chan
2008-07-10 21:00                                           ` David Miller
2008-07-10 21:23                                           ` Josip Rodin
2008-07-10 21:38                                             ` Michael Chan
2008-07-10 22:00                                               ` Josip Rodin
2008-07-10 22:26                                                 ` Michael Chan
2008-07-10 22:31                                                   ` Josip Rodin
2008-07-10 23:20                                                     ` David Miller
2008-07-11  9:24                                                       ` Josip Rodin
2008-07-11  9:56                                                         ` David Miller
2008-07-11 12:19                                                           ` Patrick McHardy
2008-07-12  9:49                                                           ` Jarek Poplawski
2008-07-12 13:21                                                             ` Jarek Poplawski
2008-07-14 15:27                                                               ` Patrick McHardy
2008-07-14 17:20                                                                 ` Jarek Poplawski
2008-07-14 17:25                                                                   ` Jarek Poplawski
2008-07-14 20:21                                                                   ` Josip Rodin
2008-07-14 21:22                                                                     ` Jarek Poplawski
2008-07-14 21:26                                                                       ` Josip Rodin
2008-07-14 21:48                                                                         ` Jarek Poplawski
2008-07-17 21:30                                                                           ` Josip Rodin
2008-07-17 21:44                                                                             ` David Miller
2008-07-18  5:12                                                                               ` Jarek Poplawski
2008-08-02 12:28                                                                               ` bad htb_{en,re}queue return codes causing corrupt data in drivers [was Re: bnx2_poll panicking kernel] Josip Rodin
2008-08-03  7:06                                                                                 ` bad htb_{en,re}queue return codes causing corrupt data in drivers David Miller
2008-07-14 22:05                                                                         ` bnx2_poll panicking kernel Jarek Poplawski

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=48602847.1020203@broadcom.com \
    --to=mchan@broadcom.com \
    --cc=bhutchings@solarflare.com \
    --cc=billfink@mindspring.com \
    --cc=joy@entuzijast.net \
    --cc=mirrors@debian.org \
    --cc=netdev@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).