All of lore.kernel.org
 help / color / mirror / Atom feed
From: Ingo Molnar <mingo@elte.hu>
To: David Miller <davem@davemloft.net>
Cc: johnpol@2ka.mipt.ru, penberg@cs.helsinki.fi,
	linux-kernel@vger.kernel.org, netdev@vger.kernel.org,
	vegard.nossum@gmail.com, rjw@sisk.pl, cl@linux-foundation.org,
	auke-jan.h.kok@intel.com, jeffrey.t.kirsher@intel.com
Subject: Re: [bug, netconsole, SLUB] BUG skbuff_head_cache: Poison overwritten
Date: Tue, 22 Jul 2008 15:34:15 +0200	[thread overview]
Message-ID: <20080722133415.GA23046@elte.hu> (raw)
In-Reply-To: <20080722075000.GB15807@elte.hu>


* Ingo Molnar <mingo@elte.hu> wrote:

> > > Can you check kind of this patch:
> > 
> > The call even seems pointless, since the caller will call ->poll() 
> > (which is e1000_clean) as the very next action, and that will invoke 
> > e1000_clean_tx_irq() properly.
> > 
> > I would just delete this call from e1000_netpoll() entirely.
> 
> ok, i've added the patch below to tip/out-of-tree.
> 
> Overnight test had about 100 successful bootups on this testbox. 
> (until it stopped on a drivers/net/hp.c build error - which is 
> unrelated to this problem)
> 
> So testing with netconsole disabled is conclusive enough to implicate 
> netconsole strongly. I've now re-enabled netconsole on the testbox and 
> will continue the test with the fix below. Previously it would crash 
> within 10-40 iterations.

it's now past 50 successful iterations with e1000e+netconsole enabled 
again with the fix applied, and not a single crash, hang, memory 
corruption or other weirdness happened. Jeff, please apply the fix 
below.

Thanks,

	Ingo

---------------->
commit 1fd48807674d4f6defe82258c46619ce2c116943
Author: Ingo Molnar <mingo@elte.hu>
Date:   Tue Jul 22 09:44:32 2008 +0200

    e1000e: fix e1000_netpoll(), remove extraneous e1000_clean_tx_irq() call
    
    Evgeniy Polyakov noticed that drivers/net/e1000e/netdev.c:e1000_netpoll()
    was calling e1000_clean_tx_irq() without taking the TX lock.
    
    David Miller suggested to remove the call altogether: since in this
    callpah there's periodic calls to ->poll() anyway which will do
    e1000_clean_tx_irq() and will garbage-collect any finished TX ring
    descriptors.
    
    This fix solved the e1000e+netconsole crashes i've been seeing:
    
    =============================================================================
    BUG skbuff_head_cache: Poison overwritten
    -----------------------------------------------------------------------------
    
    INFO: 0xf658ae9c-0xf658ae9c. First byte 0x6a instead of 0x6b
    INFO: Allocated in __alloc_skb+0x2c/0x110 age=0 cpu=0 pid=5098
    INFO: Freed in __kfree_skb+0x31/0x80 age=0 cpu=1 pid=4440
    INFO: Slab 0xc16cc140 objects=16 used=1 fp=0xf658ae00 flags=0x400000c3
    INFO: Object 0xf658ae00 @offset=3584 fp=0xf658af00
    
    Signed-off-by: Ingo Molnar <mingo@elte.hu>
---
 drivers/net/e1000e/netdev.c |    2 --
 1 files changed, 0 insertions(+), 2 deletions(-)

diff --git a/drivers/net/e1000e/netdev.c b/drivers/net/e1000e/netdev.c
index 869544b..9c0f56b 100644
--- a/drivers/net/e1000e/netdev.c
+++ b/drivers/net/e1000e/netdev.c
@@ -4067,8 +4067,6 @@ static void e1000_netpoll(struct net_device *netdev)
 	disable_irq(adapter->pdev->irq);
 	e1000_intr(adapter->pdev->irq, netdev);
 
-	e1000_clean_tx_irq(adapter);
-
 	enable_irq(adapter->pdev->irq);
 }
 #endif

  reply	other threads:[~2008-07-22 13:35 UTC|newest]

Thread overview: 58+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2008-07-17 21:42 [bug, netconsole, SLUB] BUG skbuff_head_cache: Poison overwritten Ingo Molnar
2008-07-17 21:45 ` David Miller
2008-07-17 22:06   ` Ingo Molnar
2008-07-17 22:09     ` David Miller
2008-07-17 22:43     ` Ingo Molnar
2008-07-17 23:15 ` Vegard Nossum
2008-07-17 23:35   ` Vegard Nossum
2008-07-17 23:52   ` Ingo Molnar
2008-07-18  0:01     ` Ingo Molnar
2008-07-18  0:05     ` Vegard Nossum
2008-07-18  0:16       ` Ingo Molnar
2008-07-18  2:13     ` David Miller
2008-07-18  2:03   ` David Miller
2008-07-18  7:03     ` Vegard Nossum
2008-07-18  7:12       ` David Miller
2008-07-18  9:05       ` Ingo Molnar
2008-07-18 19:10       ` [bug] Attempt to release alive inet socket f6fac040 Ingo Molnar
2008-07-18 19:55         ` Ingo Molnar
2008-07-17 23:27 ` [bug, netconsole, SLUB] BUG skbuff_head_cache: Poison overwritten Vegard Nossum
2008-07-17 23:56   ` Ingo Molnar
2008-07-21 11:41     ` Vegard Nossum
2008-07-18  5:46 ` Evgeniy Polyakov
2008-07-18  9:02   ` Pekka Enberg
2008-07-18  9:09     ` Ingo Molnar
2008-07-18  9:15       ` Pekka Enberg
2008-07-18 10:16     ` Evgeniy Polyakov
2008-07-18 14:44       ` Pekka Enberg
2008-07-18 14:48         ` Christoph Lameter
2008-07-18 16:07         ` Evgeniy Polyakov
2008-07-18  9:00 ` Pekka J Enberg
2008-07-18  9:11   ` Ingo Molnar
2008-07-18  9:16     ` Pekka Enberg
2008-07-18 13:54       ` Christoph Lameter
2008-07-21  9:41     ` Ingo Molnar
2008-07-21  9:52       ` Pekka Enberg
2008-07-21  9:52         ` Pekka Enberg
2008-07-21 10:06         ` Evgeniy Polyakov
2008-07-21 10:50           ` Ingo Molnar
2008-07-21 11:03             ` Vegard Nossum
2008-07-21 11:13               ` Ingo Molnar
2008-07-21 16:19               ` Christoph Lameter
2008-07-21 20:23                 ` Vegard Nossum
2008-07-21 11:25             ` Evgeniy Polyakov
2008-07-21 11:55               ` Ingo Molnar
2008-07-21 12:57                 ` Evgeniy Polyakov
2008-07-21 14:01                   ` Ingo Molnar
2008-07-21 19:21                 ` Ingo Molnar
2008-07-21 21:24                   ` Evgeniy Polyakov
2008-07-21 23:33                     ` David Miller
2008-07-22  7:50                       ` Ingo Molnar
2008-07-22 13:34                         ` Ingo Molnar [this message]
2008-07-23 22:31                           ` David Miller
2008-07-23 22:40                             ` Jeff Kirsher
2008-07-21 16:22           ` Christoph Lameter
2008-07-21 19:57             ` Evgeniy Polyakov
2008-07-21 20:05               ` Ingo Molnar
2008-07-21 20:22                 ` Vegard Nossum
2008-07-18 13:55   ` Christoph Lameter

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20080722133415.GA23046@elte.hu \
    --to=mingo@elte.hu \
    --cc=auke-jan.h.kok@intel.com \
    --cc=cl@linux-foundation.org \
    --cc=davem@davemloft.net \
    --cc=jeffrey.t.kirsher@intel.com \
    --cc=johnpol@2ka.mipt.ru \
    --cc=linux-kernel@vger.kernel.org \
    --cc=netdev@vger.kernel.org \
    --cc=penberg@cs.helsinki.fi \
    --cc=rjw@sisk.pl \
    --cc=vegard.nossum@gmail.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.