All of lore.kernel.org
 help / color / mirror / Atom feed
From: Cong Wang <xiyou.wangcong@gmail.com>
To: netdev@vger.kernel.org
Subject: Re: NULL deref in bnx2 / crashes ? ( was: netconsole leads to stalled CPU task )
Date: Thu, 23 Aug 2012 09:12:31 +0000 (UTC)	[thread overview]
Message-ID: <k14s5v$7bs$2@ger.gmane.org> (raw)
In-Reply-To: k14poh$7bs$1@ger.gmane.org

On Thu, 23 Aug 2012 at 08:31 GMT, Cong Wang <xiyou.wangcong@gmail.com> wrote:
> On Thu, 23 Aug 2012 at 07:57 GMT, Cong Wang <xiyou.wangcong@gmail.com> wrote:
>> On Wed, 22 Aug 2012 at 14:29 GMT, Sylvain Munaut <s.munaut@whatever-company.com> wrote:
>>> Hi,
>>>
>>>
>>> The machine with the intel card still hard freeze (no output / no nothing ...)
>>> The machine with the bnx2 don't crash anymore and no NULL deref, but
>>> the modprobe still hangs and I get this every 180 sec or so :
>>
>> The NULL-deref can be reproduced easily, and Eric's patch could fix it.
>> So, Eric, can you resend your patch with your SOB?
>>
>> I can't reproduce the hang as it is net driver specific, it is
>> probably related with my patch:
>>
>> commit 6bdb7fe31046ac50b47e83c35cd6c6b6160a475d
>> Author: Amerigo Wang <amwang@redhat.com>
>> Date:   Fri Aug 10 01:24:50 2012 +0000
>>
>>     netpoll: re-enable irq in poll_napi()
>>
>
> Could you test the following patch?
>
> diff --git a/net/core/netpoll.c b/net/core/netpoll.c
> index ddc453b..ed4d1e4 100644
> --- a/net/core/netpoll.c
> +++ b/net/core/netpoll.c
> @@ -166,11 +166,18 @@ static int poll_one_napi(struct netpoll_info *npinfo,
>  static void poll_napi(struct net_device *dev)
>  {
>  	struct napi_struct *napi;
> +	LIST_HEAD(napi_list);
>  	int budget = 16;
>  
>  	WARN_ON_ONCE(!irqs_disabled());
>  
> -	list_for_each_entry(napi, &dev->napi_list, dev_list) {
> +	/* After we enable the IRQ, new entries could be added
> +	 * to this list, we need to save it before re-enable
> +	 * IRQ.
> +	 */
> +	list_splice_tail(&dev->napi_list, &napi_list);
> +

This one should be list_splice_init()...


> +	list_for_each_entry(napi, &napi_list, dev_list) {
>  		local_irq_enable();
>  		if (napi->poll_owner != smp_processor_id() &&
>  		    spin_trylock(&napi->poll_lock)) {
> @@ -187,6 +194,7 @@ static void poll_napi(struct net_device *dev)
>  		}
>  		local_irq_disable();
>  	}
> +	list_splice_tail(&napi_list, &dev->napi_list);
>  }
>  
>  static void service_arp_queue(struct netpoll_info *npi)
>
>
>
>
> However, it seems we should take rtnl lock to make sure dev->napi_list
> is really safe, I am not sure if the following one makes sense.
>
>
> diff --git a/net/core/netpoll.c b/net/core/netpoll.c
> index ddc453b..7770e2b 100644
> --- a/net/core/netpoll.c
> +++ b/net/core/netpoll.c
> @@ -170,8 +170,9 @@ static void poll_napi(struct net_device *dev)
>  
>  	WARN_ON_ONCE(!irqs_disabled());
>  
> +	local_irq_enable();
> +	rtnl_lock();
>  	list_for_each_entry(napi, &dev->napi_list, dev_list) {
> -		local_irq_enable();
>  		if (napi->poll_owner != smp_processor_id() &&
>  		    spin_trylock(&napi->poll_lock)) {
>  			rcu_read_lock_bh();
> @@ -180,13 +181,12 @@ static void poll_napi(struct net_device *dev)
>  			rcu_read_unlock_bh();
>  			spin_unlock(&napi->poll_lock);
>  
> -			if (!budget) {
> -				local_irq_disable();
> +			if (!budget)
>  				break;
> -			}
>  		}
> -		local_irq_disable();
>  	}
> +	rtnl_unlock();
> +	local_irq_disable();
>  }
>  
>  static void service_arp_queue(struct netpoll_info *npi)
>

  reply	other threads:[~2012-08-23  9:12 UTC|newest]

Thread overview: 20+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2012-08-22 10:53 NULL deref in bnx2 / crashes ? ( was: netconsole leads to stalled CPU task ) Sylvain Munaut
2012-08-22 11:13 ` Eric Dumazet
2012-08-22 12:17   ` Sylvain Munaut
2012-08-22 13:05     ` Eric Dumazet
2012-08-22 14:29       ` Sylvain Munaut
2012-08-22 15:40         ` Cong Wang
2012-08-23  7:57         ` Cong Wang
2012-08-23  8:31           ` Cong Wang
2012-08-23  9:12             ` Cong Wang [this message]
2012-08-24  9:50               ` Sylvain Munaut
2012-08-25  8:01                 ` Cong Wang
2012-08-25  2:20         ` Lin Ming
2012-09-12 11:53       ` Sylvain Munaut
2012-09-12 12:49         ` Cong Wang
2012-09-12 13:05           ` Eric Dumazet
2012-09-13 17:35             ` Sylvain Munaut
2012-09-14 13:22               ` Cong Wang
2012-09-14 15:36                 ` Sylvain Munaut
2012-09-17 10:57                   ` Sylvain Munaut
2012-09-17 15:17                     ` Cong Wang

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to='k14s5v$7bs$2@ger.gmane.org' \
    --to=xiyou.wangcong@gmail.com \
    --cc=netdev@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.