All of lore.kernel.org
 help / color / mirror / Atom feed
From: Larry Finger <Larry.Finger-tQ5ms3gMjBLk1uMJSBkQmQ@public.gmane.org>
To: Ray Lee <ray-lk-0Cg02Ec9UG4BXFe83j6qeQ@public.gmane.org>
Cc: netdev-u79uwXL29TY76Z2rM5mHXA@public.gmane.org,
	Bcm43xx-dev-0fE9KPoRgkgATYTw5x5z8w@public.gmane.org,
	Michael Buesch <mb-fseUSCV1ubazQB+pC5nmwQ@public.gmane.org>
Subject: Re: bcm43xx regression 2.6.19rc3 -> rc5, rtnl_lock trouble?
Date: Thu, 16 Nov 2006 19:13:10 -0600	[thread overview]
Message-ID: <455D0CA6.3030604@lwfinger.net> (raw)
In-Reply-To: <ae017dc00611161526v6bcbddc2ve2c7e10963d25c3b-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>

Ray Lee wrote:
> First off, thanks for all your help.
> 
> Second off,
> 
> On 11/16/06, Larry Finger <Larry.Finger-tQ5ms3gMjBLk1uMJSBkQmQ@public.gmane.org> wrote:
>> Ray Lee wrote:
>> >
>> > If I could figure out a way to make it repeatable, I'd happily do a 
>> blind
>> > bisect.
> [...]
>> > I'm open to suggestions on how to make the problem trigger more than 
>> once
>> > every two days...
>>
>> I don't know what might be causing the lock problems. I'm more 
>> concerned with the NETDEV WATCHDOG
>> timeouts. AFAIK, you are the only one still reporting this error. On 
>> my system, I get an occasional
>> MAC suspend failure, sometimes followed by an BCM43xx_IRQ_XMIT_ERROR.
> 
> Last time I had trouble with 2.6.18-rcX, I wasn't the only one, just
> the only one reporting it. Can you tell me why reverting the likely
> culprit isn't an option? rc6 is out, and Linus is really pushing to
> finalize 2.6.19 here soon.
> 
>>  From what I read in your post, the timeouts happen a lot more often 
>> than once every two days. Once
>> we get those fixed, then we can concentrate on the locking.
> 
> It's becoming clear that I wasn't so clear :-). No, it doesn't happen
> more than once every two (three, now) days. I'm saying that it's only
> happened twice, as once the first timeout message starts, the timeouts
> don't stop short of a reboot.
> 
> Or, in other words, it happened occasionally under 2.6.19-rc3, but
> fixed itself. Under 2.6.19-rc5, it's happened less frequently (maybe),
> but once it starts, it goes on solid until I reboot the computer.
> Until I reboot, the laptop is fully unusable as things start hanging
> on the rtnl_lock (X, apparently).
> 
> Please see http://madrabbit.org/~ray/messages.gz for the
> /var/log/messages to understand what I mean by that. (Though, that was
> captured before I'd rebuilt the module with debugging, unfortunately.
> Regardless, it may help clarify what I mean here.)
> 
> So all the NETDEV WATCHDOG timeouts other than the first (of each of
> the two events) appear to be bogus, or side effects of rtnl_lock being
> held after the first time, and not clearing out.
> 
> <thinks...> Maybe I've got the culprit backward here. Perhaps
> something else in my system is locking on rtnl_lock, and bcm43xx can't
> acquire it? Could the NETDEV WATCHDOG timeouts be a side effect of
> someone acquiring and not releasing the rtnl_lock()? Is that possible?
> (ie, would it cause the effect I'm seeing?)

It certainly could. Please remove the new line in the hunk below for 
drivers/net/wireless/bcm43xx/bcm43xx_main.c:

@@ -3569,6 +3586,7 @@ int bcm43xx_select_wireless_core(struct
         bcm43xx_macfilter_clear(bcm, BCM43xx_MACFILTER_ASSOC);
         bcm43xx_macfilter_set(bcm, BCM43xx_MACFILTER_SELF, (u8 *)(bcm->net_dev->dev_addr));
         bcm43xx_security_init(bcm);
+       drain_txstatus_queue(bcm);
         ieee80211softmac_start(bcm->net_dev);

This will effectively remove _ALL_ bcm43xx patches between 2.6.19-rc3 and -rc6. If the rtnl_locks 
still occur, bcm43xx is not causing them. The other patches are not involved for your system.

Larry

  parent reply	other threads:[~2006-11-17  1:13 UTC|newest]

Thread overview: 38+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2006-11-15 19:01 bcm43xx regression 2.6.19rc3 -> rc5, rtnl_lock trouble? Ray Lee
2006-11-15 19:15 ` Michael Buesch
2006-11-15 19:15   ` Michael Buesch
2006-11-15 19:41   ` Ray Lee
2006-11-16  2:51     ` Larry Finger
2006-11-16  2:51       ` Larry Finger
2006-11-16  5:51       ` Ray Lee
2006-11-16 18:17         ` Larry Finger
     [not found]           ` <455CAB2F.1060709-tQ5ms3gMjBLk1uMJSBkQmQ@public.gmane.org>
2006-11-16 19:16             ` Michael Buesch
2006-11-16 19:36               ` Ray Lee
     [not found]                 ` <455CBDD7.6000507-0Cg02Ec9UG4BXFe83j6qeQ@public.gmane.org>
2006-11-16 22:40                   ` Larry Finger
2006-11-16 23:26                     ` Ray Lee
     [not found]                       ` <ae017dc00611161526v6bcbddc2ve2c7e10963d25c3b-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2006-11-17  1:13                         ` Larry Finger [this message]
2006-11-18 11:24 ` Joseph Fannin
2006-11-18 16:55   ` Johannes Berg
2006-11-18 16:55     ` Johannes Berg
2006-11-18 17:05     ` Larry Finger
2006-11-18 17:05       ` Larry Finger
2006-11-18 17:27       ` Ray Lee
2006-11-18 18:30         ` Adrian Bunk
2006-11-21  6:21           ` Ray Lee
2006-11-18 19:02         ` Larry Finger
2006-11-19 16:01           ` Michael Buesch
2006-12-12  4:06           ` ieee80211 sleeping in invalid context Ray Lee
2006-12-12  9:14             ` Michael Buesch
2006-12-12 17:51               ` Ray Lee
2006-12-12 18:31                 ` Larry Finger
2006-12-12 18:31                   ` Larry Finger
2006-11-19  6:15         ` Problem with DMA on x86_64 with 3 GB RAM Larry Finger
2006-11-21  4:38           ` Ray Lee
2006-11-21 11:28             ` Alan
2006-11-21 16:34               ` Larry Finger
2006-11-21 10:30           ` Andi Kleen
2006-11-21 16:37             ` Larry Finger
2006-11-21 16:46               ` Andi Kleen
2006-11-21 18:27                 ` Alan
2006-11-21 18:31                   ` Andi Kleen
2006-11-21 20:04                     ` Alan

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=455D0CA6.3030604@lwfinger.net \
    --to=larry.finger-tq5ms3gmjblk1umjsbkqmq@public.gmane.org \
    --cc=Bcm43xx-dev-0fE9KPoRgkgATYTw5x5z8w@public.gmane.org \
    --cc=mb-fseUSCV1ubazQB+pC5nmwQ@public.gmane.org \
    --cc=netdev-u79uwXL29TY76Z2rM5mHXA@public.gmane.org \
    --cc=ray-lk-0Cg02Ec9UG4BXFe83j6qeQ@public.gmane.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.