netdev.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Michael Buesch <mb@bu3sch.de>
To: Larry Finger <Larry.Finger@lwfinger.net>
Cc: Johannes Berg <johannes@sipsolutions.net>,
	Joseph Fannin <jhf@columbus.rr.com>,
	Andrew Morton <akpm@osdl.org>,
	netdev@vger.kernel.org, linux-kernel@vger.kernel.org,
	Michael Buesch <mb@bu3sch.de>,
	Bcm43xx-dev@lists.berlios.denunk, Adrian Bunk <bunk@stusta.de>,
	Ray Lee <ray-lk@madrabbit.org>
Subject: Re: bcm43xx regression 2.6.19rc3 -> rc5, rtnl_lock trouble?
Date: Sun, 19 Nov 2006 17:01:21 +0100	[thread overview]
Message-ID: <200611191701.21593.mb@bu3sch.de> (raw)
In-Reply-To: <455F58AC.3030801@lwfinger.net>

On Saturday 18 November 2006 20:02, Larry Finger wrote:
> Ray Lee wrote:
> > Larry Finger wrote:
> >> Johannes Berg wrote:
> >>> Hah, that's a lot more plausible than bcm43xx's drain patch actually
> >>> causing this. So maybe somehow interrupts for bcm43xx aren't routed
> >>> properly or something...
> >>>
> >>> Ray, please check /proc/interrupts when this happens.
> > 
> > When it happens, I can't. The keyboard is entirely dead (I'm in X, perhaps at
> > a console it would be okay). The only thing that works is magic SysRq. even
> > ctrl-alt-f1 to get to a console doesn't work.
> > 
> > That said, /proc/interrupts doesn't show MSI routed things on my AMD64 laptop.
> > 
> >>> I am convinced that the patch in question (drain tx status) is not
> >>> causing this -- the patch should be a no-op in most cases anyway, and in
> >>> those cases where it isn't a no-op it'll run only once at card init and
> >>> remove some things from a hardware-internal FIFO.
> > 
> > Okay, I can buy that.
> > 
> >> I agree that drain tx status should not cause the problem.
> >>
> >> Ray, does -rc6 solve your problem as it did for Joseph?
> > 
> > I can't get it to repeat other than the first two times. However, I
> > accidentally stopped NetworkManager from handling my wireless a few days ago,
> > and haven't restarted it, so that may play into this.
> > 
> > Humor me one last time, I beg. Did you look at the messages file I posted? (Or
> > maybe I didn't include this second bit... Damn, I need to be more careful with
> > cutting and pasting...)
> 
> The locking stuff wasn't in any of the messages that I received.
> 
> > The second sysrq-t shows locking stuff going on, can you tell me if it looks
> > reasonable? It still seems to me that something acquiring and not releasing
> > rtnl_lock explains what I was seeing (rtnl lock is implicated in both sysrq-t
> > backtraces). I don't know if that thing is bcm43xx, though.
> > 
> > Is this part reasonable?:
> >  1 lock held by events/0/4:
> >   #0:  (&bcm->mutex){--..}, at: [mutex_lock+9/16] mutex_lock+0x9/0x10
> >  2 locks held by NetworkManager/4837:
> >   #0:  (rtnl_mutex){--..}, at: [mutex_lock+9/16] mutex_lock+0x9/0x10
> >   #1:  (&bcm->mutex){--..}, at: [mutex_lock+9/16] mutex_lock+0x9/0x10
> >  1 lock held by wpa_supplicant/5953:
> >   #0:  (rtnl_mutex){--..}, at: [mutex_lock+9/16] mutex_lock+0x9/0x10
> 
> I'm not an expert on locking, but it certainly looks as if bcm43xx and wpa_supplicant are OK by 
> themselves, but that NetworkManager interferes. This behavior matches what I see - I don't have 
> NetworkManager on my system, but I do use wpa_supplicant, with no lockups. Of course, I have i386 
> architecture.
> 
> Although NetworkManager may be the catalyst to trigger the bug, I doubt that it is the cause. 
> Strictly as a guess, I would suspect that the locking problem is in SoftMAC, where we know there can 
> be locking difficulties, but no one is fixing them because EOL is near for that component.

Well, this is different. The (known) locking problems are of type "missing locking".
But this seems to be a deadlock or something, so clearly a bug that
must be fixed. Even in softmac.
I don't know yet how this can happen, though.

-- 
Greetings Michael.

  reply	other threads:[~2006-11-19 16:03 UTC|newest]

Thread overview: 23+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2006-11-15 19:01 bcm43xx regression 2.6.19rc3 -> rc5, rtnl_lock trouble? Ray Lee
     [not found] ` <455B63EC.8070704-0Cg02Ec9UG4BXFe83j6qeQ@public.gmane.org>
2006-11-15 19:15   ` Michael Buesch
2006-11-15 19:41     ` Ray Lee
     [not found]       ` <455B6D74.2020507-0Cg02Ec9UG4BXFe83j6qeQ@public.gmane.org>
2006-11-16  2:51         ` Larry Finger
2006-11-16  5:51           ` Ray Lee
2006-11-16 18:17             ` Larry Finger
     [not found]               ` <455CAB2F.1060709-tQ5ms3gMjBLk1uMJSBkQmQ@public.gmane.org>
2006-11-16 19:16                 ` Michael Buesch
2006-11-16 19:36                   ` Ray Lee
     [not found]                     ` <455CBDD7.6000507-0Cg02Ec9UG4BXFe83j6qeQ@public.gmane.org>
2006-11-16 22:40                       ` Larry Finger
2006-11-16 23:26                         ` Ray Lee
     [not found]                           ` <ae017dc00611161526v6bcbddc2ve2c7e10963d25c3b-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2006-11-17  1:13                             ` Larry Finger
2006-11-18 11:24 ` Joseph Fannin
     [not found]   ` <20061118112438.GB15349-JY2TvBve8Und/56dcus+/6462+2Rg2F6@public.gmane.org>
2006-11-18 16:55     ` Johannes Berg
     [not found]       ` <1163868955.27188.2.camel-YfaajirXv214zXjbi5bjpg@public.gmane.org>
2006-11-18 17:05         ` Larry Finger
2006-11-18 17:27           ` Ray Lee
2006-11-18 18:30             ` Adrian Bunk
2006-11-21  6:21               ` Ray Lee
2006-11-18 19:02             ` Larry Finger
2006-11-19 16:01               ` Michael Buesch [this message]
2006-12-12  4:06               ` ieee80211 sleeping in invalid context Ray Lee
2006-12-12  9:14                 ` Michael Buesch
2006-12-12 17:51                   ` Ray Lee
     [not found]                     ` <457EEC1E.9000806-0Cg02Ec9UG4BXFe83j6qeQ@public.gmane.org>
2006-12-12 18:31                       ` Larry Finger

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=200611191701.21593.mb@bu3sch.de \
    --to=mb@bu3sch.de \
    --cc=Bcm43xx-dev@lists.berlios.denunk \
    --cc=Larry.Finger@lwfinger.net \
    --cc=akpm@osdl.org \
    --cc=bunk@stusta.de \
    --cc=jhf@columbus.rr.com \
    --cc=johannes@sipsolutions.net \
    --cc=linux-kernel@vger.kernel.org \
    --cc=netdev@vger.kernel.org \
    --cc=ray-lk@madrabbit.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).