* 2.6.23-rc5: possible irq lock inversion dependency detected
@ 2007-09-02 13:11 Christian Kujau
2007-09-10 12:03 ` Peter Zijlstra
2007-09-10 13:00 ` Herbert Xu
0 siblings, 2 replies; 8+ messages in thread
From: Christian Kujau @ 2007-09-02 13:11 UTC (permalink / raw)
To: linux-kernel; +Cc: netdev
Hi,
after upgrading to 2.6.23-rc5 (and applying davem's fix [0]), lockdep
was quite noisy when I tried to shape my external (wireless) interface:
[ 6400.534545] FahCore_78.exe/3552 just changed the state of lock:
[ 6400.534713] (&dev->ingress_lock){-+..}, at: [<c038d595>] netif_receive_skb+0x2d5/0x3c0
[ 6400.534941] but this lock took another, soft-read-irq-unsafe lock in the past:
[ 6400.535145] (police_lock){-.--}
This happened when I executed: http://nerdbynature.de/bits/2.6.23-rc5/qos.sh.txt
(using iproute2-ss070313). The is still running, I just noticed a short
hickup, probably when it was busy writing the warning to the disk.
More details and .config: http://nerdbynature.de/bits/2.6.23-rc5/
I'm not really sure what the application mentioned in the message above
has to do with this: the application[1] has been running since bootup as
a non-privileged user and did so for earlier kernel versions too.
Christian.
[0] http://lkml.org/lkml/2007/9/2/6
[1] http://folding.stanford.edu/linux.html
--
BOFH excuse #294:
PCMCIA slave driver
^ permalink raw reply [flat|nested] 8+ messages in thread* Re: 2.6.23-rc5: possible irq lock inversion dependency detected 2007-09-02 13:11 2.6.23-rc5: possible irq lock inversion dependency detected Christian Kujau @ 2007-09-10 12:03 ` Peter Zijlstra 2007-09-10 13:00 ` Herbert Xu 1 sibling, 0 replies; 8+ messages in thread From: Peter Zijlstra @ 2007-09-10 12:03 UTC (permalink / raw) To: Christian Kujau; +Cc: linux-kernel, netdev On Sun, 2007-09-02 at 15:11 +0200, Christian Kujau wrote: > Hi, > > after upgrading to 2.6.23-rc5 (and applying davem's fix [0]), lockdep > was quite noisy when I tried to shape my external (wireless) interface: > > [ 6400.534545] FahCore_78.exe/3552 just changed the state of lock: > [ 6400.534713] (&dev->ingress_lock){-+..}, at: [<c038d595>] netif_receive_skb+0x2d5/0x3c0 > [ 6400.534941] but this lock took another, soft-read-irq-unsafe lock in the past: > [ 6400.535145] (police_lock){-.--} > > This happened when I executed: http://nerdbynature.de/bits/2.6.23-rc5/qos.sh.txt > (using iproute2-ss070313). The is still running, I just noticed a short > hickup, probably when it was busy writing the warning to the disk. > > More details and .config: http://nerdbynature.de/bits/2.6.23-rc5/ seems unavailable at this time, please submit the whole lockdep report if possible. ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: 2.6.23-rc5: possible irq lock inversion dependency detected 2007-09-02 13:11 2.6.23-rc5: possible irq lock inversion dependency detected Christian Kujau 2007-09-10 12:03 ` Peter Zijlstra @ 2007-09-10 13:00 ` Herbert Xu 2007-09-11 0:04 ` jamal 1 sibling, 1 reply; 8+ messages in thread From: Herbert Xu @ 2007-09-10 13:00 UTC (permalink / raw) To: Christian Kujau; +Cc: linux-kernel, netdev, jamal On Sun, Sep 02, 2007 at 01:11:29PM +0000, Christian Kujau wrote: > > after upgrading to 2.6.23-rc5 (and applying davem's fix [0]), lockdep > was quite noisy when I tried to shape my external (wireless) interface: > > [ 6400.534545] FahCore_78.exe/3552 just changed the state of lock: > [ 6400.534713] (&dev->ingress_lock){-+..}, at: [<c038d595>] > netif_receive_skb+0x2d5/0x3c0 > [ 6400.534941] but this lock took another, soft-read-irq-unsafe lock in the > past: > [ 6400.535145] (police_lock){-.--} This is a genuine dead-lock. The police lock can be taken for reading with softirqs on. If a second CPU tries to take the police lock for writing, while holding the ingress lock, then a softirq on the first CPU can dead-lock when it tries to get the ingress lock. The minimal fix would be to make sure that we disable BH on the first CPU. Jamal, could you take a look at this please? Thanks, -- Visit Openswan at http://www.openswan.org/ Email: Herbert Xu ~{PmV>HI~} <herbert@gondor.apana.org.au> Home Page: http://gondor.apana.org.au/~herbert/ PGP Key: http://gondor.apana.org.au/~herbert/pubkey.txt ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: 2.6.23-rc5: possible irq lock inversion dependency detected 2007-09-10 13:00 ` Herbert Xu @ 2007-09-11 0:04 ` jamal 2007-09-11 2:18 ` Herbert Xu 0 siblings, 1 reply; 8+ messages in thread From: jamal @ 2007-09-11 0:04 UTC (permalink / raw) To: Herbert Xu; +Cc: Christian Kujau, linux-kernel, netdev [-- Attachment #1: Type: text/plain, Size: 336 bytes --] On Mon, 2007-10-09 at 21:00 +0800, Herbert Xu wrote: > The minimal fix would be to make sure that we disable BH on > the first CPU. disabling BH would make it more symmetric to the way we handle egress. I couldnt reproduce the issue, but this should hopefully resolve it. Christian, can you test with this patch? cheers, jamal [-- Attachment #2: ing1 --] [-- Type: text/plain, Size: 549 bytes --] [NET_SCHED] make ingress qlock symmetric to egress Signed-off-by: Jamal Hadi Salim <hadi@cyberus.ca> --- a/net/sched/sch_generic.c 2007/09/10 23:19:45 1.1 +++ b/net/sched/sch_generic.c 2007/09/10 23:52:45 @@ -42,12 +42,12 @@ void qdisc_lock_tree(struct net_device *dev) { spin_lock_bh(&dev->queue_lock); - spin_lock(&dev->ingress_lock); + spin_lock_bh(&dev->ingress_lock); } void qdisc_unlock_tree(struct net_device *dev) { - spin_unlock(&dev->ingress_lock); + spin_unlock_bh(&dev->ingress_lock); spin_unlock_bh(&dev->queue_lock); } ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: 2.6.23-rc5: possible irq lock inversion dependency detected 2007-09-11 0:04 ` jamal @ 2007-09-11 2:18 ` Herbert Xu 2007-09-11 12:01 ` jamal 0 siblings, 1 reply; 8+ messages in thread From: Herbert Xu @ 2007-09-11 2:18 UTC (permalink / raw) To: jamal; +Cc: Christian Kujau, linux-kernel, netdev On Mon, Sep 10, 2007 at 08:04:41PM -0400, jamal wrote: > > disabling BH would make it more symmetric to the way we handle > egress. I couldnt reproduce the issue, but this should hopefully resolve > it. > Christian, can you test with this patch? Jamal, it's the police_lock that we need to make _bh. The ingress_lock is already _bh because of the spin_lock_bh that directly precedes it. Oh and I think the same thing applies for the other actions too. Cheers, -- Visit Openswan at http://www.openswan.org/ Email: Herbert Xu ~{PmV>HI~} <herbert@gondor.apana.org.au> Home Page: http://gondor.apana.org.au/~herbert/ PGP Key: http://gondor.apana.org.au/~herbert/pubkey.txt ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: 2.6.23-rc5: possible irq lock inversion dependency detected 2007-09-11 2:18 ` Herbert Xu @ 2007-09-11 12:01 ` jamal 2007-09-11 12:43 ` Herbert Xu 0 siblings, 1 reply; 8+ messages in thread From: jamal @ 2007-09-11 12:01 UTC (permalink / raw) To: Herbert Xu; +Cc: Christian Kujau, linux-kernel, netdev [-- Attachment #1: Type: text/plain, Size: 629 bytes --] On Tue, 2007-11-09 at 10:18 +0800, Herbert Xu wrote: > Jamal, it's the police_lock that we need to make _bh. The > ingress_lock is already _bh because of the spin_lock_bh that > directly precedes it. > > Oh and I think the same thing applies for the other actions > too. ga-Dang. Ok, here it is. If you see(?) any more farts let me know. I am around for another 30 minutes and off for about 18 hours. Christian, i took your config and qos setup but I cant reproduce the issue - i think i may need some of that wireless setup to recreate. So if you can test this and validate it works we can push it forward. cheers, jamal [-- Attachment #2: act_bhl --] [-- Type: text/plain, Size: 2205 bytes --] [NET_SCHED] protect action config/dump from irqs >From the sharp laser eyes of Herbert Xu to my slow farting brain... (with no apologies to C Heston) On Mon, 2007-10-09 at 21:00 +0800, Herbert Xu wrote: On Sun, Sep 02, 2007 at 01:11:29PM +0000, Christian Kujau wrote: > > > > after upgrading to 2.6.23-rc5 (and applying davem's fix [0]), lockdep > > was quite noisy when I tried to shape my external (wireless) interface: > > > > [ 6400.534545] FahCore_78.exe/3552 just changed the state of lock: > > [ 6400.534713] (&dev->ingress_lock){-+..}, at: [<c038d595>] > > netif_receive_skb+0x2d5/0x3c0 > > [ 6400.534941] but this lock took another, soft-read-irq-unsafe lock in the > > past: > > [ 6400.535145] (police_lock){-.--} > > This is a genuine dead-lock. The police lock can be taken > for reading with softirqs on. If a second CPU tries to take > the police lock for writing, while holding the ingress lock, > then a softirq on the first CPU can dead-lock when it tries > to get the ingress lock. Signed-off-by: Jamal Hadi Salim <hadi@cyberus.ca> --- a/net/sched/act_police.c 2007/09/11 10:39:36 1.1 +++ b/net/sched/act_police.c 2007/09/11 10:51:47 @@ -56,7 +56,7 @@ int err = 0, index = -1, i = 0, s_i = 0, n_i = 0; struct rtattr *r; - read_lock(&police_lock); + read_lock_bh(&police_lock); s_i = cb->args[0]; @@ -85,7 +85,7 @@ } } done: - read_unlock(&police_lock); + read_unlock_bh(&police_lock); if (n_i) cb->args[0] += n_i; return n_i; --- a/net/sched/act_api.c 2007/09/11 10:47:51 1.1 +++ b/net/sched/act_api.c 2007/09/11 10:50:47 @@ -68,7 +68,7 @@ int err = 0, index = -1,i = 0, s_i = 0, n_i = 0; struct rtattr *r ; - read_lock(hinfo->lock); + read_lock_bh(hinfo->lock); s_i = cb->args[0]; @@ -96,7 +96,7 @@ } } done: - read_unlock(hinfo->lock); + read_unlock_bh(hinfo->lock); if (n_i) cb->args[0] += n_i; return n_i; @@ -156,13 +156,13 @@ { struct tcf_common *p; - read_lock(hinfo->lock); + read_lock_bh(hinfo->lock); for (p = hinfo->htab[tcf_hash(index, hinfo->hmask)]; p; p = p->tcfc_next) { if (p->tcfc_index == index) break; } - read_unlock(hinfo->lock); + read_unlock_bh(hinfo->lock); return p; } ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: 2.6.23-rc5: possible irq lock inversion dependency detected 2007-09-11 12:01 ` jamal @ 2007-09-11 12:43 ` Herbert Xu 2007-09-12 14:33 ` David Miller 0 siblings, 1 reply; 8+ messages in thread From: Herbert Xu @ 2007-09-11 12:43 UTC (permalink / raw) To: jamal, David S. Miller; +Cc: Christian Kujau, linux-kernel, netdev On Tue, Sep 11, 2007 at 08:01:46AM -0400, jamal wrote: > > [NET_SCHED] protect action config/dump from irqs Looks good! Thanks Jamal. -- Visit Openswan at http://www.openswan.org/ Email: Herbert Xu ~{PmV>HI~} <herbert@gondor.apana.org.au> Home Page: http://gondor.apana.org.au/~herbert/ PGP Key: http://gondor.apana.org.au/~herbert/pubkey.txt ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: 2.6.23-rc5: possible irq lock inversion dependency detected 2007-09-11 12:43 ` Herbert Xu @ 2007-09-12 14:33 ` David Miller 0 siblings, 0 replies; 8+ messages in thread From: David Miller @ 2007-09-12 14:33 UTC (permalink / raw) To: herbert; +Cc: hadi, lists, linux-kernel, netdev From: Herbert Xu <herbert@gondor.apana.org.au> Date: Tue, 11 Sep 2007 20:43:27 +0800 > On Tue, Sep 11, 2007 at 08:01:46AM -0400, jamal wrote: > > > > [NET_SCHED] protect action config/dump from irqs > > Looks good! Thanks Jamal. Applied, I'll try to push this in some time soon. ^ permalink raw reply [flat|nested] 8+ messages in thread
end of thread, other threads:[~2007-09-12 14:33 UTC | newest] Thread overview: 8+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2007-09-02 13:11 2.6.23-rc5: possible irq lock inversion dependency detected Christian Kujau 2007-09-10 12:03 ` Peter Zijlstra 2007-09-10 13:00 ` Herbert Xu 2007-09-11 0:04 ` jamal 2007-09-11 2:18 ` Herbert Xu 2007-09-11 12:01 ` jamal 2007-09-11 12:43 ` Herbert Xu 2007-09-12 14:33 ` David Miller
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).