* 2.6.23-rc5: possible irq lock inversion dependency detected
@ 2007-09-02 13:11 Christian Kujau
2007-09-10 12:03 ` Peter Zijlstra
2007-09-10 13:00 ` Herbert Xu
0 siblings, 2 replies; 8+ messages in thread
From: Christian Kujau @ 2007-09-02 13:11 UTC (permalink / raw)
To: linux-kernel; +Cc: netdev
Hi,
after upgrading to 2.6.23-rc5 (and applying davem's fix [0]), lockdep
was quite noisy when I tried to shape my external (wireless) interface:
[ 6400.534545] FahCore_78.exe/3552 just changed the state of lock:
[ 6400.534713] (&dev->ingress_lock){-+..}, at: [<c038d595>] netif_receive_skb+0x2d5/0x3c0
[ 6400.534941] but this lock took another, soft-read-irq-unsafe lock in the past:
[ 6400.535145] (police_lock){-.--}
This happened when I executed: http://nerdbynature.de/bits/2.6.23-rc5/qos.sh.txt
(using iproute2-ss070313). The is still running, I just noticed a short
hickup, probably when it was busy writing the warning to the disk.
More details and .config: http://nerdbynature.de/bits/2.6.23-rc5/
I'm not really sure what the application mentioned in the message above
has to do with this: the application[1] has been running since bootup as
a non-privileged user and did so for earlier kernel versions too.
Christian.
[0] http://lkml.org/lkml/2007/9/2/6
[1] http://folding.stanford.edu/linux.html
--
BOFH excuse #294:
PCMCIA slave driver
^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: 2.6.23-rc5: possible irq lock inversion dependency detected
2007-09-02 13:11 2.6.23-rc5: possible irq lock inversion dependency detected Christian Kujau
@ 2007-09-10 12:03 ` Peter Zijlstra
2007-09-10 13:00 ` Herbert Xu
1 sibling, 0 replies; 8+ messages in thread
From: Peter Zijlstra @ 2007-09-10 12:03 UTC (permalink / raw)
To: Christian Kujau; +Cc: linux-kernel, netdev
On Sun, 2007-09-02 at 15:11 +0200, Christian Kujau wrote:
> Hi,
>
> after upgrading to 2.6.23-rc5 (and applying davem's fix [0]), lockdep
> was quite noisy when I tried to shape my external (wireless) interface:
>
> [ 6400.534545] FahCore_78.exe/3552 just changed the state of lock:
> [ 6400.534713] (&dev->ingress_lock){-+..}, at: [<c038d595>] netif_receive_skb+0x2d5/0x3c0
> [ 6400.534941] but this lock took another, soft-read-irq-unsafe lock in the past:
> [ 6400.535145] (police_lock){-.--}
>
> This happened when I executed: http://nerdbynature.de/bits/2.6.23-rc5/qos.sh.txt
> (using iproute2-ss070313). The is still running, I just noticed a short
> hickup, probably when it was busy writing the warning to the disk.
>
> More details and .config: http://nerdbynature.de/bits/2.6.23-rc5/
seems unavailable at this time, please submit the whole lockdep report
if possible.
^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: 2.6.23-rc5: possible irq lock inversion dependency detected
2007-09-02 13:11 2.6.23-rc5: possible irq lock inversion dependency detected Christian Kujau
2007-09-10 12:03 ` Peter Zijlstra
@ 2007-09-10 13:00 ` Herbert Xu
2007-09-11 0:04 ` jamal
1 sibling, 1 reply; 8+ messages in thread
From: Herbert Xu @ 2007-09-10 13:00 UTC (permalink / raw)
To: Christian Kujau; +Cc: linux-kernel, netdev, jamal
On Sun, Sep 02, 2007 at 01:11:29PM +0000, Christian Kujau wrote:
>
> after upgrading to 2.6.23-rc5 (and applying davem's fix [0]), lockdep
> was quite noisy when I tried to shape my external (wireless) interface:
>
> [ 6400.534545] FahCore_78.exe/3552 just changed the state of lock:
> [ 6400.534713] (&dev->ingress_lock){-+..}, at: [<c038d595>]
> netif_receive_skb+0x2d5/0x3c0
> [ 6400.534941] but this lock took another, soft-read-irq-unsafe lock in the
> past:
> [ 6400.535145] (police_lock){-.--}
This is a genuine dead-lock. The police lock can be taken
for reading with softirqs on. If a second CPU tries to take
the police lock for writing, while holding the ingress lock,
then a softirq on the first CPU can dead-lock when it tries
to get the ingress lock.
The minimal fix would be to make sure that we disable BH on
the first CPU. Jamal, could you take a look at this please?
Thanks,
--
Visit Openswan at http://www.openswan.org/
Email: Herbert Xu ~{PmV>HI~} <herbert@gondor.apana.org.au>
Home Page: http://gondor.apana.org.au/~herbert/
PGP Key: http://gondor.apana.org.au/~herbert/pubkey.txt
^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: 2.6.23-rc5: possible irq lock inversion dependency detected
2007-09-10 13:00 ` Herbert Xu
@ 2007-09-11 0:04 ` jamal
2007-09-11 2:18 ` Herbert Xu
0 siblings, 1 reply; 8+ messages in thread
From: jamal @ 2007-09-11 0:04 UTC (permalink / raw)
To: Herbert Xu; +Cc: Christian Kujau, linux-kernel, netdev
[-- Attachment #1: Type: text/plain, Size: 336 bytes --]
On Mon, 2007-10-09 at 21:00 +0800, Herbert Xu wrote:
> The minimal fix would be to make sure that we disable BH on
> the first CPU.
disabling BH would make it more symmetric to the way we handle
egress. I couldnt reproduce the issue, but this should hopefully resolve
it.
Christian, can you test with this patch?
cheers,
jamal
[-- Attachment #2: ing1 --]
[-- Type: text/plain, Size: 549 bytes --]
[NET_SCHED] make ingress qlock symmetric to egress
Signed-off-by: Jamal Hadi Salim <hadi@cyberus.ca>
--- a/net/sched/sch_generic.c 2007/09/10 23:19:45 1.1
+++ b/net/sched/sch_generic.c 2007/09/10 23:52:45
@@ -42,12 +42,12 @@
void qdisc_lock_tree(struct net_device *dev)
{
spin_lock_bh(&dev->queue_lock);
- spin_lock(&dev->ingress_lock);
+ spin_lock_bh(&dev->ingress_lock);
}
void qdisc_unlock_tree(struct net_device *dev)
{
- spin_unlock(&dev->ingress_lock);
+ spin_unlock_bh(&dev->ingress_lock);
spin_unlock_bh(&dev->queue_lock);
}
^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: 2.6.23-rc5: possible irq lock inversion dependency detected
2007-09-11 0:04 ` jamal
@ 2007-09-11 2:18 ` Herbert Xu
2007-09-11 12:01 ` jamal
0 siblings, 1 reply; 8+ messages in thread
From: Herbert Xu @ 2007-09-11 2:18 UTC (permalink / raw)
To: jamal; +Cc: Christian Kujau, linux-kernel, netdev
On Mon, Sep 10, 2007 at 08:04:41PM -0400, jamal wrote:
>
> disabling BH would make it more symmetric to the way we handle
> egress. I couldnt reproduce the issue, but this should hopefully resolve
> it.
> Christian, can you test with this patch?
Jamal, it's the police_lock that we need to make _bh. The
ingress_lock is already _bh because of the spin_lock_bh that
directly precedes it.
Oh and I think the same thing applies for the other actions
too.
Cheers,
--
Visit Openswan at http://www.openswan.org/
Email: Herbert Xu ~{PmV>HI~} <herbert@gondor.apana.org.au>
Home Page: http://gondor.apana.org.au/~herbert/
PGP Key: http://gondor.apana.org.au/~herbert/pubkey.txt
^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: 2.6.23-rc5: possible irq lock inversion dependency detected
2007-09-11 2:18 ` Herbert Xu
@ 2007-09-11 12:01 ` jamal
2007-09-11 12:43 ` Herbert Xu
0 siblings, 1 reply; 8+ messages in thread
From: jamal @ 2007-09-11 12:01 UTC (permalink / raw)
To: Herbert Xu; +Cc: Christian Kujau, linux-kernel, netdev
[-- Attachment #1: Type: text/plain, Size: 629 bytes --]
On Tue, 2007-11-09 at 10:18 +0800, Herbert Xu wrote:
> Jamal, it's the police_lock that we need to make _bh. The
> ingress_lock is already _bh because of the spin_lock_bh that
> directly precedes it.
>
> Oh and I think the same thing applies for the other actions
> too.
ga-Dang. Ok, here it is. If you see(?) any more farts let me know.
I am around for another 30 minutes and off for about 18 hours.
Christian, i took your config and qos setup but I cant reproduce the
issue - i think i may need some of that wireless setup to recreate. So
if you can test this and validate it works we can push it forward.
cheers,
jamal
[-- Attachment #2: act_bhl --]
[-- Type: text/plain, Size: 2205 bytes --]
[NET_SCHED] protect action config/dump from irqs
>From the sharp laser eyes of Herbert Xu to my slow farting brain...
(with no apologies to C Heston)
On Mon, 2007-10-09 at 21:00 +0800, Herbert Xu wrote:
On Sun, Sep 02, 2007 at 01:11:29PM +0000, Christian Kujau wrote:
> >
> > after upgrading to 2.6.23-rc5 (and applying davem's fix [0]), lockdep
> > was quite noisy when I tried to shape my external (wireless) interface:
> >
> > [ 6400.534545] FahCore_78.exe/3552 just changed the state of lock:
> > [ 6400.534713] (&dev->ingress_lock){-+..}, at: [<c038d595>]
> > netif_receive_skb+0x2d5/0x3c0
> > [ 6400.534941] but this lock took another, soft-read-irq-unsafe lock in the
> > past:
> > [ 6400.535145] (police_lock){-.--}
>
> This is a genuine dead-lock. The police lock can be taken
> for reading with softirqs on. If a second CPU tries to take
> the police lock for writing, while holding the ingress lock,
> then a softirq on the first CPU can dead-lock when it tries
> to get the ingress lock.
Signed-off-by: Jamal Hadi Salim <hadi@cyberus.ca>
--- a/net/sched/act_police.c 2007/09/11 10:39:36 1.1
+++ b/net/sched/act_police.c 2007/09/11 10:51:47
@@ -56,7 +56,7 @@
int err = 0, index = -1, i = 0, s_i = 0, n_i = 0;
struct rtattr *r;
- read_lock(&police_lock);
+ read_lock_bh(&police_lock);
s_i = cb->args[0];
@@ -85,7 +85,7 @@
}
}
done:
- read_unlock(&police_lock);
+ read_unlock_bh(&police_lock);
if (n_i)
cb->args[0] += n_i;
return n_i;
--- a/net/sched/act_api.c 2007/09/11 10:47:51 1.1
+++ b/net/sched/act_api.c 2007/09/11 10:50:47
@@ -68,7 +68,7 @@
int err = 0, index = -1,i = 0, s_i = 0, n_i = 0;
struct rtattr *r ;
- read_lock(hinfo->lock);
+ read_lock_bh(hinfo->lock);
s_i = cb->args[0];
@@ -96,7 +96,7 @@
}
}
done:
- read_unlock(hinfo->lock);
+ read_unlock_bh(hinfo->lock);
if (n_i)
cb->args[0] += n_i;
return n_i;
@@ -156,13 +156,13 @@
{
struct tcf_common *p;
- read_lock(hinfo->lock);
+ read_lock_bh(hinfo->lock);
for (p = hinfo->htab[tcf_hash(index, hinfo->hmask)]; p;
p = p->tcfc_next) {
if (p->tcfc_index == index)
break;
}
- read_unlock(hinfo->lock);
+ read_unlock_bh(hinfo->lock);
return p;
}
^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: 2.6.23-rc5: possible irq lock inversion dependency detected
2007-09-11 12:01 ` jamal
@ 2007-09-11 12:43 ` Herbert Xu
2007-09-12 14:33 ` David Miller
0 siblings, 1 reply; 8+ messages in thread
From: Herbert Xu @ 2007-09-11 12:43 UTC (permalink / raw)
To: jamal, David S. Miller; +Cc: Christian Kujau, linux-kernel, netdev
On Tue, Sep 11, 2007 at 08:01:46AM -0400, jamal wrote:
>
> [NET_SCHED] protect action config/dump from irqs
Looks good! Thanks Jamal.
--
Visit Openswan at http://www.openswan.org/
Email: Herbert Xu ~{PmV>HI~} <herbert@gondor.apana.org.au>
Home Page: http://gondor.apana.org.au/~herbert/
PGP Key: http://gondor.apana.org.au/~herbert/pubkey.txt
^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: 2.6.23-rc5: possible irq lock inversion dependency detected
2007-09-11 12:43 ` Herbert Xu
@ 2007-09-12 14:33 ` David Miller
0 siblings, 0 replies; 8+ messages in thread
From: David Miller @ 2007-09-12 14:33 UTC (permalink / raw)
To: herbert; +Cc: hadi, lists, linux-kernel, netdev
From: Herbert Xu <herbert@gondor.apana.org.au>
Date: Tue, 11 Sep 2007 20:43:27 +0800
> On Tue, Sep 11, 2007 at 08:01:46AM -0400, jamal wrote:
> >
> > [NET_SCHED] protect action config/dump from irqs
>
> Looks good! Thanks Jamal.
Applied, I'll try to push this in some time soon.
^ permalink raw reply [flat|nested] 8+ messages in thread
end of thread, other threads:[~2007-09-12 14:33 UTC | newest]
Thread overview: 8+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2007-09-02 13:11 2.6.23-rc5: possible irq lock inversion dependency detected Christian Kujau
2007-09-10 12:03 ` Peter Zijlstra
2007-09-10 13:00 ` Herbert Xu
2007-09-11 0:04 ` jamal
2007-09-11 2:18 ` Herbert Xu
2007-09-11 12:01 ` jamal
2007-09-11 12:43 ` Herbert Xu
2007-09-12 14:33 ` David Miller
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).