* "Badness in local_bh_enable" - a reasonable fix?
@ 2005-10-26 19:34 Steve Snyder
2005-10-26 19:40 ` Arjan van de Ven
2005-10-26 20:44 ` Stephen Hemminger
0 siblings, 2 replies; 7+ messages in thread
From: Steve Snyder @ 2005-10-26 19:34 UTC (permalink / raw)
To: Linux Kernel Mailing List
[ I observed the following on a Fedora Core 3 system, running kernel
2.6.12-1.1380_FC3. I am posting this here because a quick Googling
indicates that the problem is not specific to this environment. ]
Today I found my system log filled with the error shown below.
Reading a 366MB file across a NFS mount results in over 6300
occurrences of the error being written to the system log of the NFS
server.
I have 2 network interfaces in the NFS server machine, a standard
kernel Ethernet device driver and my own Ultra-Wide Band (UWB) device
driver. (In the error shown below the references to "fsuwbpci" are my
driver.) This problem is not seen when using the Ethernet interface,
but is perfectly consistent when reading a NFS-mounted file across the
UWB interface. Therefore there is a problem with my code.
I quickly established that the error came from within this routine:
void netdev_tx_ack(struct net_device *dev, struct sk_buff *skb)
{
struct netdev_priv *priv = (struct netdev_priv *) dev->priv;
priv->stats.tx_packets++;
priv->stats.tx_bytes += skb->len;
netdev_resume(dev);
dev_kfree_skb(skb);
}
Googling told me that a) other people had seen the same problem,
around the 2.6.11 timeframe; and b) use of dev_kfree_skb_irq() had
fixed it. I replaced the "dev_kfree_skb(skb)" above with
"dev_kfree_skb_irq(skb)" and now all seems well. I can transfer a
file across the NFS mount via the UWB interface with no errors seen in
the system log.
I am uncomfortable with this modification, though, because I don't know
what it actually does and what ramifications it will have for earlier
kernel versions. This driver code , with appropriate #ifdef's, is
used on a wide range of kernel versions, including 2.4.x kernels.
Please educate me. Is replacing dev_kfree_skb() with
dev_kfree_skb_irq() a reasonable solution to this problem? Did I
break backward-compatibility with this modification to my driver?
Thank you.
-------------------------------------------
kernel: Badness in local_bh_enable at kernel/softirq.c:140 (Tainted: P )
kernel: [<c0127e50>] local_bh_enable+0x66/0x78
kernel: [<f8c57c3f>] svc_write_space+0x21/0x85 [sunrpc]
kernel: [<c02f3819>] sock_wfree+0x34/0x36
kernel: [<c02f5067>] __kfree_skb+0x52/0x142
kernel: [<f8a9442f>] card_send_packet+0x23/0xb6 [fsuwbpci]
kernel: [<f8a94258>] card_send_request+0x14c/0x300 [fsuwbpci]
kernel: [<f8a94013>] card_send+0x42/0x13b [fsuwbpci]
kernel: [<f8a95ec1>] netdev_tx+0x34/0x5d [fsuwbpci]
kernel: [<f8a40280>] ipt_do_table+0x254/0x321 [ip_tables]
kernel: [<c030c346>] qdisc_restart+0x84/0x640
kernel: [<c030cf29>] pfifo_fast_enqueue+0x0/0x89
kernel: [<c02fb2c2>] dev_queue_xmit+0xba/0x589
kernel: [<f89c6091>] ipt_local_out_hook+0x66/0x6d [iptable_filter]
kernel: [<c03019f6>] neigh_connected_output+0x8a/0xd1
kernel: [<c031ce31>] ip_finish_output+0x13d/0x211
kernel: [<c031ccdb>] dst_output+0x0/0x19
kernel: [<c031efd8>] ip_push_pending_frames+0x30c/0x4b0
kernel: [<c031ccdb>] dst_output+0x0/0x19
kernel: [<c033a91e>] udp_push_pending_frames+0x139/0x258
kernel: [<c033af74>] udp_sendmsg+0x4fb/0x6be
kernel: [<c02f3bae>] sock_alloc_send_skb+0x16/0x1b
kernel: [<c031e5eb>] ip_append_data+0x55d/0x80b
kernel: [<c033ad69>] udp_sendmsg+0x2f0/0x6be
kernel: [<c0341fb1>] inet_sendmsg+0x3e/0x4a
kernel: [<c02f09ce>] sock_sendmsg+0xf3/0x10e
kernel: [<c0341fb1>] inet_sendmsg+0x3e/0x4a
kernel: [<c013e9b9>] autoremove_wake_function+0x0/0x37
kernel: [<c02f09ce>] sock_sendmsg+0xf3/0x10e
kernel: [<c02f0a0f>] kernel_sendmsg+0x26/0x2c
kernel: [<c02f404a>] sock_no_sendpage+0x56/0x69
kernel: [<c033b217>] udp_sendpage+0xe0/0x11e
kernel: [<c034202e>] inet_sendpage+0x71/0x8f
kernel: [<f8c578e2>] svc_sendto+0x99/0x25a [sunrpc]
kernel: [<c0192d5f>] dput+0xca/0x5dc
kernel: [<f8c57f2f>] svc_udp_sendto+0xe/0x22 [sunrpc]
kernel: [<f8c598d5>] svc_send+0xc4/0x107 [sunrpc]
kernel: [<f8cc1011>] fh_put+0x136/0x180 [nfsd]
kernel: [<f8c5bb22>] svcauth_unix_release+0x38/0x4e [sunrpc]
kernel: [<f8ccd06a>] nfs3svc_encode_attrstat+0x0/0x238 [nfsd]
kernel: [<f8c56c04>] svc_process+0x3a3/0x61f [sunrpc]
kernel: [<c0131fe5>] sigprocmask+0xc0/0x29b
kernel: [<f8cbe478>] nfsd+0x1c4/0x459 [nfsd]
kernel: [<f8cbe2b4>] nfsd+0x0/0x459 [nfsd]
kernel: [<c01012c1>] kernel_thread_helper+0x5/0xb
-------------------------------------------
^ permalink raw reply [flat|nested] 7+ messages in thread* Re: "Badness in local_bh_enable" - a reasonable fix?
2005-10-26 19:34 "Badness in local_bh_enable" - a reasonable fix? Steve Snyder
@ 2005-10-26 19:40 ` Arjan van de Ven
2005-10-26 20:01 ` Steve Snyder
2005-10-26 20:44 ` Stephen Hemminger
1 sibling, 1 reply; 7+ messages in thread
From: Arjan van de Ven @ 2005-10-26 19:40 UTC (permalink / raw)
To: Steve Snyder; +Cc: Linux Kernel Mailing List
On Wed, 2005-10-26 at 15:34 -0400, Steve Snyder wrote:
> [ I observed the following on a Fedora Core 3 system, running kernel
> 2.6.12-1.1380_FC3. I am posting this here because a quick Googling
> indicates that the problem is not specific to this environment. ]
>
> Today I found my system log filled with the error shown below.
> Reading a 366MB file across a NFS mount results in over 6300
> occurrences of the error being written to the system log of the NFS
> server.
>
> I have 2 network interfaces in the NFS server machine, a standard
> kernel Ethernet device driver and my own Ultra-Wide Band (UWB) device
> driver. (In the error shown below the references to "fsuwbpci" are my
> driver.) This problem is not seen when using the Ethernet interface,
> but is perfectly consistent when reading a NFS-mounted file across the
> UWB interface. Therefore there is a problem with my code.
>
> I quickly established that the error came from within this routine:
>
> -------------------------------------------
>
> kernel: Badness in local_bh_enable at kernel/softirq.c:140 (Tainted: P )
hmmm binary module? please try without.
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: "Badness in local_bh_enable" - a reasonable fix?
2005-10-26 19:40 ` Arjan van de Ven
@ 2005-10-26 20:01 ` Steve Snyder
2005-10-26 20:24 ` Lee Revell
0 siblings, 1 reply; 7+ messages in thread
From: Steve Snyder @ 2005-10-26 20:01 UTC (permalink / raw)
To: Arjan van de Ven; +Cc: Linux Kernel Mailing List
On Wednesday 26 October 2005 15:40, Arjan van de Ven wrote:
> On Wed, 2005-10-26 at 15:34 -0400, Steve Snyder wrote:
> > [ I observed the following on a Fedora Core 3 system, running kernel
> > 2.6.12-1.1380_FC3. I am posting this here because a quick Googling
> > indicates that the problem is not specific to this environment. ]
> >
> > Today I found my system log filled with the error shown below.
> > Reading a 366MB file across a NFS mount results in over 6300
> > occurrences of the error being written to the system log of the NFS
> > server.
> >
> > I have 2 network interfaces in the NFS server machine, a standard
> > kernel Ethernet device driver and my own Ultra-Wide Band (UWB) device
> > driver. (In the error shown below the references to "fsuwbpci" are my
> > driver.) This problem is not seen when using the Ethernet interface,
> > but is perfectly consistent when reading a NFS-mounted file across the
> > UWB interface. Therefore there is a problem with my code.
> >
> > I quickly established that the error came from within this routine:
> >
>
> > -------------------------------------------
> >
> > kernel: Badness in local_bh_enable at kernel/softirq.c:140 (Tainted: P )
>
>
>
> hmmm binary module? please try without.
What, you mean the driver? No, it is built from source against the
installed & running Fedora Core 3 kernel version 2.6.12-1.1380_FC3.
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: "Badness in local_bh_enable" - a reasonable fix?
2005-10-26 20:01 ` Steve Snyder
@ 2005-10-26 20:24 ` Lee Revell
2005-10-26 21:20 ` Steve Snyder
0 siblings, 1 reply; 7+ messages in thread
From: Lee Revell @ 2005-10-26 20:24 UTC (permalink / raw)
To: Steve Snyder; +Cc: Arjan van de Ven, Linux Kernel Mailing List
On Wed, 2005-10-26 at 16:01 -0400, Steve Snyder wrote:
> What, you mean the driver? No, it is built from source against the
> installed & running Fedora Core 3 kernel version 2.6.12-1.1380_FC3.
No, your kernel is tainted because you loaded some otehr proprietary
module. Maybe nvidia?
Lee
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: "Badness in local_bh_enable" - a reasonable fix?
2005-10-26 20:24 ` Lee Revell
@ 2005-10-26 21:20 ` Steve Snyder
2005-10-26 22:00 ` Lee Revell
0 siblings, 1 reply; 7+ messages in thread
From: Steve Snyder @ 2005-10-26 21:20 UTC (permalink / raw)
To: Lee Revell; +Cc: Arjan van de Ven, Linux Kernel Mailing List
On Wednesday 26 October 2005 16:24, Lee Revell wrote:
> On Wed, 2005-10-26 at 16:01 -0400, Steve Snyder wrote:
> > What, you mean the driver? No, it is built from source against the
> > installed & running Fedora Core 3 kernel version 2.6.12-1.1380_FC3.
>
> No, your kernel is tainted because you loaded some otehr proprietary
> module. Maybe nvidia?
Yes, I did have the nvidia binary kernel module loaded. After removing
it (not uninstalling; I just specified use of the X.org vesa driver
instead and rebooted) I get the same behavior - including the Tainted
notation.
Thanks.
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: "Badness in local_bh_enable" - a reasonable fix?
2005-10-26 21:20 ` Steve Snyder
@ 2005-10-26 22:00 ` Lee Revell
0 siblings, 0 replies; 7+ messages in thread
From: Lee Revell @ 2005-10-26 22:00 UTC (permalink / raw)
To: Steve Snyder; +Cc: Arjan van de Ven, Linux Kernel Mailing List
On Wed, 2005-10-26 at 17:20 -0400, Steve Snyder wrote:
> On Wednesday 26 October 2005 16:24, Lee Revell wrote:
> > On Wed, 2005-10-26 at 16:01 -0400, Steve Snyder wrote:
> > > What, you mean the driver? No, it is built from source against the
> > > installed & running Fedora Core 3 kernel version 2.6.12-1.1380_FC3.
> >
> > No, your kernel is tainted because you loaded some otehr proprietary
> > module. Maybe nvidia?
>
> Yes, I did have the nvidia binary kernel module loaded. After removing
> it (not uninstalling; I just specified use of the X.org vesa driver
> instead and rebooted) I get the same behavior - including the Tainted
> notation.
Some other binary only module must have been loaded. The "P" in the
Tainted line indicates that the kernel was tainted by loading a
proprietary module.
Maybe you didn't do MODULE_LICENSE("GPL") in your driver?
LLee
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: "Badness in local_bh_enable" - a reasonable fix?
2005-10-26 19:34 "Badness in local_bh_enable" - a reasonable fix? Steve Snyder
2005-10-26 19:40 ` Arjan van de Ven
@ 2005-10-26 20:44 ` Stephen Hemminger
1 sibling, 0 replies; 7+ messages in thread
From: Stephen Hemminger @ 2005-10-26 20:44 UTC (permalink / raw)
To: linux-kernel
On Wed, 26 Oct 2005 15:34:38 -0400
Steve Snyder <R00020C@freescale.com> wrote:
> [ I observed the following on a Fedora Core 3 system, running kernel
> 2.6.12-1.1380_FC3. I am posting this here because a quick Googling
> indicates that the problem is not specific to this environment. ]
>
> Today I found my system log filled with the error shown below.
> Reading a 366MB file across a NFS mount results in over 6300
> occurrences of the error being written to the system log of the NFS
> server.
>
> I have 2 network interfaces in the NFS server machine, a standard
> kernel Ethernet device driver and my own Ultra-Wide Band (UWB) device
> driver. (In the error shown below the references to "fsuwbpci" are my
> driver.) This problem is not seen when using the Ethernet interface,
> but is perfectly consistent when reading a NFS-mounted file across the
> UWB interface. Therefore there is a problem with my code.
>
> I quickly established that the error came from within this routine:
>
> void netdev_tx_ack(struct net_device *dev, struct sk_buff *skb)
> {
> struct netdev_priv *priv = (struct netdev_priv *) dev->priv;
>
> priv->stats.tx_packets++;
> priv->stats.tx_bytes += skb->len;
>
> netdev_resume(dev);
> dev_kfree_skb(skb);
> }
Your driver is calling dev_kfree_skb with interrupts disabled.
Call dev_kfree_skb_any instead.
--
Stephen Hemminger <shemminger@osdl.org>
OSDL http://developer.osdl.org/~shemminger
^ permalink raw reply [flat|nested] 7+ messages in thread
end of thread, other threads:[~2005-10-26 22:01 UTC | newest]
Thread overview: 7+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2005-10-26 19:34 "Badness in local_bh_enable" - a reasonable fix? Steve Snyder
2005-10-26 19:40 ` Arjan van de Ven
2005-10-26 20:01 ` Steve Snyder
2005-10-26 20:24 ` Lee Revell
2005-10-26 21:20 ` Steve Snyder
2005-10-26 22:00 ` Lee Revell
2005-10-26 20:44 ` Stephen Hemminger
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox