* Possible race/deadlock in netdev_unregister
@ 2005-01-28 22:13 Daniele Orlandi
2005-01-28 23:38 ` Stephen Hemminger
0 siblings, 1 reply; 3+ messages in thread
From: Daniele Orlandi @ 2005-01-28 22:13 UTC (permalink / raw)
To: netdev
Hello,
First of all, please excuse me if this happens to be a bug in my code :)
Unfortunately I'm still a newbie with netdev and his interactions with
netlink, hotplug, etc... please help me understand what's happening and who
is wrong.
The scenario is this:
- A device driver module (written by me) has two netdevices registered.
- Another module (written by me, too) provides sockets implementation for the
protocol spoken by the device.
- An application has one socket bound to one netdevice
- I rmmod the device driver module
- The exit function in the module calls netdev_unregister
- The event dispatcher notifies the socket layer that a device is going down
- The socket is marked errored but the application keeps it open for a while
- The application ends, the socket is destroyed, the remaining reference to
netdevice is released but netdev_unregister keeps sleeping forever with this
backtrace:
Jan 28 19:02:58 bastard kernel: Call Trace:
Jan 28 19:02:58 bastard kernel: [<c03053ee>] __down+0x6e/0xd0
Jan 28 19:02:58 bastard kernel: [<c0119c70>] default_wake_function+0x0/0x10
Jan 28 19:02:58 bastard kernel: [<c02a8f46>] netlink_dump+0x66/0x180
Jan 28 19:02:58 bastard kernel: [<c0119cb5>] __wake_up_common+0x35/0x60
Jan 28 19:02:58 bastard kernel: [<c0305554>] __down_failed+0x8/0xc
Jan 28 19:02:58 bastard kernel: [<c0297108>] .text.lock.dev+0x91/0xb9
Jan 28 19:02:58 bastard kernel: [<c029b660>] rtnetlink_dump_ifinfo+0x0/0x70
Jan 28 19:02:58 bastard kernel: [<c029bc18>] rtnetlink_rcv+0x1d8/0x3f0
Jan 28 19:02:58 bastard kernel: [<c029ba40>] rtnetlink_rcv+0x0/0x3f0
Jan 28 19:02:58 bastard kernel: [<c02a8df8>] netlink_data_ready+0x28/0x50
Jan 28 19:02:58 bastard kernel: [<c0296b81>] netdev_wait_allrefs+0xf1/0x100
Jan 28 19:02:58 bastard kernel: [<c01d0410>] kobject_release+0x0/0x10
Jan 28 19:02:58 bastard kernel: [<c0296c8c>] netdev_run_todo+0xfc/0x1c0
Jan 28 19:02:58 bastard kernel: [<c029b660>] rtnetlink_dump_ifinfo+0x0/0x70
Jan 28 19:02:58 bastard kernel: [<c029bc18>] rtnetlink_rcv+0x1d8/0x3f0
Jan 28 19:02:58 bastard kernel: [<c011cabf>] printk+0xf/0x20
Jan 28 19:02:58 bastard kernel: [<c012ab00>] wakeme_after_rcu+0x0/0x10
Jan 28 19:02:58 bastard kernel: [<c029ba40>] rtnetlink_rcv+0x0/0x3f0
Jan 28 19:02:58 bastard kernel: [<c02a8df8>] netlink_data_ready+0x28/0x50
Jan 28 19:02:58 bastard kernel: [<c029aea1>] rtnl_unlock+0x31/0x40
Jan 28 19:02:58 bastard kernel: [<e12f115e>] fake_module_exit+0x2e/0x7e
[fake_isdn]
Jan 28 19:02:58 bastard kernel: [<c012eaaa>] sys_delete_module+0x15a/0x170
Jan 28 19:02:58 bastard kernel: [<c014653e>] unmap_vma_list+0xe/0x20
Jan 28 19:02:58 bastard kernel: [<c0146838>] do_munmap+0xd8/0x120
Jan 28 19:02:58 bastard kernel: [<c01468bc>] sys_munmap+0x3c/0x60
Jan 28 19:02:58 bastard kernel: [<c0105c69>] sysenter_past_esp+0x52/0x79
Note that disabling hotplug avoids the problem.
Bye,
--
Daniele Orlandi
^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: Possible race/deadlock in netdev_unregister
2005-01-28 22:13 Possible race/deadlock in netdev_unregister Daniele Orlandi
@ 2005-01-28 23:38 ` Stephen Hemminger
2005-01-29 0:25 ` Daniele Orlandi
0 siblings, 1 reply; 3+ messages in thread
From: Stephen Hemminger @ 2005-01-28 23:38 UTC (permalink / raw)
To: Daniele Orlandi; +Cc: netdev
On Fri, 28 Jan 2005 23:13:29 +0100
Daniele Orlandi <daniele@orlandi.com> wrote:
>
> Hello,
>
> First of all, please excuse me if this happens to be a bug in my code :)
> Unfortunately I'm still a newbie with netdev and his interactions with
> netlink, hotplug, etc... please help me understand what's happening and who
> is wrong.
>
> The scenario is this:
>
> - A device driver module (written by me) has two netdevices registered.
> - Another module (written by me, too) provides sockets implementation for the
> protocol spoken by the device.
> - An application has one socket bound to one netdevice
> - I rmmod the device driver module
> - The exit function in the module calls netdev_unregister
> - The event dispatcher notifies the socket layer that a device is going down
> - The socket is marked errored but the application keeps it open for a while
> - The application ends, the socket is destroyed, the remaining reference to
> netdevice is released but netdev_unregister keeps sleeping forever with this
> backtrace:
You are probably self-deadlocking on the netlink mutex (rtnl).
Is your netdevice_notifier handler calling rtnl_lock?
^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: Possible race/deadlock in netdev_unregister
2005-01-28 23:38 ` Stephen Hemminger
@ 2005-01-29 0:25 ` Daniele Orlandi
0 siblings, 0 replies; 3+ messages in thread
From: Daniele Orlandi @ 2005-01-29 0:25 UTC (permalink / raw)
To: Stephen Hemminger; +Cc: netdev
On Saturday 29 January 2005 00:38, you wrote:
>
> You are probably self-deadlocking on the netlink mutex (rtnl).
> Is your netdevice_notifier handler calling rtnl_lock?
No, I'm just setting an error on all sockets, and sk_state_change()ing them,
nothing that might sleep, AFAIK.
Even in that case, that wouldn't explain the interaction with hotplug.
I also forgot to say that if the application exists immediately after seeing
the error on the socket, the problem doesn't show.
Thanks anyway,
Bye!
--
Daniele Orlandi
^ permalink raw reply [flat|nested] 3+ messages in thread
end of thread, other threads:[~2005-01-29 0:25 UTC | newest]
Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2005-01-28 22:13 Possible race/deadlock in netdev_unregister Daniele Orlandi
2005-01-28 23:38 ` Stephen Hemminger
2005-01-29 0:25 ` Daniele Orlandi
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).