netdev.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* Unable to halt or reboot due to - unregister_netdevice: waiting for eth0.20 to become free. Usage count = 1
@ 2006-08-31 15:17 Jesper Juhl
  2006-08-31 15:30 ` Ben Greear
  0 siblings, 1 reply; 8+ messages in thread
From: Jesper Juhl @ 2006-08-31 15:17 UTC (permalink / raw)
  To: Linux Kernel Mailing List
  Cc: Mark Evans, Fred N. van Kempen, Ross Biro, davem, yoshfuji,
	Ben Greear, netdev

Hi,

I've got a small problem with 2.6.18-rc5-git2.

I've got a vlan setup on eth0.20, eth0 does not have an IP.

When I attempt to reboot or halt the machine I get the following
message from the loop in net/core/dev.c::netdev_wait_allrefs() where
it waits for the ref-count to drop to zero.
Unfortunately the ref-count stays at 1 forever and the server never
gets any further.

  unregister_netdevice: waiting for eth0.20 to become free. Usage count = 1

I googled a bit and found that people have had similar problems in the
past and could work around them by shutting down the vlan interface
before the 'lo' interface. I tried that and indeed, it works.

Any idea how we can get this fixed?


-- 
Jesper Juhl <jesper.juhl@gmail.com>
Don't top-post  http://www.catb.org/~esr/jargon/html/T/top-post.html
Plain text mails only, please      http://www.expita.com/nomime.html

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: Unable to halt or reboot due to - unregister_netdevice: waiting for eth0.20 to become free. Usage count = 1
  2006-08-31 15:17 Unable to halt or reboot due to - unregister_netdevice: waiting for eth0.20 to become free. Usage count = 1 Jesper Juhl
@ 2006-08-31 15:30 ` Ben Greear
  2006-08-31 15:33   ` Jesper Juhl
  0 siblings, 1 reply; 8+ messages in thread
From: Ben Greear @ 2006-08-31 15:30 UTC (permalink / raw)
  To: Jesper Juhl
  Cc: Linux Kernel Mailing List, Mark Evans, Fred N. van Kempen,
	Ross Biro, davem, yoshfuji, netdev

Jesper Juhl wrote:
> Hi,
> 
> I've got a small problem with 2.6.18-rc5-git2.
> 
> I've got a vlan setup on eth0.20, eth0 does not have an IP.
> 
> When I attempt to reboot or halt the machine I get the following
> message from the loop in net/core/dev.c::netdev_wait_allrefs() where
> it waits for the ref-count to drop to zero.
> Unfortunately the ref-count stays at 1 forever and the server never
> gets any further.
> 
>  unregister_netdevice: waiting for eth0.20 to become free. Usage count = 1
> 
> I googled a bit and found that people have had similar problems in the
> past and could work around them by shutting down the vlan interface
> before the 'lo' interface. I tried that and indeed, it works.
> 
> Any idea how we can get this fixed?

This is usually a ref-count leak somewhere.  Used to be IPv6 had 
issues..then there were some neighbor leaks...but these were fixed as 
far as I know.

Can you reproduce this on older kernels?

Ben

> 
> 


-- 
Ben Greear <greearb@candelatech.com>
Candela Technologies Inc  http://www.candelatech.com

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: Unable to halt or reboot due to - unregister_netdevice: waiting for eth0.20 to become free. Usage count = 1
  2006-08-31 15:30 ` Ben Greear
@ 2006-08-31 15:33   ` Jesper Juhl
  2006-09-01  9:59     ` Jesper Juhl
  0 siblings, 1 reply; 8+ messages in thread
From: Jesper Juhl @ 2006-08-31 15:33 UTC (permalink / raw)
  To: Ben Greear
  Cc: Linux Kernel Mailing List, Mark Evans, Fred N. van Kempen,
	Ross Biro, davem, yoshfuji, netdev

On 31/08/06, Ben Greear <greearb@candelatech.com> wrote:
> Jesper Juhl wrote:
> > Hi,
> >
> > I've got a small problem with 2.6.18-rc5-git2.
> >
> > I've got a vlan setup on eth0.20, eth0 does not have an IP.
> >
> > When I attempt to reboot or halt the machine I get the following
> > message from the loop in net/core/dev.c::netdev_wait_allrefs() where
> > it waits for the ref-count to drop to zero.
> > Unfortunately the ref-count stays at 1 forever and the server never
> > gets any further.
> >
> >  unregister_netdevice: waiting for eth0.20 to become free. Usage count = 1
> >
> > I googled a bit and found that people have had similar problems in the
> > past and could work around them by shutting down the vlan interface
> > before the 'lo' interface. I tried that and indeed, it works.
> >
> > Any idea how we can get this fixed?
>
> This is usually a ref-count leak somewhere.  Used to be IPv6 had
> issues..then there were some neighbor leaks...but these were fixed as
> far as I know.
>
Using IPv4 here.


> Can you reproduce this on older kernels?
>
I've not actively tried, but I do have several servers running various
older kernel releases with similar vlan setups and I'm not aware of
any problems with those. Only this new box that I'm using for testing
new kernels (currently) shows the problem, and I've only tried 2.6.8
and 2.6.18-rc5-git2 on the box so far (2.6.8 doesn't have the
problem).

-- 
Jesper Juhl <jesper.juhl@gmail.com>
Don't top-post  http://www.catb.org/~esr/jargon/html/T/top-post.html
Plain text mails only, please      http://www.expita.com/nomime.html

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: Unable to halt or reboot due to - unregister_netdevice: waiting for eth0.20 to become free. Usage count = 1
  2006-08-31 15:33   ` Jesper Juhl
@ 2006-09-01  9:59     ` Jesper Juhl
  2006-09-01 10:41       ` Herbert Xu
  0 siblings, 1 reply; 8+ messages in thread
From: Jesper Juhl @ 2006-09-01  9:59 UTC (permalink / raw)
  To: Ben Greear
  Cc: Linux Kernel Mailing List, Fred N. van Kempen, Ross Biro, davem,
	yoshfuji, netdev

On 31/08/06, Jesper Juhl <jesper.juhl@gmail.com> wrote:
> On 31/08/06, Ben Greear <greearb@candelatech.com> wrote:
> > Jesper Juhl wrote:
> > > Hi,
> > >
> > > I've got a small problem with 2.6.18-rc5-git2.
> > >
> > > I've got a vlan setup on eth0.20, eth0 does not have an IP.
> > >
> > > When I attempt to reboot or halt the machine I get the following
> > > message from the loop in net/core/dev.c::netdev_wait_allrefs() where
> > > it waits for the ref-count to drop to zero.
> > > Unfortunately the ref-count stays at 1 forever and the server never
> > > gets any further.
> > >
> > >  unregister_netdevice: waiting for eth0.20 to become free. Usage count = 1
> > >
> > > I googled a bit and found that people have had similar problems in the
> > > past and could work around them by shutting down the vlan interface
> > > before the 'lo' interface. I tried that and indeed, it works.
> > >
> > > Any idea how we can get this fixed?
> >
> > This is usually a ref-count leak somewhere.  Used to be IPv6 had
> > issues..then there were some neighbor leaks...but these were fixed as
> > far as I know.
> >
> Using IPv4 here.
>
>
> > Can you reproduce this on older kernels?
> >
> I've not actively tried, but I do have several servers running various
> older kernel releases with similar vlan setups and I'm not aware of
> any problems with those. Only this new box that I'm using for testing
> new kernels (currently) shows the problem, and I've only tried 2.6.8
> and 2.6.18-rc5-git2 on the box so far (2.6.8 doesn't have the
> problem).
>
I've just encountered the problem on a different server with an
identical vlan setup. That server is running 2.6.13.4

-- 
Jesper Juhl <jesper.juhl@gmail.com>
Don't top-post  http://www.catb.org/~esr/jargon/html/T/top-post.html
Plain text mails only, please      http://www.expita.com/nomime.html

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: Unable to halt or reboot due to - unregister_netdevice: waiting for eth0.20 to become free. Usage count = 1
  2006-09-01  9:59     ` Jesper Juhl
@ 2006-09-01 10:41       ` Herbert Xu
  2006-09-01 10:51         ` Jesper Juhl
  0 siblings, 1 reply; 8+ messages in thread
From: Herbert Xu @ 2006-09-01 10:41 UTC (permalink / raw)
  To: Jesper Juhl
  Cc: greearb, linux-kernel, waltje, ross.biro, davem, yoshfuji, netdev

Jesper Juhl <jesper.juhl@gmail.com> wrote:
>
> I've just encountered the problem on a different server with an
> identical vlan setup. That server is running 2.6.13.4

Do you have a simple recipe to reproduce this? Ideally it'd be a
script that anyone can execute in a freshly booted system that
exhibits the problem.

Thanks,
-- 
Visit Openswan at http://www.openswan.org/
Email: Herbert Xu ~{PmV>HI~} <herbert@gondor.apana.org.au>
Home Page: http://gondor.apana.org.au/~herbert/
PGP Key: http://gondor.apana.org.au/~herbert/pubkey.txt

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: Unable to halt or reboot due to - unregister_netdevice: waiting for eth0.20 to become free. Usage count = 1
  2006-09-01 10:41       ` Herbert Xu
@ 2006-09-01 10:51         ` Jesper Juhl
  2006-09-01 11:35           ` Jesper Juhl
  0 siblings, 1 reply; 8+ messages in thread
From: Jesper Juhl @ 2006-09-01 10:51 UTC (permalink / raw)
  To: Herbert Xu
  Cc: greearb, linux-kernel, waltje, ross.biro, davem, yoshfuji, netdev

On 01/09/06, Herbert Xu <herbert@gondor.apana.org.au> wrote:
> Jesper Juhl <jesper.juhl@gmail.com> wrote:
> >
> > I've just encountered the problem on a different server with an
> > identical vlan setup. That server is running 2.6.13.4
>
> Do you have a simple recipe to reproduce this? Ideally it'd be a
> script that anyone can execute in a freshly booted system that
> exhibits the problem.
>
Well, the first server I saw this on only had a base install of debian
stable on it, then I replaced the kernel, configured the vlan
interface in /etc/network/interfaces typed 'reboot' and it failed -
and it seems to fail reliably on reboot every time.

-- 
Jesper Juhl <jesper.juhl@gmail.com>
Don't top-post  http://www.catb.org/~esr/jargon/html/T/top-post.html
Plain text mails only, please      http://www.expita.com/nomime.html

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: Unable to halt or reboot due to - unregister_netdevice: waiting for eth0.20 to become free. Usage count = 1
  2006-09-01 10:51         ` Jesper Juhl
@ 2006-09-01 11:35           ` Jesper Juhl
  2006-09-06 18:01             ` Ben Greear
  0 siblings, 1 reply; 8+ messages in thread
From: Jesper Juhl @ 2006-09-01 11:35 UTC (permalink / raw)
  To: Herbert Xu
  Cc: greearb, linux-kernel, waltje, ross.biro, davem, yoshfuji, netdev

On 01/09/06, Jesper Juhl <jesper.juhl@gmail.com> wrote:
> On 01/09/06, Herbert Xu <herbert@gondor.apana.org.au> wrote:
> > Jesper Juhl <jesper.juhl@gmail.com> wrote:
> > >
> > > I've just encountered the problem on a different server with an
> > > identical vlan setup. That server is running 2.6.13.4
> >
> > Do you have a simple recipe to reproduce this? Ideally it'd be a
> > script that anyone can execute in a freshly booted system that
> > exhibits the problem.
> >
> Well, the first server I saw this on only had a base install of debian
> stable on it, then I replaced the kernel, configured the vlan
> interface in /etc/network/interfaces typed 'reboot' and it failed -
> and it seems to fail reliably on reboot every time.
>
Ok, I've done some more testing and it seems, unfortunately, that I
can't trigger the problem reliably. I guess I was just "lucky" with my
first few reboots.
It now seems that uptime and/or amount of data that has flowed over
the vlan interface impacts the probability of hitting the problem.

-- 
Jesper Juhl <jesper.juhl@gmail.com>
Don't top-post  http://www.catb.org/~esr/jargon/html/T/top-post.html
Plain text mails only, please      http://www.expita.com/nomime.html

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: Unable to halt or reboot due to - unregister_netdevice: waiting for eth0.20 to become free. Usage count = 1
  2006-09-01 11:35           ` Jesper Juhl
@ 2006-09-06 18:01             ` Ben Greear
  0 siblings, 0 replies; 8+ messages in thread
From: Ben Greear @ 2006-09-06 18:01 UTC (permalink / raw)
  To: Jesper Juhl
  Cc: Herbert Xu, linux-kernel, waltje, ross.biro, davem, yoshfuji,
	netdev

Jesper Juhl wrote:
> Ok, I've done some more testing and it seems, unfortunately, that I
> can't trigger the problem reliably. I guess I was just "lucky" with my
> first few reboots.
> It now seems that uptime and/or amount of data that has flowed over
> the vlan interface impacts the probability of hitting the problem.

Back when I was chasing the neighbor table leak, I wrote a patch to
catch ref-count leaks for net devices.  It was against 2.6.13 or so,
but if nothing else is helping, it might be worth dusting off.

I put what I believe was the last iteration of that patch here:

http://www.candelatech.com/oss/rfcnt.patch

Thanks,
Ben

-- 
Ben Greear <greearb@candelatech.com>
Candela Technologies Inc  http://www.candelatech.com


^ permalink raw reply	[flat|nested] 8+ messages in thread

end of thread, other threads:[~2006-09-06 18:01 UTC | newest]

Thread overview: 8+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2006-08-31 15:17 Unable to halt or reboot due to - unregister_netdevice: waiting for eth0.20 to become free. Usage count = 1 Jesper Juhl
2006-08-31 15:30 ` Ben Greear
2006-08-31 15:33   ` Jesper Juhl
2006-09-01  9:59     ` Jesper Juhl
2006-09-01 10:41       ` Herbert Xu
2006-09-01 10:51         ` Jesper Juhl
2006-09-01 11:35           ` Jesper Juhl
2006-09-06 18:01             ` Ben Greear

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).