All of lore.kernel.org
 help / color / mirror / Atom feed
* automount in status D
@ 2008-06-27  8:05 Carsten Aulbert
  2008-06-27 11:06 ` Ian Kent
  0 siblings, 1 reply; 6+ messages in thread
From: Carsten Aulbert @ 2008-06-27  8:05 UTC (permalink / raw)
  To: autofs

Hi all,

we have a weird problem here and I fear I need to reboot the box
(painful, many users involved), but maybe there is a way out:

I'm automounting many nodes and usually everything seems to work nicely.
However, I'm seeing this here right now:

# ps aux|grep auto.node
root      2022  0.0  0.0  30304  1240 ?        Ds   Jun19   0:21 /usr/sbin/automount --pid-file=/var/run/autofs/_atlas_node.pid --timeout=5 /atlas/node yp auto.node_local
root     11489  0.0  0.0  30516  1108 ?        D    Jun26   0:00 /usr/sbin/automount --pid-file=/var/run/autofs/_atlas_node.pid --timeout=5 /atlas/node yp auto.node_local
root     28610  0.0  0.0   2736   656 pts/31   S+   10:03   0:00 grep auto.node


As you can see there are two automounter processes in D (or Ds) state sitting there for the same mount point (the PID-file contains 2022). 

My questions:

Any idea how this happened?

Any idea how to get this resolved (possibly without a reboot)?

TIA

Carsten

PS: 
# ypcat auto.node_local
-fstype=nfs,nfsvers=3,soft,intr,rsize=32768,wsize=32768,tcp       &:/local

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: automount in status D
  2008-06-27  8:05 automount in status D Carsten Aulbert
@ 2008-06-27 11:06 ` Ian Kent
  2008-06-27 11:35   ` Carsten Aulbert
  0 siblings, 1 reply; 6+ messages in thread
From: Ian Kent @ 2008-06-27 11:06 UTC (permalink / raw)
  To: Carsten Aulbert; +Cc: autofs


On Fri, 2008-06-27 at 10:05 +0200, Carsten Aulbert wrote:
> Hi all,
> 
> we have a weird problem here and I fear I need to reboot the box
> (painful, many users involved), but maybe there is a way out:
> 
> I'm automounting many nodes and usually everything seems to work nicely.
> However, I'm seeing this here right now:
> 
> # ps aux|grep auto.node
> root      2022  0.0  0.0  30304  1240 ?        Ds   Jun19   0:21 /usr/sbin/automount --pid-file=/var/run/autofs/_atlas_node.pid --timeout=5 /atlas/node yp auto.node_local
> root     11489  0.0  0.0  30516  1108 ?        D    Jun26   0:00 /usr/sbin/automount --pid-file=/var/run/autofs/_atlas_node.pid --timeout=5 /atlas/node yp auto.node_local
> root     28610  0.0  0.0   2736   656 pts/31   S+   10:03   0:00 grep auto.node
> 
> 
> As you can see there are two automounter processes in D (or Ds) state sitting there for the same mount point (the PID-file contains 2022). 
> 
> My questions:
> 
> Any idea how this happened?

It's been a long time since I've seen this reported.
What distro and kernel?

> 
> Any idea how to get this resolved (possibly without a reboot)?

There is nothing you can do if the process is stuck in an
uninterruptible state in the kernel.

Ian

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: automount in status D
  2008-06-27 11:06 ` Ian Kent
@ 2008-06-27 11:35   ` Carsten Aulbert
  2008-06-27 12:45     ` Ian Kent
  0 siblings, 1 reply; 6+ messages in thread
From: Carsten Aulbert @ 2008-06-27 11:35 UTC (permalink / raw)
  To: autofs



Ian Kent wrote:

>> Any idea how this happened?
> 
> It's been a long time since I've seen this reported.
> What distro and kernel?
> 

Sorry, Debian Etch with self compiled kernel:
h2:~# uname -a
Linux h2 2.6.24.4-storage #1 SMP Sun Apr 20 09:33:42 CEST 2008 x86_64
x86_64 x86_64 GNU/Linux
h2:~# apt-cache show autofs|grep Version
Version: 4.1.4-13

>> Any idea how to get this resolved (possibly without a reboot)?
> 
> There is nothing you can do if the process is stuck in an
> uninterruptible state in the kernel.

Well, as bad as I feared it to be :(

Thanks already, reboot is already announced to the users. Anything I
might help you with to corner that bug?

Cheers

Carsten

-- 
Dr. Carsten Aulbert - Max Planck Institut für Gravitationsphysik
Callinstraße 38, 30167 Hannover, Germany
Fon: +49 511 762 17185, Fax: +49 511 762 17193
http://www.top500.org/system/9234 | http://www.top500.org/connfam/6/list/31

_______________________________________________
autofs mailing list
autofs@linux.kernel.org
http://linux.kernel.org/mailman/listinfo/autofs

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: automount in status D
  2008-06-27 11:35   ` Carsten Aulbert
@ 2008-06-27 12:45     ` Ian Kent
  2008-06-27 12:59       ` Carsten Aulbert
  2008-06-27 13:46       ` Jeff Moyer
  0 siblings, 2 replies; 6+ messages in thread
From: Ian Kent @ 2008-06-27 12:45 UTC (permalink / raw)
  To: Carsten Aulbert; +Cc: autofs


On Fri, 2008-06-27 at 13:35 +0200, Carsten Aulbert wrote:
> 
> Ian Kent wrote:
> 
> >> Any idea how this happened?
> > 
> > It's been a long time since I've seen this reported.
> > What distro and kernel?
> > 
> 
> Sorry, Debian Etch with self compiled kernel:
> h2:~# uname -a
> Linux h2 2.6.24.4-storage #1 SMP Sun Apr 20 09:33:42 CEST 2008 x86_64
> x86_64 x86_64 GNU/Linux
> h2:~# apt-cache show autofs|grep Version
> Version: 4.1.4-13
> 
> >> Any idea how to get this resolved (possibly without a reboot)?
> > 
> > There is nothing you can do if the process is stuck in an
> > uninterruptible state in the kernel.
> 
> Well, as bad as I feared it to be :(
> 
> Thanks already, reboot is already announced to the users. Anything I
> might help you with to corner that bug?

Aggh .. it's been soooo long since I've heard of this happening I can't
remember.

One thing that is interesting is that the autofs4 kernel module doesn't
do an uninterruptible sleep "anywhere". It hasn't since long before
2.6.24. So if user space can't recover by doing a control-c (maybe even
a couple of times) then it's possibly not actually autofs.

Ian

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: automount in status D
  2008-06-27 12:45     ` Ian Kent
@ 2008-06-27 12:59       ` Carsten Aulbert
  2008-06-27 13:46       ` Jeff Moyer
  1 sibling, 0 replies; 6+ messages in thread
From: Carsten Aulbert @ 2008-06-27 12:59 UTC (permalink / raw)
  To: autofs

Hi Ian,

Ian Kent wrote:

> One thing that is interesting is that the autofs4 kernel module doesn't
> do an uninterruptible sleep "anywhere". It hasn't since long before
> 2.6.24. So if user space can't recover by doing a control-c (maybe even
> a couple of times) then it's possibly not actually autofs.

Well, when I tried to umount it manually (with options -f -l -i) stopped
indefinitely when doing a readlink on the mount point itself and never
returned from its own account.

I was able to CTRL-C this one but not getting the mount point removed. A
nasty side effect was that df never returned either.

I did a bit more log reading and it seems that the node got a mount as
well as a umount request yesterday evening and from the nodes
perspective everything looked fine.

In the end I guess this falls into the category of: Hopefully it won't
happen again too soon ;)

If you come up with an idea, please let me know. I'll stay on the list,
reading quietly.

Cheers

Carsten

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: automount in status D
  2008-06-27 12:45     ` Ian Kent
  2008-06-27 12:59       ` Carsten Aulbert
@ 2008-06-27 13:46       ` Jeff Moyer
  1 sibling, 0 replies; 6+ messages in thread
From: Jeff Moyer @ 2008-06-27 13:46 UTC (permalink / raw)
  To: Ian Kent; +Cc: autofs

Ian Kent <raven@themaw.net> writes:

> On Fri, 2008-06-27 at 13:35 +0200, Carsten Aulbert wrote:
>> 
>> Ian Kent wrote:
>> 
>> >> Any idea how this happened?
>> > 
>> > It's been a long time since I've seen this reported.
>> > What distro and kernel?
>> > 
>> 
>> Sorry, Debian Etch with self compiled kernel:
>> h2:~# uname -a
>> Linux h2 2.6.24.4-storage #1 SMP Sun Apr 20 09:33:42 CEST 2008 x86_64
>> x86_64 x86_64 GNU/Linux
>> h2:~# apt-cache show autofs|grep Version
>> Version: 4.1.4-13
>> 
>> >> Any idea how to get this resolved (possibly without a reboot)?
>> > 
>> > There is nothing you can do if the process is stuck in an
>> > uninterruptible state in the kernel.
>> 
>> Well, as bad as I feared it to be :(
>> 
>> Thanks already, reboot is already announced to the users. Anything I
>> might help you with to corner that bug?
>
> Aggh .. it's been soooo long since I've heard of this happening I can't
> remember.

A sysrq-t would be a good starting point.

-Jeff

^ permalink raw reply	[flat|nested] 6+ messages in thread

end of thread, other threads:[~2008-06-27 13:46 UTC | newest]

Thread overview: 6+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2008-06-27  8:05 automount in status D Carsten Aulbert
2008-06-27 11:06 ` Ian Kent
2008-06-27 11:35   ` Carsten Aulbert
2008-06-27 12:45     ` Ian Kent
2008-06-27 12:59       ` Carsten Aulbert
2008-06-27 13:46       ` Jeff Moyer

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.