All of lore.kernel.org
 help / color / mirror / Atom feed
* Problem with mount.nfs4 on latest Fedora 10 updates
@ 2009-08-13 16:50 Howard Wilkinson
  2009-08-13 18:04 ` Chuck Lever
  0 siblings, 1 reply; 3+ messages in thread
From: Howard Wilkinson @ 2009-08-13 16:50 UTC (permalink / raw)
  To: autofs, For users of Fedora Core releases, nfsv4

I have just upgraded a couple of servers from FC9 to FC10 and I am 
seeing a major problem with mount.nfs4. This occurs when autofs calls 
the mount program. It then runs at 100% CPU and never terminates.

I have VMs that are running similar configuration successfully, so this 
is something driven by being on bare metal.

Kernel is 2.6.27.29-170.2.78.fc10.i686.PAE
nfs-utils is nfs-utils-1.1.4-8.fc10.i386
autofs is autofs-5.0.3-41.i386

Command running is

/sbin/mount.nfs4 battleaxe:/ /hosts/battleaxe -s -o 
rw,nosuid,nodev,tcp,rsize=32768,wsize=32768,hard,intr

The autofs mount has worked and the directories under /hosts/battleaxe 
have been successfully accessed prior to the problem occuring - I 
suspect this is a remount after and expire has occurred.

Anybody seen this before?
Anybody know what I can do to get round this? [I am on the way to FC11 
but will have to live with FC10 for a while (a week or so)]
Any extra information I can acquire to diagnose this?

There is nothing in the log files to indicate anything going wrong, I 
could turn debug on if I knew what to set and which messages to strip 
once I do.

Any help appreciated. Howard.

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: Problem with mount.nfs4 on latest Fedora 10 updates
  2009-08-13 16:50 Problem with mount.nfs4 on latest Fedora 10 updates Howard Wilkinson
@ 2009-08-13 18:04 ` Chuck Lever
  2009-08-14  7:20   ` Howard Wilkinson
  0 siblings, 1 reply; 3+ messages in thread
From: Chuck Lever @ 2009-08-13 18:04 UTC (permalink / raw)
  To: Howard Wilkinson; +Cc: autofs, For users of Fedora Core releases, nfsv4


On Aug 13, 2009, at 12:50 PM, Howard Wilkinson wrote:

> I have just upgraded a couple of servers from FC9 to FC10 and I am  
> seeing a major problem with mount.nfs4. This occurs when autofs  
> calls the mount program. It then runs at 100% CPU and never  
> terminates.
>
> I have VMs that are running similar configuration successfully, so  
> this is something driven by being on bare metal.
>
> Kernel is 2.6.27.29-170.2.78.fc10.i686.PAE
> nfs-utils is nfs-utils-1.1.4-8.fc10.i386
> autofs is autofs-5.0.3-41.i386
>
> Command running is
>
> /sbin/mount.nfs4 battleaxe:/ /hosts/battleaxe -s -o  
> rw,nosuid,nodev,tcp,rsize=32768,wsize=32768,hard,intr
>
> The autofs mount has worked and the directories under /hosts/ 
> battleaxe have been successfully accessed prior to the problem  
> occuring - I suspect this is a remount after and expire has occurred.
>
> Anybody seen this before?
> Anybody know what I can do to get round this? [I am on the way to  
> FC11 but will have to live with FC10 for a while (a week or so)]
> Any extra information I can acquire to diagnose this?
>
> There is nothing in the log files to indicate anything going wrong,  
> I could turn debug on if I knew what to set and which messages to  
> strip once I do.

You could start with "sudo rpcdebug -m nfs -s mount" and look in /var/ 
log/messages, or you can strace the running mount command.

--
Chuck Lever
chuck[dot]lever[at]oracle[dot]com

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: Problem with mount.nfs4 on latest Fedora 10 updates
  2009-08-13 18:04 ` Chuck Lever
@ 2009-08-14  7:20   ` Howard Wilkinson
  0 siblings, 0 replies; 3+ messages in thread
From: Howard Wilkinson @ 2009-08-14  7:20 UTC (permalink / raw)
  To: Chuck Lever; +Cc: autofs, For users of Fedora Core releases, nfsv4

Chuck Lever wrote:
>
> On Aug 13, 2009, at 12:50 PM, Howard Wilkinson wrote:
>
>> I have just upgraded a couple of servers from FC9 to FC10 and I am 
>> seeing a major problem with mount.nfs4. This occurs when autofs calls 
>> the mount program. It then runs at 100% CPU and never terminates.
>>
>> I have VMs that are running similar configuration successfully, so 
>> this is something driven by being on bare metal.
>>
>> Kernel is 2.6.27.29-170.2.78.fc10.i686.PAE
>> nfs-utils is nfs-utils-1.1.4-8.fc10.i386
>> autofs is autofs-5.0.3-41.i386
>>
>> Command running is
>>
>> /sbin/mount.nfs4 battleaxe:/ /hosts/battleaxe -s -o 
>> rw,nosuid,nodev,tcp,rsize=32768,wsize=32768,hard,intr
>>
>> The autofs mount has worked and the directories under 
>> /hosts/battleaxe have been successfully accessed prior to the problem 
>> occuring - I suspect this is a remount after and expire has occurred.
>>
>> Anybody seen this before?
>> Anybody know what I can do to get round this? [I am on the way to 
>> FC11 but will have to live with FC10 for a while (a week or so)]
>> Any extra information I can acquire to diagnose this?
>>
>> There is nothing in the log files to indicate anything going wrong, I 
>> could turn debug on if I knew what to set and which messages to strip 
>> once I do.
>
> You could start with "sudo rpcdebug -m nfs -s mount" and look in 
> /var/log/messages, or you can strace the running mount command.
>
> -- 
> Chuck Lever
> chuck[dot]lever[at]oracle[dot]com
The mount.nfs4 involvement is a red-herring! It would seem that the 
problem is in the kernel - probably in the NFS4 code path. I have now 
seem bash, df, and cfagent all exhibit the same failure. The processes 
go to 100% and hang up probably in a kernel thread. This happens some 
time after the kernel has booted so may still involve something to do 
with the autofs timing out the mount.

If I revert the kernel (and nothing else) to the latest FC9 version then 
everything goes back to working as it was.

Does anybody recognise these symptoms?

I am going to see if an strace will work, but once the system has failed 
it is difficult to get other processes to run to completion.

Howard.

^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2009-08-14  7:20 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2009-08-13 16:50 Problem with mount.nfs4 on latest Fedora 10 updates Howard Wilkinson
2009-08-13 18:04 ` Chuck Lever
2009-08-14  7:20   ` Howard Wilkinson

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.