From: Orion Poplawski <orion@cora.nwra.com>
To: "Myklebust, Trond" <Trond.Myklebust@netapp.com>
Cc: "linux-nfs@vger.kernel.org" <linux-nfs@vger.kernel.org>
Subject: Re: umount(,MNT_DETACH) for nfsv4 hangs when using sec=krb5 and network is down
Date: Wed, 13 Feb 2013 10:58:32 -0700 [thread overview]
Message-ID: <511BD448.5050901@cora.nwra.com> (raw)
In-Reply-To: <4FA345DA4F4AE44899BD2B03EEEC2FA911970163@SACEXCMBX04-PRD.hq.netapp.com>
On 12/20/2012 03:01 PM, Myklebust, Trond wrote:
> On Thu, 2012-12-20 at 14:52 -0700, Orion Poplawski wrote:
>> On 12/20/2012 01:47 PM, Myklebust, Trond wrote:
>>> On Thu, 2012-12-20 at 13:31 -0700, Orion Poplawski wrote:
>>>> On 12/19/2012 03:19 PM, Myklebust, Trond wrote:
>>>>>
>>>>> Commit eb96d5c97b0825d542e9c4ba5e0a22b519355166 (SUNRPC handle
>>>>> EKEYEXPIRED in call_refreshresult), which will be in 3.8-rc1 when Linus
>>>>> releases it, may help.
>>>>>
>>>>
>>>> FWIW - I cherry picked that into the latest Fedora rawhide kernel but no
>>>> effect. Sounds like a nice patch though, the current hang forever behavior
>>>> doesn't seem the trigger the needed "ah, need a new ticket" response.
>>>>
>>>
>>> So does simply killing the rpc.gssd process help?
>>>
>>
>> Yes, if automount is already stopped (these are automounted directories). If
>> automount is running, it still seems to hang. I think I'm going to need to
>> spend some time talking to Ian.
>>
>
> I'd suggest also taking a long hard look at rpc.gssd and making sure
> that it handles ENETUNREACH, ECONNREFUSED and friends correctly. I
> suspect right now it is just baling out of the upcall instead of
> completing it by propagating the error reply to the kernel.
>
Actually, I take that back - I'm not sure it's directly involved and killing
rpc.gssd doesn't seem to be helping me now. I connected to rpc.gssd with
strace, dropped the interface and tried to umount.nfs4 -l but rpc.gssd is
still in poll and doesn't do anyting. kernel process trace shows:
[ 2788.807017] umount.nfs4 D ffff88007cc13d40 0 3001 3000 0x00000084
[ 2788.807017] ffff8800361319a8 0000000000000082 ffff880036131fd8
0000000000013d40
[ 2788.807017] ffff880036131fd8 0000000000013d40 ffff8800773add80
ffff8800773add80
[ 2788.807017] ffff88007cfe2cb8 0000000000000082 ffffffffa0009cc0
ffff880036131a20
[ 2788.807017] Call Trace:
[ 2788.807017] [<ffffffffa0009cc0>] ?
__rpc_wait_for_completion_task팝䱷 [sunrpc]
[ 2788.807017] [<ffffffff81634b39>] schedule팝뷿䱻
[ 2788.807017] [<ffffffffa0009cf5>] rpc_wait_bit_killable팝刺䱽 [sunrpc]
[ 2788.807017] [<ffffffff816335a0>] __wait_on_bit팞듿䱽
[ 2788.807017] [<ffffffffa0001c50>] ? call_connect팟瓿䱽 [sunrpc]
[ 2788.807017] [<ffffffffa0009cc0>] ?
__rpc_wait_for_completion_task팝䱷 [sunrpc]
[ 2788.807017] [<ffffffff81633707>] out_of_line_wait_on_bit팞ﯿ䱽
[ 2788.807017] [<ffffffff81080560>] ? autoremove_wake_function팞㓿䱸
[ 2788.807017] [<ffffffffa0001c50>] ? call_connect팟瓿䱽 [sunrpc]
[ 2788.807017] [<ffffffffa0001c50>] ? call_connect팟瓿䱽 [sunrpc]
[ 2788.807017] [<ffffffffa000ac7a>] __rpc_execute팝睫ﴱ� [sunrpc]
[ 2788.807017] [<ffffffffa000bd65>] rpc_execute팞秿䱽 [sunrpc]
[ 2788.807017] [<ffffffffa0002e60>] rpc_run_task팞䱽 [sunrpc]
[ 2788.807017] [<ffffffffa0002ec3>] rpc_call_sync팞㟿䱚 [sunrpc]
[ 2788.807017] [<ffffffffa01f5653>] _nfs4_call_sync팝矿䱶 [nfsv4]
[ 2788.807017] [<ffffffffa01f4e50>] _nfs4_proc_getattr팖䱜 [nfsv4]
[ 2788.807017] [<ffffffffa01f9b9e>] nfs4_proc_getattr팞ỿ䱻 [nfsv4]
[ 2788.807017] [<ffffffffa01b67bc>] __nfs_revalidate_inode팟䱶 [nfs]
[ 2788.807017] [<ffffffffa01b69a3>] nfs_revalidate_inode팞䱚 [nfs]
[ 2788.807017] [<ffffffffa01afc60>] nfs_check_verifier팞瓿䱼 [nfs]
[ 2788.807017] [<ffffffffa01b255b>] nfs_lookup_revalidate팝齯ﴱ [nfs]
[ 2788.807017] [<ffffffffa01b2705>] nfs4_lookup_revalidate팝刺䱞 [nfs]
[ 2788.807017] [<ffffffff811a18fb>] complete_walk팖�䱵
[ 2788.807017] [<ffffffff811a3310>] path_lookupat팞䱻
[ 2788.807017] [<ffffffff811a216f>] ? getname_flags팞䱵
[ 2788.807017] [<ffffffff811a3abb>] filename_lookup팝鯿䱜
[ 2788.807017] [<ffffffff811a67c4>] user_path_at_empty팞磿䱽
[ 2788.807017] [<ffffffff8117e4e6>] ? kmem_cache_free팞㫿䱵
[ 2788.807017] [<ffffffff8115d4e3>] ? remove_vma팞럿䱻
[ 2788.807017] [<ffffffff811a6811>] user_path_at팝痿䱶
[ 2788.807017] [<ffffffff811b57af>] sys_umount팝�䱷
[ 2788.807017] [<ffffffff81639f7e>] ? do_page_fault팗뿓ᵴ
[ 2788.807017] [<ffffffff8163e419>] system_call_fastpath팝竿䱵
But every other process in schedule.
The mount point gets "deleted":
# grep mnt /proc/mounts
earth:/export/home/orion /mnt\040(deleted) nfs4
rw,relatime,vers=4.0,rsize=1048576,wsize=1048576,namlen=255,hard,proto=tcp,port=0,timeo=600,retrans=2,sec=krb5,clientaddr=10.10.11.101,local_lock=none,addr=10.10.10.1
0 0
but that's it.
--
Orion Poplawski
Technical Manager 303-415-9701 x222
NWRA, Boulder Office FAX: 303-415-9702
3380 Mitchell Lane orion@nwra.com
Boulder, CO 80301 http://www.nwra.com
prev parent reply other threads:[~2013-02-13 19:02 UTC|newest]
Thread overview: 9+ messages / expand[flat|nested] mbox.gz Atom feed top
2012-12-19 20:47 umount(,MNT_DETACH) for nfsv4 hangs when using sec=krb5 and network is down Orion Poplawski
2012-12-19 21:08 ` Myklebust, Trond
2012-12-19 21:50 ` Orion Poplawski
2012-12-19 22:19 ` Myklebust, Trond
2012-12-20 20:31 ` Orion Poplawski
2012-12-20 20:47 ` Myklebust, Trond
2012-12-20 21:52 ` Orion Poplawski
2012-12-20 22:01 ` Myklebust, Trond
2013-02-13 17:58 ` Orion Poplawski [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=511BD448.5050901@cora.nwra.com \
--to=orion@cora.nwra.com \
--cc=Trond.Myklebust@netapp.com \
--cc=linux-nfs@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.