public inbox for linux-nfs@vger.kernel.org
 help / color / mirror / Atom feed
From: "Carlos André" <candrecn@gmail.com>
To: Chuck Lever <chuck.lever@oracle.com>
Cc: Ian Kent <ikent@redhat.com>, NFS list <linux-nfs@vger.kernel.org>,
	Linux NFSv4 mailing list <nfsv4@linux-nfs.org>
Subject: Re: AutoFS+NFSv4 server down = LOOOOONG timeout.
Date: Thu, 17 Sep 2009 09:58:45 -0300	[thread overview]
Message-ID: <f6ce31e30909170558i184b2197ye65d7e04fd2fad17@mail.gmail.com> (raw)
In-Reply-To: <0823762D-BD01-4C34-B550-AEB7F838FF1A@oracle.com>

Hi ppl,

any news about this problem? :)

Thanks.

2009/8/27 Chuck Lever <chuck.lever@oracle.com>:
> On Aug 27, 2009, at 11:00 AM, Trond Myklebust wrote:
>>
>> On Thu, 2009-08-27 at 10:54 -0400, Chuck Lever wrote:
>>>
>>> On Aug 27, 2009, at 10:52 AM, Trond Myklebust wrote:
>>>>
>>>> On Thu, 2009-08-27 at 10:38 -0400, Chuck Lever wrote:
>>>>>
>>>>> On Aug 27, 2009, at 4:54 AM, Ian Kent wrote:
>>>>>>
>>>>>> Ian Kent wrote:
>>>>>>>
>>>>>>> Carlos Andr=E9 wrote:
>>>>>>>>
>>>>>>>> Hi Ian,
>>>>>>>>
>>>>>>>> Thanks for patch and sorry for delay (i'm expecting receive u
>>>>>>>> reply on
>>>>>>>> bug track, not here) :)
>>>>>>>>
>>>>>>>> But, this patch doesnt worked to me like expected... =A0:(
>>>>>>>>
>>>>>>>>
>>>>>>>> Firstly I've changed "#MOUNT_WAIT=3D-1" to "MOUNT_WAIT=3D10"
>>>>>>>> and later changed "10" to "2" with same results...
>>>>>>>> (always restarting service, of course :)
>>>>>>>>
>>>>>>>> Then, tried remove "sec=3Dkrb5p", and later removed "nfs4" but i g=
ot
>>>>>>>> same results again.
>>>>>>>>
>>>>>>>> Or i'm doing something wrong?
>>>>>>>>
>>>>>>>>
>>>>>>>> [root@KSTATION areas]# automount -V
>>>>>>>>
>>>>>>>> Linux automount version 5.0.1-0.rc2.131.bz517349.1
>>>>>>>> [...]
>>>>>>>>
>>>>>>>> [root@KSTATION areas]# time ls -la testdown
>>>>>>>> ls: testedown: No such file or directory
>>>>>>>>
>>>>>>>> real =A0 =A03m9.006s
>>>>>>>> user =A0 =A00m0.002s
>>>>>>>> sys =A0 =A0 0m0.000s
>>>>>>>
>>>>>>> OK, that isn't behaving the way I expect, I'll have a look.
>>>>>>>
>>>>>>>>
>>>>>>>> LOGGING:
>>>>>>>> -----------------------------------------
>>>>>>>> Aug 24 09:23:51 KSTATION automount[20803]: mount_mount:
>>>>>>>> mount(nfs):
>>>>>>>> calling mount -t nfs4 -s -o rw,acl,sec=3Dkrb5p 1.2.3.4:/areas/
>>>>>>>> testdown
>>>>>>>> /misc/areas/testdown
>>>>>>>> Aug 24 09:27:00 KSTATION automount[20803]: mount(nfs): nfs: mount
>>>>>>>> failure 1.2.3.4:/areas/testdown on /misc/areas/testdown
>>>>>>>> Aug 24 09:27:00 KSTATION automount[20803]: ioctl_send_fail: token
>>>>>>>> =3D 91
>>>>>>>> Aug 24 09:27:00 KSTATION automount[20803]: failed to mount /misc/
>>>>>>>> areas/testdown
>>>>>>>> -----------------------------------------
>>>>>>
>>>>>> Having a look at this I suspect the reason it doesn't work as
>>>>>> expected
>>>>>> is the waitpid(2) we do after sending the TERM signal to the mount
>>>>>> process (which we have to do) is not returning. This is likely
>>>>>> because
>>>>>> the mount process isn't giving up in a shorter time as it used to.
>>>>>
>>>>> You're thinking maybe mount(2) should be as interruptible as the
>>>>> socket calls that the mount command used to do? =A0That might be
>>>>> reasonable, and I can take a look at that.
>>>>
>>>> In recent kernels, all those RPC calls should be using TASK_KILLABLE
>>>> sleep states. SIGTERM should cause them to abort, provided that some
>>>> process isn't blocking it.
>>>>
>>>> Perhaps TASK_KILLABLE could be backported to RHEL-5?
>>>
>>> That's pretty extensive, with hooks in the page cache. =A0I doubt RH
>>> would go for that.
>>
>> You don't have to add the hooks in the page cache in order to make mount
>> interruptible. You just need to replace the sigmask-manipulation in
>> net/sunrpc and fs/nfs (a.k.a. rpc_clnt_sigmask()/rpc_clnt_sigunmask())
>> with TASK_KILLABLE.
>
> That sounds like a schlep.
>
>> Alternatively, it might suffice to just turn on the 'intr' flag
>> temporarily while doing the mount path walk, and then switch it to
>> whatever default the user actually specified afterwards.
>
> That sounds easy, especially for an EL5 kernel. =A0Maybe "soft" too for t=
he
> first few requests?
>
> --
> Chuck Lever
> chuck[dot]lever[at]oracle[dot]com
>
>
>
>



-- =

Atenciosamente,
Carlos Andr=E9
LPIC-1 / LPIC-2 / CCNA / CCNP

candrecn.at.gmail.dot.com

  reply	other threads:[~2009-09-17 12:58 UTC|newest]

Thread overview: 34+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
     [not found] <f6ce31e30907291021p769d8bb7jb7a13d0370b87bd6@mail.gmail.com>
     [not found] ` <f6ce31e30908061718u2c527e2eo5cf35f6eb0800fd4@mail.gmail.com>
2009-08-07  6:42   ` AutoFS+NFSv4 server down = LOOOOONG timeout Benny Halevy
2009-08-07 14:04     ` J. Bruce Fields
2009-08-10 18:29       ` Carlos André
2009-08-10 19:18         ` Chuck Lever
2009-08-10 19:43           ` Carlos André
2009-08-10 20:05             ` Carlos André
2009-08-10 20:35               ` Chuck Lever
2009-08-11 12:41                 ` Carlos André
2009-08-11 20:00                   ` Chuck Lever
2009-08-12  2:37                     ` Carlos André
2009-08-12 14:27                       ` Ian Kent
2009-08-12 14:13                     ` Ian Kent
2009-08-12 15:00                       ` Carlos André
2009-08-12 15:20                         ` Ian Kent
2009-08-12 16:40                           ` Carlos André
2009-08-13 14:19                             ` Ian Kent
2009-08-13 14:43                               ` Carlos André
2009-08-13 15:18                                 ` Carlos André
2009-08-18  0:30                                   ` Ian Kent
2009-08-18 13:17                                     ` Chuck Lever
     [not found]                                     ` <1250555418.16878.7.camel-oPQCyYhPoviaaDTPkt0SUw@public.gmane.org>
2009-08-24 13:27                                       ` Carlos André
     [not found]                                         ` <f6ce31e30908240627gff0a7eeu3c884185e6324518-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2009-08-24 14:57                                           ` Ian Kent
2009-08-24 18:07                                             ` Carlos André
2009-08-27  8:54                                             ` Ian Kent
2009-08-27 14:38                                               ` Chuck Lever
2009-08-27 14:52                                                 ` Trond Myklebust
2009-08-27 14:54                                                   ` Chuck Lever
2009-08-27 15:00                                                     ` Trond Myklebust
2009-08-27 15:12                                                       ` Chuck Lever
2009-09-17 12:58                                                         ` Carlos André [this message]
2009-09-17 13:12                                                           ` Ondrej Valousek
2009-09-22  5:46                                         ` Ian Kent
2009-09-22 17:52                                           ` Carlos André
2009-08-10 20:11             ` Chuck Lever

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=f6ce31e30909170558i184b2197ye65d7e04fd2fad17@mail.gmail.com \
    --to=candrecn@gmail.com \
    --cc=chuck.lever@oracle.com \
    --cc=ikent@redhat.com \
    --cc=linux-nfs@vger.kernel.org \
    --cc=nfsv4@linux-nfs.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox