Linux NFS development
 help / color / mirror / Atom feed
From: Wendy Cheng <wcheng@redhat.com>
To: Neil Brown <neilb@suse.de>
Cc: Linux NFS Mailing List <nfs@lists.sourceforge.net>
Subject: Re: Question about f_count in struct nlm_file
Date: Fri, 23 Mar 2007 18:11:04 -0400	[thread overview]
Message-ID: <46045078.4090809@redhat.com> (raw)
In-Reply-To: <17923.23007.774087.48762@notabene.brown>

Neil Brown wrote:
> On Thursday March 22, wcheng@redhat.com wrote:
>   
>> Wendy Cheng wrote:
>>
>>     
>>> client does posix lock -->
>>>     server calls nlm4svc_proc_lock() ->
>>>         * server lookup file (f_count++)
>>>         * server lock the file
>>>         * server calls nlm_release_host
>>>         * server calls nlm_release_file (f_count--)
>>>         * server return to client with status 0
>>>
>>> This will cause any call into nlm_traverse_files() to crash in the 
>>> following path, if the file happens to be of "no interest" of the search 
>>> (for example, the "match" function returns FALSE in all cases). Is this 
>>> intentional or oversight ? Would 2.6.21-rc4 be a good base to do NLM 
>>> development work ?
>>>       
>
> Hmmm.... definitely seems to be a bug in there!!
>
> The key to the issue seems to be to make sure we take account of all
> the current locks.
> One way might be to replace
> 			if (list_empty(&file->f_blocks) && !file->f_locks
> 			 && !file->f_shares && !file->f_count) {
> in nlm_traverse_files with a call to nlm_file_inuse(file).
>   

Look reasonable ... an easy and quick fix. This one has my vote.
> Another way would be to try a bit harder to keep f_count uptodate by
> incrementing it in nlmsvc_lock:
>
> diff .prev/fs/lockd/svclock.c ./fs/lockd/svclock.c
> --- .prev/fs/lockd/svclock.c	2007-03-23 15:30:37.000000000 +1100
> +++ ./fs/lockd/svclock.c	2007-03-23 15:33:41.000000000 +1100
> @@ -370,6 +370,7 @@ again:
>  
>  	switch(error) {
>  		case 0:
> +			file->f_locks ++;
>  			ret = nlm_granted;
>  			goto out;
>  		case -EAGAIN:
>
> The former is probably bit safer... but I wish I know what we have
> that f_locks count.  It is not at all clear what it is needed for.
> Maybe some legacy issue that doesn't exist any more...
>   

This f_locks is definitely another awkward counter - better not messing 
with it. My vote is "no". How about removing it from nlm_file for good ?
>
>   
>> I should have made it clear... after nlm_inspect_file(), the logic 
>> unconditionally checks for possible removing of this file. Since the 
>> file is not blocked, nothing to do with shares, and f_count is zero, it 
>> will get removed from hash and fclose() invoked (even it still owns a 
>> plock). This will make VFS very unhappy and BUG() in fs/locks.c:1988 in 
>> the middle of __fput -> locks_remove_flock.
>>
>> On the other hand, the more I think (about this issue), maybe just 
>> looping back after nlm_inspect_file finds no match would be good enough. 
>> Anyway, that's what I'm going to do. Any objection ? Please let me know.
>>     
>
> Could you explain this possible fix a bit more.  I'm not sure what you
> are proposing..
>
>   
No, scratch what I said here (since it is not related) ... will submit 
the first set of failover patches shortly after this mail.

Great thanks (as always) for looking into this.

-- Wendy


-------------------------------------------------------------------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys-and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________
NFS maillist  -  NFS@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nfs

      reply	other threads:[~2007-03-23 22:30 UTC|newest]

Thread overview: 4+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2007-03-23  3:58 Question about f_count in struct nlm_file Wendy Cheng
2007-03-23  4:29 ` Wendy Cheng
2007-03-23  4:38   ` Neil Brown
2007-03-23 22:11     ` Wendy Cheng [this message]

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=46045078.4090809@redhat.com \
    --to=wcheng@redhat.com \
    --cc=neilb@suse.de \
    --cc=nfs@lists.sourceforge.net \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox