public inbox for linux-nfs@vger.kernel.org
 help / color / mirror / Atom feed
* [PATCH] NFSv4: nfs4_state_manager() vs. nfs_server_remove_lists()
@ 2014-09-17 14:50 Steve Dickson
  2014-09-17 14:55 ` Trond Myklebust
  0 siblings, 1 reply; 5+ messages in thread
From: Steve Dickson @ 2014-09-17 14:50 UTC (permalink / raw)
  To: Trond Myklebust; +Cc: Linux NFS Mailing list, Stable Tree

There is a race between nfs4_state_manager() and
nfs_server_remove_lists() that happens during a nfsv3 mount.

The v3 mount notices there is already a supper block so
nfs_server_remove_lists() called which uses the nfs_client_lock
spin lock to synchronize access to the client list.

At the same time nfs4_state_manager() is running through
the client list looking for work to do, using the same
lock. When nfs4_state_manager() wins the race to the
list, a v3 client pointer is found and not ignored
properly which causes the panic.

Moving some protocol checks before the state checking
avoids the panic.

CC: Stable Tree <stable@vger.kernel.org>
Signed-off-by: Steve Dickson <steved@redhat.com>
---
 fs/nfs/nfs4client.c | 19 ++++++++++---------
 1 file changed, 10 insertions(+), 9 deletions(-)

diff --git a/fs/nfs/nfs4client.c b/fs/nfs/nfs4client.c
index 53e435a..7ff4c02 100644
--- a/fs/nfs/nfs4client.c
+++ b/fs/nfs/nfs4client.c
@@ -622,6 +622,16 @@ int nfs41_walk_client_list(struct nfs_client *new,
 
 	spin_lock(&nn->nfs_client_lock);
 	list_for_each_entry(pos, &nn->nfs_client_list, cl_share_link) {
+
+		if (pos->rpc_ops != new->rpc_ops)
+			continue;
+
+		if (pos->cl_proto != new->cl_proto)
+			continue;
+
+		if (pos->cl_minorversion != new->cl_minorversion)
+			continue;
+
 		/* If "pos" isn't marked ready, we can't trust the
 		 * remaining fields in "pos", especially the client
 		 * ID and serverowner fields.  Wait for CREATE_SESSION
@@ -647,15 +657,6 @@ int nfs41_walk_client_list(struct nfs_client *new,
 		if (pos->cl_cons_state != NFS_CS_READY)
 			continue;
 
-		if (pos->rpc_ops != new->rpc_ops)
-			continue;
-
-		if (pos->cl_proto != new->cl_proto)
-			continue;
-
-		if (pos->cl_minorversion != new->cl_minorversion)
-			continue;
-
 		if (!nfs4_match_clientids(pos, new))
 			continue;
 
-- 
1.8.3.1


^ permalink raw reply related	[flat|nested] 5+ messages in thread

* Re: [PATCH] NFSv4: nfs4_state_manager() vs. nfs_server_remove_lists()
  2014-09-17 14:50 [PATCH] NFSv4: nfs4_state_manager() vs. nfs_server_remove_lists() Steve Dickson
@ 2014-09-17 14:55 ` Trond Myklebust
  2014-09-17 15:00   ` Anna Schumaker
  2014-09-18 13:13   ` Steve Dickson
  0 siblings, 2 replies; 5+ messages in thread
From: Trond Myklebust @ 2014-09-17 14:55 UTC (permalink / raw)
  To: Steve Dickson; +Cc: Linux NFS Mailing list

On Wed, Sep 17, 2014 at 10:50 AM, Steve Dickson <steved@redhat.com> wrote:
> There is a race between nfs4_state_manager() and
> nfs_server_remove_lists() that happens during a nfsv3 mount.
>
> The v3 mount notices there is already a supper block so
> nfs_server_remove_lists() called which uses the nfs_client_lock
> spin lock to synchronize access to the client list.
>
> At the same time nfs4_state_manager() is running through
> the client list looking for work to do, using the same
> lock. When nfs4_state_manager() wins the race to the
> list, a v3 client pointer is found and not ignored
> properly which causes the panic.
>
> Moving some protocol checks before the state checking
> avoids the panic.
>
> CC: Stable Tree <stable@vger.kernel.org>
> Signed-off-by: Steve Dickson <steved@redhat.com>
> ---
>  fs/nfs/nfs4client.c | 19 ++++++++++---------
>  1 file changed, 10 insertions(+), 9 deletions(-)
>
> diff --git a/fs/nfs/nfs4client.c b/fs/nfs/nfs4client.c
> index 53e435a..7ff4c02 100644
> --- a/fs/nfs/nfs4client.c
> +++ b/fs/nfs/nfs4client.c
> @@ -622,6 +622,16 @@ int nfs41_walk_client_list(struct nfs_client *new,
>
>         spin_lock(&nn->nfs_client_lock);
>         list_for_each_entry(pos, &nn->nfs_client_list, cl_share_link) {
> +
> +               if (pos->rpc_ops != new->rpc_ops)
> +                       continue;
> +
> +               if (pos->cl_proto != new->cl_proto)
> +                       continue;
> +
> +               if (pos->cl_minorversion != new->cl_minorversion)
> +                       continue;
> +
>                 /* If "pos" isn't marked ready, we can't trust the
>                  * remaining fields in "pos", especially the client
>                  * ID and serverowner fields.  Wait for CREATE_SESSION
> @@ -647,15 +657,6 @@ int nfs41_walk_client_list(struct nfs_client *new,
>                 if (pos->cl_cons_state != NFS_CS_READY)
>                         continue;
>
> -               if (pos->rpc_ops != new->rpc_ops)
> -                       continue;
> -
> -               if (pos->cl_proto != new->cl_proto)
> -                       continue;
> -
> -               if (pos->cl_minorversion != new->cl_minorversion)
> -                       continue;
> -
>                 if (!nfs4_match_clientids(pos, new))
>                         continue;
>
> --
> 1.8.3.1
>

Don't we need the same fix in nfs40_walk_client_list?

Cheers
  Trond

-- 
Trond Myklebust

Linux NFS client maintainer, PrimaryData

trond.myklebust@primarydata.com

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: [PATCH] NFSv4: nfs4_state_manager() vs. nfs_server_remove_lists()
  2014-09-17 14:55 ` Trond Myklebust
@ 2014-09-17 15:00   ` Anna Schumaker
  2014-09-18 13:17     ` Steve Dickson
  2014-09-18 13:13   ` Steve Dickson
  1 sibling, 1 reply; 5+ messages in thread
From: Anna Schumaker @ 2014-09-17 15:00 UTC (permalink / raw)
  To: Trond Myklebust, Steve Dickson; +Cc: Linux NFS Mailing list

On 09/17/2014 10:55 AM, Trond Myklebust wrote:
> On Wed, Sep 17, 2014 at 10:50 AM, Steve Dickson <steved@redhat.com> wrote:
>> There is a race between nfs4_state_manager() and
>> nfs_server_remove_lists() that happens during a nfsv3 mount.
>>
>> The v3 mount notices there is already a supper block so
>> nfs_server_remove_lists() called which uses the nfs_client_lock
>> spin lock to synchronize access to the client list.
>>
>> At the same time nfs4_state_manager() is running through
>> the client list looking for work to do, using the same
>> lock. When nfs4_state_manager() wins the race to the
>> list, a v3 client pointer is found and not ignored
>> properly which causes the panic.
>>
>> Moving some protocol checks before the state checking
>> avoids the panic.
>>
>> CC: Stable Tree <stable@vger.kernel.org>
>> Signed-off-by: Steve Dickson <steved@redhat.com>
>> ---
>>  fs/nfs/nfs4client.c | 19 ++++++++++---------
>>  1 file changed, 10 insertions(+), 9 deletions(-)
>>
>> diff --git a/fs/nfs/nfs4client.c b/fs/nfs/nfs4client.c
>> index 53e435a..7ff4c02 100644
>> --- a/fs/nfs/nfs4client.c
>> +++ b/fs/nfs/nfs4client.c
>> @@ -622,6 +622,16 @@ int nfs41_walk_client_list(struct nfs_client *new,
>>
>>         spin_lock(&nn->nfs_client_lock);
>>         list_for_each_entry(pos, &nn->nfs_client_list, cl_share_link) {
>> +
>> +               if (pos->rpc_ops != new->rpc_ops)
>> +                       continue;
>> +
>> +               if (pos->cl_proto != new->cl_proto)
>> +                       continue;
>> +
>> +               if (pos->cl_minorversion != new->cl_minorversion)
>> +                       continue;
>> +
>>                 /* If "pos" isn't marked ready, we can't trust the
>>                  * remaining fields in "pos", especially the client
>>                  * ID and serverowner fields.  Wait for CREATE_SESSION
>> @@ -647,15 +657,6 @@ int nfs41_walk_client_list(struct nfs_client *new,
>>                 if (pos->cl_cons_state != NFS_CS_READY)
>>                         continue;
>>
>> -               if (pos->rpc_ops != new->rpc_ops)
>> -                       continue;
>> -
>> -               if (pos->cl_proto != new->cl_proto)
>> -                       continue;
>> -
>> -               if (pos->cl_minorversion != new->cl_minorversion)
>> -                       continue;
>> -
>>                 if (!nfs4_match_clientids(pos, new))
>>                         continue;
>>
>> --
>> 1.8.3.1
>>
> Don't we need the same fix in nfs40_walk_client_list?
Bonus points for finding a way to merge these functions, since they do similar comparisons in the beginning :)

Anna

>
> Cheers
>   Trond
>


^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: [PATCH] NFSv4: nfs4_state_manager() vs. nfs_server_remove_lists()
  2014-09-17 14:55 ` Trond Myklebust
  2014-09-17 15:00   ` Anna Schumaker
@ 2014-09-18 13:13   ` Steve Dickson
  1 sibling, 0 replies; 5+ messages in thread
From: Steve Dickson @ 2014-09-18 13:13 UTC (permalink / raw)
  To: Trond Myklebust; +Cc: Linux NFS Mailing list



On 09/17/2014 10:55 AM, Trond Myklebust wrote:
> On Wed, Sep 17, 2014 at 10:50 AM, Steve Dickson <steved@redhat.com> wrote:
>> There is a race between nfs4_state_manager() and
>> nfs_server_remove_lists() that happens during a nfsv3 mount.
>>
>> The v3 mount notices there is already a supper block so
>> nfs_server_remove_lists() called which uses the nfs_client_lock
>> spin lock to synchronize access to the client list.
>>
>> At the same time nfs4_state_manager() is running through
>> the client list looking for work to do, using the same
>> lock. When nfs4_state_manager() wins the race to the
>> list, a v3 client pointer is found and not ignored
>> properly which causes the panic.
>>
>> Moving some protocol checks before the state checking
>> avoids the panic.
>>
>> CC: Stable Tree <stable@vger.kernel.org>
>> Signed-off-by: Steve Dickson <steved@redhat.com>
>> ---
>>  fs/nfs/nfs4client.c | 19 ++++++++++---------
>>  1 file changed, 10 insertions(+), 9 deletions(-)
>>
>> diff --git a/fs/nfs/nfs4client.c b/fs/nfs/nfs4client.c
>> index 53e435a..7ff4c02 100644
>> --- a/fs/nfs/nfs4client.c
>> +++ b/fs/nfs/nfs4client.c
>> @@ -622,6 +622,16 @@ int nfs41_walk_client_list(struct nfs_client *new,
>>
>>         spin_lock(&nn->nfs_client_lock);
>>         list_for_each_entry(pos, &nn->nfs_client_list, cl_share_link) {
>> +
>> +               if (pos->rpc_ops != new->rpc_ops)
>> +                       continue;
>> +
>> +               if (pos->cl_proto != new->cl_proto)
>> +                       continue;
>> +
>> +               if (pos->cl_minorversion != new->cl_minorversion)
>> +                       continue;
>> +
>>                 /* If "pos" isn't marked ready, we can't trust the
>>                  * remaining fields in "pos", especially the client
>>                  * ID and serverowner fields.  Wait for CREATE_SESSION
>> @@ -647,15 +657,6 @@ int nfs41_walk_client_list(struct nfs_client *new,
>>                 if (pos->cl_cons_state != NFS_CS_READY)
>>                         continue;
>>
>> -               if (pos->rpc_ops != new->rpc_ops)
>> -                       continue;
>> -
>> -               if (pos->cl_proto != new->cl_proto)
>> -                       continue;
>> -
>> -               if (pos->cl_minorversion != new->cl_minorversion)
>> -                       continue;
>> -
>>                 if (!nfs4_match_clientids(pos, new))
>>                         continue;
>>
>> --
>> 1.8.3.1
>>
> 
> Don't we need the same fix in nfs40_walk_client_list?
Yes... Just posted version 2...

steved.
 
> 
> Cheers
>   Trond
> 

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: [PATCH] NFSv4: nfs4_state_manager() vs. nfs_server_remove_lists()
  2014-09-17 15:00   ` Anna Schumaker
@ 2014-09-18 13:17     ` Steve Dickson
  0 siblings, 0 replies; 5+ messages in thread
From: Steve Dickson @ 2014-09-18 13:17 UTC (permalink / raw)
  To: Anna Schumaker, Trond Myklebust; +Cc: Linux NFS Mailing list



On 09/17/2014 11:00 AM, Anna Schumaker wrote:
> On 09/17/2014 10:55 AM, Trond Myklebust wrote:
>> On Wed, Sep 17, 2014 at 10:50 AM, Steve Dickson <steved@redhat.com> wrote:
>>> There is a race between nfs4_state_manager() and
>>> nfs_server_remove_lists() that happens during a nfsv3 mount.
>>>
>>> The v3 mount notices there is already a supper block so
>>> nfs_server_remove_lists() called which uses the nfs_client_lock
>>> spin lock to synchronize access to the client list.
>>>
>>> At the same time nfs4_state_manager() is running through
>>> the client list looking for work to do, using the same
>>> lock. When nfs4_state_manager() wins the race to the
>>> list, a v3 client pointer is found and not ignored
>>> properly which causes the panic.
>>>
>>> Moving some protocol checks before the state checking
>>> avoids the panic.
>>>
>>> CC: Stable Tree <stable@vger.kernel.org>
>>> Signed-off-by: Steve Dickson <steved@redhat.com>
>>> ---
>>>  fs/nfs/nfs4client.c | 19 ++++++++++---------
>>>  1 file changed, 10 insertions(+), 9 deletions(-)
>>>
>>> diff --git a/fs/nfs/nfs4client.c b/fs/nfs/nfs4client.c
>>> index 53e435a..7ff4c02 100644
>>> --- a/fs/nfs/nfs4client.c
>>> +++ b/fs/nfs/nfs4client.c
>>> @@ -622,6 +622,16 @@ int nfs41_walk_client_list(struct nfs_client *new,
>>>
>>>         spin_lock(&nn->nfs_client_lock);
>>>         list_for_each_entry(pos, &nn->nfs_client_list, cl_share_link) {
>>> +
>>> +               if (pos->rpc_ops != new->rpc_ops)
>>> +                       continue;
>>> +
>>> +               if (pos->cl_proto != new->cl_proto)
>>> +                       continue;
>>> +
>>> +               if (pos->cl_minorversion != new->cl_minorversion)
>>> +                       continue;
>>> +
>>>                 /* If "pos" isn't marked ready, we can't trust the
>>>                  * remaining fields in "pos", especially the client
>>>                  * ID and serverowner fields.  Wait for CREATE_SESSION
>>> @@ -647,15 +657,6 @@ int nfs41_walk_client_list(struct nfs_client *new,
>>>                 if (pos->cl_cons_state != NFS_CS_READY)
>>>                         continue;
>>>
>>> -               if (pos->rpc_ops != new->rpc_ops)
>>> -                       continue;
>>> -
>>> -               if (pos->cl_proto != new->cl_proto)
>>> -                       continue;
>>> -
>>> -               if (pos->cl_minorversion != new->cl_minorversion)
>>> -                       continue;
>>> -
>>>                 if (!nfs4_match_clientids(pos, new))
>>>                         continue;
>>>
>>> --
>>> 1.8.3.1
>>>
>> Don't we need the same fix in nfs40_walk_client_list?
> Bonus points for finding a way to merge these functions, since they do similar comparisons in the beginning :)
I did talk a look at merging these functions... The start
of the functions are similar but the do differ after the state
check enough to keep them separate... IMHO... 

steved.


^ permalink raw reply	[flat|nested] 5+ messages in thread

end of thread, other threads:[~2014-09-18 13:17 UTC | newest]

Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2014-09-17 14:50 [PATCH] NFSv4: nfs4_state_manager() vs. nfs_server_remove_lists() Steve Dickson
2014-09-17 14:55 ` Trond Myklebust
2014-09-17 15:00   ` Anna Schumaker
2014-09-18 13:17     ` Steve Dickson
2014-09-18 13:13   ` Steve Dickson

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox