All of lore.kernel.org
 help / color / mirror / Atom feed
From: Benny Halevy <bhalevy@panasas.com>
To: Daniel.Muntz@emc.com
Cc: andros@netapp.com, sjoshi@bluearc.com, linux-nfs@vger.kernel.org
Subject: Re: 4.1 client - LAYOUTCOMMIT & close
Date: Tue, 06 Jul 2010 16:35:56 +0300	[thread overview]
Message-ID: <4C33313C.5070708@panasas.com> (raw)
In-Reply-To: <B9A709F368FAAF4DB4B33870F72A141DFB88F3@CORPUSMX30A.corp.emc.com>

On Jul. 03, 2010, 0:46 +0300, <Daniel.Muntz@emc.com> wrote:
> By "extremely lame server" I assume you mean any pNFS server that
> doesn't have a cluster FS on the back end.  So while this might work
> well for NetApp (as long as NetApp never ships a non-clustered pNFS), it
> might break others, or at least severely impact their performance.  For
> example, will the Solaris pNFS server work correctly without
> LAYOUTCOMMIT?  IMHO, the client MUST issue the appropriate LAYOUTCOMMIT,
> but the server is free to handle it as a no-op if the server
> implementation does not need to utilize the payload.

I completely agree.

Only with Dave Noveck suggestion of adding a "LAYOUT_{DATA,FILE}_SYNC4"
stable_how4 values (or maybe a LAYOUT_SYNC4=4 or higher power of 2 flag)
to be returned by a DS on WRITE, the DS can say that it ensures metadata
synchronization with the MDS in a cluster coherent way and the client can relax
and avoid sending LAYOUTCOMMIT to the MDS.

Otherwise, the linux implementation can potentially support a mount option
telling the client to not send a LAYOUTCOMMIT to the MDS as an optimization
if the admin is sure that the server doesn't require it.

Benny


> 
>   -Dan
> 
>> -----Original Message-----
>> From: linux-nfs-owner@vger.kernel.org 
>> [mailto:linux-nfs-owner@vger.kernel.org] On Behalf Of Andy Adamson
>> Sent: Friday, July 02, 2010 8:41 AM
>> To: Sandeep Joshi
>> Cc: linux-nfs@vger.kernel.org; bhalevy@panasas.com
>> Subject: Re: 4.1 client - LAYOUTCOMMIT & close
>>
>>
>> On Jul 1, 2010, at 8:07 PM, Sandeep Joshi wrote:
>>
>> Hi Sandeep
>>
>>>
>>> In certain cases, I don't see layoutcommit on a file at all even  
>>> after doing many writes.
>>
>> FYI:
>>
>> You should not be paying attention to layoutcommits  - they have no  
>> value for the file layout type.
>>
>>  From RFC 5661:
>>
>> "The LAYOUTCOMMIT operation commits chages in the layout represented  
>> by the current filehandle, client ID (derived from the session ID in  
>> the preceding SEQUENCE operation), byte-range, and stateid."
>>
>> For the block layout type, this sentence has meaning in that 
>> there is  
>> a layoutupdate4 payload that enumerates the blocks that have changed  
>> state from being 'handed out' to being 'written'.
>>
>> The file layout type has no layoutupdate4 payload, and the 
>> layout does  
>> not change due to writes, and thus the LAYOUTCOMMIT call is useless.
>>
>> The only field in the LAYOUTCOMMIT4args that might possibly 
>> be useful  
>> is the loca_last_write_offset which tells the server what the client  
>> thinks is the EOF of the file after WRITE. It is an extremely lame  
>> server (file layout type server) that depends upon clients for this  
>> info.
>>
>>>
>>>
>>>
>>> Client side operations:
>>>
>>> open
>>> write(s)
>>> close
>>>
>>>
>>> On server side (observed operations):
>>>
>>> open
>>> layoutget's
>>> close
>>>
>>>
>>> But, I do not see laycommit at all. In terms data written 
>> by client  
>>> it is about 4-5MB.
>>>
>>> When does client issue laycommit?
>>
>> The latest linux client sends a layout commit when the VFS does a  
>> super_operations.write_inode call which happens when the metadata of  
>> an inode needs updating. We are seriously considering removing the  
>> layoutcommit call from the file layout client.
>>
>> -->Andy
>>
>>>
>>>
>>> regards,
>>>
>>> Sandeep
>>>
>>> --
>>> To unsubscribe from this list: send the line "unsubscribe 
>> linux-nfs"  
>>> in
>>> the body of a message to majordomo@vger.kernel.org
>>> More majordomo info at  http://vger.kernel.org/majordomo-info.html
>>
>> --
>> To unsubscribe from this list: send the line "unsubscribe 
>> linux-nfs" in
>> the body of a message to majordomo@vger.kernel.org
>> More majordomo info at  http://vger.kernel.org/majordomo-info.html
>>
>>

  reply	other threads:[~2010-07-06 13:36 UTC|newest]

Thread overview: 38+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2010-07-01 23:47 4.1 client - LAYOUTCOMMIT Sandeep Joshi
2010-07-02  0:07 ` 4.1 client - LAYOUTCOMMIT & close Sandeep Joshi
     [not found]   ` <A062FCC8662DA848949F7C3046B9BEAE01F3A6EE-e1HlL03umel79urLq6li5IWksG4c/lV9Sp/tIRYA5EM@public.gmane.org>
2010-07-02 15:41     ` Andy Adamson
2010-07-02 17:08       ` 4.1 client - LAYOUTCOMMIT &amp; close Suchit Kaura
     [not found]         ` <loom.20100702T190300-538-eS7Uydv5nfjZ+VzJOa5vwg@public.gmane.org>
2010-07-06 13:12           ` Andy Adamson
2010-07-06 13:23             ` Benny Halevy
2010-07-02 21:46       ` 4.1 client - LAYOUTCOMMIT & close Daniel.Muntz
2010-07-06 13:35         ` Benny Halevy [this message]
2010-07-06 13:37         ` Andy Adamson
2010-07-06 14:04           ` Boaz Harrosh
2010-07-06 19:20           ` Daniel.Muntz
2010-07-06 20:40             ` Trond Myklebust
2010-07-06 22:50               ` Daniel.Muntz
2010-07-06 23:23                 ` Trond Myklebust
2010-07-07 12:05               ` Benny Halevy
2010-07-07 13:06                 ` Trond Myklebust
2010-07-07 13:18                   ` [nfsv4] " Trond Myklebust
2010-07-07 13:51                     ` Benny Halevy
2010-07-07 14:03                       ` Trond Myklebust
2010-07-07 17:45                         ` Dean Hildebrand
2010-07-07 20:39                         ` Daniel.Muntz
2010-07-07 21:01                           ` Trond Myklebust
2010-07-07 22:04                             ` Noveck_David
2010-07-07 22:27                               ` Trond Myklebust
2010-07-07 22:44                               ` david.black
2010-07-07 22:52                                 ` Trond Myklebust
2010-07-07 23:09                                   ` Trond Myklebust
     [not found]                                     ` <1278544497.15524.17.camel@heimdal.trondhje! m .org>
     [not found]                                       ` < 4C35F5E3.3000604@panasas.com>
2010-07-07 23:14                                     ` Trond Myklebust
2010-07-08 15:59                                       ` Benny Halevy
2010-07-08 20:30                                         ` [nfsv4] " david.black
2010-07-08 21:16                                           ` Trond Myklebust
2010-07-08 23:51                                             ` Daniel.Muntz
     [not found]                                             ` <1278623771.13551.54.camel-rJ7iovZKK19ZJLDQqaL3InhyD016LWXt@public.gmane.org>
2010-07-09  0:03                                               ` [nfsv4] " Sandeep Joshi
2010-07-08 22:12                                           ` sfaibish
2010-07-08 23:01                                             ` Tom Haynes
2010-07-08 23:57                                               ` sfaibish
2010-07-09  0:41                                               ` [nfsv4] " Trond Myklebust
2010-07-06 13:20 ` 4.1 client - LAYOUTCOMMIT Benny Halevy

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=4C33313C.5070708@panasas.com \
    --to=bhalevy@panasas.com \
    --cc=Daniel.Muntz@emc.com \
    --cc=andros@netapp.com \
    --cc=linux-nfs@vger.kernel.org \
    --cc=sjoshi@bluearc.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.