linux-btrfs.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Aastha Mehta <aasthakm@gmail.com>
To: Josef Bacik <jbacik@fusionio.com>
Cc: linux-btrfs <linux-btrfs@vger.kernel.org>
Subject: Re: basic questions regarding COW in Btrfs
Date: Sat, 23 Feb 2013 10:33:03 +0100	[thread overview]
Message-ID: <CAEx9m45O2XMg_foaDP5HmXTF3NnRY3WJTP6YLwv94XdbeKzyxg@mail.gmail.com> (raw)
In-Reply-To: <CAEx9m46b9rW4xDiQi8QiTmyC5Wa9hNcasL_nXF4Y2qdyPE+D6g@mail.gmail.com>

A gentle reminder on this one.

Thanks,
Aastha.

On 21 February 2013 18:32, Aastha Mehta <aasthakm@gmail.com> wrote:
> Thanks a lot for the prompt response. I had seen that, but I am still
> not sure of where it really
> happens within fill_delalloc. Could you help me a little further in that path?
>
> Secondly, now I am confused between the btree_writepages and
> btrfs_writepages/btrfs_writepage
> methods. I thought btrfs_writepages was for writing the pages holding
> inodes and btree_writepages
> for writing the other indirect and leaf extents of the btree. Then, it
> seems that the write operations
> lead to update of the file system data structures in a top-down
> manner, i.e. first changing the inode
> and then the data extents. Is that correct?
>
> Thirdly, it seems that the old extents maybe dropped before the new
> extents are flushed to the disk.
> What would happen if the write fails before the disk commit? What am I
> missing here?
>
> Thanks,
> Aastha.
>
> On 20 February 2013 18:54, Josef Bacik <jbacik@fusionio.com> wrote:
>> On Wed, Feb 20, 2013 at 10:28:10AM -0700, Aastha Mehta wrote:
>>> Hello,
>>>
>>> I am trying to understand the COW mechanism in Btrfs. Is it correct to
>>> say that unless nodatacow option is specified, Btrfs always performs
>>> COW for all the data+metadata extents used in the system?
>>>
>>
>> So we always cow the metadata, but yes nodatacow means we don't cow the actual
>> data in the data extents.
>>
>>> I saw that COWing is implemented in btrfs_cow_block() function, which
>>> is called at the time of searching a slot for a particular item, while
>>> inserting into a new slot, committing transactions, while creating
>>> pending snapshots and few other places.
>>>
>>> However, while tracing through the complete write path, I could not
>>> quite figure out when extents actually get COWed. Could you please
>>> point me to the place where COWing takes place? Is there any time
>>> when, for performance or any other reasons, the extents are not COWed
>>> but overwritten in place (apart from the explicit nodatacow flag being
>>> set during mount)?
>>
>> You'll want to look at the tree operation ->fill_delalloc().  Thats where we do
>> cow_file_range().  We allocate new space and write.  When we finish the ordered
>> io we do btrfs_drop_extents() on the range we just wrote which will free up any
>> existing extents that exist, and then insert our new file extent.  Thanks,
>>
>> Josef



--
Aastha Mehta
MPI-SWS, Germany
E-mail: aasthakm@mpi-sws.org

  reply	other threads:[~2013-02-23  9:33 UTC|newest]

Thread overview: 14+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2013-02-20 17:28 basic questions regarding COW in Btrfs Aastha Mehta
2013-02-20 17:54 ` Josef Bacik
2013-02-21 17:32   ` Aastha Mehta
2013-02-23  9:33     ` Aastha Mehta [this message]
     [not found]     ` <CAEzrpqcGeiq=tHEvcH9625vZpCZaD94PrpjYO1_r4nS2GP9ByA@mail.gmail.com>
2013-02-25 15:15       ` Aastha Mehta
2013-02-25 17:27         ` Josef Bacik
2013-02-25 19:00           ` Aastha Mehta
2013-03-02 21:07             ` Alex Lyakas
2013-03-03 23:42               ` Josef Bacik
2013-03-03 15:41             ` Aastha Mehta
2013-03-03 23:52               ` Josef Bacik
2013-03-05  0:57                 ` Aastha Mehta
2013-03-05  1:51                   ` Josef Bacik
2013-03-05  1:55                     ` Aastha Mehta

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=CAEx9m45O2XMg_foaDP5HmXTF3NnRY3WJTP6YLwv94XdbeKzyxg@mail.gmail.com \
    --to=aasthakm@gmail.com \
    --cc=jbacik@fusionio.com \
    --cc=linux-btrfs@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).