From: Tao Ma <tao.ma@oracle.com>
To: ocfs2-devel@oss.oracle.com
Subject: [Ocfs2-devel] [PATCH 19/41] ocfs2: Integrate CoW in file write.
Date: Sat, 22 Aug 2009 07:17:55 +0800 [thread overview]
Message-ID: <4A8F2B23.9060903@oracle.com> (raw)
In-Reply-To: <20090821211259.GD4330@mail.oracle.com>
Joel Becker wrote:
> On Tue, Aug 18, 2009 at 02:19:20PM +0800, Tao Ma wrote:
>> + if (ret == -ETXTBSY) {
>> + BUG_ON(refcounted_cpos == UINT_MAX);
>> + cow_len = wc->w_clen - (refcounted_cpos - wc->w_cpos);
>> +
>> + ret = ocfs2_refcount_cow(inode, di_bh,
>> + refcounted_cpos, cow_len);
>> + if (ret) {
>> + mlog_errno(ret);
>> + goto out;
>> + }
>
> I've just realized two more problems. Well, one is a bug;
> the other is merely inefficient.
> First, the inefficiency. We've cooked up an
> ocfs2_refcount_cow() that can handle any cpos+write_len. But we call it
> from ocfs2_write_begin_nolock(), which only goes a page at a time. So
> even for a 1GB write, we're going to CoW 1MB at a time. For the first
> page of the I/O, we'll call ocfs2_refcount_cow(). This will try to CoW
> just the page. We'll pad that out to 1MB in cal_cow_clusters(). For
> the next few pages up to 1MB of I/O it will see the now-CoWed clusters.
> But then it gets to the first page of the second MB. It will CoW the
> second MB, and so on. We've just split the 1GB range into 1MB hunks on
> disk.
yes, that is anticipated. We CoW 1MB at most at a time.
> Now, we have to check REFCOUNTED in write_begin() (well,
> populate_write_desc()) because that's how we trap mmap(). So we leave
> it here. But for a regular write, we know the entire length up in
> ocfs2_file_aio_write(). So in ocfs2_prepare_inode_for_write(), right
> before the direct_io checks, why don't we just CoW the entire write
> there? Create a check_for_refcount just like check_for_holes, except
> instead of filling holes you CoW. The function can easily skip out if
> there's no refcount tree on the inode. This gives us large CoW regions.
> We're going to have to do the CoW anyway. When a regular write gets
> into populate_write_desc(), it won't find any refcounted records, so
> there's no more work at that level.
yes, we can put a check there, but we can't resolve the 1MB issue you
mentioned above either. Maybe we can make ocfs2_refcount_cow more
intelligent? But I would say let us leave it as-is and this can be a
future improvement.
> Even better, this fixes the bug. What's the bug? The current
> code doesn't CoW O_DIRECT writes! We only check in prepare_write_desc,
> which we don't use for O_DIRECT! And ocfs2_direct_IO_get_blocks()
> doesn't trigger buffered fallback either! Well, we don't want buffered
> fallback. We want CoW followed by real O_DIRECT. ANd if we do the CoW
> up in prepare_inode_for_write(), we get it. Plus, we can put a
> BUG_ON(ext_flags & REFCOUNTED) in direct_IO_get_blocks().
oh, yes, this is really a bug. I don't think of O_DIRECT when I created
this patch set. So I may really need to add a check in
ocfs2_prepare_inode_for_write(I guess just need to call
ocfs2_refcount_cow and make write_len<=1MB).
This also make me think that we can cal ocfs2_refcount_cow right before
we populate_write_desc, so that we don't need to call it twice and we
can directly BUG_ON(ext_flags & REFCOUNTED) in it.
Regards,
Tao
next prev parent reply other threads:[~2009-08-21 23:17 UTC|newest]
Thread overview: 75+ messages / expand[flat|nested] mbox.gz Atom feed top
2009-08-18 6:19 [Ocfs2-devel] [PATCH 00/41] ocfs2: Add reflink file support. V4 Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 01/41] ocfs2: Define refcount tree structure Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 02/41] ocfs2: Add metaecc for ocfs2_refcount_block Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 03/41] ocfs2: Add ocfs2_read_refcount_block Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 04/41] ocfs2: Abstract caching info checkpoint Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 05/41] ocfs2: Add new refcount tree lock resource in dlmglue Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 06/41] ocfs2: Add caching info for refcount tree Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 07/41] ocfs2: Add refcount tree lock mechanism Tao Ma
2009-08-19 23:25 ` Joel Becker
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 08/41] ocfs2: Basic tree root operation Tao Ma
2009-08-19 23:30 ` Joel Becker
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 09/41] ocfs2: Wrap ocfs2_extent_contig in ocfs2_extent_tree Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 10/41] ocfs2: Abstract extent split process Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 11/41] ocfs2: Add refcount b-tree as a new extent tree Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 12/41] ocfs2: move tree path functions to alloc.h Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 13/41] ocfs2: Add support for incrementing refcount in the tree Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 14/41] ocfs2: Add support of decrementing refcount for delete Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 15/41] ocfs2: Add functions for extents refcounted Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 16/41] ocfs2: Decrement refcount when truncating refcounted extents Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 17/41] ocfs2: Add CoW support Tao Ma
2009-08-21 0:59 ` Joel Becker
2009-08-21 2:04 ` Tao Ma
2009-08-21 2:51 ` Joel Becker
2009-08-21 3:04 ` Tao Ma
2009-08-21 7:10 ` Joel Becker
2009-08-21 3:55 ` Joel Becker
2009-08-21 6:25 ` Tao Ma
2009-08-21 7:07 ` Joel Becker
2009-08-21 8:24 ` Tao Ma
2009-08-21 18:39 ` Joel Becker
2009-08-21 20:58 ` Joel Becker
2009-08-24 15:04 ` Tao Ma
2009-08-24 18:20 ` Joel Becker
2009-08-25 19:30 ` Joel Becker
2009-08-26 8:17 ` TaoMa
2009-08-21 23:07 ` Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 18/41] ocfs2: CoW refcount tree improvement Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 19/41] ocfs2: Integrate CoW in file write Tao Ma
2009-08-21 1:04 ` Joel Becker
2009-08-21 2:12 ` Tao Ma
2009-08-21 14:55 ` Tao Ma
2009-08-21 20:43 ` Joel Becker
2009-08-21 21:12 ` Joel Becker
2009-08-21 23:17 ` Tao Ma [this message]
2009-08-21 23:42 ` Joel Becker
2009-08-22 0:31 ` Tao Ma
2009-08-24 15:06 ` Tao Ma
2009-08-24 18:32 ` Joel Becker
2009-08-25 0:12 ` [Ocfs2-devel] [PATCH 19/41] ocfs2: Integrate CoW in file write(add refcount check) Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 20/41] ocfs2: CoW a reflinked cluster when it is truncated Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 21/41] ocfs2: Add normal functions for reflink a normal file's extents Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 22/41] ocfs2: handle file attributes issue for reflink Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 23/41] ocfs2: Return extent flags for xattr value tree Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 24/41] ocfs2: Abstract duplicate clusters process in CoW Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 25/41] ocfs2: Add CoW support for xattr Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 26/41] ocfs2: Remove inode from ocfs2_xattr_bucket_get_name_value Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 27/41] ocfs2: Abstract the creation of xattr block Tao Ma
2009-08-21 1:22 ` Joel Becker
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 28/41] ocfs2: Abstract ocfs2 xattr tree extend rec iteration process Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 29/41] ocfs2: Attach xattr clusters to refcount tree Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 30/41] ocfs2: Call refcount tree remove process properly Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 31/41] ocfs2: Create an xattr indexed block if needed Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 32/41] ocfs2: Add reflink support for xattr Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 33/41] ocfs2: Modify removing xattr process for refcount Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 34/41] ocfs2: Don't merge in 1st refcount ops of reflink Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 35/41] ocfs2: Make transaction extend more efficient Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 36/41] ocfs2: Use proper parameter for some inode operation Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 37/41] ocfs2: Create reflinked file in orphan dir Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 38/41] ocfs2: Add preserve to reflink Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 39/41] ocfs2: Implement ocfs2_reflink Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 40/41] ocfs2: Enable refcount tree support Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 41/41] ocfs2: Add ioctl for reflink Tao Ma
2009-08-21 1:24 ` [Ocfs2-devel] [PATCH 00/41] ocfs2: Add reflink file support. V4 Joel Becker
2009-08-21 1:39 ` Tao Ma
2009-08-24 23:11 ` TaoMa
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=4A8F2B23.9060903@oracle.com \
--to=tao.ma@oracle.com \
--cc=ocfs2-devel@oss.oracle.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.