From: Joel Becker <Joel.Becker@oracle.com>
To: ocfs2-devel@oss.oracle.com
Subject: [Ocfs2-devel] [PATCH 19/41] ocfs2: Integrate CoW in file write.
Date: Fri, 21 Aug 2009 14:12:59 -0700 [thread overview]
Message-ID: <20090821211259.GD4330@mail.oracle.com> (raw)
In-Reply-To: <1250576382-27080-19-git-send-email-tao.ma@oracle.com>
On Tue, Aug 18, 2009 at 02:19:20PM +0800, Tao Ma wrote:
> + if (ret == -ETXTBSY) {
> + BUG_ON(refcounted_cpos == UINT_MAX);
> + cow_len = wc->w_clen - (refcounted_cpos - wc->w_cpos);
> +
> + ret = ocfs2_refcount_cow(inode, di_bh,
> + refcounted_cpos, cow_len);
> + if (ret) {
> + mlog_errno(ret);
> + goto out;
> + }
I've just realized two more problems. Well, one is a bug;
the other is merely inefficient.
First, the inefficiency. We've cooked up an
ocfs2_refcount_cow() that can handle any cpos+write_len. But we call it
from ocfs2_write_begin_nolock(), which only goes a page at a time. So
even for a 1GB write, we're going to CoW 1MB at a time. For the first
page of the I/O, we'll call ocfs2_refcount_cow(). This will try to CoW
just the page. We'll pad that out to 1MB in cal_cow_clusters(). For
the next few pages up to 1MB of I/O it will see the now-CoWed clusters.
But then it gets to the first page of the second MB. It will CoW the
second MB, and so on. We've just split the 1GB range into 1MB hunks on
disk.
Now, we have to check REFCOUNTED in write_begin() (well,
populate_write_desc()) because that's how we trap mmap(). So we leave
it here. But for a regular write, we know the entire length up in
ocfs2_file_aio_write(). So in ocfs2_prepare_inode_for_write(), right
before the direct_io checks, why don't we just CoW the entire write
there? Create a check_for_refcount just like check_for_holes, except
instead of filling holes you CoW. The function can easily skip out if
there's no refcount tree on the inode. This gives us large CoW regions.
We're going to have to do the CoW anyway. When a regular write gets
into populate_write_desc(), it won't find any refcounted records, so
there's no more work at that level.
Even better, this fixes the bug. What's the bug? The current
code doesn't CoW O_DIRECT writes! We only check in prepare_write_desc,
which we don't use for O_DIRECT! And ocfs2_direct_IO_get_blocks()
doesn't trigger buffered fallback either! Well, we don't want buffered
fallback. We want CoW followed by real O_DIRECT. ANd if we do the CoW
up in prepare_inode_for_write(), we get it. Plus, we can put a
BUG_ON(ext_flags & REFCOUNTED) in direct_IO_get_blocks().
Joel
--
"There is no more evil thing on earth than race prejudice, none at
all. I write deliberately -- it is the worst single thing in life
now. It justifies and holds together more baseness, cruelty and
abomination than any other sort of error in the world."
- H. G. Wells
Joel Becker
Principal Software Developer
Oracle
E-mail: joel.becker at oracle.com
Phone: (650) 506-8127
next prev parent reply other threads:[~2009-08-21 21:12 UTC|newest]
Thread overview: 75+ messages / expand[flat|nested] mbox.gz Atom feed top
2009-08-18 6:19 [Ocfs2-devel] [PATCH 00/41] ocfs2: Add reflink file support. V4 Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 01/41] ocfs2: Define refcount tree structure Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 02/41] ocfs2: Add metaecc for ocfs2_refcount_block Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 03/41] ocfs2: Add ocfs2_read_refcount_block Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 04/41] ocfs2: Abstract caching info checkpoint Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 05/41] ocfs2: Add new refcount tree lock resource in dlmglue Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 06/41] ocfs2: Add caching info for refcount tree Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 07/41] ocfs2: Add refcount tree lock mechanism Tao Ma
2009-08-19 23:25 ` Joel Becker
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 08/41] ocfs2: Basic tree root operation Tao Ma
2009-08-19 23:30 ` Joel Becker
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 09/41] ocfs2: Wrap ocfs2_extent_contig in ocfs2_extent_tree Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 10/41] ocfs2: Abstract extent split process Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 11/41] ocfs2: Add refcount b-tree as a new extent tree Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 12/41] ocfs2: move tree path functions to alloc.h Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 13/41] ocfs2: Add support for incrementing refcount in the tree Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 14/41] ocfs2: Add support of decrementing refcount for delete Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 15/41] ocfs2: Add functions for extents refcounted Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 16/41] ocfs2: Decrement refcount when truncating refcounted extents Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 17/41] ocfs2: Add CoW support Tao Ma
2009-08-21 0:59 ` Joel Becker
2009-08-21 2:04 ` Tao Ma
2009-08-21 2:51 ` Joel Becker
2009-08-21 3:04 ` Tao Ma
2009-08-21 7:10 ` Joel Becker
2009-08-21 3:55 ` Joel Becker
2009-08-21 6:25 ` Tao Ma
2009-08-21 7:07 ` Joel Becker
2009-08-21 8:24 ` Tao Ma
2009-08-21 18:39 ` Joel Becker
2009-08-21 20:58 ` Joel Becker
2009-08-24 15:04 ` Tao Ma
2009-08-24 18:20 ` Joel Becker
2009-08-25 19:30 ` Joel Becker
2009-08-26 8:17 ` TaoMa
2009-08-21 23:07 ` Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 18/41] ocfs2: CoW refcount tree improvement Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 19/41] ocfs2: Integrate CoW in file write Tao Ma
2009-08-21 1:04 ` Joel Becker
2009-08-21 2:12 ` Tao Ma
2009-08-21 14:55 ` Tao Ma
2009-08-21 20:43 ` Joel Becker
2009-08-21 21:12 ` Joel Becker [this message]
2009-08-21 23:17 ` Tao Ma
2009-08-21 23:42 ` Joel Becker
2009-08-22 0:31 ` Tao Ma
2009-08-24 15:06 ` Tao Ma
2009-08-24 18:32 ` Joel Becker
2009-08-25 0:12 ` [Ocfs2-devel] [PATCH 19/41] ocfs2: Integrate CoW in file write(add refcount check) Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 20/41] ocfs2: CoW a reflinked cluster when it is truncated Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 21/41] ocfs2: Add normal functions for reflink a normal file's extents Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 22/41] ocfs2: handle file attributes issue for reflink Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 23/41] ocfs2: Return extent flags for xattr value tree Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 24/41] ocfs2: Abstract duplicate clusters process in CoW Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 25/41] ocfs2: Add CoW support for xattr Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 26/41] ocfs2: Remove inode from ocfs2_xattr_bucket_get_name_value Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 27/41] ocfs2: Abstract the creation of xattr block Tao Ma
2009-08-21 1:22 ` Joel Becker
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 28/41] ocfs2: Abstract ocfs2 xattr tree extend rec iteration process Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 29/41] ocfs2: Attach xattr clusters to refcount tree Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 30/41] ocfs2: Call refcount tree remove process properly Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 31/41] ocfs2: Create an xattr indexed block if needed Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 32/41] ocfs2: Add reflink support for xattr Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 33/41] ocfs2: Modify removing xattr process for refcount Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 34/41] ocfs2: Don't merge in 1st refcount ops of reflink Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 35/41] ocfs2: Make transaction extend more efficient Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 36/41] ocfs2: Use proper parameter for some inode operation Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 37/41] ocfs2: Create reflinked file in orphan dir Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 38/41] ocfs2: Add preserve to reflink Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 39/41] ocfs2: Implement ocfs2_reflink Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 40/41] ocfs2: Enable refcount tree support Tao Ma
2009-08-18 6:19 ` [Ocfs2-devel] [PATCH 41/41] ocfs2: Add ioctl for reflink Tao Ma
2009-08-21 1:24 ` [Ocfs2-devel] [PATCH 00/41] ocfs2: Add reflink file support. V4 Joel Becker
2009-08-21 1:39 ` Tao Ma
2009-08-24 23:11 ` TaoMa
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20090821211259.GD4330@mail.oracle.com \
--to=joel.becker@oracle.com \
--cc=ocfs2-devel@oss.oracle.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.