From: Chris Mason <clm@fb.com>
To: Christoph Hellwig <hch@infradead.org>
Cc: robbieko <robbieko@synology.com>, <linux-btrfs@vger.kernel.org>
Subject: Re: [PATCH] Btrfs: implement unlocked buffered write
Date: Tue, 22 May 2018 14:31:36 -0400 [thread overview]
Message-ID: <D87124BA-3EEC-480F-8520-EBD3B5A33C04@fb.com> (raw)
In-Reply-To: <20180522180828.GA8340@infradead.org>
On 22 May 2018, at 14:08, Christoph Hellwig wrote:
> On Wed, May 16, 2018 at 11:52:37AM +0800, robbieko wrote:
>> From: Robbie Ko <robbieko@synology.com>
>>
>> This idea is from direct io. By this patch, we can make the buffered
>> write parallel, and improve the performance and latency. But because
>> we
>> can not update isize without i_mutex, the unlocked buffered write
>> just
>> can be done in front of the EOF.
>>
>> We needn't worry about the race between buffered write and truncate,
>> because the truncate need wait until all the buffered write end.
>>
>> And we also needn't worry about the race between dio write and punch
>> hole,
>> because we have extent lock to protect our operation.
>>
>> I ran fio to test the performance of this feature.
>
> And what protects two writes from interleaving their results now?
page locks...ish, we at least won't have results interleaved in a single
page. For btrfs it'll actually be multiple pages since we try to do
more than one at a time.
I haven't verified all the assumptions around truncate and fallocate and
friends expecting the dio special locking to be inside i_size. In
general this makes me a little uncomfortable.
But we're not avoiding the inode lock completely, we're just dropping it
for the expensive parts of writing to the file. A quick guess about
what the expensive parts are:
1) balance_dirty_pages()
2) btrfs_btree_balance_dirty()
3) metadata reservations/enospc waiting.
Can I bribe you to benchmark how much each of those things is impacting
the iops/latency benefits?
-chris
next prev parent reply other threads:[~2018-05-22 18:32 UTC|newest]
Thread overview: 13+ messages / expand[flat|nested] mbox.gz Atom feed top
2018-05-16 3:52 [PATCH] Btrfs: implement unlocked buffered write robbieko
2018-05-22 17:11 ` David Sterba
2018-05-22 17:28 ` Omar Sandoval
2018-05-23 7:07 ` robbieko
2018-05-22 18:08 ` Christoph Hellwig
2018-05-22 18:31 ` Chris Mason [this message]
2018-05-23 6:37 ` Christoph Hellwig
2018-05-23 7:58 ` Nikolay Borisov
2018-05-23 18:01 ` Chris Mason
2018-05-23 7:26 ` robbieko
2018-05-23 15:56 ` Chris Mason
2018-05-24 8:46 ` robbieko
2018-05-24 15:05 ` Chris Mason
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=D87124BA-3EEC-480F-8520-EBD3B5A33C04@fb.com \
--to=clm@fb.com \
--cc=hch@infradead.org \
--cc=linux-btrfs@vger.kernel.org \
--cc=robbieko@synology.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).