From: "Maria Wikström" <maria@ponstudios.se>
To: johannes.hirte@fem.tu-ilmenau.de, xin.zhong@intel.com
Cc: linux-btrfs@vger.kernel.org
Subject: Re: [PATCH v2]Btrfs: pwrite blocked when writing from the mmaped buffer of the same page
Date: Thu, 27 Jan 2011 23:12:25 +0100 [thread overview]
Message-ID: <1296166345.11397.16.camel@mainframe> (raw)
In-Reply-To: <201101271409.27877.johannes.hirte@fem.tu-ilmenau.de>
tor 2011-01-27 klockan 14:09 +0100 skrev Johannes Hirte:
> On Thursday 09 December 2010 10:30:14 Zhong, Xin wrote:
> > This problem is found in meego testing:
> > http://bugs.meego.com/show_bug.cgi?id=6672
> > A file in btrfs is mmaped and the mmaped buffer is passed to pwrite to
> > write to the same page of the same file. In btrfs_file_aio_write(), the
> > pages is locked by prepare_pages(). So when btrfs_copy_from_user() is
> > called, page fault happens and the same page needs to be locked again in
> > filemap_fault(). The fix is to move iov_iter_fault_in_readable() before
> > prepage_pages() to make page fault happen before pages are locked. And
> > also disable page fault in critical region in btrfs_copy_from_user().
> >
> > Reviewed-by: Yan, Zheng<zheng.z.yan@intel.com>
> > Signed-off-by: Zhong, Xin <xin.zhong@intel.com>
> > ---
> > fs/btrfs/file.c | 92
> > ++++++++++++++++++++++++++++++++++++------------------- 1 files changed,
> > 60 insertions(+), 32 deletions(-)
> >
> > diff --git a/fs/btrfs/file.c b/fs/btrfs/file.c
> > index c1faded..66836d8 100644
> > --- a/fs/btrfs/file.c
> > +++ b/fs/btrfs/file.c
> > @@ -48,30 +48,34 @@ static noinline int btrfs_copy_from_user(loff_t pos,
> > int num_pages, struct page **prepared_pages,
> > struct iov_iter *i)
> > {
> > - size_t copied;
> > + size_t copied = 0;
> > int pg = 0;
> > int offset = pos & (PAGE_CACHE_SIZE - 1);
> > + int total_copied = 0;
> >
> > while (write_bytes > 0) {
> > size_t count = min_t(size_t,
> > PAGE_CACHE_SIZE - offset, write_bytes);
> > struct page *page = prepared_pages[pg];
> > -again:
> > - if (unlikely(iov_iter_fault_in_readable(i, count)))
> > - return -EFAULT;
> > -
> > - /* Copy data from userspace to the current page */
> > - copied = iov_iter_copy_from_user(page, i, offset, count);
> > + /*
> > + * Copy data from userspace to the current page
> > + *
> > + * Disable pagefault to avoid recursive lock since
> > + * the pages are already locked
> > + */
> > + pagefault_disable();
> > + copied = iov_iter_copy_from_user_atomic(page, i, offset, count);
> > + pagefault_enable();
> >
> > /* Flush processor's dcache for this page */
> > flush_dcache_page(page);
> > iov_iter_advance(i, copied);
> > write_bytes -= copied;
> > + total_copied += copied;
> >
> > + /* Return to btrfs_file_aio_write to fault page */
> > if (unlikely(copied == 0)) {
> > - count = min_t(size_t, PAGE_CACHE_SIZE - offset,
> > - iov_iter_single_seg_count(i));
> > - goto again;
> > + break;
> > }
> >
> > if (unlikely(copied < PAGE_CACHE_SIZE - offset)) {
> > @@ -81,7 +85,7 @@ again:
> > offset = 0;
> > }
> > }
> > - return 0;
> > + return total_copied;
> > }
> >
> > /*
> > @@ -854,6 +858,8 @@ static ssize_t btrfs_file_aio_write(struct kiocb *iocb,
> > unsigned long last_index;
> > int will_write;
> > int buffered = 0;
> > + int copied = 0;
> > + int dirty_pages = 0;
> >
> > will_write = ((file->f_flags & O_DSYNC) || IS_SYNC(inode) ||
> > (file->f_flags & O_DIRECT));
> > @@ -970,7 +976,17 @@ static ssize_t btrfs_file_aio_write(struct kiocb
> > *iocb, WARN_ON(num_pages > nrptrs);
> > memset(pages, 0, sizeof(struct page *) * nrptrs);
> >
> > - ret = btrfs_delalloc_reserve_space(inode, write_bytes);
> > + /*
> > + * Fault pages before locking them in prepare_pages
> > + * to avoid recursive lock
> > + */
> > + if (unlikely(iov_iter_fault_in_readable(&i, write_bytes))) {
> > + ret = -EFAULT;
> > + goto out;
> > + }
> > +
> > + ret = btrfs_delalloc_reserve_space(inode,
> > + num_pages << PAGE_CACHE_SHIFT);
> > if (ret)
> > goto out;
> >
> > @@ -978,37 +994,49 @@ static ssize_t btrfs_file_aio_write(struct kiocb
> > *iocb, pos, first_index, last_index,
> > write_bytes);
> > if (ret) {
> > - btrfs_delalloc_release_space(inode, write_bytes);
> > + btrfs_delalloc_release_space(inode,
> > + num_pages << PAGE_CACHE_SHIFT);
> > goto out;
> > }
> >
> > - ret = btrfs_copy_from_user(pos, num_pages,
> > + copied = btrfs_copy_from_user(pos, num_pages,
> > write_bytes, pages, &i);
> > - if (ret == 0) {
> > + dirty_pages = (copied + PAGE_CACHE_SIZE - 1) >>
> > + PAGE_CACHE_SHIFT;
> > +
> > + if (num_pages > dirty_pages) {
> > + if (copied > 0)
> > + atomic_inc(
> > + &BTRFS_I(inode)->outstanding_extents);
> > + btrfs_delalloc_release_space(inode,
> > + (num_pages - dirty_pages) <<
> > + PAGE_CACHE_SHIFT);
> > + }
> > +
> > + if (copied > 0) {
> > dirty_and_release_pages(NULL, root, file, pages,
> > - num_pages, pos, write_bytes);
> > + dirty_pages, pos, copied);
> > }
> >
> > btrfs_drop_pages(pages, num_pages);
> > - if (ret) {
> > - btrfs_delalloc_release_space(inode, write_bytes);
> > - goto out;
> > - }
> >
> > - if (will_write) {
> > - filemap_fdatawrite_range(inode->i_mapping, pos,
> > - pos + write_bytes - 1);
> > - } else {
> > - balance_dirty_pages_ratelimited_nr(inode->i_mapping,
> > - num_pages);
> > - if (num_pages <
> > - (root->leafsize >> PAGE_CACHE_SHIFT) + 1)
> > - btrfs_btree_balance_dirty(root, 1);
> > - btrfs_throttle(root);
> > + if (copied > 0) {
> > + if (will_write) {
> > + filemap_fdatawrite_range(inode->i_mapping, pos,
> > + pos + copied - 1);
> > + } else {
> > + balance_dirty_pages_ratelimited_nr(
> > + inode->i_mapping,
> > + dirty_pages);
> > + if (dirty_pages <
> > + (root->leafsize >> PAGE_CACHE_SHIFT) + 1)
> > + btrfs_btree_balance_dirty(root, 1);
> > + btrfs_throttle(root);
> > + }
> > }
> >
> > - pos += write_bytes;
> > - num_written += write_bytes;
> > + pos += copied;
> > + num_written += copied;
> >
> > cond_resched();
> > }
>
> This patch breaks one of my Gentoo boxes. When I try to install/update via
> emerge, some packages hang. It seems that it's always a "svn info" process
> that is stuck in kernel eating 100% CPU. I don't know how svn is involved
> here, but reverting this patch makes the system work again. I'll try to get a
> simple testcase.
>
> regards,
> Johannes
The same thing happens here. The only way to kill the process (svn info)
is to restart the computer. Running vanilla 2.6.37 without patches.
#strace emerge libgcrypt
-- cut --
open("/var/tmp/portage/dev-libs/libgcrypt-1.4.6/temp/build.log",
O_WRONLY|O_CREAT|O_APPEND|O_LARGEFILE, 0666) = 7
fstat64(7, {st_mode=S_IFREG|0660, st_size=416, ...}) = 0
mmap2(NULL, 4096, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_ANONYMOUS, -1,
0) = 0xb713a000
fstat64(7, {st_mode=S_IFREG|0660, st_size=416, ...}) = 0
_llseek(7, 416, [416], SEEK_SET) = 0
fstat64(7, {st_mode=S_IFREG|0660, st_size=416, ...}) = 0
stat64("/var/tmp/portage/dev-libs/libgcrypt-1.4.6/temp/build.log",
{st_mode=S_IFREG|0660, st_size=416, ...}) = 0
dup(1) = 8
fcntl64(8, F_GETFL) = 0x2 (flags O_RDWR)
fstat64(8, {st_mode=S_IFCHR|0620, st_rdev=makedev(136, 3), ...}) = 0
mmap2(NULL, 4096, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_ANONYMOUS, -1,
0) = 0xb7005000
_llseek(8, 0, 0xbff80cfc, SEEK_CUR) = -1 ESPIPE (Illegal seek)
fstat64(8, {st_mode=S_IFCHR|0620, st_rdev=makedev(136, 3), ...}) = 0
stat64("/var/tmp/portage/dev-libs/libgcrypt-1.4.6/temp/environment",
{st_mode=S_IFREG|0664, st_size=105970, ...}) = 0
clone(child_stack=0, flags=CLONE_CHILD_CLEARTID|CLONE_CHILD_SETTID|
SIGCHLD, child_tidptr=0xb7430728) = 11665
close(6) = 0
gettimeofday({1296162836, 126192}, NULL) = 0
poll([{fd=3, events=POLLIN|POLLERR|POLLHUP|POLLNVAL}, {fd=5,
events=POLLIN|POLLERR|POLLHUP|POLLNVAL}], 2, 3000) = 1 ([{fd=5,
revents=POLLIN}])
read(5, ">>> Preparing source in /var/tmp"..., 4096) = 91
read(5, 0xb713b000, 4096) = -1 EAGAIN (Resource
temporarily unavailable)
write(8, ">>> Preparing source in /var/tmp"..., 91>>> Preparing source
in /var/tmp/portage/dev-libs/libgcrypt-1.4.6/work/libgcrypt-1.4.6 ...
) = 91
write(7, ">>> Preparing source in /var/tmp"..., 91) = 91
poll([{fd=3, events=POLLIN|POLLERR|POLLHUP|POLLNVAL}, {fd=5,
events=POLLIN|POLLERR|POLLHUP|POLLNVAL}], 2, 3000) = 0 (Timeout)
poll([{fd=3, events=POLLIN|POLLERR|POLLHUP|POLLNVAL}, {fd=5,
events=POLLIN|POLLERR|POLLHUP|POLLNVAL}], 2, 3000) = 0 (Timeout)
poll([{fd=3, events=POLLIN|POLLERR|POLLHUP|POLLNVAL}, {fd=5,
events=POLLIN|POLLERR|POLLHUP|POLLNVAL}], 2, 3000) = 0 (Timeout)
The last line is repeated every 3sec.
// Maria
next prev parent reply other threads:[~2011-01-27 22:12 UTC|newest]
Thread overview: 43+ messages / expand[flat|nested] mbox.gz Atom feed top
2010-12-09 9:30 [PATCH v2]Btrfs: pwrite blocked when writing from the mmaped buffer of the same page Zhong, Xin
2011-01-27 13:09 ` Johannes Hirte
2011-01-27 22:12 ` Maria Wikström [this message]
2011-01-28 1:26 ` Zhong, Xin
2011-01-28 2:54 ` Johannes Hirte
2011-01-28 3:53 ` Zhong, Xin
2011-02-01 23:34 ` Johannes Hirte
2011-02-11 4:39 ` Zhong, Xin
2011-02-18 11:31 ` Maria Wikström
2011-02-21 1:51 ` Zhong, Xin
2011-02-24 14:51 ` Maria Wikström
2011-02-24 15:55 ` Mitch Harder
2011-02-24 16:00 ` Chris Mason
2011-02-24 16:03 ` Mitch Harder
2011-02-24 16:19 ` Chris Mason
2011-02-24 16:32 ` Mitch Harder
[not found] ` <AANLkTinvyb-bTVVignd1KGojvh-QrYCFmCnwYKBsYC_2@mail.gmail.com>
2011-02-25 17:11 ` Mitch Harder
2011-02-25 18:43 ` Mitch Harder
2011-02-25 19:19 ` Chris Mason
2011-02-28 1:46 ` [PATCH] btrfs file write debugging patch Chris Mason
2011-02-28 8:56 ` Zhong, Xin
2011-02-28 14:02 ` Chris Mason
2011-02-28 10:13 ` Johannes Hirte
2011-02-28 14:00 ` Chris Mason
2011-02-28 16:10 ` Josef Bacik
2011-02-28 16:45 ` Maria Wikström
2011-02-28 17:47 ` Mitch Harder
2011-02-28 20:20 ` Mitch Harder
2011-03-01 5:09 ` Mitch Harder
2011-03-01 10:14 ` Zhong, Xin
2011-03-01 11:56 ` Zhong, Xin
2011-03-01 14:54 ` Mitch Harder
2011-03-01 14:51 ` Mitch Harder
2011-03-01 21:56 ` Piotr Szymaniak
2011-02-24 23:35 ` [PATCH v2]Btrfs: pwrite blocked when writing from the mmaped buffer of the same page Piotr Szymaniak
2011-02-22 22:27 ` Johannes Hirte
2011-02-23 7:27 ` Zhong, Xin
2011-02-23 21:56 ` Chris Mason
2011-02-23 23:02 ` Johannes Hirte
2011-02-24 15:23 ` Chris Mason
2011-01-28 16:47 ` Maria Wikström
2011-01-28 18:27 ` Rui Miguel Silva
2011-01-29 15:38 ` Maria Wikström
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=1296166345.11397.16.camel@mainframe \
--to=maria@ponstudios.se \
--cc=johannes.hirte@fem.tu-ilmenau.de \
--cc=linux-btrfs@vger.kernel.org \
--cc=xin.zhong@intel.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.