From: Josef Bacik <jbacik@fusionio.com>
To: Liu Bo <bo.li.liu@oracle.com>
Cc: Josef Bacik <jbacik@fusionio.com>, <linux-btrfs@vger.kernel.org>
Subject: Re: [PATCH] Btrfs: fix crash of compressed writes
Date: Tue, 1 Oct 2013 08:38:30 -0400 [thread overview]
Message-ID: <20131001123830.GE27490@localhost.localdomain> (raw)
In-Reply-To: <20131001033721.GA32575@localhost.localdomain>
On Tue, Oct 01, 2013 at 11:37:22AM +0800, Liu Bo wrote:
> On Mon, Sep 30, 2013 at 01:02:49PM -0400, Josef Bacik wrote:
> > On Mon, Sep 30, 2013 at 08:39:57PM +0800, Liu Bo wrote:
> > > The crash[1] is found by xfstests/generic/208 with "-o compress",
> > > it's not reproduced everytime, but it does panic.
> > >
> > > The bug is quite interesting, it's actually introduced by a recent commit
> > > (573aecafca1cf7a974231b759197a1aebcf39c2a,
> > > Btrfs: actually limit the size of delalloc range).
> > >
> > > Btrfs implements delay allocation, so during writeback, we
> > > (1) get a page A and lock it
> > > (2) search the state tree for delalloc bytes and lock all pages within the range
> > > (3) process the delalloc range, including find disk space and create
> > > ordered extent and so on.
> > > (4) submit the page A.
> > >
> > > It runs well in normal cases, but if we're in a racy case, eg.
> > > buffered compressed writes and aio-dio writes,
> > > sometimes we may fail to lock all pages in the 'delalloc' range,
> > > in which case, we need to fall back to search the state tree again with
> > > a smaller range limit(max_bytes = PAGE_CACHE_SIZE - offset).
> > >
> > > The mentioned commit has a side effect, that is, in the fallback case,
> > > we can find delalloc bytes before the index of the page we already have locked,
> > > so we're in the case of (delalloc_end <= *start) and return with (found > 0).
> > >
> > > This ends with not locking delalloc pages but making ->writepage still
> > > process them, and the crash happens.
> > >
> > > This fixes it by not enforcing the 'max_bytes' limit in the special fallback
> > > case.
> > >
> >
> > Great analysis, thank you for that, however I don't like the fix ;). Instead We
> > need to change the
> >
> > if (!found || delalloc_end <= *start)
> >
> > to always return 0 since we should not call fill delalloc if the delalloc range
> > doesn't start at our current offset. Secondly the
>
> Setting 'found = 0' is also the first idea into my mind :), but I'm not sure if
> it's right.
>
> I've check it out, commit 70b99e6959a4c28ae1b314985eca731f3db72f1d
> (Btrfs: Compression corner fixes,
> Signed-off-by: Chris Mason <chris.mason@oracle.com>)
> introduced the (delalloc_end <= *start).
>
> The commit log says,
> "Make sure we lock pages in the correct order during delalloc. The
> search into the state tree for delalloc bytes can return bytes
> before the page we already have locked."
>
> But I think if we find bytes before the page we locked, then we
> should get (found = 0).
>
> So I don't know why we need the check (delalloc_end <= *start),
> maybe some corner cases need that? Chris can explain a bit?
>
No we needed his other addition below which is important, for when the delalloc
range starts before but still encompasses our page. The other part is not
needed, we should be returning 0 if delalloc_end lands before or on our start,
which should fix your bug and make it a bit more sane.
> >
> > max_bytes = PAGE_CACHE_SIZE - offset;
> >
> > is completely crap since we are talking about the entire page/sector. We should
> > simply set max_bytes to sectorszie or PAGE_CACHE_SIZE that way we avoid this
> > issue. Thanks,
>
> This might be due to historical reasons, but for now, it doesn't cause
> problems as the passed @start is page aligned and @offset is always 0.
>
Yeah you are right, still I'd kill it anyway since it doesn't make sense.
Thanks,
Josef
next prev parent reply other threads:[~2013-10-01 12:38 UTC|newest]
Thread overview: 5+ messages / expand[flat|nested] mbox.gz Atom feed top
2013-09-30 12:39 [PATCH] Btrfs: fix crash of compressed writes Liu Bo
2013-09-30 17:02 ` Josef Bacik
2013-10-01 3:37 ` Liu Bo
2013-10-01 12:38 ` Josef Bacik [this message]
2013-10-01 15:42 ` Liu Bo
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20131001123830.GE27490@localhost.localdomain \
--to=jbacik@fusionio.com \
--cc=bo.li.liu@oracle.com \
--cc=linux-btrfs@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).