From: Josef Bacik <jbacik@fusionio.com>
To: Miao Xie <miaox@cn.fujitsu.com>
Cc: Josef Bacik <JBacik@fusionio.com>,
"linux-btrfs@vger.kernel.org" <linux-btrfs@vger.kernel.org>
Subject: Re: [PATCH] Btrfs: delay block group item insertion
Date: Thu, 13 Sep 2012 08:35:27 -0400 [thread overview]
Message-ID: <20120913123527.GA12994@localhost.localdomain> (raw)
In-Reply-To: <5051549B.7070206@cn.fujitsu.com>
On Wed, Sep 12, 2012 at 09:35:55PM -0600, Miao Xie wrote:
> On wed, 12 Sep 2012 14:04:13 -0400, Josef Bacik wrote:
> > So we have lots of places where we try to preallocate chunks in order to
> > make sure we have enough space as we make our allocations. This has
> > historically meant that we're constantly tweaking when we should allocate a
> > new chunk, and historically we have gotten this horribly wrong so we way
> > over allocate either metadata or data. To try and keep this from happening
> > we are going to make it so that the block group item insertion is done out
> > of band at the end of a transaction. This will allow us to create chunks
> > even if we are trying to make an allocation for the extent tree. With this
> > patch my enospc tests run faster (didn't expect this) and more efficiently
> > use the disk space (this is what I wanted). Thanks,
>
> This patch also fixes a deadlock problem that happened when we add two or
> more small devices(< 4G) into a seed fs(the profile of metadata is RAID1),
> and a enospc problem when we add a small device (< 256M) into a big empty
> seed fs(> 60G).
>
> (My fix patch which is similar to this one is on the way, I'm a bit slow :) )
>
> > @@ -1400,6 +1407,9 @@ int btrfs_commit_transaction(struct btrfs_trans_handle *trans,
> > */
> > cur_trans->delayed_refs.flushing = 1;
> >
> > + if (!list_empty(&trans->new_bgs))
> > + btrfs_create_pending_block_groups(trans, root);
> > +
> > ret = btrfs_run_delayed_refs(trans, root, 0);
> > if (ret)
> > goto cleanup_transaction;
>
> I think we can not make sure we won't allocate new chunks when we
> create the pending snapshots and write out the space cache and inode
> cache, so we should check ->new_bgs and call btrfs_create_pending_block_groups()
> when committing the cowonly tree roots.
>
> And beside that, We'd better add a BUG_ON() after we update the root tree to
> make sure there is no pending block group item left in the list.
>
We're also running this in run_delayed_refs when we want to run all delayed refs
so we should be pretty safe, but a BUG_ON() would definitely make sure we are.
Thanks,
Josef
prev parent reply other threads:[~2012-09-13 12:35 UTC|newest]
Thread overview: 6+ messages / expand[flat|nested] mbox.gz Atom feed top
2012-09-12 18:04 [PATCH] Btrfs: delay block group item insertion Josef Bacik
2012-09-13 1:32 ` Liu Bo
2012-09-13 2:57 ` Miao Xie
2012-09-13 12:37 ` Josef Bacik
2012-09-13 3:35 ` Miao Xie
2012-09-13 12:35 ` Josef Bacik [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20120913123527.GA12994@localhost.localdomain \
--to=jbacik@fusionio.com \
--cc=linux-btrfs@vger.kernel.org \
--cc=miaox@cn.fujitsu.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).