From: "Aneesh Kumar K. V" <aneesh.kumar@linux.vnet.ibm.com>
To: "Amir G." <amir73il@users.sourceforge.net>, Theodore Tso <tytso@mit.edu>
Cc: Ext4 Developers List <linux-ext4@vger.kernel.org>
Subject: Re: [PATCH][RFC] ext4: avoid taking down_read(&grp->alloc_sem)
Date: Mon, 14 Feb 2011 15:04:05 +0530 [thread overview]
Message-ID: <8739nrc642.fsf@linux.vnet.ibm.com> (raw)
In-Reply-To: <AANLkTimUmo5v14oKodaBKFtZPEHHuR9kZS5OSVtEFj2c@mail.gmail.com>
On Wed, 9 Feb 2011 12:05:11 +0200, "Amir G." <amir73il@users.sourceforge.net> wrote:
> Hi Aneesh,
>
> As you are signed off on most of the recent alloc_sem related code changes,
> can you please comment on the patch below, which tries to avoid taking
> the read lock most of the times on a 4K block fs.
>
> Can anyone tell what performance impact (if any) will be caused by avoiding
> the read lock on most allocations? group spin lock will still be taken, but for
> much shorter periods of time (cycles).
>
> Any ideas how this patch can be properly tested?
A quick check says the changes are correct. But i am not sure whether we
want to conditionalize these locks unless they appear as highly
contented locks in a profile.
>
> Thanks,
> Amir.
>
> grp->alloc_sem is used to synchronize buddy cache users with buddy cache init
> of other groups that use the same buddy cache page and with adding blocks to
> group on online resize.
>
> When blocks_per_page <= 2, each group has it's own private buddy cache page
> so taking the read lock for every allocation is futile and can be avoided for
> every group, but the last one.
>
> The write lock is taken in ext4_mb_init_group() and in ext4_add_groupblocks()
> to synchronize the buddy cache init of a group on first time allocation after
> mount and after extending the last group.
>
> Signed-off-by: Amir Goldstein <amir73il@users.sf.net>
> ---
> fs/ext4/mballoc.c | 19 +++++++++++++++----
> 1 files changed, 15 insertions(+), 4 deletions(-)
>
> diff --git a/fs/ext4/mballoc.c b/fs/ext4/mballoc.c
> index 1b3256b..22a5251 100644
> --- a/fs/ext4/mballoc.c
> +++ b/fs/ext4/mballoc.c
> @@ -1160,7 +1160,15 @@ ext4_mb_load_buddy(struct super_block *sb,
> ext4_group_t group,
> e4b->bd_group = group;
> e4b->bd_buddy_page = NULL;
> e4b->bd_bitmap_page = NULL;
> - e4b->alloc_semp = &grp->alloc_sem;
> + /*
> + * We only need to take the read lock if other groups share the buddy
> + * page with this group or if blocks may be added to this (last) group
> + * by ext4_group_extend().
> + */
> + if (blocks_per_page > 2 || group == sbi->s_groups_count - 1)
If we can say groups_per_page > 1 that would make it more clear.
> + e4b->alloc_semp = &grp->alloc_sem;
> + else
> + e4b->alloc_semp = NULL;
>
> /* Take the read lock on the group alloc
> * sem. This would make sure a parallel
> @@ -1169,7 +1177,8 @@ ext4_mb_load_buddy(struct super_block *sb,
> ext4_group_t group,
> * till we are done with allocation
> */
> repeat_load_buddy:
> - down_read(e4b->alloc_semp);
> + if (e4b->alloc_semp)
> + down_read(e4b->alloc_semp);
>
> if (unlikely(EXT4_MB_GRP_NEED_INIT(grp))) {
> /* we need to check for group need init flag
> @@ -1177,7 +1186,8 @@ repeat_load_buddy:
> * that new blocks didn't get added to the group
> * when we are loading the buddy cache
> */
> - up_read(e4b->alloc_semp);
> + if (e4b->alloc_semp)
> + up_read(e4b->alloc_semp);
> /*
> * we need full data about the group
> * to make a good selection
> @@ -1277,7 +1287,8 @@ err:
> e4b->bd_bitmap = NULL;
>
> /* Done with the buddy cache */
> - up_read(e4b->alloc_semp);
> + if (e4b->alloc_semp)
> + up_read(e4b->alloc_semp);
> return ret;
> }
>
-aneesh
next prev parent reply other threads:[~2011-02-14 9:34 UTC|newest]
Thread overview: 6+ messages / expand[flat|nested] mbox.gz Atom feed top
2011-02-09 10:05 [PATCH][RFC] ext4: avoid taking down_read(&grp->alloc_sem) Amir G.
2011-02-14 7:52 ` Amir G.
2011-02-14 16:30 ` Andreas Dilger
2011-02-14 18:18 ` Amir G.
2011-02-14 9:34 ` Aneesh Kumar K. V [this message]
2011-02-14 12:08 ` Amir G.
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=8739nrc642.fsf@linux.vnet.ibm.com \
--to=aneesh.kumar@linux.vnet.ibm.com \
--cc=amir73il@users.sourceforge.net \
--cc=linux-ext4@vger.kernel.org \
--cc=tytso@mit.edu \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).