From: Chao Yu <chao2.yu@samsung.com>
To: jaegeuk.kim@samsung.com
Cc: linux-fsdevel@vger.kernel.org, linux-kernel@vger.kernel.org,
linux-f2fs-devel@lists.sourceforge.net
Subject: Re: [PATCH V2 2/2 RESEND] f2fs: read contiguous sit entry pages by merging for mount performance
Date: Thu, 21 Nov 2013 11:18:39 +0800 [thread overview]
Message-ID: <000001cee668$872ec6a0$958c53e0$@samsung.com> (raw)
In-Reply-To: <1384997517.26319.60.camel@kjgkr>
Hi,
> -----Original Message-----
> From: Jaegeuk Kim [mailto:jaegeuk.kim@samsung.com]
> Sent: Thursday, November 21, 2013 9:32 AM
> To: Chao Yu
> Cc: linux-fsdevel@vger.kernel.org; linux-kernel@vger.kernel.org; linux-f2fs-devel@lists.sourceforge.net; 谭姝
> Subject: Re: [f2fs-dev] [PATCH V2 2/2 RESEND] f2fs: read contiguous sit entry pages by merging for mount performance
>
> Hi,
>
> It seems that ra_sit_pages() is too tightly coupled with
> build_sit_entries().
This code could be improved.
> Is there another way not to use *is_order?
Previously the code is like this:
-build_sit_entries()
next_setp:
for(start = 0; start < TOTAL_SEGS(sbi); start++)
/*step#1 readahead all sit entries blocks*/
if(start % SIT_ENTRY_PER_BLOCK == 0) {
blk_addr = current_sit_addr(sbi, start);
/* grab and submit_read_page */
}
if(start == TOTAL_SEGS(sbi) - 1)
f2fs_submit_read_bio();
continue;
/*step#2 fill sit entries info*/
/*step#3 cover sit entries with journal*/
But I think its weakness is that it will cost lots of memory to read
ahead all sit entry pages when f2fs mount, and also it's serious waste
that we read them again after these pages are released by VM when
out of memory.
>
> The ra_sit_pages() tries to read consecutive sit pages as many as
> possible.
> So then, what about just checking whether its block address is
> contiguous or not?
>
> Something like this:
> -ra_sit_pages()
> blkno = start;
> while (blkno < sit_i->sit_blocks) {
> blk_addr = current_sit_addr(sbi, blkno);
> if (blkno != start && prev_blk_addr + 1 != blk_addr)
> break;
>
> /* grab and submit_read_page */
>
> prev_blk_addr = blk_addr;
> blkno++;
> }
Agreed, this method could remove *order.
Shouldn't we add nrpages for readahead policy as VM?
>
> Thanks,
>
> 2013-11-20 (수), 14:47 +0800, Chao Yu:
> > Previously we read sit entries page one by one, this method lost the chance
> > of reading contiguous page together. So we read pages as contiguous as
> > possible for better mount performance.
> >
> > v1-->v2:
> > o merge judgements/use 'Continue' or 'Break' instead of 'Goto' as Gu Zheng
> > suggested.
> > o add mark_page_accessed() before release page to delay VM reclaiming them.
> >
> > Signed-off-by: Chao Yu <chao2.yu@samsung.com>
> > ---
> > fs/f2fs/segment.c | 108 ++++++++++++++++++++++++++++++++++++++++-------------
> > fs/f2fs/segment.h | 2 +
> > 2 files changed, 84 insertions(+), 26 deletions(-)
> >
> > diff --git a/fs/f2fs/segment.c b/fs/f2fs/segment.c
> > index 8149eba..52c88de 100644
> > --- a/fs/f2fs/segment.c
> > +++ b/fs/f2fs/segment.c
> > @@ -14,6 +14,7 @@
> > #include <linux/blkdev.h>
> > #include <linux/prefetch.h>
> > #include <linux/vmalloc.h>
> > +#include <linux/swap.h>
> >
> > #include "f2fs.h"
> > #include "segment.h"
> > @@ -1488,41 +1489,96 @@ static int build_curseg(struct f2fs_sb_info *sbi)
> > return restore_curseg_summaries(sbi);
> > }
> >
> > +static int ra_sit_pages(struct f2fs_sb_info *sbi, int start,
> > + int nrpages, bool *is_order)
>
> Why do you use nrpages?
nrpages point out expectation of caller, caller cloud control how many
pages want to read this time. It solves the weakness of previous coed I
give above.
>
> > +{
> > + struct address_space *mapping = sbi->meta_inode->i_mapping;
> > + struct sit_info *sit_i = SIT_I(sbi);
> > + struct page *page;
> > + block_t blk_addr;
> > + int blkno = start, readcnt = 0;
> > + int sit_blk_cnt = SIT_BLK_CNT(sbi);
> > +
> > + for (; blkno < start + nrpages && blkno < sit_blk_cnt; blkno++) {
> > +
> > + if ((!f2fs_test_bit(blkno, sit_i->sit_bitmap) ^ !*is_order)) {
> > + *is_order = !*is_order;
> > + break;
> > + }
> > +
> > + blk_addr = sit_i->sit_base_addr + blkno;
> > + if (*is_order)
> > + blk_addr += sit_i->sit_blocks;
> > +repeat:
> > + page = grab_cache_page(mapping, blk_addr);
> > + if (!page) {
> > + cond_resched();
> > + goto repeat;
> > + }
> > + if (PageUptodate(page)) {
> > + mark_page_accessed(page);
> > + f2fs_put_page(page, 1);
> > + readcnt++;
> > + continue;
> > + }
> > +
> > + submit_read_page(sbi, page, blk_addr, READ_SYNC);
> > +
> > + mark_page_accessed(page);
> > + f2fs_put_page(page, 0);
> > + readcnt++;
> > + }
> > +
> > + f2fs_submit_read_bio(sbi, READ_SYNC);
> > + return readcnt;
> > +}
> > +
> > static void build_sit_entries(struct f2fs_sb_info *sbi)
> > {
> > struct sit_info *sit_i = SIT_I(sbi);
> > struct curseg_info *curseg = CURSEG_I(sbi, CURSEG_COLD_DATA);
> > struct f2fs_summary_block *sum = curseg->sum_blk;
> > - unsigned int start;
> > -
> > - for (start = 0; start < TOTAL_SEGS(sbi); start++) {
> > - struct seg_entry *se = &sit_i->sentries[start];
> > - struct f2fs_sit_block *sit_blk;
> > - struct f2fs_sit_entry sit;
> > - struct page *page;
> > - int i;
> > + bool is_order = f2fs_test_bit(0, sit_i->sit_bitmap) ? true : false;
> > + int sit_blk_cnt = SIT_BLK_CNT(sbi);
> > + unsigned int i, start, end;
> > + unsigned int readed, start_blk = 0;
> >
> > - mutex_lock(&curseg->curseg_mutex);
> > - for (i = 0; i < sits_in_cursum(sum); i++) {
> > - if (le32_to_cpu(segno_in_journal(sum, i)) == start) {
> > - sit = sit_in_journal(sum, i);
> > - mutex_unlock(&curseg->curseg_mutex);
> > - goto got_it;
> > + do {
> > + readed = ra_sit_pages(sbi, start_blk, sit_blk_cnt, &is_order);
> > +
> > + start = start_blk * sit_i->sents_per_block;
> > + end = (start_blk + readed) * sit_i->sents_per_block;
> > +
> > + for (; start < end && start < TOTAL_SEGS(sbi); start++) {
> > + struct seg_entry *se = &sit_i->sentries[start];
> > + struct f2fs_sit_block *sit_blk;
> > + struct f2fs_sit_entry sit;
> > + struct page *page;
> > +
> > + mutex_lock(&curseg->curseg_mutex);
> > + for (i = 0; i < sits_in_cursum(sum); i++) {
> > + if (le32_to_cpu(segno_in_journal(sum, i)) == start) {
> > + sit = sit_in_journal(sum, i);
> > + mutex_unlock(&curseg->curseg_mutex);
> > + goto got_it;
> > + }
> > }
> > - }
> > - mutex_unlock(&curseg->curseg_mutex);
> > - page = get_current_sit_page(sbi, start);
> > - sit_blk = (struct f2fs_sit_block *)page_address(page);
> > - sit = sit_blk->entries[SIT_ENTRY_OFFSET(sit_i, start)];
> > - f2fs_put_page(page, 1);
> > + mutex_unlock(&curseg->curseg_mutex);
> > +
> > + page = get_current_sit_page(sbi, start);
> > + sit_blk = (struct f2fs_sit_block *)page_address(page);
> > + sit = sit_blk->entries[SIT_ENTRY_OFFSET(sit_i, start)];
> > + f2fs_put_page(page, 1);
> > got_it:
> > - check_block_count(sbi, start, &sit);
> > - seg_info_from_raw_sit(se, &sit);
> > - if (sbi->segs_per_sec > 1) {
> > - struct sec_entry *e = get_sec_entry(sbi, start);
> > - e->valid_blocks += se->valid_blocks;
> > + check_block_count(sbi, start, &sit);
> > + seg_info_from_raw_sit(se, &sit);
> > + if (sbi->segs_per_sec > 1) {
> > + struct sec_entry *e = get_sec_entry(sbi, start);
> > + e->valid_blocks += se->valid_blocks;
> > + }
> > }
> > - }
> > + start_blk += readed;
> > + } while (start_blk < sit_blk_cnt);
> > }
> >
> > static void init_free_segmap(struct f2fs_sb_info *sbi)
> > diff --git a/fs/f2fs/segment.h b/fs/f2fs/segment.h
> > index 269f690..ad5b9f1 100644
> > --- a/fs/f2fs/segment.h
> > +++ b/fs/f2fs/segment.h
> > @@ -83,6 +83,8 @@
> > (segno / SIT_ENTRY_PER_BLOCK)
> > #define START_SEGNO(sit_i, segno) \
> > (SIT_BLOCK_OFFSET(sit_i, segno) * SIT_ENTRY_PER_BLOCK)
> > +#define SIT_BLK_CNT(sbi) \
> > + ((TOTAL_SEGS(sbi) + SIT_ENTRY_PER_BLOCK - 1) / SIT_ENTRY_PER_BLOCK)
> > #define f2fs_bitmap_size(nr) \
> > (BITS_TO_LONGS(nr) * sizeof(unsigned long))
> > #define TOTAL_SEGS(sbi) (SM_I(sbi)->main_segments)
>
> --
> Jaegeuk Kim
> Samsung
------------------------------------------------------------------------------
Shape the Mobile Experience: Free Subscription
Software experts and developers: Be at the forefront of tech innovation.
Intel(R) Software Adrenaline delivers strategic insight and game-changing
conversations that shape the rapidly evolving mobile landscape. Sign up now.
http://pubads.g.doubleclick.net/gampad/clk?id=63431311&iu=/4140/ostg.clktrk
_______________________________________________
Linux-f2fs-devel mailing list
Linux-f2fs-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/linux-f2fs-devel
next prev parent reply other threads:[~2013-11-21 3:18 UTC|newest]
Thread overview: 6+ messages / expand[flat|nested] mbox.gz Atom feed top
2013-11-20 6:47 [f2fs-dev] [PATCH V2 2/2 RESEND] f2fs: read contiguous sit entry pages by merging for mount performance Chao Yu
2013-11-21 1:31 ` Jaegeuk Kim
2013-11-21 3:18 ` Chao Yu [this message]
2013-11-21 4:45 ` Jaegeuk Kim
2013-11-21 6:37 ` [f2fs-dev] " Chao Yu
2013-11-21 9:35 ` Jaegeuk Kim
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to='000001cee668$872ec6a0$958c53e0$@samsung.com' \
--to=chao2.yu@samsung.com \
--cc=jaegeuk.kim@samsung.com \
--cc=linux-f2fs-devel@lists.sourceforge.net \
--cc=linux-fsdevel@vger.kernel.org \
--cc=linux-kernel@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).