All of lore.kernel.org
 help / color / mirror / Atom feed
From: Jaegeuk Kim <jaegeuk@kernel.org>
To: Gu Zheng <guz.fnst@cn.fujitsu.com>
Cc: f2fs <linux-f2fs-devel@lists.sourceforge.net>,
	fsdevel <linux-fsdevel@vger.kernel.org>,
	이창만 <cm224.lee@samsung.com>, 俞� <chao2.yu@samsung.com>
Subject: Re: [PATCH 3/4] f2fs: use find_next_bit_le rather than test_bit_le in, find_in_block
Date: Thu, 3 Jul 2014 22:36:19 -0700	[thread overview]
Message-ID: <20140704053619.GA32528@jmac.local> (raw)
In-Reply-To: <53B52CEA.8080407@cn.fujitsu.com>

Well, how about testing with many ones in the bit streams?
Thanks,

On Thu, Jul 03, 2014 at 06:14:02PM +0800, Gu Zheng wrote:
> Hi Jaegeuk, Changman
> 
> Just a simple test, not very sure it can address
> our qualm.
> 
> Bitmap size:216(the same as f2fs dentry_bits).
> CPU: Intel i5 x86_64.
> 
> Time counting based on tsc(the less the fast).
> [Index of 1]	find_next_bit_le	test_bit_le
> 0		20			117
> 1		20			114
> 2		20			113
> 3		20			139
> 4		22			121
> 5		22			118
> 6		22			115
> 8		22			112
> 9		22			106
> 10		22			105
> 11		22			100
> 16		22			98
> 48		22			97
> 80		27			95	
> 104		27			92
> 136		32			95
> 160		32			92
> 184		32			90
> 200		27			87
> 208		35			84
> 
> According to the result, find_next_bit_le is always
> better than test_bit_le, though there may be some
> noise, but I think the result is clear.
> Hope it can help us.:)
> ps.The sample is attached too.
> 
> Thanks,
> Gu
> On 07/02/2014 06:30 PM, Jaegeuk Kim wrote:
> 
> > Thanks Changman for reminding this. :)
> > 
> > If there are a lot of ones in the bit stream, find_next_bit_le would cause some
> > overhead to translate the bits.
> > 
> > However, it would be effective to use find_next_bit_le if the bit stream looks
> > like 0000000001.
> > 
> > Well, IMO the former case would be a little bit more common.
> > 
> > Gu,
> > Can you provide some performance numbers wrt this?
> > 
> > On Tue, Jun 24, 2014 at 06:20:41PM +0800, Gu Zheng wrote:
> >> Use find_next_bit_le rather than test_bit_le to improve search speed
> >> lightly.
> >>
> >> Signed-off-by: Gu Zheng <guz.fnst@cn.fujitsu.com>
> >> ---
> >>  fs/f2fs/dir.c |   43 +++++++++++++++++++++----------------------
> >>  1 files changed, 21 insertions(+), 22 deletions(-)
> >>
> >> diff --git a/fs/f2fs/dir.c b/fs/f2fs/dir.c
> >> index 3edd561..ba510fb 100644
> >> --- a/fs/f2fs/dir.c
> >> +++ b/fs/f2fs/dir.c
> >> @@ -93,42 +93,41 @@ static struct f2fs_dir_entry *find_in_block(struct page *dentry_page,
> >>  			const char *name, size_t namelen, int *max_slots,
> >>  			f2fs_hash_t namehash, struct page **res_page)
> >>  {
> >> -	struct f2fs_dir_entry *de;
> >> -	unsigned long bit_pos = 0;
> >> +	unsigned long bit_pos = 0, bit_start = 0;
> >>  	struct f2fs_dentry_block *dentry_blk = kmap(dentry_page);
> >>  	const void *dentry_bits = &dentry_blk->dentry_bitmap;
> >> -	int max_len = 0;
> >>  
> >> -	while (bit_pos < NR_DENTRY_IN_BLOCK) {
> >> -		if (!test_bit_le(bit_pos, dentry_bits)) {
> >> -			if (bit_pos == 0)
> >> -				max_len = 1;
> >> -			else if (!test_bit_le(bit_pos - 1, dentry_bits))
> >> -				max_len++;
> >> -			bit_pos++;
> >> -			continue;
> >> +	while (bit_start < NR_DENTRY_IN_BLOCK) {
> >> +		struct f2fs_dir_entry *de;
> >> +		int max_len = 0;
> >> +
> >> +		bit_pos = find_next_bit_le(dentry_bits,
> >> +					NR_DENTRY_IN_BLOCK, bit_start);
> >> +
> >> +		max_len = bit_pos - bit_start;
> >> +		if (max_len > *max_slots) {
> >> +			*max_slots = max_len;
> >> +			max_len = 0;
> >>  		}
> >> +
> >> +		if (bit_pos >= NR_DENTRY_IN_BLOCK)
> >> +			break;
> >> +
> >>  		de = &dentry_blk->dentry[bit_pos];
> >>  		if (early_match_name(name, namelen, namehash, de)) {
> >>  			if (!memcmp(dentry_blk->filename[bit_pos],
> >>  							name, namelen)) {
> >>  				*res_page = dentry_page;
> >> -				goto found;
> >> +				return de;
> >>  			}
> >>  		}
> >> -		if (max_len > *max_slots) {
> >> -			*max_slots = max_len;
> >> -			max_len = 0;
> >> -		}
> >> -		bit_pos += GET_DENTRY_SLOTS(le16_to_cpu(de->name_len));
> >> +
> >> +		bit_start = bit_pos
> >> +				+ GET_DENTRY_SLOTS(le16_to_cpu(de->name_len));
> >>  	}
> >>  
> >> -	de = NULL;
> >>  	kunmap(dentry_page);
> >> -found:
> >> -	if (max_len > *max_slots)
> >> -		*max_slots = max_len;
> >> -	return de;
> >> +	return NULL;
> >>  }
> >>  
> >>  static struct f2fs_dir_entry *find_in_level(struct inode *dir,
> >> -- 
> >> 1.7.7
> > 
> 
> 

> #include <linux/module.h>
> 
> __u8 bitmaps[20][27] = {
> 		{1, 0, 0, 0, 0, 0, 0,
> 		 0, 0, 0, 0, 0, 0, 0,
> 		 0, 0, 0, 0, 0, 0, 0,
> 		 0, 0, 0, 0, 0, 0},
> 		{2, 0, 0, 0, 0, 0, 0,
> 		 0, 0, 0, 0, 0, 0, 0,
> 		 0, 0, 0, 0, 0, 0, 0,
> 		 0, 0, 0, 0, 0, 0},
> 		{4, 0, 0, 0, 0, 0, 0,
> 		 0, 0, 0, 0, 0, 0, 0,
> 		 0, 0, 0, 0, 0, 0, 0,
> 		 0, 0, 0, 0, 0, 0},
> 		{8, 0, 0, 0, 0, 0, 0,
> 		 0, 0, 0, 0, 0, 0, 0,
> 		 0, 0, 0, 0, 0, 0, 0,
> 		 0, 0, 0, 0, 0, 0},
> 		{16, 0, 0, 0, 0, 0, 0,
> 		 0, 0, 0, 0, 0, 0, 1,
> 		 0, 0, 0, 0, 0, 0, 0,
> 		 0, 0, 0, 0, 0, 0},
> 		{32, 0, 0, 0, 0, 0, 0,
> 		 0, 0, 0, 0, 0, 0, 0,
> 		 0, 0, 0, 0, 0, 0, 0,
> 		 0, 0, 0, 0, 0, 0},
> 		{64, 0, 0, 0, 0, 0, 0,
> 		 0, 0, 0, 0, 0, 0, 0,
> 		 0, 0, 0, 0, 0, 0, 0,
> 		 0, 0, 0, 0, 0, 0},
> 		{0, 1, 0, 0, 0, 0, 0,
> 		 0, 0, 0, 0, 0, 0, 0,
> 		 0, 0, 0, 0, 0, 0, 0,
> 		 0, 0, 0, 0, 0, 0},
> 		{0, 2, 0, 0, 0, 0, 0,
> 		 0, 0, 0, 0, 0, 0, 0,
> 		 0, 0, 0, 0, 0, 0, 0,
> 		 0, 0, 0, 0, 0, 0},
> 		{0, 4, 0, 0, 0, 0, 0,
> 		 0, 0, 0, 0, 0, 0, 0,
> 		 0, 0, 0, 0, 0, 0, 0,
> 		 0, 0, 0, 0, 0, 0},
> 		{0, 8, 0, 0, 0, 0, 0,
> 		 0, 0, 0, 0, 0, 0, 0,
> 		 0, 0, 0, 0, 0, 0, 0,
> 		 0, 0, 0, 0, 0, 0},
> 		{0, 0, 1, 0, 0, 0, 0,
> 		 0, 0, 0, 0, 0, 0, 0,
> 		 0, 0, 0, 0, 0, 0, 0,
> 		 0, 0, 0, 0, 0, 0},
> 		{0, 0, 0, 0, 0, 0, 1,
> 		 0, 0, 0, 0, 0, 0, 0,
> 		 0, 0, 0, 0, 0, 0, 0,
> 		 0, 0, 0, 0, 0, 0},
> 		{0, 0, 0, 0, 0, 0, 0,
> 		 0, 0, 0, 1, 0, 0, 0,
> 		 0, 0, 0, 0, 0, 0, 0,
> 		 0, 0, 0, 0, 0, 0},
> 		{0, 0, 0, 0, 0, 0, 0,
> 		 0, 0, 0, 0, 0, 0, 1,
> 		 0, 0, 0, 0, 0, 0, 0,
> 		 0, 0, 0, 0, 0, 0},
> 		{0, 0, 0, 0, 0, 0, 0,
> 		 0, 0, 0, 0, 0, 0, 0,
> 		 0, 0, 0, 1, 0, 0, 0,
> 		 0, 0, 0, 0, 0, 0},
> 		{0, 0, 0, 0, 0, 0, 0,
> 		 0, 0, 0, 0, 0, 0, 0,
> 		 0, 0, 0, 0, 0, 0, 1,
> 		 0, 0, 0, 0, 0, 0},
> 		{0, 0, 0, 0, 0, 0, 0,
> 		 0, 0, 0, 0, 0, 0, 0,
> 		 0, 0, 0, 0, 0, 0, 0,
> 		 0, 0, 1, 0, 0, 0},
> 		{0, 0, 0, 0, 0, 0, 0,
> 		 0, 0, 0, 0, 0, 0, 0,
> 		 0, 0, 0, 0, 0, 0, 0,
> 		 0, 0, 0, 0, 1, 0},
> 		{0, 0, 0, 0, 0, 0, 0,
> 		 0, 0, 0, 0, 0, 0, 0,
> 		 0, 0, 0, 0, 0, 0, 0,
> 		 0, 0, 0, 0, 0, 1}
> 		};
> 
> uint64_t rdtsc(void) {
> uint32_t lo, hi;
> __asm__ __volatile__ ("rdtsc" : "=a" (lo), "=d" (hi));
> return (uint64_t)hi << 32 | lo;
> }
> 
> static void test_bit_search_speed(void)
> {
> 	unsigned long flags;
> 	uint64_t tsc_s, tsc_diff;
> 	int i, j, pos;
> 	const void *bit_addr;
> 
> 	local_irq_save(flags);
> 	preempt_disable();
> 
> 	for (i = 0; i < 20; i++) {
> 		printk("Test bitmap %d... \n", i);
> 
> 		bit_addr = &bitmaps[i];
> 
> 		tsc_s = rdtsc();
> 
> 		for (j = 0; j < 1000; j++)
> 			find_next_bit_le(bit_addr, 216, 0);
> 
> 		tsc_diff = (rdtsc() - tsc_s)/1000;
> 
> 		printk("find_next_bit_le: %llu \n", tsc_diff);
> 
> 		pos = 0;
> 		tsc_s = rdtsc();
> 
> 		for (j = 0; j < 1000; j++)
> 			while (!test_bit_le(pos++, bit_addr));
> 
> 		tsc_diff = (rdtsc() - tsc_s)/1000;
> 
> 		printk("test_bit_le     : %llu \n", tsc_diff);
> 	}
> 
> 	preempt_enable();
> 	local_irq_restore(flags);
> }
> 
> static int __init start_test(void) {
> 	test_bit_search_speed();
> 	return 0;
> }
> 
> static void __exit exit_test(void) {
> 
> }
> 
> module_init(start_test);
> module_exit(exit_test);
> 
> MODULE_LICENSE("GPL");
> MODULE_AUTHOR("Gu Zheng");


-- 
Jaegeuk Kim

  reply	other threads:[~2014-07-04  5:36 UTC|newest]

Thread overview: 16+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2014-06-24 10:20 [PATCH 3/4] f2fs: use find_next_bit_le rather than test_bit_le in, find_in_block Gu Zheng
2014-06-25  2:30 ` [f2fs-dev] " Chao Yu
2014-06-25  2:53   ` Gu Zheng
2014-06-27  9:58 ` [PATCH V2 3/4] f2fs: use find_next_bit_le rather than test_bit_le in find_in_block Gu Zheng
2014-07-02  7:49 ` [PATCH 3/4] f2fs: use find_next_bit_le rather than test_bit_le in, find_in_block Changman Lee
2014-07-03  1:05   ` Gu Zheng
2014-07-02 10:30 ` Jaegeuk Kim
2014-07-03  1:13   ` Gu Zheng
2014-07-03 10:14   ` Gu Zheng
2014-07-04  5:36     ` Jaegeuk Kim [this message]
2014-07-04  6:21       ` Chao Yu
2014-07-04  8:04         ` Gu Zheng
2014-07-05  6:25           ` Jaegeuk Kim
2014-07-05 11:15             ` Chao Yu
2014-07-07  1:45             ` Changman Lee
2014-07-07  2:13               ` Gu Zheng

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20140704053619.GA32528@jmac.local \
    --to=jaegeuk@kernel.org \
    --cc=chao2.yu@samsung.com \
    --cc=cm224.lee@samsung.com \
    --cc=guz.fnst@cn.fujitsu.com \
    --cc=linux-f2fs-devel@lists.sourceforge.net \
    --cc=linux-fsdevel@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.