From: Jaegeuk Kim <jaegeuk@kernel.org>
To: Gu Zheng <guz.fnst@cn.fujitsu.com>
Cc: f2fs <linux-f2fs-devel@lists.sourceforge.net>,
fsdevel <linux-fsdevel@vger.kernel.org>,
이창만 <cm224.lee@samsung.com>, 俞� <chao2.yu@samsung.com>
Subject: Re: [PATCH 3/4] f2fs: use find_next_bit_le rather than test_bit_le in, find_in_block
Date: Thu, 3 Jul 2014 22:36:19 -0700 [thread overview]
Message-ID: <20140704053619.GA32528@jmac.local> (raw)
In-Reply-To: <53B52CEA.8080407@cn.fujitsu.com>
Well, how about testing with many ones in the bit streams?
Thanks,
On Thu, Jul 03, 2014 at 06:14:02PM +0800, Gu Zheng wrote:
> Hi Jaegeuk, Changman
>
> Just a simple test, not very sure it can address
> our qualm.
>
> Bitmap size:216(the same as f2fs dentry_bits).
> CPU: Intel i5 x86_64.
>
> Time counting based on tsc(the less the fast).
> [Index of 1] find_next_bit_le test_bit_le
> 0 20 117
> 1 20 114
> 2 20 113
> 3 20 139
> 4 22 121
> 5 22 118
> 6 22 115
> 8 22 112
> 9 22 106
> 10 22 105
> 11 22 100
> 16 22 98
> 48 22 97
> 80 27 95
> 104 27 92
> 136 32 95
> 160 32 92
> 184 32 90
> 200 27 87
> 208 35 84
>
> According to the result, find_next_bit_le is always
> better than test_bit_le, though there may be some
> noise, but I think the result is clear.
> Hope it can help us.:)
> ps.The sample is attached too.
>
> Thanks,
> Gu
> On 07/02/2014 06:30 PM, Jaegeuk Kim wrote:
>
> > Thanks Changman for reminding this. :)
> >
> > If there are a lot of ones in the bit stream, find_next_bit_le would cause some
> > overhead to translate the bits.
> >
> > However, it would be effective to use find_next_bit_le if the bit stream looks
> > like 0000000001.
> >
> > Well, IMO the former case would be a little bit more common.
> >
> > Gu,
> > Can you provide some performance numbers wrt this?
> >
> > On Tue, Jun 24, 2014 at 06:20:41PM +0800, Gu Zheng wrote:
> >> Use find_next_bit_le rather than test_bit_le to improve search speed
> >> lightly.
> >>
> >> Signed-off-by: Gu Zheng <guz.fnst@cn.fujitsu.com>
> >> ---
> >> fs/f2fs/dir.c | 43 +++++++++++++++++++++----------------------
> >> 1 files changed, 21 insertions(+), 22 deletions(-)
> >>
> >> diff --git a/fs/f2fs/dir.c b/fs/f2fs/dir.c
> >> index 3edd561..ba510fb 100644
> >> --- a/fs/f2fs/dir.c
> >> +++ b/fs/f2fs/dir.c
> >> @@ -93,42 +93,41 @@ static struct f2fs_dir_entry *find_in_block(struct page *dentry_page,
> >> const char *name, size_t namelen, int *max_slots,
> >> f2fs_hash_t namehash, struct page **res_page)
> >> {
> >> - struct f2fs_dir_entry *de;
> >> - unsigned long bit_pos = 0;
> >> + unsigned long bit_pos = 0, bit_start = 0;
> >> struct f2fs_dentry_block *dentry_blk = kmap(dentry_page);
> >> const void *dentry_bits = &dentry_blk->dentry_bitmap;
> >> - int max_len = 0;
> >>
> >> - while (bit_pos < NR_DENTRY_IN_BLOCK) {
> >> - if (!test_bit_le(bit_pos, dentry_bits)) {
> >> - if (bit_pos == 0)
> >> - max_len = 1;
> >> - else if (!test_bit_le(bit_pos - 1, dentry_bits))
> >> - max_len++;
> >> - bit_pos++;
> >> - continue;
> >> + while (bit_start < NR_DENTRY_IN_BLOCK) {
> >> + struct f2fs_dir_entry *de;
> >> + int max_len = 0;
> >> +
> >> + bit_pos = find_next_bit_le(dentry_bits,
> >> + NR_DENTRY_IN_BLOCK, bit_start);
> >> +
> >> + max_len = bit_pos - bit_start;
> >> + if (max_len > *max_slots) {
> >> + *max_slots = max_len;
> >> + max_len = 0;
> >> }
> >> +
> >> + if (bit_pos >= NR_DENTRY_IN_BLOCK)
> >> + break;
> >> +
> >> de = &dentry_blk->dentry[bit_pos];
> >> if (early_match_name(name, namelen, namehash, de)) {
> >> if (!memcmp(dentry_blk->filename[bit_pos],
> >> name, namelen)) {
> >> *res_page = dentry_page;
> >> - goto found;
> >> + return de;
> >> }
> >> }
> >> - if (max_len > *max_slots) {
> >> - *max_slots = max_len;
> >> - max_len = 0;
> >> - }
> >> - bit_pos += GET_DENTRY_SLOTS(le16_to_cpu(de->name_len));
> >> +
> >> + bit_start = bit_pos
> >> + + GET_DENTRY_SLOTS(le16_to_cpu(de->name_len));
> >> }
> >>
> >> - de = NULL;
> >> kunmap(dentry_page);
> >> -found:
> >> - if (max_len > *max_slots)
> >> - *max_slots = max_len;
> >> - return de;
> >> + return NULL;
> >> }
> >>
> >> static struct f2fs_dir_entry *find_in_level(struct inode *dir,
> >> --
> >> 1.7.7
> >
>
>
> #include <linux/module.h>
>
> __u8 bitmaps[20][27] = {
> {1, 0, 0, 0, 0, 0, 0,
> 0, 0, 0, 0, 0, 0, 0,
> 0, 0, 0, 0, 0, 0, 0,
> 0, 0, 0, 0, 0, 0},
> {2, 0, 0, 0, 0, 0, 0,
> 0, 0, 0, 0, 0, 0, 0,
> 0, 0, 0, 0, 0, 0, 0,
> 0, 0, 0, 0, 0, 0},
> {4, 0, 0, 0, 0, 0, 0,
> 0, 0, 0, 0, 0, 0, 0,
> 0, 0, 0, 0, 0, 0, 0,
> 0, 0, 0, 0, 0, 0},
> {8, 0, 0, 0, 0, 0, 0,
> 0, 0, 0, 0, 0, 0, 0,
> 0, 0, 0, 0, 0, 0, 0,
> 0, 0, 0, 0, 0, 0},
> {16, 0, 0, 0, 0, 0, 0,
> 0, 0, 0, 0, 0, 0, 1,
> 0, 0, 0, 0, 0, 0, 0,
> 0, 0, 0, 0, 0, 0},
> {32, 0, 0, 0, 0, 0, 0,
> 0, 0, 0, 0, 0, 0, 0,
> 0, 0, 0, 0, 0, 0, 0,
> 0, 0, 0, 0, 0, 0},
> {64, 0, 0, 0, 0, 0, 0,
> 0, 0, 0, 0, 0, 0, 0,
> 0, 0, 0, 0, 0, 0, 0,
> 0, 0, 0, 0, 0, 0},
> {0, 1, 0, 0, 0, 0, 0,
> 0, 0, 0, 0, 0, 0, 0,
> 0, 0, 0, 0, 0, 0, 0,
> 0, 0, 0, 0, 0, 0},
> {0, 2, 0, 0, 0, 0, 0,
> 0, 0, 0, 0, 0, 0, 0,
> 0, 0, 0, 0, 0, 0, 0,
> 0, 0, 0, 0, 0, 0},
> {0, 4, 0, 0, 0, 0, 0,
> 0, 0, 0, 0, 0, 0, 0,
> 0, 0, 0, 0, 0, 0, 0,
> 0, 0, 0, 0, 0, 0},
> {0, 8, 0, 0, 0, 0, 0,
> 0, 0, 0, 0, 0, 0, 0,
> 0, 0, 0, 0, 0, 0, 0,
> 0, 0, 0, 0, 0, 0},
> {0, 0, 1, 0, 0, 0, 0,
> 0, 0, 0, 0, 0, 0, 0,
> 0, 0, 0, 0, 0, 0, 0,
> 0, 0, 0, 0, 0, 0},
> {0, 0, 0, 0, 0, 0, 1,
> 0, 0, 0, 0, 0, 0, 0,
> 0, 0, 0, 0, 0, 0, 0,
> 0, 0, 0, 0, 0, 0},
> {0, 0, 0, 0, 0, 0, 0,
> 0, 0, 0, 1, 0, 0, 0,
> 0, 0, 0, 0, 0, 0, 0,
> 0, 0, 0, 0, 0, 0},
> {0, 0, 0, 0, 0, 0, 0,
> 0, 0, 0, 0, 0, 0, 1,
> 0, 0, 0, 0, 0, 0, 0,
> 0, 0, 0, 0, 0, 0},
> {0, 0, 0, 0, 0, 0, 0,
> 0, 0, 0, 0, 0, 0, 0,
> 0, 0, 0, 1, 0, 0, 0,
> 0, 0, 0, 0, 0, 0},
> {0, 0, 0, 0, 0, 0, 0,
> 0, 0, 0, 0, 0, 0, 0,
> 0, 0, 0, 0, 0, 0, 1,
> 0, 0, 0, 0, 0, 0},
> {0, 0, 0, 0, 0, 0, 0,
> 0, 0, 0, 0, 0, 0, 0,
> 0, 0, 0, 0, 0, 0, 0,
> 0, 0, 1, 0, 0, 0},
> {0, 0, 0, 0, 0, 0, 0,
> 0, 0, 0, 0, 0, 0, 0,
> 0, 0, 0, 0, 0, 0, 0,
> 0, 0, 0, 0, 1, 0},
> {0, 0, 0, 0, 0, 0, 0,
> 0, 0, 0, 0, 0, 0, 0,
> 0, 0, 0, 0, 0, 0, 0,
> 0, 0, 0, 0, 0, 1}
> };
>
> uint64_t rdtsc(void) {
> uint32_t lo, hi;
> __asm__ __volatile__ ("rdtsc" : "=a" (lo), "=d" (hi));
> return (uint64_t)hi << 32 | lo;
> }
>
> static void test_bit_search_speed(void)
> {
> unsigned long flags;
> uint64_t tsc_s, tsc_diff;
> int i, j, pos;
> const void *bit_addr;
>
> local_irq_save(flags);
> preempt_disable();
>
> for (i = 0; i < 20; i++) {
> printk("Test bitmap %d... \n", i);
>
> bit_addr = &bitmaps[i];
>
> tsc_s = rdtsc();
>
> for (j = 0; j < 1000; j++)
> find_next_bit_le(bit_addr, 216, 0);
>
> tsc_diff = (rdtsc() - tsc_s)/1000;
>
> printk("find_next_bit_le: %llu \n", tsc_diff);
>
> pos = 0;
> tsc_s = rdtsc();
>
> for (j = 0; j < 1000; j++)
> while (!test_bit_le(pos++, bit_addr));
>
> tsc_diff = (rdtsc() - tsc_s)/1000;
>
> printk("test_bit_le : %llu \n", tsc_diff);
> }
>
> preempt_enable();
> local_irq_restore(flags);
> }
>
> static int __init start_test(void) {
> test_bit_search_speed();
> return 0;
> }
>
> static void __exit exit_test(void) {
>
> }
>
> module_init(start_test);
> module_exit(exit_test);
>
> MODULE_LICENSE("GPL");
> MODULE_AUTHOR("Gu Zheng");
--
Jaegeuk Kim
next prev parent reply other threads:[~2014-07-04 5:36 UTC|newest]
Thread overview: 16+ messages / expand[flat|nested] mbox.gz Atom feed top
2014-06-24 10:20 [PATCH 3/4] f2fs: use find_next_bit_le rather than test_bit_le in, find_in_block Gu Zheng
2014-06-25 2:30 ` [f2fs-dev] " Chao Yu
2014-06-25 2:53 ` Gu Zheng
2014-06-27 9:58 ` [PATCH V2 3/4] f2fs: use find_next_bit_le rather than test_bit_le in find_in_block Gu Zheng
2014-07-02 7:49 ` [PATCH 3/4] f2fs: use find_next_bit_le rather than test_bit_le in, find_in_block Changman Lee
2014-07-03 1:05 ` Gu Zheng
2014-07-02 10:30 ` Jaegeuk Kim
2014-07-03 1:13 ` Gu Zheng
2014-07-03 10:14 ` Gu Zheng
2014-07-04 5:36 ` Jaegeuk Kim [this message]
2014-07-04 6:21 ` Chao Yu
2014-07-04 8:04 ` Gu Zheng
2014-07-05 6:25 ` Jaegeuk Kim
2014-07-05 11:15 ` Chao Yu
2014-07-07 1:45 ` Changman Lee
2014-07-07 2:13 ` Gu Zheng
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20140704053619.GA32528@jmac.local \
--to=jaegeuk@kernel.org \
--cc=chao2.yu@samsung.com \
--cc=cm224.lee@samsung.com \
--cc=guz.fnst@cn.fujitsu.com \
--cc=linux-f2fs-devel@lists.sourceforge.net \
--cc=linux-fsdevel@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).