From: Timofey Titovets <nefelim4ag@gmail.com>
To: linux-btrfs@vger.kernel.org
Cc: Timofey Titovets <nefelim4ag@gmail.com>
Subject: [PATCH v7 3/6] Btrfs: implement heuristic sampling logic
Date: Fri, 25 Aug 2017 12:18:42 +0300 [thread overview]
Message-ID: <20170825091845.4120-4-nefelim4ag@gmail.com> (raw)
In-Reply-To: <20170825091845.4120-1-nefelim4ag@gmail.com>
Copy sample data from input data range to sample buffer
then calculate byte type count for that sample into bucket.
Signed-off-by: Timofey Titovets <nefelim4ag@gmail.com>
---
fs/btrfs/heuristic.c | 38 +++++++++++++++++++++++++++++++++++++-
1 file changed, 37 insertions(+), 1 deletion(-)
diff --git a/fs/btrfs/heuristic.c b/fs/btrfs/heuristic.c
index e3924c87af08..5192e51ab81e 100644
--- a/fs/btrfs/heuristic.c
+++ b/fs/btrfs/heuristic.c
@@ -69,8 +69,20 @@ static struct list_head *heuristic_alloc_workspace(void)
static int heuristic(struct list_head *ws, struct inode *inode,
u64 start, u64 end)
{
+ struct workspace *workspace = list_entry(ws, struct workspace, list);
struct page *page;
u64 index, index_end;
+ u32 a, b;
+ u8 *in_data, *sample = workspace->sample;
+ u8 byte;
+
+ /*
+ * Compression only handle first 128kb of input range
+ * And just shift over range in loop for compressing it.
+ * Let's do the same.
+ */
+ if (end - start > BTRFS_MAX_UNCOMPRESSED)
+ end = start + BTRFS_MAX_UNCOMPRESSED;
index = start >> PAGE_SHIFT;
index_end = end >> PAGE_SHIFT;
@@ -79,13 +91,37 @@ static int heuristic(struct list_head *ws, struct inode *inode,
if (!IS_ALIGNED(end, PAGE_SIZE))
index_end++;
+ b = 0;
for (; index < index_end; index++) {
page = find_get_page(inode->i_mapping, index);
- kmap(page);
+ in_data = kmap(page);
+ /* Handle case where start unaligned to PAGE_SIZE */
+ a = start%PAGE_SIZE;
+ while (a < PAGE_SIZE - READ_SIZE) {
+ /* Prevent sample overflow */
+ if (b >= MAX_SAMPLE_SIZE)
+ break;
+ /* Don't sample mem trash from last page */
+ if (start > end - READ_SIZE)
+ break;
+ memcpy(&sample[b], &in_data[a], READ_SIZE);
+ a += ITER_SHIFT;
+ start += ITER_SHIFT;
+ b += READ_SIZE;
+ }
kunmap(page);
put_page(page);
}
+ workspace->sample_size = b;
+
+ memset(workspace->bucket, 0, sizeof(*workspace->bucket)*BUCKET_SIZE);
+
+ for (a = 0; a < workspace->sample_size; a++) {
+ byte = sample[a];
+ workspace->bucket[byte].count++;
+ }
+
return 1;
}
--
2.14.1
next prev parent reply other threads:[~2017-08-25 9:19 UTC|newest]
Thread overview: 14+ messages / expand[flat|nested] mbox.gz Atom feed top
2017-08-25 9:18 [PATCH v7 0/6] Btrfs: populate heuristic with code Timofey Titovets
2017-08-25 9:18 ` [PATCH v7 1/6] Btrfs: heuristic make use compression workspaces Timofey Titovets
2017-09-27 13:12 ` David Sterba
2017-08-25 9:18 ` [PATCH v7 2/6] Btrfs: heuristic workspace add bucket and sample items Timofey Titovets
2017-09-27 13:22 ` David Sterba
2017-08-25 9:18 ` Timofey Titovets [this message]
2017-09-27 13:38 ` [PATCH v7 3/6] Btrfs: implement heuristic sampling logic David Sterba
2017-08-25 9:18 ` [PATCH v7 4/6] Btrfs: heuristic add detection of repeated data patterns Timofey Titovets
2017-09-27 13:47 ` David Sterba
2017-08-25 9:18 ` [PATCH v7 5/6] Btrfs: heuristic add byte set calculation Timofey Titovets
2017-09-27 13:50 ` David Sterba
2017-08-25 9:18 ` [PATCH v7 6/6] Btrfs: heuristic add byte core " Timofey Titovets
2017-09-27 13:54 ` David Sterba
2017-09-27 13:56 ` David Sterba
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20170825091845.4120-4-nefelim4ag@gmail.com \
--to=nefelim4ag@gmail.com \
--cc=linux-btrfs@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).