linux-btrfs.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* [PATCH v4 0/3] Btrfs: populate heuristic with detection logic
@ 2017-08-20 18:11 Timofey Titovets
  2017-08-20 18:11 ` [PATCH v4 1/3] Btrfs: heuristic add simple sampling logic Timofey Titovets
                   ` (2 more replies)
  0 siblings, 3 replies; 4+ messages in thread
From: Timofey Titovets @ 2017-08-20 18:11 UTC (permalink / raw)
  To: linux-btrfs; +Cc: Timofey Titovets

Based on kdave for-next
As heuristic skeleton already merged
Populate heuristic with basic code.

First patch: add simple sampling code
It's get 16 byte samples with 256 bytes shifts
over input data. Collect info about how many
different bytes (symbols) has been found in sample data

Second patch: add code for calculate
how many unique bytes has been
found in sample data
That can fast detect easy compressible data

Third patch: add code for calculate byte core set size
i.e. how many unique bytes use 90% of sample data
That code require that numbers in bucket must be sorted
That can detect easy compressible data with many repeated bytes
That can detect not compressible data with evenly distributed bytes

Changes v1 -> v2:
  - Change input data iterator shift 512 -> 256
  - Replace magic macro numbers with direct values
  - Drop useless symbol population in bucket
    as no one care about where and what symbol stored
    in bucket at now

Changes v2 -> v3 (only update #3 patch):
  - Fix u64 division problem by use u32 for input_size
  - Fix input size calculation start - end -> end - start
  - Add missing sort.h header

Changes v3 -> v4 (only update #1 patch):
  - Change counter type in bucket item u16 -> u32
  - Drop other fields from bucket item for now,
    no one use it

Timofey Titovets (3):
  Btrfs: heuristic add simple sampling logic
  Btrfs: heuristic add byte set calculation
  Btrfs: heuristic add byte core set calculation

 fs/btrfs/compression.c | 109 ++++++++++++++++++++++++++++++++++++++++++++++++-
 fs/btrfs/compression.h |  11 +++++
 2 files changed, 118 insertions(+), 2 deletions(-)

--
2.14.1

^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2017-08-20 18:11 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2017-08-20 18:11 [PATCH v4 0/3] Btrfs: populate heuristic with detection logic Timofey Titovets
2017-08-20 18:11 ` [PATCH v4 1/3] Btrfs: heuristic add simple sampling logic Timofey Titovets
2017-08-20 18:11 ` [PATCH v4 2/3] Btrfs: heuristic add byte set calculation Timofey Titovets
2017-08-20 18:11 ` [PATCH v4 3/3] Btrfs: heuristic add byte core " Timofey Titovets

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).