All of lore.kernel.org
 help / color / mirror / Atom feed
From: Laszlo Ersek <lersek@redhat.com>
To: Peter Lieven <pl@dlhnet.de>
Cc: Orit Wasserman <owasserm@redhat.com>,
	"qemu-devel@nongnu.org" <qemu-devel@nongnu.org>
Subject: Re: [Qemu-devel] [PATCH] page_cache: use multiplicative hash for page position calculation
Date: Fri, 01 Mar 2013 13:50:33 +0100	[thread overview]
Message-ID: <5130A419.9020903@redhat.com> (raw)
In-Reply-To: <513096A1.8020008@dlhnet.de>

On 03/01/13 12:53, Peter Lieven wrote:
> instead of a linear mapping we use a multiplicative hash
> with the golden ratio to derive the cache bucket from the
> address. this helps to reduce collisions if memory positions
> are multiple of the cache size and it avoids a division
> in the position calculation.
> 
> Signed-off-by: Peter Lieven <pl@kamp.de>
> ---
>  page_cache.c |    5 ++++-
>  1 file changed, 4 insertions(+), 1 deletion(-)
> 
> diff --git a/page_cache.c b/page_cache.c
> index 376f1db..45d769a 100644
> --- a/page_cache.c
> +++ b/page_cache.c
> @@ -24,6 +24,7 @@
>  #include <strings.h>
> 
>  #include "qemu-common.h"
> +#include "qemu/host-utils.h"
>  #include "migration/page_cache.h"
> 
>  #ifdef DEBUG_CACHE
> @@ -48,6 +49,7 @@ struct PageCache {
>      int64_t max_num_items;
>      uint64_t max_item_age;
>      int64_t num_items;
> +    uint64_t hash_shift_bits;
>  };
> 
>  PageCache *cache_init(int64_t num_pages, unsigned int page_size)
> @@ -72,6 +74,7 @@ PageCache *cache_init(int64_t num_pages, unsigned int
> page_size)
>      cache->num_items = 0;
>      cache->max_item_age = 0;
>      cache->max_num_items = num_pages;
> +    cache->hash_shift_bits = clz64(num_pages-1);
> 
>      DPRINTF("Setting cache buckets to %" PRId64 "\n",
> cache->max_num_items);
> 
> @@ -108,7 +111,7 @@ static size_t cache_get_cache_pos(const PageCache
> *cache,
>      size_t pos;
> 
>      g_assert(cache->max_num_items);
> -    pos = (address / cache->page_size) & (cache->max_num_items - 1);
> +    pos = (address * 0x9e3779b97f4a7c13) >> cache->hash_shift_bits;
>      return pos;
>  }
> 

According to <http://www.brpreiss.com/books/opus4/html/page214.html>,
the multiplier "is chosen as the integer that is relatively prime to"
2^64 "which is closest to" (sqrt(5)-1)/2 * 2^64.

(sqrt(5)-1)/2 * 2^64 ~= 11400714819323198485.86699842797038469120

hence the constant would be a=0x9e3779b97f4a7c15. Any reason why a-2 is
used in the patch?

(Note: this is not a review or any suggestion to change the patch; I'm
just curious.)

A google-fight between "a" and "a-2" is inconclusive. So is stackoverflow:

http://stackoverflow.com/questions/4113278/64-bit-multiplicative-hashing
http://stackoverflow.com/questions/8513911/how-to-create-a-good-hash-combine-with-64-bit-output-inspired-by-boosthash-co

Thanks
Laszlo

  reply	other threads:[~2013-03-01 12:48 UTC|newest]

Thread overview: 3+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2013-03-01 11:53 [Qemu-devel] [PATCH] page_cache: use multiplicative hash for page position calculation Peter Lieven
2013-03-01 12:50 ` Laszlo Ersek [this message]
2013-03-01 13:22   ` Peter Lieven

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=5130A419.9020903@redhat.com \
    --to=lersek@redhat.com \
    --cc=owasserm@redhat.com \
    --cc=pl@dlhnet.de \
    --cc=qemu-devel@nongnu.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.