From: Mark Nelson <mnelson@redhat.com>
To: Samuel Just <sjust@redhat.com>, Sage Weil <sweil@redhat.com>
Cc: Gleb Borisov <borisov.gleb@gmail.com>, ceph-devel@vger.kernel.org
Subject: Re: Strange issue with CRUSH
Date: Mon, 13 Jul 2015 10:20:00 -0500 [thread overview]
Message-ID: <55A3D720.7070006@redhat.com> (raw)
In-Reply-To: <1736665041.43352077.1436482991363.JavaMail.zimbra@redhat.com>
FWIW,
It would be very interesting to see the output of:
https://github.com/ceph/cbt/blob/master/tools/readpgdump.py
If you see something that looks anomalous. I'd like to make sure that
I'm detecting issues like this.
Mark
On 07/09/2015 06:03 PM, Samuel Just wrote:
> I've seen some odd teuthology in the last week or two which seems to be anomalous rjenkins hash behavior as well.
>
> http://tracker.ceph.com/issues/12231
> -Sam
>
> ----- Original Message -----
> From: "Sage Weil" <sweil@redhat.com>
> To: "Gleb Borisov" <borisov.gleb@gmail.com>
> Cc: ceph-devel@vger.kernel.org
> Sent: Thursday, July 9, 2015 3:06:00 PM
> Subject: Re: Strange issue with CRUSH
>
> On Fri, 10 Jul 2015, Gleb Borisov wrote:
>> Hi Sage,
>>
>> Sorry for mailing you in person, I realize that you're quite busy at redhat,
>> but I wanted you have a look on an issue with CRUSH map.
>
> No problem. I hope you don't mind I've added ceph-devel to the cc list.
>
>> I've described very first insights here:
>> http://lists.ceph.com/pipermail/ceph-users-ceph.com/2015-July/002897.html
>>
>> We are continue our research and found that distribution of PG count by OSD
>> is very strange and after digging into CRUSH source code found rjenkins1
>> hash function.
>>
>> After some testing we realized that rjenkins1's value distribution is
>> exponential, and this can cause our disbalance.
>
> Any issue with rjenkins1's hash function is very interesting and
> concerning. Can you describe your analysis and what you mean by the
> distribution being exponential?
>
>> What do you think about adding additional hashing algorithm to CRUSH? It
>> seems that it could improve distribution.
>
> I am definitely open to adding new hash functions, especially if the
> current ones are flawed. The current hash was created by making ad hoc
> combinations of rjenkins' mix function with various numbers of
> arguments--hardly scientific or methodical. We did an analysis a couple
> years back and found that it effectively modeled a uniform distribution,
> but if we missed something or were wrong we should definitely correct it!
>
> In any case, the important step is to quantify what is wrong with the
> current hash so that we can ensure any new one is not flawed in the same
> way.
>
> Thanks-
> sage
>
>
>> We have also tried to generate some syntetic crushmaps (another bucket
>> types, more OSDs per host, more/less hosts by rack, different cound of
>> racks, linear osd ids, random osd ids, etc), but didn't found any
>> combination with better distribution of PG across OSD.
>>
>> Thanks and one more sorry for bothering you in person.
>> --
>> Best regards,
>> Gleb M Borisov
>>
>>
> --
> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at http://vger.kernel.org/majordomo-info.html
>
> --
> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at http://vger.kernel.org/majordomo-info.html
>
next prev parent reply other threads:[~2015-07-13 15:20 UTC|newest]
Thread overview: 4+ messages / expand[flat|nested] mbox.gz Atom feed top
[not found] <CAOWwpMJv4N0BvdEy2Y-WpDqqZ6R=f_5_+2j1dWeVe-Xk1fg1bQ@mail.gmail.com>
2015-07-09 22:06 ` Strange issue with CRUSH Sage Weil
2015-07-09 23:03 ` Samuel Just
2015-07-13 15:20 ` Mark Nelson [this message]
[not found] ` <CAOWwpMKN3YtACUj181K2Wqg539EibuUfh5UZPceoYr2rBLxOEQ@mail.gmail.com>
[not found] ` <CAOWwpMKeQyx+PQdNMqGvXAtamg4mMGSZrEw96X9D5KWGnaQa9A@mail.gmail.com>
2015-07-13 16:28 ` Mark Nelson
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=55A3D720.7070006@redhat.com \
--to=mnelson@redhat.com \
--cc=borisov.gleb@gmail.com \
--cc=ceph-devel@vger.kernel.org \
--cc=sjust@redhat.com \
--cc=sweil@redhat.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.