* OSDMonitor: don't allow creation of pools with > 65535 pgs
@ 2012-12-14 15:41 Jim Schutt
2012-12-14 16:59 ` Joao Eduardo Luis
2012-12-18 0:14 ` Joao Eduardo Luis
0 siblings, 2 replies; 4+ messages in thread
From: Jim Schutt @ 2012-12-14 15:41 UTC (permalink / raw)
To: Joao Eduardo Luis, Greg Farnum; +Cc: ceph-devel@vger.kernel.org
Hi,
I'm looking at commit e3ed28eb2 in the next branch,
and I have a question.
Shouldn't the limit be pg_num > 65536, because
PGs are numbered 0 thru pg_num-1?
If not, what am I missing?
FWIW, up through yesterday I've been using the next branch and this:
ceph osd pool set data pg_num 65536 --allow-experimental-feature
ceph osd pool set metadata pg_num 65536 --allow-experimental-feature
ceph osd pool set data pgp_num 65536 --allow-experimental-feature
ceph osd pool set metadata pgp_num 65536 --allow-experimental-feature
using cephfs clients, and have seen no trouble with
misdirected ops, etc.
-- Jim
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: OSDMonitor: don't allow creation of pools with > 65535 pgs
2012-12-14 15:41 OSDMonitor: don't allow creation of pools with > 65535 pgs Jim Schutt
@ 2012-12-14 16:59 ` Joao Eduardo Luis
2012-12-14 17:41 ` [EXTERNAL] " Jim Schutt
2012-12-18 0:14 ` Joao Eduardo Luis
1 sibling, 1 reply; 4+ messages in thread
From: Joao Eduardo Luis @ 2012-12-14 16:59 UTC (permalink / raw)
To: Jim Schutt; +Cc: Greg Farnum, ceph-devel@vger.kernel.org
On 12/14/2012 03:41 PM, Jim Schutt wrote:
> Hi,
>
> I'm looking at commit e3ed28eb2 in the next branch,
> and I have a question.
>
> Shouldn't the limit be pg_num > 65536, because
> PGs are numbered 0 thru pg_num-1?
>
> If not, what am I missing?
>
> FWIW, up through yesterday I've been using the next branch and this:
>
> ceph osd pool set data pg_num 65536 --allow-experimental-feature
> ceph osd pool set metadata pg_num 65536 --allow-experimental-feature
> ceph osd pool set data pgp_num 65536 --allow-experimental-feature
> ceph osd pool set metadata pgp_num 65536 --allow-experimental-feature
>
> using cephfs clients, and have seen no trouble with
> misdirected ops, etc.
>
> -- Jim
>
Hi Jim,
To the best of my knowledge, one of the things that triggered the
required hard cap on the number of pgs was that the kernel side is still
limited to 16 bits, despite that on the osd side this is no longer true.
I'm not familiar with what's going on on the kernel side, but if there's
a slight chance that we are indeed keeping the 'pg_num' on a 16-bit
variable, then that value must be capped to 65535. If that's not the
case and we're just limited by the pg's number/id, then I guess that
accepting up to 65636 would be fine (0..65535).
Just in case I'll look into this and further implications.
Thanks.
-Joao
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: [EXTERNAL] Re: OSDMonitor: don't allow creation of pools with > 65535 pgs
2012-12-14 16:59 ` Joao Eduardo Luis
@ 2012-12-14 17:41 ` Jim Schutt
0 siblings, 0 replies; 4+ messages in thread
From: Jim Schutt @ 2012-12-14 17:41 UTC (permalink / raw)
To: Joao Eduardo Luis; +Cc: Greg Farnum, ceph-devel@vger.kernel.org
On 12/14/2012 09:59 AM, Joao Eduardo Luis wrote:
> On 12/14/2012 03:41 PM, Jim Schutt wrote:
>> Hi,
>>
>> I'm looking at commit e3ed28eb2 in the next branch,
>> and I have a question.
>>
>> Shouldn't the limit be pg_num > 65536, because
>> PGs are numbered 0 thru pg_num-1?
>>
>> If not, what am I missing?
>>
>> FWIW, up through yesterday I've been using the next branch and this:
>>
>> ceph osd pool set data pg_num 65536 --allow-experimental-feature
>> ceph osd pool set metadata pg_num 65536 --allow-experimental-feature
>> ceph osd pool set data pgp_num 65536 --allow-experimental-feature
>> ceph osd pool set metadata pgp_num 65536 --allow-experimental-feature
>>
>> using cephfs clients, and have seen no trouble with
>> misdirected ops, etc.
>>
>> -- Jim
>>
>
> Hi Jim,
>
> To the best of my knowledge, one of the things that triggered the
> required hard cap on the number of pgs was that the kernel side is
> still limited to 16 bits, despite that on the osd side this is no
> longer true.
>
I believe the culprit is the "ps" member of struct ceph_pg,
which stores what is eventually used as the PG id as __le16.
> I'm not familiar with what's going on on the kernel side, but if
> there's a slight chance that we are indeed keeping the 'pg_num' on a
> 16-bit variable, then that value must be capped to 65535. If that's
> not the case and we're just limited by the pg's number/id, then I
> guess that accepting up to 65636 would be fine (0..65535).
struct ceph_pg_pool in the kernel stores pg_num and friends
as __le32.
>
> Just in case I'll look into this and further implications.
Cool, thanks.
-- Jim
>
> Thanks.
>
> -Joao
>
>
>
>
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: OSDMonitor: don't allow creation of pools with > 65535 pgs
2012-12-14 15:41 OSDMonitor: don't allow creation of pools with > 65535 pgs Jim Schutt
2012-12-14 16:59 ` Joao Eduardo Luis
@ 2012-12-18 0:14 ` Joao Eduardo Luis
1 sibling, 0 replies; 4+ messages in thread
From: Joao Eduardo Luis @ 2012-12-18 0:14 UTC (permalink / raw)
To: Jim Schutt; +Cc: Greg Farnum, ceph-devel@vger.kernel.org
On 12/14/2012 03:41 PM, Jim Schutt wrote:
> Hi,
>
> I'm looking at commit e3ed28eb2 in the next branch,
> and I have a question.
>
> Shouldn't the limit be pg_num > 65536, because
> PGs are numbered 0 thru pg_num-1?
>
> If not, what am I missing?
>
> FWIW, up through yesterday I've been using the next branch and this:
>
> ceph osd pool set data pg_num 65536 --allow-experimental-feature
> ceph osd pool set metadata pg_num 65536 --allow-experimental-feature
> ceph osd pool set data pgp_num 65536 --allow-experimental-feature
> ceph osd pool set metadata pgp_num 65536 --allow-experimental-feature
>
> using cephfs clients, and have seen no trouble with
> misdirected ops, etc.
>
> -- Jim
>
Jim,
A new patch went into next that increases the default maximum pg_num to
65536, and also adds an option ('mon max pool pg num') that will allow
this value to be adjusted.
-Joao
^ permalink raw reply [flat|nested] 4+ messages in thread
end of thread, other threads:[~2012-12-18 0:20 UTC | newest]
Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2012-12-14 15:41 OSDMonitor: don't allow creation of pools with > 65535 pgs Jim Schutt
2012-12-14 16:59 ` Joao Eduardo Luis
2012-12-14 17:41 ` [EXTERNAL] " Jim Schutt
2012-12-18 0:14 ` Joao Eduardo Luis
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.