Trigger to create PGs ?

All of lore.kernel.org
 help / color / mirror / Atom feed

* Trigger to create PGs ?
@ 2013-02-04 22:50 Yasuhiro Ohara
  2013-02-04 23:26 ` Yasuhiro Ohara
  0 siblings, 1 reply; 8+ messages in thread
From: Yasuhiro Ohara @ 2013-02-04 22:50 UTC (permalink / raw)
  To: ceph-devel

Hi,

I happened to have an incorrect crush map in the start-up of
my system, but even after fixing it manually, the PGs do not
seem to be created properly. Is there any way to trigger the
system to start creating the PGs again ?

Here's my configurations:
ceph.conf: http://pastebin.com/EwwdQrf9
crush map: http://pastebin.com/UYNFvvQx
ceph osd tree: http://pastebin.com/u2Z4Hppn
ceph pg dump: http://pastebin.com/JfE146WJ

FYI, in the first, I had mistakenly osd.0 in all host clauses.

regards,
Yasu

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: Trigger to create PGs ?
  2013-02-04 22:50 Trigger to create PGs ? Yasuhiro Ohara
@ 2013-02-04 23:26 ` Yasuhiro Ohara
  2013-02-04 23:53   ` Dan Mick
  2013-02-05  2:19   ` Sage Weil
  0 siblings, 2 replies; 8+ messages in thread
From: Yasuhiro Ohara @ 2013-02-04 23:26 UTC (permalink / raw)
  To: ceph-devel


Umm, I mean, my system stuck like:

   health HEALTH_WARN 1088 pgs stuck inactive; 1088 pgs stuck unclean
   monmap e1: 5 mons at {0=128.114.52.59:6789/0,1=128.114.52.67:6789/0,2=128.114.52.68:6789/0,3=128.114.52.69:6789/0,4=128.114.52.70:6789/0}, election epoch 72, quorum 0,1,2,3,4 0,1,2,3,4
   osdmap e295: 16 osds: 16 up, 16 in
    pgmap v83184: 3264 pgs: 1088 creating, 2176 active+clean; 672 GB data, 2042 GB used, 5107 GB / 7452 GB avail
   mdsmap e20: 1/1/1 up {0=1=up:active}, 4 up:standby

and I am asking how to bring it to HEALTHY state.

regards,
Yasu

From: Yasuhiro Ohara <yasu@soe.ucsc.edu>
Subject: Trigger to create PGs ?
Date: Mon, 04 Feb 2013 14:50:38 -0800 (PST)
Message-ID: <20130204.145038.210467743.yasu@soe.ucsc.edu>

> 
> Hi,
> 
> I happened to have an incorrect crush map in the start-up of
> my system, but even after fixing it manually, the PGs do not
> seem to be created properly. Is there any way to trigger the
> system to start creating the PGs again ?
> 
> Here's my configurations:
> ceph.conf: http://pastebin.com/EwwdQrf9
> crush map: http://pastebin.com/UYNFvvQx
> ceph osd tree: http://pastebin.com/u2Z4Hppn
> ceph pg dump: http://pastebin.com/JfE146WJ
> 
> FYI, in the first, I had mistakenly osd.0 in all host clauses.
> 
> regards,
> Yasu
> 

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: Trigger to create PGs ?
  2013-02-04 23:26 ` Yasuhiro Ohara
@ 2013-02-04 23:53   ` Dan Mick
  2013-02-05  0:22     ` Yasuhiro Ohara
  2013-02-05  2:19   ` Sage Weil
  1 sibling, 1 reply; 8+ messages in thread
From: Dan Mick @ 2013-02-04 23:53 UTC (permalink / raw)
  To: Yasuhiro Ohara; +Cc: ceph-devel

On 02/04/2013 03:26 PM, Yasuhiro Ohara wrote:
> 3264 pgs: 1088 creating, 2176 active+clean

This means, I believe, that the cluster has healed 2176 of 3264 PGs, and 
is working on the remaining 1088.  You can use 'ceph -w' to observe the 
progress, but I think your cluster is backfilling the newly-configured 
OSDs as it should be.

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: Trigger to create PGs ?
  2013-02-04 23:53   ` Dan Mick
@ 2013-02-05  0:22     ` Yasuhiro Ohara
  2013-02-05  0:43       ` Dan Mick
  0 siblings, 1 reply; 8+ messages in thread
From: Yasuhiro Ohara @ 2013-02-05  0:22 UTC (permalink / raw)
  To: dan.mick; +Cc: ceph-devel

Dan,

In fact it's been more than a week since it is in the status.
ceph -w can show us the other progress (like backfilling on the
down/up osds), but has not shown any progress on the 'creating'.

regards,
Yasu

From: Dan Mick <dan.mick@inktank.com>
Subject: Re: Trigger to create PGs ?
Date: Mon, 04 Feb 2013 15:53:25 -0800
Message-ID: <511049F5.609@inktank.com>

> 
> 
> On 02/04/2013 03:26 PM, Yasuhiro Ohara wrote:
>> 3264 pgs: 1088 creating, 2176 active+clean
> 
> This means, I believe, that the cluster has healed 2176 of 3264 PGs,
> and is working on the remaining 1088.  You can use 'ceph -w' to
> observe the progress, but I think your cluster is backfilling the
> newly-configured OSDs as it should be.

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: Trigger to create PGs ?
  2013-02-05  0:22     ` Yasuhiro Ohara
@ 2013-02-05  0:43       ` Dan Mick
  0 siblings, 0 replies; 8+ messages in thread
From: Dan Mick @ 2013-02-05  0:43 UTC (permalink / raw)
  To: Yasuhiro Ohara; +Cc: ceph-devel

I see now that there are a bunch of PGs in creating state; sorry for 
missing that.

I think you might try "ceph pg send_pg_creates" to kick the cluster in 
the head and get it to create those PGs.

On 02/04/2013 04:22 PM, Yasuhiro Ohara wrote:
>
> Dan,
>
> In fact it's been more than a week since it is in the status.
> ceph -w can show us the other progress (like backfilling on the
> down/up osds), but has not shown any progress on the 'creating'.
>
> regards,
> Yasu
>
> From: Dan Mick <dan.mick@inktank.com>
> Subject: Re: Trigger to create PGs ?
> Date: Mon, 04 Feb 2013 15:53:25 -0800
> Message-ID: <511049F5.609@inktank.com>
>
>>
>>
>> On 02/04/2013 03:26 PM, Yasuhiro Ohara wrote:
>>> 3264 pgs: 1088 creating, 2176 active+clean
>>
>> This means, I believe, that the cluster has healed 2176 of 3264 PGs,
>> and is working on the remaining 1088.  You can use 'ceph -w' to
>> observe the progress, but I think your cluster is backfilling the
>> newly-configured OSDs as it should be.

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: Trigger to create PGs ?
  2013-02-04 23:26 ` Yasuhiro Ohara
  2013-02-04 23:53   ` Dan Mick
@ 2013-02-05  2:19   ` Sage Weil
  2013-02-05  2:36     ` Yasuhiro Ohara
  1 sibling, 1 reply; 8+ messages in thread
From: Sage Weil @ 2013-02-05  2:19 UTC (permalink / raw)
  To: Yasuhiro Ohara; +Cc: ceph-devel

All of the stuck pgs are in pool 2.  My guess is that that pool is 
referncing a broken crush rule. 

Your CRUSH map has min and max_size of 3 for all of the rules; it should 
be min 1 and max 10.  Probably the RBD pool is set to 3 replicas, which 
means it matches no existing CRUSH rule and you get no OSDs.  You can fix 
the CRUSH rules (that is a good idea anyway), or also change the pool 2 
(rbd) to 3x replication:

ceph osd pool set rbd size 3

sage


On Mon, 4 Feb 2013, Yasuhiro Ohara wrote:
> 
> Umm, I mean, my system stuck like:
> 
>    health HEALTH_WARN 1088 pgs stuck inactive; 1088 pgs stuck unclean
>    monmap e1: 5 mons at {0=128.114.52.59:6789/0,1=128.114.52.67:6789/0,2=128.114.52.68:6789/0,3=128.114.52.69:6789/0,4=128.114.52.70:6789/0}, election epoch 72, quorum 0,1,2,3,4 0,1,2,3,4
>    osdmap e295: 16 osds: 16 up, 16 in
>     pgmap v83184: 3264 pgs: 1088 creating, 2176 active+clean; 672 GB data, 2042 GB used, 5107 GB / 7452 GB avail
>    mdsmap e20: 1/1/1 up {0=1=up:active}, 4 up:standby
> 
> and I am asking how to bring it to HEALTHY state.
> 
> regards,
> Yasu
> 
> From: Yasuhiro Ohara <yasu@soe.ucsc.edu>
> Subject: Trigger to create PGs ?
> Date: Mon, 04 Feb 2013 14:50:38 -0800 (PST)
> Message-ID: <20130204.145038.210467743.yasu@soe.ucsc.edu>
> 
> > 
> > Hi,
> > 
> > I happened to have an incorrect crush map in the start-up of
> > my system, but even after fixing it manually, the PGs do not
> > seem to be created properly. Is there any way to trigger the
> > system to start creating the PGs again ?
> > 
> > Here's my configurations:
> > ceph.conf: http://pastebin.com/EwwdQrf9
> > crush map: http://pastebin.com/UYNFvvQx
> > ceph osd tree: http://pastebin.com/u2Z4Hppn
> > ceph pg dump: http://pastebin.com/JfE146WJ
> > 
> > FYI, in the first, I had mistakenly osd.0 in all host clauses.
> > 
> > regards,
> > Yasu
> > 
> --
> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html
> 
> 

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: Trigger to create PGs ?
  2013-02-05  2:19   ` Sage Weil
@ 2013-02-05  2:36     ` Yasuhiro Ohara
  2013-02-05  2:56       ` Dan Mick
  0 siblings, 1 reply; 8+ messages in thread
From: Yasuhiro Ohara @ 2013-02-05  2:36 UTC (permalink / raw)
  To: sage; +Cc: ceph-devel


Thanks Sage, it instantly fixed the problem.
:)

regards,
Yasu

From: Sage Weil <sage@inktank.com>
Subject: Re: Trigger to create PGs ?
Date: Mon, 4 Feb 2013 18:19:46 -0800 (PST)
Message-ID: <alpine.DEB.2.00.1302041817040.19233@cobra.newdream.net>

> All of the stuck pgs are in pool 2.  My guess is that that pool is 
> referncing a broken crush rule. 
> 
> Your CRUSH map has min and max_size of 3 for all of the rules; it should 
> be min 1 and max 10.  Probably the RBD pool is set to 3 replicas, which 
> means it matches no existing CRUSH rule and you get no OSDs.  You can fix 
> the CRUSH rules (that is a good idea anyway), or also change the pool 2 
> (rbd) to 3x replication:
> 
> ceph osd pool set rbd size 3
> 
> sage
> 
> 
> On Mon, 4 Feb 2013, Yasuhiro Ohara wrote:
>> 
>> Umm, I mean, my system stuck like:
>> 
>>    health HEALTH_WARN 1088 pgs stuck inactive; 1088 pgs stuck unclean
>>    monmap e1: 5 mons at {0=128.114.52.59:6789/0,1=128.114.52.67:6789/0,2=128.114.52.68:6789/0,3=128.114.52.69:6789/0,4=128.114.52.70:6789/0}, election epoch 72, quorum 0,1,2,3,4 0,1,2,3,4
>>    osdmap e295: 16 osds: 16 up, 16 in
>>     pgmap v83184: 3264 pgs: 1088 creating, 2176 active+clean; 672 GB data, 2042 GB used, 5107 GB / 7452 GB avail
>>    mdsmap e20: 1/1/1 up {0=1=up:active}, 4 up:standby
>> 
>> and I am asking how to bring it to HEALTHY state.
>> 
>> regards,
>> Yasu
>> 
>> From: Yasuhiro Ohara <yasu@soe.ucsc.edu>
>> Subject: Trigger to create PGs ?
>> Date: Mon, 04 Feb 2013 14:50:38 -0800 (PST)
>> Message-ID: <20130204.145038.210467743.yasu@soe.ucsc.edu>
>> 
>> > 
>> > Hi,
>> > 
>> > I happened to have an incorrect crush map in the start-up of
>> > my system, but even after fixing it manually, the PGs do not
>> > seem to be created properly. Is there any way to trigger the
>> > system to start creating the PGs again ?
>> > 
>> > Here's my configurations:
>> > ceph.conf: http://pastebin.com/EwwdQrf9
>> > crush map: http://pastebin.com/UYNFvvQx
>> > ceph osd tree: http://pastebin.com/u2Z4Hppn
>> > ceph pg dump: http://pastebin.com/JfE146WJ
>> > 
>> > FYI, in the first, I had mistakenly osd.0 in all host clauses.
>> > 
>> > regards,
>> > Yasu
>> > 
>> --
>> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
>> the body of a message to majordomo@vger.kernel.org
>> More majordomo info at  http://vger.kernel.org/majordomo-info.html
>> 
>> 

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: Trigger to create PGs ?
  2013-02-05  2:36     ` Yasuhiro Ohara
@ 2013-02-05  2:56       ` Dan Mick
  0 siblings, 0 replies; 8+ messages in thread
From: Dan Mick @ 2013-02-05  2:56 UTC (permalink / raw)
  To: Yasuhiro Ohara; +Cc: sage, ceph-devel

(rbd was set to 2, which meant it didn't match, which I'm sure is what 
Sage meant.  Just correcting the record for those scoring at home.)


On 02/04/2013 06:36 PM, Yasuhiro Ohara wrote:
>
> Thanks Sage, it instantly fixed the problem.
> :)
>
> regards,
> Yasu
>
> From: Sage Weil <sage@inktank.com>
> Subject: Re: Trigger to create PGs ?
> Date: Mon, 4 Feb 2013 18:19:46 -0800 (PST)
> Message-ID: <alpine.DEB.2.00.1302041817040.19233@cobra.newdream.net>
>
>> All of the stuck pgs are in pool 2.  My guess is that that pool is
>> referncing a broken crush rule.
>>
>> Your CRUSH map has min and max_size of 3 for all of the rules; it should
>> be min 1 and max 10.  Probably the RBD pool is set to 3 replicas, which
>> means it matches no existing CRUSH rule and you get no OSDs.  You can fix
>> the CRUSH rules (that is a good idea anyway), or also change the pool 2
>> (rbd) to 3x replication:
>>
>> ceph osd pool set rbd size 3
>>
>> sage
>>
>>
>> On Mon, 4 Feb 2013, Yasuhiro Ohara wrote:
>>>
>>> Umm, I mean, my system stuck like:
>>>
>>>     health HEALTH_WARN 1088 pgs stuck inactive; 1088 pgs stuck unclean
>>>     monmap e1: 5 mons at {0=128.114.52.59:6789/0,1=128.114.52.67:6789/0,2=128.114.52.68:6789/0,3=128.114.52.69:6789/0,4=128.114.52.70:6789/0}, election epoch 72, quorum 0,1,2,3,4 0,1,2,3,4
>>>     osdmap e295: 16 osds: 16 up, 16 in
>>>      pgmap v83184: 3264 pgs: 1088 creating, 2176 active+clean; 672 GB data, 2042 GB used, 5107 GB / 7452 GB avail
>>>     mdsmap e20: 1/1/1 up {0=1=up:active}, 4 up:standby
>>>
>>> and I am asking how to bring it to HEALTHY state.
>>>
>>> regards,
>>> Yasu
>>>
>>> From: Yasuhiro Ohara <yasu@soe.ucsc.edu>
>>> Subject: Trigger to create PGs ?
>>> Date: Mon, 04 Feb 2013 14:50:38 -0800 (PST)
>>> Message-ID: <20130204.145038.210467743.yasu@soe.ucsc.edu>
>>>
>>>>
>>>> Hi,
>>>>
>>>> I happened to have an incorrect crush map in the start-up of
>>>> my system, but even after fixing it manually, the PGs do not
>>>> seem to be created properly. Is there any way to trigger the
>>>> system to start creating the PGs again ?
>>>>
>>>> Here's my configurations:
>>>> ceph.conf: http://pastebin.com/EwwdQrf9
>>>> crush map: http://pastebin.com/UYNFvvQx
>>>> ceph osd tree: http://pastebin.com/u2Z4Hppn
>>>> ceph pg dump: http://pastebin.com/JfE146WJ
>>>>
>>>> FYI, in the first, I had mistakenly osd.0 in all host clauses.
>>>>
>>>> regards,
>>>> Yasu
>>>>
>>> --
>>> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
>>> the body of a message to majordomo@vger.kernel.org
>>> More majordomo info at  http://vger.kernel.org/majordomo-info.html
>>>
>>>
> --
> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html
>

^ permalink raw reply	[flat|nested] 8+ messages in thread

end of thread, other threads:[~2013-02-05  2:56 UTC | newest]

Thread overview: 8+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2013-02-04 22:50 Trigger to create PGs ? Yasuhiro Ohara
2013-02-04 23:26 ` Yasuhiro Ohara
2013-02-04 23:53   ` Dan Mick
2013-02-05  0:22     ` Yasuhiro Ohara
2013-02-05  0:43       ` Dan Mick
2013-02-05  2:19   ` Sage Weil
2013-02-05  2:36     ` Yasuhiro Ohara
2013-02-05  2:56       ` Dan Mick

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.