All of lore.kernel.org
 help / color / mirror / Atom feed
* problem with removing osd
@ 2016-12-29 15:07 Łukasz Chrustek
  2016-12-29 19:10 ` Sage Weil
  0 siblings, 1 reply; 12+ messages in thread
From: Łukasz Chrustek @ 2016-12-29 15:07 UTC (permalink / raw)
  To: ceph-devel

Hi,

I was trying to delete 3 osds from cluster, deletion procces took very
long  time and I interrupted it. mon process then crushed, and in ceph
osd tree (after restart ceph-mon) I saw:

 ~]# ceph osd tree
ID         WEIGHT    TYPE NAME            UP/DOWN REWEIGHT PRIMARY-AFFINITY
        -7  16.89590 root ssd-disks
       -11         0     host ssd1
-231707408         0
     22100         0         osd.22100        DNE        0
        71         0         osd.71           DNE        0


when I tried to delete osd.22100:

[root@cc1 ~]# ceph osd crush remove osd.22100
device 'osd.22100' does not appear in the crush map

then I tried to delete osd.71 and mon proccess crushed:

[root@cc1 ~]# ceph osd crush remove osd.71
2016-12-28 17:52:34.459668 7f426a862700  0 monclient: hunting for new mon

after restart of ceph-mon in ceph osd tree it shows:

# ceph osd tree
ID        WEIGHT    TYPE NAME             UP/DOWN REWEIGHT PRIMARY-AFFINITY
       -7  16.89590 root ssd-disks
      -11         0     host ssd1
598798032         0         osd.598798032     DNE        0
    21940         0         osd.21940         DNE        0
       71         0         osd.71            DNE        0

My question is how to delete this osds without direct editing crushmap
? It is production system, I can't affort any service interruption :(,
when I try to ceph osd crush remove then ceph-mon crushes....

I  dumped  crushmap,  but it took 19G (!!) after decompiling (compiled
file  is  very small). So, I cleaned this file with perl (it take very
long  time), and I have now small txt crushmap, which I edited. But is
there  any  chance  that ceph will still remember somewhere about this
huge  numbers  for osds ? Is it safe to apply this cleaned crushmap to
cluster ?

-- 
Regards
Luk


^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: problem with removing osd
  2016-12-29 15:07 problem with removing osd Łukasz Chrustek
@ 2016-12-29 19:10 ` Sage Weil
  2016-12-29 20:20   ` Łukasz Chrustek
  0 siblings, 1 reply; 12+ messages in thread
From: Sage Weil @ 2016-12-29 19:10 UTC (permalink / raw)
  To: Łukasz Chrustek; +Cc: ceph-devel

[-- Attachment #1: Type: TEXT/PLAIN, Size: 2244 bytes --]

On Thu, 29 Dec 2016, Łukasz Chrustek wrote:
> Hi,
> 
> I was trying to delete 3 osds from cluster, deletion procces took very
> long  time and I interrupted it. mon process then crushed, and in ceph
> osd tree (after restart ceph-mon) I saw:
> 
>  ~]# ceph osd tree
> ID         WEIGHT    TYPE NAME            UP/DOWN REWEIGHT PRIMARY-AFFINITY
>         -7  16.89590 root ssd-disks
>        -11         0     host ssd1
> -231707408         0
>      22100         0         osd.22100        DNE        0
>         71         0         osd.71           DNE        0
> 
> 
> when I tried to delete osd.22100:
> 
> [root@cc1 ~]# ceph osd crush remove osd.22100
> device 'osd.22100' does not appear in the crush map
> 
> then I tried to delete osd.71 and mon proccess crushed:
> 
> [root@cc1 ~]# ceph osd crush remove osd.71
> 2016-12-28 17:52:34.459668 7f426a862700  0 monclient: hunting for new mon
> 
> after restart of ceph-mon in ceph osd tree it shows:
> 
> # ceph osd tree
> ID        WEIGHT    TYPE NAME             UP/DOWN REWEIGHT PRIMARY-AFFINITY
>        -7  16.89590 root ssd-disks
>       -11         0     host ssd1
> 598798032         0         osd.598798032     DNE        0

Yikes!

>     21940         0         osd.21940         DNE        0
>        71         0         osd.71            DNE        0
> 
> My question is how to delete this osds without direct editing crushmap
> ? It is production system, I can't affort any service interruption :(,
> when I try to ceph osd crush remove then ceph-mon crushes....
> 
> I  dumped  crushmap,  but it took 19G (!!) after decompiling (compiled
> file  is  very small). So, I cleaned this file with perl (it take very
> long  time), and I have now small txt crushmap, which I edited. But is
> there  any  chance  that ceph will still remember somewhere about this
> huge  numbers  for osds ? Is it safe to apply this cleaned crushmap to
> cluster ?

It sounds like the problem is the OSDMap, not CRUSH per se.  Can you 
attach the output from 'ceph osd dump -f json-pretty'?

Do you know how osd.598798032 got created?  Or osd.21940 for that matter.  
OSD ids should be small since they are stored internally by OSDMap as a 
vector.  This is probably why your mon is crashing.

sage

^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: problem with removing osd
  2016-12-29 19:10 ` Sage Weil
@ 2016-12-29 20:20   ` Łukasz Chrustek
  2016-12-29 20:26     ` Sage Weil
  0 siblings, 1 reply; 12+ messages in thread
From: Łukasz Chrustek @ 2016-12-29 20:20 UTC (permalink / raw)
  To: Sage Weil; +Cc: ceph-devel

Hi,


>> 
>> # ceph osd tree
>> ID        WEIGHT    TYPE NAME             UP/DOWN REWEIGHT PRIMARY-AFFINITY
>>        -7  16.89590 root ssd-disks
>>       -11         0     host ssd1
>> 598798032         0         osd.598798032     DNE        0

> Yikes!

Yes... indeed, I don't like this number too...

>>     21940         0         osd.21940         DNE        0
>>        71         0         osd.71            DNE        0
>> 
>> My question is how to delete this osds without direct editing crushmap
>> ? It is production system, I can't affort any service interruption :(,
>> when I try to ceph osd crush remove then ceph-mon crushes....
>> 
>> I  dumped  crushmap,  but it took 19G (!!) after decompiling (compiled
>> file  is  very small). So, I cleaned this file with perl (it take very
>> long  time), and I have now small txt crushmap, which I edited. But is
>> there  any  chance  that ceph will still remember somewhere about this
>> huge  numbers  for osds ? Is it safe to apply this cleaned crushmap to
>> cluster ?

> It sounds like the problem is the OSDMap, not CRUSH per se.  Can you 
> attach the output from 'ceph osd dump -f json-pretty'?

It's quite big so I put it on pastebin:

http://pastebin.com/Unkk2Pa7

> Do you know how osd.598798032 got created?  Or osd.21940 for that matter.
> OSD ids should be small since they are stored internally by OSDMap as a
> vector.  This is probably why your mon is crashing.

[root@cc1 /etc/ceph]# ceph osd tree
ID  WEIGHT    TYPE NAME            UP/DOWN REWEIGHT PRIMARY-AFFINITY
 -7  16.89590 root ssd-intel-s3700
-11         0     host ssd-stor1
 69         0         osd.69          down        0          1.00000
 70         0         osd.70          down        0          1.00000
 71         0         osd.71          down        0          1.00000


This the moment, when it happend:
]# for i in `seq 69 71`;do ceph osd crush remove osd.$i;done
removed item id 69 name 'osd.69' from crush map


removed item id 70 name 'osd.70' from crush map

here i press ctrl+c

2016-12-28 17:38:10.055239 7f4576d7a700  0 monclient: hunting for new mon
2016-12-28 17:38:10.055582 7f4574233700  0 -- 192.168.128.1:0/1201679761 >> 192.168.128.2:6789/0 pipe(0x7f456c023190 sd=3 :0 s=1 pgs=0 cs=0 l=1 c=0x7f456c024470).fault
2016-12-28 17:38:30.550622 7f4574233700  0 -- 192.168.128.1:0/1201679761 >> 192.168.128.1:6789/0 pipe(0x7f45600008c0 sd=3 :0 s=1 pgs=0 cs=0 l=1 c=0x7f4560001df0).fault
2016-12-28 17:38:54.551031 7f4574474700  0 -- 192.168.128.1:0/1201679761 >> 192.168.128.2:6789/0 pipe(0x7f45600046c0 sd=4 :0 s=1 pgs=0 cs=0 l=1 c=0x7f45600042b0).fault

after restart of ceph-mon:

]# ceph osd tree
ID         WEIGHT    TYPE NAME            UP/DOWN REWEIGHT PRIMARY-AFFINITY
        -7  16.89590 root ssd-intel-s3700
       -11         0     host ssd-stor1
-231707408         0
     22100         0         osd.22100        DNE        0
        71         0         osd.71           DNE        0

and later:

[root@cc1 ~]# ceph osd crush remove osd.22100
device 'osd.22100' does not appear in the crush map
[root@cc1 ~]# ceph osd crush remove osd.71
2016-12-28 17:52:34.459668 7f426a862700  0 monclient: hunting for new mon
2016-12-28 17:52:55.238418 7f426a862700  0 monclient: hunting for new mon
2016-12-28 17:52:55.238680 7f4262ebc700  0 -- 192.168.128.1:0/692048545 >> 192.168.128.2:6789/0 pipe(0x7f4254028300 sd=4 :0 s=1 pgs=0 cs=0 l=1 c=0x7f4254026800).fault

and after another restart of ceph-mon:

]# ceph osd tree
ID        WEIGHT    TYPE NAME             UP/DOWN REWEIGHT PRIMARY-AFFINITY
       -7  16.89590 root ssd-intel-s3700
      -11         0     host ssd-stor1
598798032         0         osd.598798032     DNE        0
    21940         0         osd.21940         DNE        0
       71         0         osd.71            DNE        0




-- 
Regards
Luk


^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: problem with removing osd
  2016-12-29 20:20   ` Łukasz Chrustek
@ 2016-12-29 20:26     ` Sage Weil
  2016-12-29 20:46       ` Łukasz Chrustek
  0 siblings, 1 reply; 12+ messages in thread
From: Sage Weil @ 2016-12-29 20:26 UTC (permalink / raw)
  To: Łukasz Chrustek; +Cc: ceph-devel

[-- Attachment #1: Type: TEXT/PLAIN, Size: 4393 bytes --]

On Thu, 29 Dec 2016, Łukasz Chrustek wrote:
> Hi,
> 
> 
> >> 
> >> # ceph osd tree
> >> ID        WEIGHT    TYPE NAME             UP/DOWN REWEIGHT PRIMARY-AFFINITY
> >>        -7  16.89590 root ssd-disks
> >>       -11         0     host ssd1
> >> 598798032         0         osd.598798032     DNE        0
> 
> > Yikes!
> 
> Yes... indeed, I don't like this number too...
> 
> >>     21940         0         osd.21940         DNE        0
> >>        71         0         osd.71            DNE        0
> >> 
> >> My question is how to delete this osds without direct editing crushmap
> >> ? It is production system, I can't affort any service interruption :(,
> >> when I try to ceph osd crush remove then ceph-mon crushes....
> >> 
> >> I  dumped  crushmap,  but it took 19G (!!) after decompiling (compiled
> >> file  is  very small). So, I cleaned this file with perl (it take very
> >> long  time), and I have now small txt crushmap, which I edited. But is
> >> there  any  chance  that ceph will still remember somewhere about this
> >> huge  numbers  for osds ? Is it safe to apply this cleaned crushmap to
> >> cluster ?
> 
> > It sounds like the problem is the OSDMap, not CRUSH per se.  Can you 
> > attach the output from 'ceph osd dump -f json-pretty'?
> 
> It's quite big so I put it on pastebin:
> 
> http://pastebin.com/Unkk2Pa7
> 
> > Do you know how osd.598798032 got created?  Or osd.21940 for that matter.
> > OSD ids should be small since they are stored internally by OSDMap as a
> > vector.  This is probably why your mon is crashing.
> 
> [root@cc1 /etc/ceph]# ceph osd tree
> ID  WEIGHT    TYPE NAME            UP/DOWN REWEIGHT PRIMARY-AFFINITY
>  -7  16.89590 root ssd-intel-s3700
> -11         0     host ssd-stor1
>  69         0         osd.69          down        0          1.00000
>  70         0         osd.70          down        0          1.00000
>  71         0         osd.71          down        0          1.00000
> 
> 
> This the moment, when it happend:
> ]# for i in `seq 69 71`;do ceph osd crush remove osd.$i;done
> removed item id 69 name 'osd.69' from crush map
> 
> 
> removed item id 70 name 'osd.70' from crush map
> 
> here i press ctrl+c
> 
> 2016-12-28 17:38:10.055239 7f4576d7a700  0 monclient: hunting for new mon
> 2016-12-28 17:38:10.055582 7f4574233700  0 -- 192.168.128.1:0/1201679761 >> 192.168.128.2:6789/0 pipe(0x7f456c023190 sd=3 :0 s=1 pgs=0 cs=0 l=1 c=0x7f456c024470).fault
> 2016-12-28 17:38:30.550622 7f4574233700  0 -- 192.168.128.1:0/1201679761 >> 192.168.128.1:6789/0 pipe(0x7f45600008c0 sd=3 :0 s=1 pgs=0 cs=0 l=1 c=0x7f4560001df0).fault
> 2016-12-28 17:38:54.551031 7f4574474700  0 -- 192.168.128.1:0/1201679761 >> 192.168.128.2:6789/0 pipe(0x7f45600046c0 sd=4 :0 s=1 pgs=0 cs=0 l=1 c=0x7f45600042b0).fault

What version is this?

Can you attach the crush map too?  (ceph osd crush dump -f json-pretty)

Thanks!
sage

> after restart of ceph-mon:
> 
> ]# ceph osd tree
> ID         WEIGHT    TYPE NAME            UP/DOWN REWEIGHT PRIMARY-AFFINITY
>         -7  16.89590 root ssd-intel-s3700
>        -11         0     host ssd-stor1
> -231707408         0
>      22100         0         osd.22100        DNE        0
>         71         0         osd.71           DNE        0
> 
> and later:
> 
> [root@cc1 ~]# ceph osd crush remove osd.22100
> device 'osd.22100' does not appear in the crush map
> [root@cc1 ~]# ceph osd crush remove osd.71
> 2016-12-28 17:52:34.459668 7f426a862700  0 monclient: hunting for new mon
> 2016-12-28 17:52:55.238418 7f426a862700  0 monclient: hunting for new mon
> 2016-12-28 17:52:55.238680 7f4262ebc700  0 -- 192.168.128.1:0/692048545 >> 192.168.128.2:6789/0 pipe(0x7f4254028300 sd=4 :0 s=1 pgs=0 cs=0 l=1 c=0x7f4254026800).fault
> 
> and after another restart of ceph-mon:
> 
> ]# ceph osd tree
> ID        WEIGHT    TYPE NAME             UP/DOWN REWEIGHT PRIMARY-AFFINITY
>        -7  16.89590 root ssd-intel-s3700
>       -11         0     host ssd-stor1
> 598798032         0         osd.598798032     DNE        0
>     21940         0         osd.21940         DNE        0
>        71         0         osd.71            DNE        0
> 
> 
> 
> 
> -- 
> Regards
> Luk
> 
> --
> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html
> 
> 

^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: problem with removing osd
  2016-12-29 20:26     ` Sage Weil
@ 2016-12-29 20:46       ` Łukasz Chrustek
  2016-12-29 20:49         ` Sage Weil
  0 siblings, 1 reply; 12+ messages in thread
From: Łukasz Chrustek @ 2016-12-29 20:46 UTC (permalink / raw)
  To: Sage Weil; +Cc: ceph-devel

Cześć,

> On Thu, 29 Dec 2016, Łukasz Chrustek wrote:
>> Hi,
>> 
>> 
>> >> 
>> >> # ceph osd tree
>> >> ID        WEIGHT    TYPE NAME             UP/DOWN REWEIGHT PRIMARY-AFFINITY
>> >>        -7  16.89590 root ssd-disks
>> >>       -11         0     host ssd1
>> >> 598798032         0         osd.598798032     DNE        0
>> 
>> > Yikes!
>> 
>> Yes... indeed, I don't like this number too...
>> 
>> >>     21940         0         osd.21940         DNE        0
>> >>        71         0         osd.71            DNE        0
>> >> 
>> >> My question is how to delete this osds without direct editing crushmap
>> >> ? It is production system, I can't affort any service interruption :(,
>> >> when I try to ceph osd crush remove then ceph-mon crushes....
>> >> 
>> >> I  dumped  crushmap,  but it took 19G (!!) after decompiling (compiled
>> >> file  is  very small). So, I cleaned this file with perl (it take very
>> >> long  time), and I have now small txt crushmap, which I edited. But is
>> >> there  any  chance  that ceph will still remember somewhere about this
>> >> huge  numbers  for osds ? Is it safe to apply this cleaned crushmap to
>> >> cluster ?
>> 
>> > It sounds like the problem is the OSDMap, not CRUSH per se.  Can you 
>> > attach the output from 'ceph osd dump -f json-pretty'?
>> 
>> It's quite big so I put it on pastebin:
>> 
>> http://pastebin.com/Unkk2Pa7
>> 
>> > Do you know how osd.598798032 got created?  Or osd.21940 for that matter.
>> > OSD ids should be small since they are stored internally by OSDMap as a
>> > vector.  This is probably why your mon is crashing.
>> 
>> [root@cc1 /etc/ceph]# ceph osd tree
>> ID  WEIGHT    TYPE NAME            UP/DOWN REWEIGHT PRIMARY-AFFINITY
>>  -7  16.89590 root ssd-intel-s3700
>> -11         0     host ssd-stor1
>>  69         0         osd.69          down        0          1.00000
>>  70         0         osd.70          down        0          1.00000
>>  71         0         osd.71          down        0          1.00000
>> 
>> 
>> This the moment, when it happend:
>> ]# for i in `seq 69 71`;do ceph osd crush remove osd.$i;done
>> removed item id 69 name 'osd.69' from crush map
>> 
>> 
>> removed item id 70 name 'osd.70' from crush map
>> 
>> here i press ctrl+c
>> 
>> 2016-12-28 17:38:10.055239 7f4576d7a700  0 monclient: hunting for new mon
>> 2016-12-28 17:38:10.055582 7f4574233700  0 -- 192.168.128.1:0/1201679761 >> 192.168.128.2:6789/0 pipe(0x7f456c023190 sd=3 :0 s=1 pgs=0 cs=0 l=1 c=0x7f456c024470).fault
>> 2016-12-28 17:38:30.550622 7f4574233700  0 -- 192.168.128.1:0/1201679761 >> 192.168.128.1:6789/0 pipe(0x7f45600008c0 sd=3 :0 s=1 pgs=0 cs=0 l=1 c=0x7f4560001df0).fault
>> 2016-12-28 17:38:54.551031 7f4574474700  0 -- 192.168.128.1:0/1201679761 >> 192.168.128.2:6789/0 pipe(0x7f45600046c0 sd=4 :0 s=1 pgs=0 cs=0 l=1 c=0x7f45600042b0).fault

> What version is this?

infernalis

> Can you attach the crush map too?  (ceph osd crush dump -f json-pretty)

I can't - ceph-mons are crushing on diffrent ceph-mon hosts:

]# ceph osd crush dump -f json-pretty

2016-12-29 21:42:47.172097 7f0c36061700  0 monclient: hunting for new mon
^CTraceback (most recent call last):
  File "/usr/bin/ceph", line 948, in <module>
    retval = main()
  File "/usr/bin/ceph", line 881, in main
    sigdict, inbuf, verbose)
  File "/usr/bin/ceph", line 479, in new_style_command
    inbuf=inbuf)
  File "/usr/lib/python2.7/dist-packages/ceph_argparse.py", line 1291, in json_command
    raise RuntimeError('"{0}": exception {1}'.format(argdict, e))
RuntimeError: "{'prefix': u'osd crush dump', 'format': 'json-pretty'}": exception "['{"prefix": "osd crush dump", "format": "json-pretty"}']": exception 'int' object is not iterable

-- 
Pozdrowienia,
 Łukasz Chrustek


^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: problem with removing osd
  2016-12-29 20:46       ` Łukasz Chrustek
@ 2016-12-29 20:49         ` Sage Weil
  2016-12-29 20:55           ` Łukasz Chrustek
  0 siblings, 1 reply; 12+ messages in thread
From: Sage Weil @ 2016-12-29 20:49 UTC (permalink / raw)
  To: Łukasz Chrustek; +Cc: ceph-devel

[-- Attachment #1: Type: TEXT/PLAIN, Size: 3336 bytes --]

On Thu, 29 Dec 2016, Łukasz Chrustek wrote:
> Cześć,
> 
> > On Thu, 29 Dec 2016, Łukasz Chrustek wrote:
> >> Hi,
> >> 
> >> 
> >> >> 
> >> >> # ceph osd tree
> >> >> ID        WEIGHT    TYPE NAME             UP/DOWN REWEIGHT PRIMARY-AFFINITY
> >> >>        -7  16.89590 root ssd-disks
> >> >>       -11         0     host ssd1
> >> >> 598798032         0         osd.598798032     DNE        0
> >> 
> >> > Yikes!
> >> 
> >> Yes... indeed, I don't like this number too...
> >> 
> >> >>     21940         0         osd.21940         DNE        0
> >> >>        71         0         osd.71            DNE        0
> >> >> 
> >> >> My question is how to delete this osds without direct editing crushmap
> >> >> ? It is production system, I can't affort any service interruption :(,
> >> >> when I try to ceph osd crush remove then ceph-mon crushes....
> >> >> 
> >> >> I  dumped  crushmap,  but it took 19G (!!) after decompiling (compiled
> >> >> file  is  very small). So, I cleaned this file with perl (it take very
> >> >> long  time), and I have now small txt crushmap, which I edited. But is
> >> >> there  any  chance  that ceph will still remember somewhere about this
> >> >> huge  numbers  for osds ? Is it safe to apply this cleaned crushmap to
> >> >> cluster ?
> >> 
> >> > It sounds like the problem is the OSDMap, not CRUSH per se.  Can you 
> >> > attach the output from 'ceph osd dump -f json-pretty'?
> >> 
> >> It's quite big so I put it on pastebin:
> >> 
> >> http://pastebin.com/Unkk2Pa7
> >> 
> >> > Do you know how osd.598798032 got created?  Or osd.21940 for that matter.
> >> > OSD ids should be small since they are stored internally by OSDMap as a
> >> > vector.  This is probably why your mon is crashing.
> >> 
> >> [root@cc1 /etc/ceph]# ceph osd tree
> >> ID  WEIGHT    TYPE NAME            UP/DOWN REWEIGHT PRIMARY-AFFINITY
> >>  -7  16.89590 root ssd-intel-s3700
> >> -11         0     host ssd-stor1
> >>  69         0         osd.69          down        0          1.00000
> >>  70         0         osd.70          down        0          1.00000
> >>  71         0         osd.71          down        0          1.00000
> >> 
> >> 
> >> This the moment, when it happend:
> >> ]# for i in `seq 69 71`;do ceph osd crush remove osd.$i;done
> >> removed item id 69 name 'osd.69' from crush map
> >> 
> >> 
> >> removed item id 70 name 'osd.70' from crush map
> >> 
> >> here i press ctrl+c
> >> 
> >> 2016-12-28 17:38:10.055239 7f4576d7a700  0 monclient: hunting for new mon
> >> 2016-12-28 17:38:10.055582 7f4574233700  0 -- 192.168.128.1:0/1201679761 >> 192.168.128.2:6789/0 pipe(0x7f456c023190 sd=3 :0 s=1 pgs=0 cs=0 l=1 c=0x7f456c024470).fault
> >> 2016-12-28 17:38:30.550622 7f4574233700  0 -- 192.168.128.1:0/1201679761 >> 192.168.128.1:6789/0 pipe(0x7f45600008c0 sd=3 :0 s=1 pgs=0 cs=0 l=1 c=0x7f4560001df0).fault
> >> 2016-12-28 17:38:54.551031 7f4574474700  0 -- 192.168.128.1:0/1201679761 >> 192.168.128.2:6789/0 pipe(0x7f45600046c0 sd=4 :0 s=1 pgs=0 cs=0 l=1 c=0x7f45600042b0).fault
> 
> > What version is this?
> 
> infernalis
> 
> > Can you attach the crush map too?  (ceph osd crush dump -f json-pretty)
> 
> I can't - ceph-mons are crushing on diffrent ceph-mon hosts:
> 
> ]# ceph osd crush dump -f json-pretty

Hmm, in that case, 'ceph osd getcrushmap -o cm' and post that somewhere?

sage

^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: problem with removing osd
  2016-12-29 20:49         ` Sage Weil
@ 2016-12-29 20:55           ` Łukasz Chrustek
  2016-12-29 21:41             ` Sage Weil
  0 siblings, 1 reply; 12+ messages in thread
From: Łukasz Chrustek @ 2016-12-29 20:55 UTC (permalink / raw)
  To: Sage Weil; +Cc: ceph-devel

[-- Attachment #1: Type: text/plain, Size: 1045 bytes --]

Cześć,


>> >> 
>> >> here i press ctrl+c
>> >> 
>> >> 2016-12-28 17:38:10.055239 7f4576d7a700  0 monclient: hunting for new mon
>> >> 2016-12-28 17:38:10.055582 7f4574233700  0 -- 192.168.128.1:0/1201679761 >> 192.168.128.2:6789/0 pipe(0x7f456c023190 sd=3 :0 s=1 pgs=0 cs=0 l=1 c=0x7f456c024470).fault
>> >> 2016-12-28 17:38:30.550622 7f4574233700  0 -- 192.168.128.1:0/1201679761 >> 192.168.128.1:6789/0 pipe(0x7f45600008c0 sd=3 :0 s=1 pgs=0 cs=0 l=1 c=0x7f4560001df0).fault
>> >> 2016-12-28 17:38:54.551031 7f4574474700  0 -- 192.168.128.1:0/1201679761 >> 192.168.128.2:6789/0 pipe(0x7f45600046c0 sd=4 :0 s=1 pgs=0 cs=0 l=1 c=0x7f45600042b0).fault
>> 
>> > What version is this?
>> 
>> infernalis
>> 
>> > Can you attach the crush map too?  (ceph osd crush dump -f json-pretty)
>> 
>> I can't - ceph-mons are crushing on diffrent ceph-mon hosts:
>> 
>> ]# ceph osd crush dump -f json-pretty

> Hmm, in that case, 'ceph osd getcrushmap -o cm' and post that somewhere?

In attachment.


> sage



-- 
Regards
Luk

[-- Attachment #2: cm --]
[-- Type: application/octet-stream, Size: 3559 bytes --]

^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: problem with removing osd
  2016-12-29 20:55           ` Łukasz Chrustek
@ 2016-12-29 21:41             ` Sage Weil
  2016-12-29 22:02               ` Łukasz Chrustek
  0 siblings, 1 reply; 12+ messages in thread
From: Sage Weil @ 2016-12-29 21:41 UTC (permalink / raw)
  To: Łukasz Chrustek; +Cc: ceph-devel

[-- Attachment #1: Type: TEXT/PLAIN, Size: 1660 bytes --]

On Thu, 29 Dec 2016, Łukasz Chrustek wrote:
> Cześć,
> 
> 
> >> >> 
> >> >> here i press ctrl+c
> >> >> 
> >> >> 2016-12-28 17:38:10.055239 7f4576d7a700  0 monclient: hunting for new mon
> >> >> 2016-12-28 17:38:10.055582 7f4574233700  0 -- 192.168.128.1:0/1201679761 >> 192.168.128.2:6789/0 pipe(0x7f456c023190 sd=3 :0 s=1 pgs=0 cs=0 l=1 c=0x7f456c024470).fault
> >> >> 2016-12-28 17:38:30.550622 7f4574233700  0 -- 192.168.128.1:0/1201679761 >> 192.168.128.1:6789/0 pipe(0x7f45600008c0 sd=3 :0 s=1 pgs=0 cs=0 l=1 c=0x7f4560001df0).fault
> >> >> 2016-12-28 17:38:54.551031 7f4574474700  0 -- 192.168.128.1:0/1201679761 >> 192.168.128.2:6789/0 pipe(0x7f45600046c0 sd=4 :0 s=1 pgs=0 cs=0 l=1 c=0x7f45600042b0).fault
> >> 
> >> > What version is this?
> >> 
> >> infernalis
> >> 
> >> > Can you attach the crush map too?  (ceph osd crush dump -f json-pretty)
> >> 
> >> I can't - ceph-mons are crushing on diffrent ceph-mon hosts:
> >> 
> >> ]# ceph osd crush dump -f json-pretty
> 
> > Hmm, in that case, 'ceph osd getcrushmap -o cm' and post that somewhere?
> 
> In attachment.

The encoded tree bucket -11 had bad values.  I don't really trust the tree 
bucket code in crush... it's not well tested (and is a poor balance 
computation and efficiency anyway).  We should probably try to remove tree 
entirely.

I've attached a fixed map that you can inject with

 ceph osd setcrushmap -i <filename>

Bucket -11 is now empty; not sure what was supposed to be in it.

I suggest switching all of your tree buckets over to straw2 as soon as 
possible.  Note that this will result in some rebalancing.  You could do 
it one bucket a time if that's concerning.

sage

[-- Attachment #2: Type: APPLICATION/octet-stream, Size: 3627 bytes --]

^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: problem with removing osd
  2016-12-29 21:41             ` Sage Weil
@ 2016-12-29 22:02               ` Łukasz Chrustek
  2016-12-29 22:18                 ` Sage Weil
  0 siblings, 1 reply; 12+ messages in thread
From: Łukasz Chrustek @ 2016-12-29 22:02 UTC (permalink / raw)
  To: Sage Weil; +Cc: ceph-devel

Hi,

Thank  You  very  much for analize and the file ! I had similar :) but
wasn't sure if it wan't distroy something in cluster.

> The encoded tree bucket -11 had bad values.  I don't really trust the tree
> bucket code in crush... it's not well tested (and is a poor balance 
> computation and efficiency anyway).  We should probably try to remove tree
> entirely.

> I've attached a fixed map that you can inject with

>  ceph osd setcrushmap -i <filename>

Now it works, and also ceph osd crush dump -f json-pretty runs OK.

> Bucket -11 is now empty; not sure what was supposed to be in it.

this server will be reinstalled, there where three osds.

> I suggest switching all of your tree buckets over to straw2 as soon as
> possible.  Note that this will result in some rebalancing.  You could do
> it one bucket a time if that's concerning.

OK,  changing  alg  to straw2 will rebalance ale PGs on all nodes ?
-- 
Regards,
Luk


^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: problem with removing osd
  2016-12-29 22:02               ` Łukasz Chrustek
@ 2016-12-29 22:18                 ` Sage Weil
  2016-12-29 23:13                   ` Brad Hubbard
  0 siblings, 1 reply; 12+ messages in thread
From: Sage Weil @ 2016-12-29 22:18 UTC (permalink / raw)
  To: Łukasz Chrustek; +Cc: ceph-devel

[-- Attachment #1: Type: TEXT/PLAIN, Size: 1366 bytes --]

On Thu, 29 Dec 2016, Łukasz Chrustek wrote:
> Hi,
> 
> Thank  You  very  much for analize and the file ! I had similar :) but
> wasn't sure if it wan't distroy something in cluster.
> 
> > The encoded tree bucket -11 had bad values.  I don't really trust the tree
> > bucket code in crush... it's not well tested (and is a poor balance 
> > computation and efficiency anyway).  We should probably try to remove tree
> > entirely.
> 
> > I've attached a fixed map that you can inject with
> 
> >  ceph osd setcrushmap -i <filename>
> 
> Now it works, and also ceph osd crush dump -f json-pretty runs OK.

Great news!
 
> > Bucket -11 is now empty; not sure what was supposed to be in it.
> 
> this server will be reinstalled, there where three osds.
> 
> > I suggest switching all of your tree buckets over to straw2 as soon as
> > possible.  Note that this will result in some rebalancing.  You could do
> > it one bucket a time if that's concerning.
> 
> OK,  changing  alg  to straw2 will rebalance ale PGs on all nodes ?

For any bucket you change from tree -> straw2, you'll see PGs shuffle 
between the children of that bucket.  So for hosts, you'll see data move 
between the disks.  And fixing ssd-intel-s3700 will shuffle data between 
hosts.

I'd also switch the straw buckets to straw2, although that will move a 
comparatively small amount of data.

sage

^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: problem with removing osd
  2016-12-29 22:18                 ` Sage Weil
@ 2016-12-29 23:13                   ` Brad Hubbard
  2016-12-29 23:33                     ` Łukasz Chrustek
  0 siblings, 1 reply; 12+ messages in thread
From: Brad Hubbard @ 2016-12-29 23:13 UTC (permalink / raw)
  To: Sage Weil; +Cc: Łukasz Chrustek, ceph-devel

Have you thought through your use of '"min_size": 1' on many of these pools?


See recent discussions on this mailing list on the subject for background.

Just FYI.


On Fri, Dec 30, 2016 at 8:18 AM, Sage Weil <sage@newdream.net> wrote:
> On Thu, 29 Dec 2016, Łukasz Chrustek wrote:
>> Hi,
>>
>> Thank  You  very  much for analize and the file ! I had similar :) but
>> wasn't sure if it wan't distroy something in cluster.
>>
>> > The encoded tree bucket -11 had bad values.  I don't really trust the tree
>> > bucket code in crush... it's not well tested (and is a poor balance
>> > computation and efficiency anyway).  We should probably try to remove tree
>> > entirely.
>>
>> > I've attached a fixed map that you can inject with
>>
>> >  ceph osd setcrushmap -i <filename>
>>
>> Now it works, and also ceph osd crush dump -f json-pretty runs OK.
>
> Great news!
>
>> > Bucket -11 is now empty; not sure what was supposed to be in it.
>>
>> this server will be reinstalled, there where three osds.
>>
>> > I suggest switching all of your tree buckets over to straw2 as soon as
>> > possible.  Note that this will result in some rebalancing.  You could do
>> > it one bucket a time if that's concerning.
>>
>> OK,  changing  alg  to straw2 will rebalance ale PGs on all nodes ?
>
> For any bucket you change from tree -> straw2, you'll see PGs shuffle
> between the children of that bucket.  So for hosts, you'll see data move
> between the disks.  And fixing ssd-intel-s3700 will shuffle data between
> hosts.
>
> I'd also switch the straw buckets to straw2, although that will move a
> comparatively small amount of data.
>
> sage



-- 
Cheers,
Brad

^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: problem with removing osd
  2016-12-29 23:13                   ` Brad Hubbard
@ 2016-12-29 23:33                     ` Łukasz Chrustek
  0 siblings, 0 replies; 12+ messages in thread
From: Łukasz Chrustek @ 2016-12-29 23:33 UTC (permalink / raw)
  To: Brad Hubbard, Sage Weil; +Cc: ceph-devel

Hi,

Thank You for suggestion - I will read about it.

Regards
Luk

> Have you thought through your use of '"min_size": 1' on many of these pools?


> See recent discussions on this mailing list on the subject for background.

> Just FYI.


> On Fri, Dec 30, 2016 at 8:18 AM, Sage Weil <sage@newdream.net> wrote:
>> On Thu, 29 Dec 2016, Łukasz Chrustek wrote:
>>> Hi,
>>>
>>> Thank  You  very  much for analize and the file ! I had similar :) but
>>> wasn't sure if it wan't distroy something in cluster.
>>>
>>> > The encoded tree bucket -11 had bad values.  I don't really trust the tree
>>> > bucket code in crush... it's not well tested (and is a poor balance
>>> > computation and efficiency anyway).  We should probably try to remove tree
>>> > entirely.
>>>
>>> > I've attached a fixed map that you can inject with
>>>
>>> >  ceph osd setcrushmap -i <filename>
>>>
>>> Now it works, and also ceph osd crush dump -f json-pretty runs OK.
>>
>> Great news!
>>
>>> > Bucket -11 is now empty; not sure what was supposed to be in it.
>>>
>>> this server will be reinstalled, there where three osds.
>>>
>>> > I suggest switching all of your tree buckets over to straw2 as soon as
>>> > possible.  Note that this will result in some rebalancing.  You could do
>>> > it one bucket a time if that's concerning.
>>>
>>> OK,  changing  alg  to straw2 will rebalance ale PGs on all nodes ?
>>
>> For any bucket you change from tree -> straw2, you'll see PGs shuffle
>> between the children of that bucket.  So for hosts, you'll see data move
>> between the disks.  And fixing ssd-intel-s3700 will shuffle data between
>> hosts.
>>
>> I'd also switch the straw buckets to straw2, although that will move a
>> comparatively small amount of data.
>>
>> sage






-- 
Pozdrowienia,
 Łukasz Chrustek


^ permalink raw reply	[flat|nested] 12+ messages in thread

end of thread, other threads:[~2016-12-29 23:33 UTC | newest]

Thread overview: 12+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2016-12-29 15:07 problem with removing osd Łukasz Chrustek
2016-12-29 19:10 ` Sage Weil
2016-12-29 20:20   ` Łukasz Chrustek
2016-12-29 20:26     ` Sage Weil
2016-12-29 20:46       ` Łukasz Chrustek
2016-12-29 20:49         ` Sage Weil
2016-12-29 20:55           ` Łukasz Chrustek
2016-12-29 21:41             ` Sage Weil
2016-12-29 22:02               ` Łukasz Chrustek
2016-12-29 22:18                 ` Sage Weil
2016-12-29 23:13                   ` Brad Hubbard
2016-12-29 23:33                     ` Łukasz Chrustek

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.