From: "Łukasz Chrustek" <skidoo@tlen.pl>
To: Sage Weil <sage@newdream.net>
Cc: ceph-devel@vger.kernel.org
Subject: Re: Problem with query and any operation on PGs
Date: Wed, 24 May 2017 15:19:52 +0200 [thread overview]
Message-ID: <135176900.20170524151952@tlen.pl> (raw)
In-Reply-To: <alpine.DEB.2.11.1705232146500.3646@piezo.novalocal>
Cześć,
> On Tue, 23 May 2017, Łukasz Chrustek wrote:
>> Cześć,
>>
>> > On Tue, 23 May 2017, Łukasz Chrustek wrote:
>> >> I'm not sleeping for over 30 hours, and still can't find solution. I
>> >> did, as You wrote, but turning off this
>> >> (https://pastebin.com/1npBXeMV) osds didn't resolve issue...
>>
>> > The important bit is:
>>
>> > "blocked": "peering is blocked due to down osds",
>> > "down_osds_we_would_probe": [
>> > 6,
>> > 10,
>> > 33,
>> > 37,
>> > 72
>> > ],
>> > "peering_blocked_by": [
>> > {
>> > "osd": 6,
>> > "current_lost_at": 0,
>> > "comment": "starting or marking this osd lost may let
>> > us proceed"
>> > },
>> > {
>> > "osd": 10,
>> > "current_lost_at": 0,
>> > "comment": "starting or marking this osd lost may let
>> > us proceed"
>> > },
>> > {
>> > "osd": 37,
>> > "current_lost_at": 0,
>> > "comment": "starting or marking this osd lost may let
>> > us proceed"
>> > },
>> > {
>> > "osd": 72,
>> > "current_lost_at": 113771,
>> > "comment": "starting or marking this osd lost may let
>> > us proceed"
>> > }
>> > ]
>> > },
>>
>> > Are any of those OSDs startable?
>>
>> They were all up and running - but I decided to shut them down and out
>> them from ceph, now it looks like ceph working ok, but still two PGs
>> are in down state, how to get rid of it ?
> If you haven't deleted the data, you should start the OSDs back up.
> If they are partially damanged you can use ceph-objectstore-tool to
> extract just the PGs in question to make sure you haven't lost anything,
> inject them on some other OSD(s) and restart those, and *then* mark the
> bad OSDs as 'lost'.
> If all else fails, you can just mark those OSDs 'lost', but in doing so
> you might be telling the cluster to lose data.
> The best thing to do is definitely to get those OSDs started again.
Now situation looks like this:
[root@cc1 ~]# rbd info volumes/volume-ccc5d976-cecf-4938-a452-1bee6188987b
rbd image 'volume-ccc5d976-cecf-4938-a452-1bee6188987b':
size 500 GB in 128000 objects
order 22 (4096 kB objects)
block_name_prefix: rbd_data.ed9d394a851426
format: 2
features: layering
flags:
[root@cc1 ~]# rados -p volumes ls | grep rbd_data.ed9d394a851426
(output cutted)
rbd_data.ed9d394a851426.000000000000447c
rbd_data.ed9d394a851426.0000000000010857
rbd_data.ed9d394a851426.000000000000ec8b
rbd_data.ed9d394a851426.000000000000fa43
rbd_data.ed9d394a851426.000000000001ef2d
^C
it hangs on this object and isn't going further. rbd cp also hangs...
rbd map - also...
can You advice what can be solution for this case ?
--
Regards,
Łukasz Chrustek
next prev parent reply other threads:[~2017-05-24 13:30 UTC|newest]
Thread overview: 35+ messages / expand[flat|nested] mbox.gz Atom feed top
[not found] <175484591.20170523135449@tlen.pl>
2017-05-23 12:48 ` Problem with query and any operation on PGs Łukasz Chrustek
2017-05-23 14:17 ` Sage Weil
2017-05-23 14:43 ` Łukasz Chrustek
[not found] ` <1464688590.20170523185052@tlen.pl>
2017-05-23 17:40 ` Sage Weil
2017-05-23 21:43 ` Łukasz Chrustek
2017-05-23 21:48 ` Sage Weil
2017-05-24 13:19 ` Łukasz Chrustek [this message]
2017-05-24 13:37 ` Sage Weil
2017-05-24 13:58 ` Łukasz Chrustek
2017-05-24 14:02 ` Sage Weil
2017-05-24 14:18 ` Łukasz Chrustek
2017-05-24 14:47 ` Sage Weil
2017-05-24 15:00 ` Łukasz Chrustek
2017-05-24 15:07 ` Łukasz Chrustek
2017-05-24 15:11 ` Sage Weil
2017-05-24 15:24 ` Łukasz Chrustek
2017-05-24 15:54 ` Łukasz Chrustek
2017-05-24 16:02 ` Łukasz Chrustek
2017-05-24 17:07 ` Łukasz Chrustek
2017-05-24 17:16 ` Sage Weil
2017-05-24 17:28 ` Łukasz Chrustek
2017-05-24 18:16 ` Sage Weil
2017-05-24 19:47 ` Łukasz Chrustek
2017-05-24 17:30 ` Łukasz Chrustek
2017-05-24 17:35 ` Łukasz Chrustek
2017-05-24 21:38 ` Łukasz Chrustek
2017-05-24 21:53 ` Sage Weil
2017-05-24 22:09 ` Łukasz Chrustek
2017-05-24 22:27 ` Sage Weil
2017-05-24 22:46 ` Łukasz Chrustek
2017-05-25 2:06 ` Sage Weil
2017-05-25 11:22 ` Łukasz Chrustek
2017-05-29 15:31 ` Łukasz Chrustek
2017-05-30 13:21 ` Sage Weil
2017-06-10 22:45 ` Łukasz Chrustek
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=135176900.20170524151952@tlen.pl \
--to=skidoo@tlen.pl \
--cc=ceph-devel@vger.kernel.org \
--cc=sage@newdream.net \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.