From: Martin Wilderoth <martin.wilderoth@linserv.se>
To: ceph-devel@vger.kernel.org
Subject: Re: HEALTH_WARNING
Date: Sat, 2 Apr 2011 12:55:38 +0200 (CEST) [thread overview]
Message-ID: <718796783.13438.1301741738011.JavaMail.root@mail.linserv.se> (raw)
In-Reply-To: <1463999357.13436.1301740919511.JavaMail.root@mail.linserv.se>
Hello,
I have seperate partitions for my osd and the btrfs file system.
I also use SSD-disk for journaling.
But I got problem when the root system was filled up with logfiles on one host,
the file system reported out of diskspace.
But the osd's were not filled to 100%. Later I realised that the root system on one of the osd hosts (osd2 and osd3) had no space left, to much logging.
The only way I know to recover is to create a new filesystem in the cluster :-)
But it's bad fot the data :-)
When i get problems with one osd it seems as if they are crashing one by one.
And i dont know how to get them up again whitout deleting all the data.
Hi,
On Sat, 2011-04-02 at 05:59 +0200, Martin Wilderoth wrote:
> Hello,
>
> One of my hosts run out of diskspace on the root file system (logfiles)
> So I restared ceph. Discoverd the low diskspace during the restart. osd2 and osd3
>
Do you have separate partitions for your OSD data? Or do you have one
big / partition? I'd recommend a separate partition for your OSD's.
> ceph health gives a message like this
>
> HEALTH_WARN osdmonitor: num_osds = 6, num_up_osds = 4, num_in_osds = 4 Some PGs are: degraded,peering
>
> now osd.1 is dead all the other are running
>
> How do I get the running one up and in ? and how do I know which ods it is ?
>
$ ceph osd dump -o -
That should tell you which OSD is down/out.
> how do I recover the dead one ?
>
Normally starting the OSD would be enough. Look closely though, you
might have hit a bug which caused the OSD to crash. If so, there should
be a file called "core" in / which has a core-dump and could tell why
the OSD crashed:
$ gdb /usr/bin/cosd /core
Make sure you have the debug symbols (-dbg packages) installed when
doing so.
If you monitor 'ceph -w' then, you should see the cluster recover and
all OSD's should be up & in.
Wido
--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
next parent reply other threads:[~2011-04-02 11:02 UTC|newest]
Thread overview: 8+ messages / expand[flat|nested] mbox.gz Atom feed top
[not found] <1463999357.13436.1301740919511.JavaMail.root@mail.linserv.se>
2011-04-02 10:55 ` Martin Wilderoth [this message]
2011-04-02 15:04 ` HEALTH_WARNING Henry Chang
2011-04-02 18:23 ` HEALTH_WARNING Martin Wilderoth
2011-04-03 23:38 ` HEALTH_WARNING Gregory Farnum
[not found] <290366553.13874.1302029956409.JavaMail.root@mail.linserv.se>
2011-04-05 19:07 ` HEALTH_WARNING Martin Wilderoth
2011-04-06 17:13 ` HEALTH_WARNING Josh Durgin
[not found] <835540127.13427.1301716690785.JavaMail.root@mail.linserv.se>
2011-04-02 3:59 ` HEALTH_WARNING Martin Wilderoth
2011-04-02 8:22 ` HEALTH_WARNING Wido den Hollander
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=718796783.13438.1301741738011.JavaMail.root@mail.linserv.se \
--to=martin.wilderoth@linserv.se \
--cc=ceph-devel@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.