All of lore.kernel.org
 help / color / mirror / Atom feed
* Unfixable corruption in ceph cluster
@ 2014-02-07 21:19 Daniel Poelzleithner
  0 siblings, 0 replies; only message in thread
From: Daniel Poelzleithner @ 2014-02-07 21:19 UTC (permalink / raw)
  To: ceph-devel

Hi,

we experience a strange corruption in the ceph cluster that makes it
impossible to restart all nodes of it. Always one node crashes when some
pg gets replicated.
As much as I understood the admin, if the node is cleared completely,
the node synces, but some other node crashes then.

I think there was a similar bug
http://tracker.ceph.com/issues/6101#note-7 already filed.

Removing the rados block did not fix the problem.

In my opinion the bug is severe, as it shows that some internal
corruption seems to be triggered by network failure and causes a
permanent unfixable broken cluster.

Could someone please take a look at it ?
I will try to provide additional information when required.


kind regards
 Daniel

^ permalink raw reply	[flat|nested] only message in thread

only message in thread, other threads:[~2014-02-07 21:29 UTC | newest]

Thread overview: (only message) (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2014-02-07 21:19 Unfixable corruption in ceph cluster Daniel Poelzleithner

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.