All of lore.kernel.org
 help / color / mirror / Atom feed
* SMART monitoring
@ 2013-12-27  0:26 James Harper
  2013-12-27  2:17 ` Sage Weil
  0 siblings, 1 reply; 5+ messages in thread
From: James Harper @ 2013-12-27  0:26 UTC (permalink / raw)
  To: ceph-devel@vger.kernel.org

What would be the best approach to integrate SMART with ceph, for the predictive failure case?

Assuming you agree with SMART diagnosis of an impending failure, would it be better to automatically start migrating data off the OSD (reduce the weight to 0?), or to just prompt the user to replace the disk (which requires no monitoring on ceph's part)? The former would ensure that redundancy is maintained at all times without any user interaction.

And what about the bad sector case? Assuming you are using something like btrfs with redundant copies of metadata, and assuming that is enough to keep the metadata consistent, what should be done in the case of a small number of fs errors? Can ceph handle getting an i/o error on one of its files inside the osd and just read from the replica, or should the entire osd just be failed and let ceph rebalance the data itself?

Thanks

James

^ permalink raw reply	[flat|nested] 5+ messages in thread

end of thread, other threads:[~2014-05-22  9:00 UTC | newest]

Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2013-12-27  0:26 SMART monitoring James Harper
2013-12-27  2:17 ` Sage Weil
2013-12-27 16:15   ` Justin Erenkrantz
2013-12-27 17:09     ` Andrey Korolyov
2014-05-22  8:59       ` Andrey Korolyov

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.