* Reliability models
@ 2014-01-13 8:48 Loic Dachary
2014-01-13 17:25 ` Kyle Bader
0 siblings, 1 reply; 3+ messages in thread
From: Loic Dachary @ 2014-01-13 8:48 UTC (permalink / raw)
To: Kyle Bader; +Cc: Ceph Development
[-- Attachment #1: Type: text/plain, Size: 261 bytes --]
Hi Kyle,
IIRC you figured out how to use https://github.com/ceph/ceph-tools/tree/master/models . Do you happen to have some kind of HOWTO or even the list of commands you've used to get me started ?
Cheers
--
Loïc Dachary, Artisan Logiciel Libre
[-- Attachment #2: OpenPGP digital signature --]
[-- Type: application/pgp-signature, Size: 263 bytes --]
^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: Reliability models
2014-01-13 8:48 Reliability models Loic Dachary
@ 2014-01-13 17:25 ` Kyle Bader
2014-01-14 23:44 ` Loic Dachary
0 siblings, 1 reply; 3+ messages in thread
From: Kyle Bader @ 2014-01-13 17:25 UTC (permalink / raw)
To: Loic Dachary; +Cc: Ceph Development
Hi Loic,
> IIRC you figured out how to use https://github.com/ceph/ceph-tools/tree/master/models . Do you happen to have some kind of HOWTO or even the list of commands you've used to get me started ?
git clone ...ceph-tools
cd ceph-tools/models/reliability
./main.py -g
At this point a graphical form should pop up and most of it should be
straight forward. NRE rate is non recoverable read errors, the NRE
model determines how an array/disk responds to a non recoverable read
error event. I tend to be on the conservative end having seen my share
of RAID horror shows so I set the NRE model to "fail". The other
gotcha is most disk manufacturers use MTBF or AFR while the
reliability modeling tool requests FITS. The formula you will need is:
MTBF = 1,000,000,000 x 1/FIT. [1]
Stripe length is the number of RADOS objects that are require to store
a blob of data, ie all RADOS objects that compose a RBD volume. Since
this code was written a while ago there is no code to support modeling
erasure coded objects. If you have any questions, let me know!
[1] http://en.wikipedia.org/wiki/Failure_rate#Units
--
Kyle Bader - Inktank
Senior Solution Architect
^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: Reliability models
2014-01-13 17:25 ` Kyle Bader
@ 2014-01-14 23:44 ` Loic Dachary
0 siblings, 0 replies; 3+ messages in thread
From: Loic Dachary @ 2014-01-14 23:44 UTC (permalink / raw)
To: Kyle Bader; +Cc: Ceph Development
[-- Attachment #1: Type: text/plain, Size: 1301 bytes --]
Crystal clear, thanks !
On 13/01/2014 18:25, Kyle Bader wrote:
> Hi Loic,
>
>> IIRC you figured out how to use https://github.com/ceph/ceph-tools/tree/master/models . Do you happen to have some kind of HOWTO or even the list of commands you've used to get me started ?
>
> git clone ...ceph-tools
> cd ceph-tools/models/reliability
> ./main.py -g
>
> At this point a graphical form should pop up and most of it should be
> straight forward. NRE rate is non recoverable read errors, the NRE
> model determines how an array/disk responds to a non recoverable read
> error event. I tend to be on the conservative end having seen my share
> of RAID horror shows so I set the NRE model to "fail". The other
> gotcha is most disk manufacturers use MTBF or AFR while the
> reliability modeling tool requests FITS. The formula you will need is:
>
> MTBF = 1,000,000,000 x 1/FIT. [1]
>
> Stripe length is the number of RADOS objects that are require to store
> a blob of data, ie all RADOS objects that compose a RBD volume. Since
> this code was written a while ago there is no code to support modeling
> erasure coded objects. If you have any questions, let me know!
>
> [1] http://en.wikipedia.org/wiki/Failure_rate#Units
>
--
Loïc Dachary, Artisan Logiciel Libre
[-- Attachment #2: OpenPGP digital signature --]
[-- Type: application/pgp-signature, Size: 263 bytes --]
^ permalink raw reply [flat|nested] 3+ messages in thread
end of thread, other threads:[~2014-01-14 23:44 UTC | newest]
Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2014-01-13 8:48 Reliability models Loic Dachary
2014-01-13 17:25 ` Kyle Bader
2014-01-14 23:44 ` Loic Dachary
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.