From: tomjose <tomjose@linux.vnet.ibm.com>
To: OpenBMC Maillist <openbmc@lists.ozlabs.org>
Cc: rosedahl@us.ibm.com, thalerj@us.ibm.com, jkeusema@us.ibm.com
Subject: RFC for Telemetry data collection
Date: Thu, 7 Sep 2017 20:50:16 +0530 [thread overview]
Message-ID: <59B163B0.6060804@linux.vnet.ibm.com> (raw)
Hello,
I am working on the issue
(https://github.com/openbmc/openbmc/issues/1957) to design a telemetry
application for the OpenBMC. I would be explaining a rough idea of how
we plan to go about. Please share your thoughts and feedback on this
proposal. This issue would depend on the design evolving out of
following issues, since this app would utilize the capabilities
provided. (https://github.com/openbmc/openbmc/issues/1856,
https://github.com/openbmc/openbmc/issues/2102).
Summary of the requirements that we came across relevant to this discussion.
1) BMC telemetry data (example VRM rail voltages) where the data is
collected at different rates depending on the data and aggregated by the
BMC app (minimum, maximum
and average). Based on the collection timing request(frequency) the
metrics are logged, so that the user can fetch it for analytics.
2) Users should be able to set thresholds for the temperature limits,
and receive alerts. This would allow user to plan the cooling needs.
3) BMC would act as route for the OCC metrics to be send to the user.
The OCC would send down telemetric data to the BMC and BMC should figure
out a way to
alert the user to consume this data.
We would keep the focus of the discussion on the requirement no 1.
This proposal presupposes that all the resources( example VRM rail
voltages, ambient temperature) that the telemetry app is interested in,
should be populated as dbus objects, which can
be queried to read the instantaneous values. phosphor-hwmon application
exposes many of the interested resources.
The idea is to have a yaml based approach, where the policy of the
telemetry app will be expressed. The application would be able to
consume the yaml and initiate the telemetry
data collection. The yaml would express the following:
a) Dbus Info (object, interface, property) associated with the resource.
b) Units associated with the value (celsius) and the associated scaling
factor).
c) Granularity - the time between two measures.
d) Aggregation methods - min,max,avg..etc.
e) Logging policy - frequency for creating an event and alerting the user.
The application would operate based on the policy and log the telemetry
data. The details of logging would evolve as we progress on the related
issue.
Regards,
Tom
next reply other threads:[~2017-09-07 15:20 UTC|newest]
Thread overview: 11+ messages / expand[flat|nested] mbox.gz Atom feed top
2017-09-07 15:20 tomjose [this message]
2017-09-07 18:41 ` RFC for Telemetry data collection Rick Altherr
2017-09-07 20:04 ` Todd Rosedahl
2017-09-08 1:18 ` Brad Bishop
2017-09-08 1:16 ` Brad Bishop
2017-09-08 3:29 ` Deepak Kodihalli
2017-09-08 4:06 ` Brad Bishop
2018-03-09 13:43 ` Deepak Kodihalli
2018-03-12 17:57 ` Deepak Kodihalli
2018-03-13 14:23 ` Kurt Taylor
2018-03-13 14:50 ` Deepak Kodihalli
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=59B163B0.6060804@linux.vnet.ibm.com \
--to=tomjose@linux.vnet.ibm.com \
--cc=jkeusema@us.ibm.com \
--cc=openbmc@lists.ozlabs.org \
--cc=rosedahl@us.ibm.com \
--cc=thalerj@us.ibm.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.