All of lore.kernel.org
 help / color / mirror / Atom feed
* Preventing a system power on before BMC Ready
@ 2023-05-02 20:48 Andrew Geissler
  2023-05-02 21:50 ` Michael Richardson
  2023-05-03  0:48 ` Ed Tanous
  0 siblings, 2 replies; 4+ messages in thread
From: Andrew Geissler @ 2023-05-02 20:48 UTC (permalink / raw)
  To: OpenBMC List

About once a month a bug arrives internally where someone has powered on the
host without waiting for the BMC to reach its Ready state. Our systems for a
variety of reasons require the BMC to be at Ready before initiating a system
power on.

The defects are usually returned as user error in that users are supposed to
know to wait. Our Redfish clients (including the web UI) know to not allow a
power on operation until Ready. Recently however we had a bug where our external
Redfish client allowed a power on before Ready. That client is event driven once
connected to the BMC and because they never got an event about an unexpected BMC
reboot, they allowed a power on before Ready when the BMC came back up. Granted
there is only about a 30s window where we have a problem here, but as we all
know, when there's a window, someone finds it.

That got us brainstorming about some possible solutions:
- Write some code in bmcweb to send a “bmc state change event” anytime bmcweb
  comes up to ensure listening clients know “something” has happened
- Add an optional compile option to bmcweb (or PSM/x86-power-control) to require
  BMC Ready before issuing chassis or system POST requests (return error if not
  at Ready)
- Queue up the power on request and execute it once we reach BMC Ready (not sure
  what type of response that would be to Redfish clients or what error path
  looks like if we never reach Ready?)
- Find a way in the client to better detect an unexpected bmc reboot (heartbeat
  of some sort)
- Push bmcweb further in the startup to BMC Ready, ensuring clients can't talk
  to the BMC until it's near Ready state

Thoughts?
Andrew

^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2023-05-09 20:01 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2023-05-02 20:48 Preventing a system power on before BMC Ready Andrew Geissler
2023-05-02 21:50 ` Michael Richardson
2023-05-03  0:48 ` Ed Tanous
2023-05-09 20:00   ` Andrew Geissler

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.