public inbox for linux-ia64@vger.kernel.org
 help / color / mirror / Atom feed
* [RFC] Better MCA recovery on IPF
@ 2003-10-27  8:07 Hidetoshi Seto
  2003-10-27 16:58 ` Matthias Fouquet-Lapar
                   ` (26 more replies)
  0 siblings, 27 replies; 28+ messages in thread
From: Hidetoshi Seto @ 2003-10-27  8:07 UTC (permalink / raw)
  To: linux-ia64

I want to make contributions to the development of MCA Error Handling.

According to IPF Error Handling Guide, OS should have capability to recover from
error.

There are three types of error, Corrected, Recoverable, and Fatal. They are
reported to OS by MCA/CPEI/CMCI, and actions required to OS depend on the type
of them. Relations between the type and the action are as follows;

 - Corrected:
     Do nothing.

 - Recoverable:
     Depends on the situation,
     - Fix the error, continue interrupted thread.
     - Terminate suffered threads.
     - Just as Fatal, reboot.

 - Fatal:
     Reboot system immediately.

In all case, Linux should log error information based on SAL record.
So, some programs in user land, like fault prediction logic or
a daemon that reports error to remote site, could use these logs. And
system administrator also could use these logs to keep their system
healthy.


I have strong expectations for Linux to realize such recovery features.
However, Linux is deficient in recovery codes, especially on recoverable MCA,
at this moment. (I know your good job, Tony.)

I want to know what difficulty keep Linux as-is.

What do you think of error recovery on Linux?
What kind of functions, macros, structures should Linux have for recovery?


Best regards,

------

H.Seto <seto.hidetoshi@jp.fujitsu.com>


^ permalink raw reply	[flat|nested] 28+ messages in thread

end of thread, other threads:[~2003-11-10 10:33 UTC | newest]

Thread overview: 28+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2003-10-27  8:07 [RFC] Better MCA recovery on IPF Hidetoshi Seto
2003-10-27 16:58 ` Matthias Fouquet-Lapar
2003-10-31  5:09 ` Hidetoshi Seto
2003-10-31 17:14 ` Grant Grundler
2003-11-01  6:39 ` Matthias Fouquet-Lapar
2003-11-01  8:38 ` Keith Owens
2003-11-02 13:33 ` Matthias Fouquet-Lapar
2003-11-03 17:09 ` Russ Anderson
2003-11-03 17:37 ` Matthias Fouquet-Lapar
2003-11-03 17:51 ` Alberto Munoz
2003-11-03 17:53 ` Alberto Munoz
2003-11-03 18:23 ` Jack Steiner
2003-11-03 18:42 ` Alberto Munoz
2003-11-03 19:28 ` Jack Steiner
2003-11-03 23:09 ` Alberto Munoz
2003-11-05  4:11 ` Greg Banks
2003-11-05 17:00 ` Luck, Tony
2003-11-05 17:14 ` Alberto Munoz
2003-11-05 17:30 ` Matthew Wilcox
2003-11-05 17:37 ` Alberto Munoz
2003-11-06 12:03 ` Hidetoshi Seto
2003-11-06 14:23 ` Matthias Fouquet-Lapar
2003-11-06 19:09 ` Luck, Tony
2003-11-07  9:58 ` Hidetoshi Seto
2003-11-07 10:52 ` Matthias Fouquet-Lapar
2003-11-08  1:15 ` Luck, Tony
2003-11-08  7:36 ` Matthias Fouquet-Lapar
2003-11-10 10:33 ` Hidetoshi Seto

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox