public inbox for linux-ia64@vger.kernel.org
 help / color / mirror / Atom feed
* [Linux-ia64] kernel message
@ 2003-06-09  5:40 Dai, Yiyang
  2003-06-09 18:26 ` Alex Williamson
  0 siblings, 1 reply; 2+ messages in thread
From: Dai, Yiyang @ 2003-06-09  5:40 UTC (permalink / raw)
  To: linux-ia64


[-- Attachment #1.1: Type: text/plain, Size: 214 bytes --]

Hi all:

    I got some kernel error messages in my rx2600 box , seems it will
lead a server auto reboot .pls check attachment for detail message.

   any advise ?
 <<error@master.txt>> 

Regards,
Yiyang

[-- Attachment #1.2: Type: text/html, Size: 884 bytes --]

[-- Attachment #2: error@master.txt --]
[-- Type: text/plain, Size: 2034 bytes --]

Jun  6 09:22:34 master kernel: +BEGIN HARDWARE ERROR STATE AT CMC
Jun  6 09:22:34 master kernel: +Err Record ID: 4    SAL Rev:  0.02
Jun  6 09:22:34 master kernel: +Time: 06/06/2003 01:22:08    Severity 2
Jun  6 09:22:34 master kernel: +Processor Device Error Info Section
Jun  6 09:22:34 master kernel:  Processor Error Map: 0x1000000
Jun  6 09:22:34 master kernel:  Processor State Param: 0x0
Jun  6 09:22:34 master kernel:  Processor LID: 0x1000000
Jun  6 09:22:34 master kernel: + BUS Check Info [0]
Jun  6 09:22:34 master kernel: + Status Info: 0 ,Severity: 0 ,Transaction Type:
3 ,Transaction Size: 0 ,Error: External
Jun  6 09:22:34 master kernel: +END HARDWARE ERROR STATE AT CMC
Jun  6 09:22:34 master kernel: +BEGIN HARDWARE ERROR STATE AT CMC
Jun  6 09:22:34 master kernel: +Err Record ID: 5    SAL Rev:  0.02
Jun  6 09:22:34 master kernel: +Time: 06/06/2003 01:22:09    Severity 2
Jun  6 09:22:34 master kernel: +Processor Device Error Info Section
Jun  6 09:22:34 master kernel:  Processor Error Map: 0x1000000
Jun  6 09:22:34 master kernel:  Processor State Param: 0x0
Jun  6 09:22:35 master kernel:  Processor LID: 0x1000000
Jun  6 09:22:35 master kernel: + BUS Check Info [0]
Jun  6 09:22:35 master kernel: + Status Info: 0 ,Severity: 0 ,Transaction Type:
3 ,Transaction Size: 0 ,Error: External
Jun  6 09:22:35 master kernel: +END HARDWARE ERROR STATE AT CMC
Jun  6 09:23:59 master ftpd: 10.10.3.160: intel: QUIT




                                                                  [4175]: FTP se
ssion closed
Jun  6 10:26:06 master atd: atd shutdown succeeded
Jun  6 10:26:06 master Font Server[1351]: terminating
Jun  6 10:26:06 master xfs: xfs shutdown succeeded
Jun  6 10:26:06 master gpm: gpm shutdown succeeded
Jun  6 10:26:07 master iscsi: iscsilun shutdown failed
Jun  6 10:26:07 master kernel: iSCSI: release HBA e0000002fff92fd0, host #2
Jun  6 10:26:07 master kernel: scsi : 2 hosts left.
Jun  6 10:26:07 master rpc.mountd: Caught signal 15, un-registering and exiting.

^ permalink raw reply	[flat|nested] 2+ messages in thread

* Re: [Linux-ia64] kernel message
  2003-06-09  5:40 [Linux-ia64] kernel message Dai, Yiyang
@ 2003-06-09 18:26 ` Alex Williamson
  0 siblings, 0 replies; 2+ messages in thread
From: Alex Williamson @ 2003-06-09 18:26 UTC (permalink / raw)
  To: linux-ia64

Yiyang,

   You had some Corrected Machine Checks on CPU1.  Likely just single
bit errors that got corrected by ECC.  Since your error record ID is
only at 5, you haven't had too many of them.  Looks to me like the
reboot started over an hour after the CMCs, why do you think they're
related?  CMCs in small numbers are generally harmless.  If you get
them in large bursts, you'll want to use my patch that swithes to
polling to reduce system impact.

	Alex

"Dai, Yiyang" wrote:
> 
> Hi all:
> 
>     I got some kernel error messages in my rx2600 box , seems it will lead a server auto reboot .pls check attachment for detail message.
> 
>    any advise ?
> <<error@master.txt>>
> 
> Regards,
> Yiyang

-- 
Alex Williamson                             HP Linux & Open Source Lab


^ permalink raw reply	[flat|nested] 2+ messages in thread

end of thread, other threads:[~2003-06-09 18:26 UTC | newest]

Thread overview: 2+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2003-06-09  5:40 [Linux-ia64] kernel message Dai, Yiyang
2003-06-09 18:26 ` Alex Williamson

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox