* Review: Servercrash with kernel SuSE 2.4.16
@ 2002-05-06 8:57 Oliver.Schersand
2002-05-06 11:33 ` Hans Reiser
2002-05-06 15:17 ` Alan Cox
0 siblings, 2 replies; 3+ messages in thread
From: Oliver.Schersand @ 2002-05-06 8:57 UTC (permalink / raw)
To: chris.mason, alessandro.suardi; +Cc: sbrand, reiser, linux-kernel
Hi,
This is the actual result of the analys of our oracle server crashes on
linux servers with suse Enterprise Server 7 and Kernel 2.4.16
The reason for theses crashes where the compaq remote inside and compaq
health drivers. These drivers are deliverd from compaq. On stardup of these
agents, they load binary kernel modules, which are very version sensitive.
This modules corrupt the virtual memory management of the server on heavy
load
This shows us a main problem of Linux in datacenter environment. The
automatic guarding of the local attached storage and the hardware is very
importend in this environments. In this environment we use expensive high
performance hardware. These hardware is not good supported by the
standard linux kernel. The companies which sell these hardware deliver not
all features of these hardware to the community of linux. There drivers
and guarding agents are not distributed under GPL.
Here is the statement of Compaq ( HPQ)
The new health driver that can be run on any re-compiled kernel is due
out in the SmartStart 5.50 timeframe (3Q02). The reason for this is
that it also provides support for new servers as well.
The delivery method will be via the Compaq web site and the SmartStart
Cd.
The customer expectation is that it will work with any re-compiled
kernel. Most likely that will be true however it will be impossible for
us to validate it with every possibility due to the open source nature
of Linux.
Today, customers actually can run the existing health driver on
re-compiled kernels by evoking a "fix-up" script that will be available
on our website. This is not an ideal solution but it will work in a
majority of the situations.
Thank you very much for all the help with this problem
Oliver Schersand
^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: Review: Servercrash with kernel SuSE 2.4.16
2002-05-06 8:57 Review: Servercrash with kernel SuSE 2.4.16 Oliver.Schersand
@ 2002-05-06 11:33 ` Hans Reiser
2002-05-06 15:17 ` Alan Cox
1 sibling, 0 replies; 3+ messages in thread
From: Hans Reiser @ 2002-05-06 11:33 UTC (permalink / raw)
To: Oliver.Schersand; +Cc: chris.mason, alessandro.suardi, sbrand, linux-kernel
Oliver.Schersand@BASF-IT-Services.com wrote:
>Hi,
>
>This is the actual result of the analys of our oracle server crashes on
>linux servers with suse Enterprise Server 7 and Kernel 2.4.16
>
>The reason for theses crashes where the compaq remote inside and compaq
>health drivers. These drivers are deliverd from compaq. On stardup of these
>agents, they load binary kernel modules, which are very version sensitive.
>This modules corrupt the virtual memory management of the server on heavy
>load
>
>This shows us a main problem of Linux in datacenter environment. The
>automatic guarding of the local attached storage and the hardware is very
>importend in this environments. In this environment we use expensive high
>performance hardware. These hardware is not good supported by the
>standard linux kernel. The companies which sell these hardware deliver not
>all features of these hardware to the community of linux. There drivers
>and guarding agents are not distributed under GPL.
>
>
>Here is the statement of Compaq ( HPQ)
>
> The new health driver that can be run on any re-compiled kernel is due
> out in the SmartStart 5.50 timeframe (3Q02). The reason for this is
> that it also provides support for new servers as well.
> The delivery method will be via the Compaq web site and the SmartStart
> Cd.
>
> The customer expectation is that it will work with any re-compiled
> kernel. Most likely that will be true however it will be impossible for
> us to validate it with every possibility due to the open source nature
> of Linux.
>
Well, if they supplied the source code then customers could validate it
and fix it.;-)
>
> Today, customers actually can run the existing health driver on
> re-compiled kernels by evoking a "fix-up" script that will be available
> on our website. This is not an ideal solution but it will work in a
> majority of the situations.
>
>
>Thank you very much for all the help with this problem
>
>Oliver Schersand
>
>
>
>
>
I think that any product which lacks source code is unsuited for use in
a mission-critical Linux server application. I think customers should
tell companies this. When things break, you need to be able to fix
them, and not listen to excuses about how it is somehow the customer's
fault that they used a version of Linux that works except for the
vendor's code. The paradigm has changed.....
You have my sympathies. I will restrain myself from telling you about
buying a Dell laptop, and having customer service tell me that they
would not provide me any support unless I installed the original
(windows) OS when I had a clearly hardware problem.....
I should have bought IBM and waited 4-6 weeks for their incompetent
(incompetent only in that they take too long to ship) shipping
department to build and send me my laptop....
Hans
^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: Review: Servercrash with kernel SuSE 2.4.16
2002-05-06 8:57 Review: Servercrash with kernel SuSE 2.4.16 Oliver.Schersand
2002-05-06 11:33 ` Hans Reiser
@ 2002-05-06 15:17 ` Alan Cox
1 sibling, 0 replies; 3+ messages in thread
From: Alan Cox @ 2002-05-06 15:17 UTC (permalink / raw)
To: Oliver.Schersand
Cc: chris.mason, alessandro.suardi, sbrand, reiser, linux-kernel
> This shows us a main problem of Linux in datacenter environment. The
> automatic guarding of the local attached storage and the hardware is very
> importend in this environments. In this environment we use expensive high
> performance hardware. These hardware is not good supported by the
> standard linux kernel. The companies which sell these hardware deliver not
> all features of these hardware to the community of linux. There drivers
> and guarding agents are not distributed under GPL.
I would suggest you review your vendor and hardware policies. The standard
Linux i2c/smbus addons support extensive power and health monitoring for
most standards based systems.
Anyone who loads a product onto a critical datacentre system where the
vendor says "well it might work, but we don't know even with the vendor
supplied kernel" is not being terribly professional about it.
Maybe your products are in fact also supported by open source code, maybe
your choice of hardware is poor. Would you buy Win2K setups where the
vendor said "well the monitoring might work, we dont know" ?
My system temperatures, power status, disk array temperatures and disk SMART
status are all happily being logged. I have no binary modules on the server.
^ permalink raw reply [flat|nested] 3+ messages in thread
end of thread, other threads:[~2002-05-06 14:59 UTC | newest]
Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2002-05-06 8:57 Review: Servercrash with kernel SuSE 2.4.16 Oliver.Schersand
2002-05-06 11:33 ` Hans Reiser
2002-05-06 15:17 ` Alan Cox
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox