From: Fabio M. Di Nitto <fdinitto@redhat.com>
To: cluster-devel.redhat.com
Subject: [Cluster-devel] fencing conditions: what should trigger a fencing operation?
Date: Thu, 19 Nov 2009 19:10:54 +0100 [thread overview]
Message-ID: <4B058A2E.9070600@redhat.com> (raw)
In-Reply-To: <20091119170404.GA23287@redhat.com>
David Teigland wrote:
> On Thu, Nov 19, 2009 at 12:35:05PM +0100, Fabio M. Di Nitto wrote:
>
>> - what are the current fencing policies?
>
> node failure
>
>> - what can we do to improve them?
>
> node failure is a simple, black and white, fact
>
>> - should we monitor for more failures than we do now?
>
> corosync *exists* to to detect node failure
>
>> It is a known issue that node1 will crash at some point (kernel OOPS).
>
> oops is not necessarily node failure; if you *want* it to be, then you
> sysctl -w kernel.panic_on_oops=1
>
> (gfs has also had it's own mount options over the years to force this
> behavior, even if the sysctl isn't set properly; it's a common issue.
> It seems panic_on_oops has had inconsistent default values over various
> releases, sometimes 0, sometimes 1; setting it has historically been part
> of cluster/gfs documentation since most customers want it to be 1.)
So a cluster can hang because our code failed, but we don?t detect that
it did fail.... so what determines a node failure? only when corosync dies?
panic_on_oops is not cluster specific and not all OOPS are panic == not
a clean solution.
Fabio
next prev parent reply other threads:[~2009-11-19 18:10 UTC|newest]
Thread overview: 9+ messages / expand[flat|nested] mbox.gz Atom feed top
2009-11-19 11:35 [Cluster-devel] fencing conditions: what should trigger a fencing operation? Fabio M. Di Nitto
2009-11-19 17:04 ` David Teigland
2009-11-19 16:15 ` Steven Whitehouse
2009-11-19 17:28 ` David Teigland
2009-11-19 17:16 ` David Teigland
2009-11-19 18:10 ` Fabio M. Di Nitto [this message]
2009-11-19 19:49 ` David Teigland
2009-11-20 7:26 ` Fabio M. Di Nitto
2009-11-20 17:40 ` David Teigland
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=4B058A2E.9070600@redhat.com \
--to=fdinitto@redhat.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.