public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
From: Lars Marowsky-Bree <lmb@suse.de>
To: David Teigland <teigland@redhat.com>
Cc: Steven Dake <sdake@mvista.com>,
	linux-kernel@vger.kernel.org, akpm@osdl.org
Subject: Re: [PATCH 1b/7] dlm: core locking
Date: Thu, 28 Apr 2005 14:33:15 +0200	[thread overview]
Message-ID: <20050428123315.GP21645@marowsky-bree.de> (raw)
In-Reply-To: <20050427142638.GG16502@redhat.com>

On 2005-04-27T22:26:38, David Teigland <teigland@redhat.com> wrote:

> > So in effect, the delivery of the suspend/membership distribution/resume
> > events are three cluster-wide barriers?
> > 
> > I can see how that simplifies the recovery algorithm.
> Correct.  I actually consider it two external barriers: the first after
> the lockspace has been suspended, the second after lockspace recovery is
> completed.

Hmmm. This is actually slightly different from what I thought you were
doing.

Actually, is that several phase step really necessary? Given that
failures can occur at any given point even then - during each barrier
and in-between - couldn't we just as well deliver the membership event
directly, and proceed with recovery as if that was the "final" state?
We always have to deal with nodes failing, rejoining etc at any given
step and eventually restarting the algorithm if needed.

(Well, a node joining you could serialize and only do that after you
have completed the recovery steps for the event before. But a failing
node during recovery seems to imply the need to restart the algorithm
anyway, and that's just what would happen if a new membership event was
delivered.)

Does it really simplify the recovery, or does it just obscure the
complexity, ie, snake oil?

That said, the model can be mapped as-is quite directly to how the
heartbeat 2.x handles resources which can be active more than once. The
first barrier would be the "we lost a node and are about to fence one of
your incarnations" (or "a node joined and we're about to start one"),
and the second one would be the "we fenced node X" or "we started you on
node X".

However, there's one property here: We assume that those notifications
_can never fail_; they are delivered (and guaranteed to be before we
commence the main operation), and that's it. Can a node in your model
choose to reject the suspend/resume operation?

> > And, I assume that the delivery of a "node down" membership event
> > implies that said node also has been fenced.
> Typically it does if you're combining the dlm with something that requires
> fencing (like a file system).  Fencing isn't relevant to the dlm itself,
> though, since the dlm software isn't touching any storage.

Ack. Good point, I was thinking too much in terms of GFS/OCFS2 here ;-)

> > Just some food for thought how this all fits together rather neatly.
> Interesting, and sounds correct.  I must admit that using the word "lock"
> to describe these CRM-level inter-dependent objects is new to me.

It's locks with dependencies, instead of one "lock" per resource group.
That's been mulling on the back of my mind ever since Stephen gave me
the DLM-centric-clustering-world talking to at Linux Kongress 98, I
think ;-) By now I think the model fits.


Sincerely,
    Lars Marowsky-Brée <lmb@suse.de>

-- 
High Availability & Clustering
SUSE Labs, Research and Development
SUSE LINUX Products GmbH - A Novell Business


  reply	other threads:[~2005-04-28 12:57 UTC|newest]

Thread overview: 48+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2005-04-25 16:58 [PATCH 1b/7] dlm: core locking David Teigland
2005-04-25 18:34 ` Nikita Danilov
2005-04-25 20:44   ` Daniel Phillips
2005-04-25 22:27     ` Nikita Danilov
2005-04-26  1:34       ` Daniel Phillips
2005-04-25 20:41 ` Jesper Juhl
2005-04-26  5:00   ` Daniel Phillips
2005-04-25 21:54 ` Steven Dake
2005-04-26  5:49   ` David Teigland
2005-04-26 17:40     ` Steven Dake
2005-04-26 22:24       ` Daniel Phillips
2005-04-26 23:04         ` Steven Dake
2005-04-27  0:53           ` Daniel Phillips
2005-04-27  1:50             ` Steven Dake
2005-04-27  4:21               ` Daniel Phillips
2005-04-27  3:02       ` David Teigland
2005-04-27 13:41         ` Lars Marowsky-Bree
2005-04-27 14:26           ` David Teigland
2005-04-28 12:33             ` Lars Marowsky-Bree [this message]
2005-04-28 16:39               ` Daniel McNeil
2005-04-28 16:45                 ` Lars Marowsky-Bree
2005-04-29  8:25                   ` Daniel Phillips
2005-05-02 20:45                     ` Lars Marowsky-Bree
2005-05-02 23:23                       ` Daniel Phillips
2005-04-29  4:01                 ` David Teigland
2005-04-29 22:58                   ` Daniel McNeil
2005-04-30  4:29                     ` David Teigland
2005-04-30  9:09                     ` Daniel Phillips
2005-04-30 10:32                       ` Lars Marowsky-Bree
2005-04-30 11:12                         ` Daniel Phillips
2005-05-02 20:51                           ` Lars Marowsky-Bree
2005-05-02 22:21                             ` Daniel Phillips
2005-05-05 12:25                       ` Stephen C. Tweedie
2005-05-05 12:40                         ` copy_to_user question linux
2005-05-05 13:13                           ` Richard B. Johnson
2005-05-05 19:29                         ` [PATCH 1b/7] dlm: core locking Daniel Phillips
2005-04-28  2:52           ` Daniel Phillips
2005-04-28 12:37             ` Lars Marowsky-Bree
2005-04-28 23:43               ` Daniel Phillips
2005-04-28  6:49           ` Daniel Phillips
2005-04-28 12:55             ` Lars Marowsky-Bree
2005-04-29  0:26               ` Daniel Phillips
2005-04-29  2:52                 ` David Teigland
2005-04-29  3:49                   ` Daniel Phillips
2005-05-02 21:00                     ` Lars Marowsky-Bree
2005-05-03  2:54                       ` David Teigland
2005-04-27 12:33 ` Domen Puncer
2005-04-27 13:30   ` David Teigland

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20050428123315.GP21645@marowsky-bree.de \
    --to=lmb@suse.de \
    --cc=akpm@osdl.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=sdake@mvista.com \
    --cc=teigland@redhat.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox