* RE: [Linux-cluster] Re: [Ocfs2-devel] [RFC] nodemanager, ocfs2, dlm
@ 2005-07-20 16:55 Walker, Bruce J (HP-Labs)
2005-07-20 18:09 ` [Clusters_sig] " Lars Marowsky-Bree
2005-07-22 3:22 ` Daniel Phillips
0 siblings, 2 replies; 5+ messages in thread
From: Walker, Bruce J (HP-Labs) @ 2005-07-20 16:55 UTC (permalink / raw)
To: linux clustering, David Teigland; +Cc: linux-kernel, ocfs2-devel, clusters_sig
Like Lars, I too was under the wrong impression about this configfs "nodemanager" kernel component. Our discussions in the cluster meeting Monday and Tuesday were assuming it was a general service that other kernel components could/would utilize and possibly also something that could send uevents to non-kernel components wanting a std. way to see membership information/events.
As to kernel components without corresponding user-level "managers", look no farther than OpenSSI. Our hope was that we could adapt to a user-land membership service and this interface thru configfs would drive all our kernel subsystems.
Bruce Walker
OpenSSI Cluster project
-----Original Message-----
From: linux-cluster-bounces@redhat.com [mailto:linux-cluster-bounces@redhat.com] On Behalf Of Lars Marowsky-Bree
Sent: Wednesday, July 20, 2005 9:27 AM
To: David Teigland
Cc: linux-cluster@redhat.com; linux-kernel@vger.kernel.org; ocfs2-devel@oss.oracle.com
Subject: [Linux-cluster] Re: [Ocfs2-devel] [RFC] nodemanager, ocfs2, dlm
On 2005-07-20T11:35:46, David Teigland <teigland@redhat.com> wrote:
> > Also, eventually we obviously need to have state for the nodes -
> > up/down et cetera. I think the node manager also ought to track this.
> We don't have a need for that information yet; I'm hoping we won't
> ever need it in the kernel, but we'll see.
Hm, I'm thinking a service might have a good reason to want to know the possible list of nodes as opposed to the currently active membership; though the DLM as the service in question right now does not appear to need such.
But, see below.
> There are at least two ways to handle this:
>
> 1. Pass cluster events and data into the kernel (this sounds like what
> you're talking about above), notify the effected kernel components,
> each kernel component takes the cluster data and does whatever it
> needs to with it (internal adjustments, recovery, etc).
>
> 2. Each kernel component "foo-kernel" has an associated user space
> component "foo-user". Cluster events (from userland clustering
> infrastructure) are passed to foo-user -- not into the kernel.
> foo-user determines what the specific consequences are for foo-kernel.
> foo-user then manipulates foo-kernel accordingly, through user/kernel
> hooks (sysfs, configfs, etc). These control hooks would largely be specific to foo.
>
> We're following option 2 with the dlm and gfs and have been for quite
> a while, which means we don't need 1. I think ocfs2 is moving that
> way, too. Someone could still try 1, of course, but it would be of no
> use or interest to me. I'm not aware of any actual projects pushing
> forward with something like 1, so the persistent reference to it is somewhat baffling.
Right. I thought that the node manager changes for generalizing it where pushing into sort-of direction 1. Thanks for clearing this up.
Sincerely,
Lars Marowsky-Brée <lmb@suse.de>
--
High Availability & Clustering
SUSE Labs, Research and Development
SUSE LINUX Products GmbH - A Novell Business -- Charles Darwin
"Ignorance more frequently begets confidence than does knowledge"
--
Linux-cluster mailing list
Linux-cluster@redhat.com
http://www.redhat.com/mailman/listinfo/linux-cluster
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [Clusters_sig] RE: [Linux-cluster] Re: [Ocfs2-devel] [RFC] nodemanager, ocfs2, dlm
2005-07-20 16:55 [Linux-cluster] Re: [Ocfs2-devel] [RFC] nodemanager, ocfs2, dlm Walker, Bruce J (HP-Labs)
@ 2005-07-20 18:09 ` Lars Marowsky-Bree
2005-07-20 18:39 ` Joel Becker
2005-07-22 3:22 ` Daniel Phillips
1 sibling, 1 reply; 5+ messages in thread
From: Lars Marowsky-Bree @ 2005-07-20 18:09 UTC (permalink / raw)
To: Walker, Bruce J (HP-Labs), linux clustering, David Teigland
Cc: linux-kernel, ocfs2-devel, clusters_sig
On 2005-07-20T09:55:31, "Walker, Bruce J (HP-Labs)" <bruce.walker@hp.com> wrote:
> Like Lars, I too was under the wrong impression about this configfs
> "nodemanager" kernel component. Our discussions in the cluster
> meeting Monday and Tuesday were assuming it was a general service that
> other kernel components could/would utilize and possibly also
> something that could send uevents to non-kernel components wanting a
> std. way to see membership information/events.
Let me clarify that this was something we briefly touched on in
Walldorf: The node manager would (re-)export the current data via sysfs
(which would result in uevents being sent, too), and not something we
dreamed up just Monday ;-)
> As to kernel components without corresponding user-level "managers",
> look no farther than OpenSSI. Our hope was that we could adapt to a
> user-land membership service and this interface thru configfs would
> drive all our kernel subsystems.
Well, node manager still can provide you the input as to which nodes are
configured, which in a way translates to "membership". The thing it
doesn't seem to provide yet is the supsend/modify/resume cycle which for
example the RHAT DLM seems to require.
Sincerely,
Lars Marowsky-Brée <lmb@suse.de>
--
High Availability & Clustering
SUSE Labs, Research and Development
SUSE LINUX Products GmbH - A Novell Business -- Charles Darwin
"Ignorance more frequently begets confidence than does knowledge"
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [Clusters_sig] RE: [Linux-cluster] Re: [Ocfs2-devel] [RFC] nodemanager, ocfs2, dlm
2005-07-20 18:09 ` [Clusters_sig] " Lars Marowsky-Bree
@ 2005-07-20 18:39 ` Joel Becker
2005-07-21 23:22 ` Lars Marowsky-Bree
0 siblings, 1 reply; 5+ messages in thread
From: Joel Becker @ 2005-07-20 18:39 UTC (permalink / raw)
To: linux clustering
Cc: Walker, Bruce J (HP-Labs), David Teigland, linux-kernel,
ocfs2-devel, clusters_sig
On Wed, Jul 20, 2005 at 08:09:18PM +0200, Lars Marowsky-Bree wrote:
> On 2005-07-20T09:55:31, "Walker, Bruce J (HP-Labs)" <bruce.walker@hp.com> wrote:
>
> > Like Lars, I too was under the wrong impression about this configfs
> > "nodemanager" kernel component. Our discussions in the cluster
> > meeting Monday and Tuesday were assuming it was a general service that
> > other kernel components could/would utilize and possibly also
> > something that could send uevents to non-kernel components wanting a
> > std. way to see membership information/events.
>
> Let me clarify that this was something we briefly touched on in
> Walldorf: The node manager would (re-)export the current data via sysfs
> (which would result in uevents being sent, too), and not something we
> dreamed up just Monday ;-)
In turn, let me clarify a little where configfs fits in to
things. Configfs is merely a convenient and transparent method to
communicate configuration to kernel objects. It's not a place for
uevents, for netlink sockets, or for fancy communication. It allows
userspace to create an in-kernel object and set/get values on that
object. It also allows userspace and kernelspace to share the same
representation of that object and its values.
For more complex interaction, sysfs and procfs are often more
appropriate. While you might "configure" all known nodes in configfs,
the node up/down state might live in sysfs. A netlink socket for
up/down events might live in procfs. And so on.
Joel
--
"But all my words come back to me
In shades of mediocrity.
Like emptiness in harmony
I need someone to comfort me."
Joel Becker
Senior Member of Technical Staff
Oracle
E-mail: joel.becker@oracle.com
Phone: (650) 506-8127
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [Clusters_sig] RE: [Linux-cluster] Re: [Ocfs2-devel] [RFC] nodemanager, ocfs2, dlm
2005-07-20 18:39 ` Joel Becker
@ 2005-07-21 23:22 ` Lars Marowsky-Bree
0 siblings, 0 replies; 5+ messages in thread
From: Lars Marowsky-Bree @ 2005-07-21 23:22 UTC (permalink / raw)
To: linux clustering, Walker, Bruce J (HP-Labs), David Teigland,
linux-kernel, ocfs2-devel, clusters_sig
On 2005-07-20T11:39:38, Joel Becker <Joel.Becker@oracle.com> wrote:
> In turn, let me clarify a little where configfs fits in to
> things. Configfs is merely a convenient and transparent method to
> communicate configuration to kernel objects. It's not a place for
> uevents, for netlink sockets, or for fancy communication. It allows
> userspace to create an in-kernel object and set/get values on that
> object. It also allows userspace and kernelspace to share the same
> representation of that object and its values.
> For more complex interaction, sysfs and procfs are often more
> appropriate. While you might "configure" all known nodes in configfs,
> the node up/down state might live in sysfs. A netlink socket for
> up/down events might live in procfs. And so on.
Right. Thanks for the clarification and elaboration, for I am sure
not entirely clear as to how all these mechanisms relate in detail and
what is appropriate just where, and when to use something more classic
like ioctl etc... ;-)
FWIW, we didn't mean to get uevents out via configfs of course.
Sincerely,
Lars Marowsky-Brée <lmb@suse.de>
--
High Availability & Clustering
SUSE Labs, Research and Development
SUSE LINUX Products GmbH - A Novell Business -- Charles Darwin
"Ignorance more frequently begets confidence than does knowledge"
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [Linux-cluster] Re: [Ocfs2-devel] [RFC] nodemanager, ocfs2, dlm
2005-07-20 16:55 [Linux-cluster] Re: [Ocfs2-devel] [RFC] nodemanager, ocfs2, dlm Walker, Bruce J (HP-Labs)
2005-07-20 18:09 ` [Clusters_sig] " Lars Marowsky-Bree
@ 2005-07-22 3:22 ` Daniel Phillips
1 sibling, 0 replies; 5+ messages in thread
From: Daniel Phillips @ 2005-07-22 3:22 UTC (permalink / raw)
To: linux-cluster
Cc: Walker, Bruce J (HP-Labs), David Teigland, linux-kernel,
ocfs2-devel, clusters_sig
On Thursday 21 July 2005 02:55, Walker, Bruce J (HP-Labs) wrote:
> Like Lars, I too was under the wrong impression about this configfs
> "nodemanager" kernel component. Our discussions in the cluster meeting
> Monday and Tuesday were assuming it was a general service that other
> kernel components could/would utilize and possibly also something that
> could send uevents to non-kernel components wanting a std. way to see
> membership information/events.
>
> As to kernel components without corresponding user-level "managers", look
> no farther than OpenSSI. Our hope was that we could adapt to a user-land
> membership service and this interface thru configfs would drive all our
> kernel subsystems.
Guys, it is absolutely stupid to rely on a virtual filesystem for
userspace/kernel communication for any events that might have to be
transmitted inside the block IO path. This includes, among other things,
memberhips events. Inserting a virtual filesystem into this path does
nothing but add long call chains and new, hard-to-characterize memory
usage.
There are already tried-and-true interfaces that are designed to do this
kind of job efficiently and with quantifiable resource requirements:
sockets (UNIX domain or netlink) and ioctls. If you want to layer a
virtual filesystem on top as a user friendly way to present current cluster
configuration or as a way to provide some administrator knobs, then fine,
virtual filesystems are good for this kind of thing. But please do not try
to insinuate that bloat into the block IO path.
Regards,
Daniel
^ permalink raw reply [flat|nested] 5+ messages in thread
end of thread, other threads:[~2005-07-22 3:22 UTC | newest]
Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2005-07-20 16:55 [Linux-cluster] Re: [Ocfs2-devel] [RFC] nodemanager, ocfs2, dlm Walker, Bruce J (HP-Labs)
2005-07-20 18:09 ` [Clusters_sig] " Lars Marowsky-Bree
2005-07-20 18:39 ` Joel Becker
2005-07-21 23:22 ` Lars Marowsky-Bree
2005-07-22 3:22 ` Daniel Phillips
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox