From: Lee Schermerhorn <Lee.Schermerhorn@hp.com>
To: Paul Jackson <pj@sgi.com>
Cc: rientjes@google.com, clameter@sgi.com, akpm@linux-foundation.org,
ak@suse.de, linux-kernel@vger.kernel.org
Subject: Re: [patch 2/2] cpusets: add interleave_over_allowed option
Date: Mon, 29 Oct 2007 15:01:42 -0400 [thread overview]
Message-ID: <1193684503.5035.142.camel@localhost> (raw)
In-Reply-To: <20071029114109.46285026.pj@sgi.com>
On Mon, 2007-10-29 at 11:41 -0700, Paul Jackson wrote:
> Lee wrote:
> > Maybe it's just me, but I think it's pretty presumptuous to think we can
> > infer the intent of the application from the nodemask w/o additional
> > flags such as Christoph proposed [cpuset relative]--especially for
> > subsets of the cpuset. E.g., the application could intend the nodemask
> > to specify memories within a certain distance of a physical resource,
> > such as where a particular IO adapter or set thereof attach to the
> > platform.
>
> Well, yes, we can't presume to know whether some application can move
> or not.
>
> But our kernel work is not presuming that.
>
> It's providing mechanisms useful for moving apps.
>
> The people using this decide what and when and if to move.
>
> For example, the particular customers (HPC) I focus on for my job don't
> move jobs because they don't want to take the transient performance
> hit that would come from blowing out all their memory caches.
>
> I'm guessing that David's situation involves something closer what you
> see with a shared web hosting service, running jobs that are very
> independent of hardware particulars.
>
> But in any case, we (the kernel) are just providing the mechanisms.
> If they don't fit ones needs, don't use them ;).
>
I'm with you on this last point! I was reacting to the notion that we
can infer intent from a nodemask and that preserving the cpuset relative
numbering after changing cpuset resources or moving tasks preserves that
intent--especially if it involves locality and distance considerations.
I can envision sets of such transformations on HP platforms where
locality and distance would be preserved by preserving cpuset-relative
numbering, and many where they would not. I expect you could do the
same for SGI platforms. I'm not opposed to what you're trying to do,
modulo complexity concerns. And I'm not saying that the complexity is
not worth it to customers. But, given that we just "providing the
mechanism", I think we need to provide very good documentation on the
implications of these mechanism vis a vis whatever
characteristics--locality, distance, bandwidth sharing, ...--the
application intends when it installs a policy.
Like you, no doubt, I'm eyeballs deep in a number of things. At some
point, I'll take a cut at enumerating various "intents" that different
types of applications might have when using mem policies and cpusets.
Others can add to that, or may even beat me to it. We can then
evaluate how well these scenarios are served by the current mechanisms
and by whatever changes are proposed.
I should note that I really like cpusets--i.e., find them useful--and
I'm painfully aware of the awkward interactions with mempolicy. On the
other hand, I don't want to sacrifice mem policy capabilities to shoe
horn them into cpusets. In fact, I want to add additional mechanisms
that may also be awkward in cpusets. As you say, "if they don't fit
your needs, don't use them."
Later,
Lee
next prev parent reply other threads:[~2007-10-29 19:02 UTC|newest]
Thread overview: 98+ messages / expand[flat|nested] mbox.gz Atom feed top
2007-10-25 22:54 [patch 1/2] cpusets: extract mmarray loading from update_nodemask David Rientjes
2007-10-25 22:54 ` [patch 2/2] cpusets: add interleave_over_allowed option David Rientjes
2007-10-25 23:37 ` Christoph Lameter
2007-10-25 23:56 ` David Rientjes
2007-10-26 0:28 ` Christoph Lameter
2007-10-26 1:55 ` Paul Jackson
2007-10-26 2:11 ` David Rientjes
2007-10-26 2:29 ` Paul Jackson
2007-10-26 2:45 ` David Rientjes
2007-10-26 3:14 ` Paul Jackson
2007-10-26 3:58 ` David Rientjes
2007-10-26 4:34 ` Paul Jackson
2007-10-26 15:37 ` Lee Schermerhorn
2007-10-26 17:04 ` Paul Jackson
2007-10-26 17:28 ` Lee Schermerhorn
2007-10-26 20:21 ` Michael Kerrisk
2007-10-26 20:25 ` Paul Jackson
2007-10-26 20:33 ` Michael Kerrisk
2007-10-26 15:30 ` Lee Schermerhorn
2007-10-26 18:46 ` David Rientjes
2007-10-26 19:00 ` Paul Jackson
2007-10-26 20:45 ` David Rientjes
2007-10-26 21:05 ` Christoph Lameter
2007-10-26 21:08 ` David Rientjes
2007-10-26 21:12 ` Christoph Lameter
2007-10-26 21:15 ` David Rientjes
2007-10-26 21:13 ` Lee Schermerhorn
2007-10-26 21:17 ` Christoph Lameter
2007-10-26 21:26 ` Lee Schermerhorn
2007-10-26 21:37 ` Christoph Lameter
2007-10-29 15:00 ` Lee Schermerhorn
2007-10-29 17:33 ` Paul Jackson
2007-10-29 17:46 ` Lee Schermerhorn
2007-10-29 20:35 ` Christoph Lameter
2007-10-26 21:18 ` David Rientjes
2007-10-26 21:31 ` Lee Schermerhorn
2007-10-26 21:39 ` David Rientjes
2007-10-27 1:07 ` Paul Jackson
2007-10-27 1:26 ` Christoph Lameter
2007-10-27 2:41 ` Paul Jackson
2007-10-27 2:50 ` Christoph Lameter
2007-10-27 5:16 ` Paul Jackson
2007-10-27 6:07 ` Christoph Lameter
2007-10-27 8:36 ` Paul Jackson
2007-10-27 17:47 ` Christoph Lameter
2007-10-27 20:59 ` Paul Jackson
2007-10-27 17:50 ` David Rientjes
2007-10-27 23:19 ` Paul Jackson
2007-10-28 18:19 ` David Rientjes
2007-10-28 23:46 ` Paul Jackson
2007-10-29 1:04 ` David Rientjes
2007-10-29 4:27 ` Paul Jackson
2007-10-29 4:47 ` David Rientjes
2007-10-29 5:45 ` Paul Jackson
2007-10-29 7:00 ` David Rientjes
2007-10-29 7:26 ` Paul Jackson
2007-10-30 22:53 ` David Rientjes
2007-10-30 23:17 ` Paul Jackson
2007-10-30 23:25 ` David Rientjes
2007-10-31 0:03 ` Paul Jackson
2007-10-31 0:05 ` Paul Jackson
2007-10-29 7:15 ` Paul Jackson
2007-10-30 23:12 ` David Rientjes
2007-10-30 23:44 ` Paul Jackson
2007-10-30 23:53 ` David Rientjes
2007-10-31 0:29 ` Paul Jackson
2007-10-29 16:54 ` Lee Schermerhorn
2007-10-29 19:40 ` Paul Jackson
2007-10-29 19:45 ` Paul Jackson
2007-10-29 19:57 ` Paul Jackson
2007-10-29 20:02 ` Paul Jackson
2007-10-27 17:45 ` David Rientjes
2007-10-27 21:22 ` Paul Jackson
2007-10-29 15:10 ` Lee Schermerhorn
2007-10-29 18:41 ` Paul Jackson
2007-10-29 19:01 ` Lee Schermerhorn [this message]
2007-10-30 23:17 ` David Rientjes
2007-10-31 0:03 ` Paul Jackson
2007-10-30 22:57 ` David Rientjes
2007-10-30 23:46 ` Paul Jackson
2007-10-26 20:43 ` Lee Schermerhorn
2007-10-26 15:18 ` Lee Schermerhorn
2007-10-26 17:36 ` Christoph Lameter
2007-10-26 18:45 ` David Rientjes
2007-10-26 19:02 ` Paul Jackson
2007-10-27 19:16 ` David Rientjes
2007-10-29 16:23 ` Lee Schermerhorn
2007-10-29 17:35 ` Andi Kleen
2007-10-29 19:35 ` Paul Jackson
2007-10-29 20:36 ` Christoph Lameter
2007-10-29 21:08 ` Andi Kleen
2007-10-29 22:48 ` Paul Jackson
2007-10-30 19:47 ` Paul Jackson
2007-10-30 20:20 ` Lee Schermerhorn
2007-10-30 20:26 ` Paul Jackson
2007-10-30 20:27 ` Andi Kleen
2007-10-26 1:13 ` Paul Jackson
2007-10-26 1:30 ` David Rientjes
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=1193684503.5035.142.camel@localhost \
--to=lee.schermerhorn@hp.com \
--cc=ak@suse.de \
--cc=akpm@linux-foundation.org \
--cc=clameter@sgi.com \
--cc=linux-kernel@vger.kernel.org \
--cc=pj@sgi.com \
--cc=rientjes@google.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox