From: Alistair Popple <apopple@nvidia.com>
To: Wei Xu <weixugc@google.com>
Cc: "Aneesh Kumar K.V" <aneesh.kumar@linux.ibm.com>,
Yang Shi <shy828301@gmail.com>,
Andrew Morton <akpm@linux-foundation.org>,
Dave Hansen <dave.hansen@linux.intel.com>,
Huang Ying <ying.huang@intel.com>,
Dan Williams <dan.j.williams@intel.com>,
Linux MM <linux-mm@kvack.org>, Greg Thelen <gthelen@google.com>,
Jagdish Gediya <jvgediya@linux.ibm.com>,
Linux Kernel Mailing List <linux-kernel@vger.kernel.org>,
Davidlohr Bueso <dave@stgolabs.net>,
Michal Hocko <mhocko@kernel.org>,
Baolin Wang <baolin.wang@linux.alibaba.com>,
Brice Goglin <brice.goglin@gmail.com>,
Feng Tang <feng.tang@intel.com>,
Jonathan Cameron <Jonathan.Cameron@huawei.com>,
Tim Chen <tim.c.chen@linux.intel.com>
Subject: Re: RFC: Memory Tiering Kernel Interfaces
Date: Wed, 11 May 2022 17:34:58 +1000 [thread overview]
Message-ID: <87k0asqzko.fsf@nvdebian.thelocal> (raw)
In-Reply-To: <CAAPL-u9FvCfgA7xsqStLNZ=W03iyWBmvHrpVzPKyitsGN2v_KQ@mail.gmail.com>
Wei Xu <weixugc@google.com> writes:
> On Tue, May 10, 2022 at 4:38 AM Aneesh Kumar K.V
> <aneesh.kumar@linux.ibm.com> wrote:
>>
>> Alistair Popple <apopple@nvidia.com> writes:
>>
>> > Wei Xu <weixugc@google.com> writes:
>> >
>> >> On Thu, May 5, 2022 at 5:19 PM Alistair Popple <apopple@nvidia.com> wrote:
>> >>>
>> >>> Wei Xu <weixugc@google.com> writes:
>> >>>
>> >>> [...]
>> >>>
>> >>> >> >
>> >>> >> >
>> >>> >> > Tiering Hierarchy Initialization
>> >>> >> > `=============================='
>> >>> >> >
>> >>> >> > By default, all memory nodes are in the top tier (N_TOPTIER_MEMORY).
>> >>> >> >
>> >>> >> > A device driver can remove its memory nodes from the top tier, e.g.
>> >>> >> > a dax driver can remove PMEM nodes from the top tier.
>> >>> >>
>> >>> >> With the topology built by firmware we should not need this.
>> >>>
>> >>> I agree that in an ideal world the hierarchy should be built by firmware based
>> >>> on something like the HMAT. But I also think being able to override this will be
>> >>> useful in getting there. Therefore a way of overriding the generated hierarchy
>> >>> would be good, either via sysfs or kernel boot parameter if we don't want to
>> >>> commit to a particular user interface now.
>> >>>
>> >>> However I'm less sure letting device-drivers override this is a good idea. How
>> >>> for example would a GPU driver make sure it's node is in the top tier? By moving
>> >>> every node that the driver does not know about out of N_TOPTIER_MEMORY? That
>> >>> could get messy if say there were two drivers both of which wanted their node to
>> >>> be in the top tier.
>> >>
>> >> The suggestion is to allow a device driver to opt out its memory
>> >> devices from the top-tier, not the other way around.
>> >
>> > So how would demotion work in the case of accelerators then? In that
>> > case we would want GPU memory to demote to DRAM, but that won't happen
>> > if both DRAM and GPU memory are in N_TOPTIER_MEMORY and it seems the
>> > only override available with this proposal would move GPU memory into a
>> > lower tier, which is the opposite of what's needed there.
>>
>> How about we do 3 tiers now. dax kmem devices can be registered to
>> tier 3. By default all numa nodes can be registered at tier 2 and HBM or
>> GPU can be enabled to register at tier 1. ?
>
> This makes sense. I will send an updated RFC based on the discussions so far.
Thanks! The sense I got from LSF/MM was that we should initially try
keep things simple by limiting it to two tiers. However I don't think
there was strong opposition to adding a third tier to support
GPU+CPU+PMEM, and it does seem like it might even be simpler to just
have three tiers and assign devices as suggested.
>> -aneesh
next prev parent reply other threads:[~2022-05-11 7:49 UTC|newest]
Thread overview: 57+ messages / expand[flat|nested] mbox.gz Atom feed top
2022-04-30 2:10 RFC: Memory Tiering Kernel Interfaces Wei Xu
2022-04-30 3:59 ` Yang Shi
2022-04-30 6:37 ` Wei Xu
2022-05-06 0:01 ` Alistair Popple
2022-05-10 4:32 ` Wei Xu
2022-05-10 5:37 ` Alistair Popple
2022-05-10 11:38 ` Aneesh Kumar K.V
2022-05-11 5:30 ` Wei Xu
2022-05-11 7:34 ` Alistair Popple [this message]
2022-05-11 7:49 ` ying.huang
2022-05-11 17:07 ` Wei Xu
2022-05-12 1:42 ` ying.huang
2022-05-12 2:39 ` Wei Xu
2022-05-12 3:13 ` ying.huang
2022-05-12 3:37 ` Wei Xu
2022-05-12 6:24 ` Wei Xu
2022-05-06 18:56 ` Yang Shi
2022-05-09 14:32 ` Hesham Almatary
2022-05-10 3:24 ` Yang Shi
2022-05-10 9:59 ` Hesham Almatary
2022-05-10 12:10 ` Aneesh Kumar K V
2022-05-11 5:42 ` Wei Xu
2022-05-11 7:12 ` Alistair Popple
2022-05-11 9:05 ` Hesham Almatary
2022-05-12 3:02 ` ying.huang
2022-05-12 4:40 ` Aneesh Kumar K V
2022-05-12 4:49 ` Wei Xu
2022-05-10 4:22 ` Wei Xu
2022-05-10 10:01 ` Hesham Almatary
2022-05-10 11:44 ` Aneesh Kumar K.V
2022-05-01 18:35 ` Dan Williams
2022-05-03 6:36 ` Wei Xu
2022-05-06 19:05 ` Yang Shi
2022-05-07 7:56 ` ying.huang
2022-05-01 17:58 ` Davidlohr Bueso
2022-05-02 1:04 ` David Rientjes
2022-05-02 7:23 ` Aneesh Kumar K.V
2022-05-03 2:07 ` Baolin Wang
2022-05-03 6:06 ` Wei Xu
2022-05-03 17:14 ` Alistair Popple
2022-05-03 17:47 ` Dave Hansen
2022-05-03 22:35 ` Alistair Popple
2022-05-03 23:54 ` Dave Hansen
2022-05-04 1:31 ` Wei Xu
2022-05-04 17:02 ` Dave Hansen
2022-05-05 6:35 ` Wei Xu
2022-05-05 14:24 ` Dave Hansen
2022-05-10 4:43 ` Wei Xu
2022-05-02 6:25 ` Aneesh Kumar K.V
2022-05-03 7:02 ` Wei Xu
2022-05-02 15:20 ` Dave Hansen
2022-05-03 7:19 ` Wei Xu
2022-05-03 19:12 ` Tim Chen
2022-05-05 7:02 ` Wei Xu
2022-05-05 8:57 ` ying.huang
2022-05-05 23:57 ` Alistair Popple
2022-05-06 0:25 ` Alistair Popple
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=87k0asqzko.fsf@nvdebian.thelocal \
--to=apopple@nvidia.com \
--cc=Jonathan.Cameron@huawei.com \
--cc=akpm@linux-foundation.org \
--cc=aneesh.kumar@linux.ibm.com \
--cc=baolin.wang@linux.alibaba.com \
--cc=brice.goglin@gmail.com \
--cc=dan.j.williams@intel.com \
--cc=dave.hansen@linux.intel.com \
--cc=dave@stgolabs.net \
--cc=feng.tang@intel.com \
--cc=gthelen@google.com \
--cc=jvgediya@linux.ibm.com \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=mhocko@kernel.org \
--cc=shy828301@gmail.com \
--cc=tim.c.chen@linux.intel.com \
--cc=weixugc@google.com \
--cc=ying.huang@intel.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.