Linux CXL
 help / color / mirror / Atom feed
From: David Hildenbrand <david@redhat.com>
To: Kyungsan Kim <ks0204.kim@samsung.com>
Cc: lsf-pc@lists.linux-foundation.org, linux-mm@kvack.org,
	linux-fsdevel@vger.kernel.org, linux-cxl@vger.kernel.org,
	a.manzanares@samsung.com, viacheslav.dubeyko@bytedance.com,
	dan.j.williams@intel.com
Subject: Re: FW: [LSF/MM/BPF TOPIC] SMDK inspired MM changes for CXL
Date: Fri, 24 Mar 2023 10:30:32 +0100	[thread overview]
Message-ID: <91d02705-1c3f-5f55-158a-1a68120df2f4@redhat.com> (raw)
In-Reply-To: <20230324092731.148023-1-ks0204.kim@samsung.com>

On 24.03.23 10:27, Kyungsan Kim wrote:
>> On 24.03.23 10:09, Kyungsan Kim wrote:
>>> Thank you David Hinderbrand for your interest on this topic.
>>>
>>>>>
>>>>>> Kyungsan Kim wrote:
>>>>>> [..]
>>>>>>>> In addition to CXL memory, we may have other kind of memory in the
>>>>>>>> system, for example, HBM (High Bandwidth Memory), memory in FPGA card,
>>>>>>>> memory in GPU card, etc.  I guess that we need to consider them
>>>>>>>> together.  Do we need to add one zone type for each kind of memory?
>>>>>>>
>>>>>>> We also don't think a new zone is needed for every single memory
>>>>>>> device.  Our viewpoint is the sole ZONE_NORMAL becomes not enough to
>>>>>>> manage multiple volatile memory devices due to the increased device
>>>>>>> types.  Including CXL DRAM, we think the ZONE_EXMEM can be used to
>>>>>>> represent extended volatile memories that have different HW
>>>>>>> characteristics.
>>>>>>
>>>>>> Some advice for the LSF/MM discussion, the rationale will need to be
>>>>>> more than "we think the ZONE_EXMEM can be used to represent extended
>>>>>> volatile memories that have different HW characteristics". It needs to
>>>>>> be along the lines of "yes, to date Linux has been able to describe DDR
>>>>>> with NUMA effects, PMEM with high write overhead, and HBM with improved
>>>>>> bandwidth not necessarily latency, all without adding a new ZONE, but a
>>>>>> new ZONE is absolutely required now to enable use case FOO, or address
>>>>>> unfixable NUMA problem BAR." Without FOO and BAR to discuss the code
>>>>>> maintainability concern of "fewer degress of freedom in the ZONE
>>>>>> dimension" starts to dominate.
>>>>>
>>>>> One problem we experienced was occured in the combination of hot-remove and kerelspace allocation usecases.
>>>>> ZONE_NORMAL allows kernel context allocation, but it does not allow hot-remove because kernel resides all the time.
>>>>> ZONE_MOVABLE allows hot-remove due to the page migration, but it only allows userspace allocation.
>>>>> Alternatively, we allocated a kernel context out of ZONE_MOVABLE by adding GFP_MOVABLE flag.
>>>
>>>> That sounds like a bad hack :) .
>>> I consent you.
>>>
>>>>> In case, oops and system hang has occasionally occured because ZONE_MOVABLE can be swapped.
>>>>> We resolved the issue using ZONE_EXMEM by allowing seletively choice of the two usecases.
>>>
>>>> I once raised the idea of a ZONE_PREFER_MOVABLE [1], maybe that's
>>>> similar to what you have in mind here. In general, adding new zones is
>>>> frowned upon.
>>>
>>> Actually, we have already studied your idea and thought it is similar with us in 2 aspects.
>>> 1. ZONE_PREFER_MOVABLE allows a kernelspace allocation using a new zone
>>> 2. ZONE_PREFER_MOVABLE helps less fragmentation by splitting zones, and ordering allocation requests from the zones.
>>>
>>> We think ZONE_EXMEM also helps less fragmentation.
>>> Because it is a separated zone and handles a page allocation as movable by default.
>>
>> So how is it different that it would justify a different (more confusing
>> IMHO) name? :) Of course, names don't matter that much, but I'd be
>> interested in which other aspect that zone would be "special".
> 
> FYI for the first time I named it as ZONE_CXLMEM, but we thought it would be needed to cover other extended memory types as well.
> So I changed it as ZONE_EXMEM.
> We also would like to point out a "special" zone aspeact, which is different from ZONE_NORMAL for tranditional DDR DRAM.
> Of course, a symbol naming is important more or less to represent it very nicely, though.
> Do you prefer ZONE_SPECIAL? :)

I called it ZONE_PREFER_MOVABLE. If you studied that approach there must 
be a good reason to name it differently?

-- 
Thanks,

David / dhildenb


  reply	other threads:[~2023-03-24  9:31 UTC|newest]

Thread overview: 67+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
     [not found] <CGME20230221014114epcas2p1687db1d75765a8f9ed0b3495eab1154d@epcas2p1.samsung.com>
2023-02-21  1:41 ` [LSF/MM/BPF TOPIC] SMDK inspired MM changes for CXL Kyungsan Kim
2023-02-27 23:14   ` Dan Williams
2023-02-28  4:35     ` Kyungsan Kim
2023-03-03  6:07   ` Huang, Ying
2023-03-22  4:33     ` FW: " Kyungsan Kim
2023-03-22 22:03       ` Dan Williams
2023-03-23 10:51         ` RE(2): " Kyungsan Kim
2023-03-23 12:25           ` David Hildenbrand
2023-03-24  9:09             ` RE(4): " Kyungsan Kim
2023-03-24  9:12               ` David Hildenbrand
2023-03-24  9:27                 ` RE(2): " Kyungsan Kim
2023-03-24  9:30                   ` David Hildenbrand [this message]
2023-03-24  9:50                     ` RE(3): " Kyungsan Kim
2023-03-24 13:08                       ` Jørgen Hansen
2023-03-24 22:33                         ` David Hildenbrand
2023-03-31 11:42                           ` Kyungsan Kim
2023-03-31 13:42                             ` Matthew Wilcox
2023-03-31 15:56                               ` Frank van der Linden
2023-04-03  8:34                                 ` David Hildenbrand
2023-04-05  2:16                                   ` Kyungsan Kim
2023-04-05  2:06                                 ` Re: " Kyungsan Kim
2023-04-05  5:00                                   ` Dan Williams
2023-04-05  2:01                               ` Kyungsan Kim
2023-04-05  3:11                                 ` Matthew Wilcox
2023-04-03  8:28                             ` David Hildenbrand
2023-04-05  2:09                               ` Kyungsan Kim
2023-03-31 11:31                         ` RE: RE(3): " Kyungsan Kim
2023-03-24  0:41           ` RE(2): " Huang, Ying
2023-03-24  8:48             ` RE(4): " Kyungsan Kim
2023-03-24 13:46               ` Gregory Price
2023-03-31 11:34                 ` Kyungsan Kim
2023-03-31 15:53                   ` Gregory Price
2023-04-05  2:02                     ` Kyungsan Kim
2023-03-24 14:55           ` RE(2): " Matthew Wilcox
2023-03-24 17:49             ` Matthew Wilcox
2023-03-31 11:37               ` Kyungsan Kim
2023-03-31 12:54                 ` Matthew Wilcox
2023-04-05  2:00                   ` Kyungsan Kim
2023-04-05  4:48                     ` Dan Williams
2023-04-05 18:12                       ` Matthew Wilcox
2023-04-05 19:42                         ` Dan Williams
2023-04-06 12:27                           ` David Hildenbrand
2023-04-07  9:30                             ` Kyungsan Kim
2023-03-31 11:38               ` Re: RE(2): " Kyungsan Kim
2023-03-26  7:21           ` Mike Rapoport
2023-03-30 22:03             ` Dragan Stancevic
2023-04-03  8:44               ` Mike Rapoport
2023-04-04  4:27                 ` Dragan Stancevic
2023-04-04  6:47                   ` Huang, Ying
2023-04-06 22:27                     ` Dragan Stancevic
2023-04-07  0:58                       ` Huang, Ying
2023-04-07  9:29                         ` Kyungsan Kim
2023-04-07 14:35                         ` Dragan Stancevic
2023-04-05 10:18                   ` Kyungsan Kim
2023-03-31 11:45             ` RE: RE(2): " Kyungsan Kim
2023-04-04  8:31               ` Mike Rapoport
2023-04-04 17:58                 ` Adam Manzanares
2023-04-01 10:51                   ` Gregory Price
2023-04-04 18:59                     ` [External] " Viacheslav A.Dubeyko
2023-04-01 11:51                       ` Gregory Price
2023-04-04 21:09                         ` Viacheslav A.Dubeyko
     [not found]                         ` <642cb7ec58c71_21a829453@dwillia2-xfh.jf.intel.com.notmuch>
2023-04-05  2:34                           ` Gregory Price
2023-04-05 10:18                         ` Kyungsan Kim
2023-03-30 22:02   ` Dragan Stancevic
2023-03-31 11:46     ` Kyungsan Kim
2023-04-14  8:41   ` FW: " Kyungsan Kim
2023-05-09 18:45     ` MTK

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=91d02705-1c3f-5f55-158a-1a68120df2f4@redhat.com \
    --to=david@redhat.com \
    --cc=a.manzanares@samsung.com \
    --cc=dan.j.williams@intel.com \
    --cc=ks0204.kim@samsung.com \
    --cc=linux-cxl@vger.kernel.org \
    --cc=linux-fsdevel@vger.kernel.org \
    --cc=linux-mm@kvack.org \
    --cc=lsf-pc@lists.linux-foundation.org \
    --cc=viacheslav.dubeyko@bytedance.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox