From: Oleksii Kurochko <oleksii.kurochko@gmail.com>
To: Jan Beulich <jbeulich@suse.com>
Cc: "Alistair Francis" <alistair.francis@wdc.com>,
"Bob Eshleman" <bobbyeshleman@gmail.com>,
"Connor Davis" <connojdavis@gmail.com>,
"Andrew Cooper" <andrew.cooper3@citrix.com>,
"Anthony PERARD" <anthony.perard@vates.tech>,
"Michal Orzel" <michal.orzel@amd.com>,
"Julien Grall" <julien@xen.org>,
"Roger Pau Monné" <roger.pau@citrix.com>,
"Stefano Stabellini" <sstabellini@kernel.org>,
xen-devel@lists.xenproject.org
Subject: Re: [PATCH v2 13/17] xen/riscv: Implement p2m_entry_from_mfn() and support PBMT configuration
Date: Tue, 22 Jul 2025 13:34:17 +0200 [thread overview]
Message-ID: <f110cd42-d34b-44fc-bd76-85cb0bfc2357@gmail.com> (raw)
In-Reply-To: <fb7176a2-740e-4773-b1cb-3cd430d6f838@gmail.com>
[-- Attachment #1: Type: text/plain, Size: 8900 bytes --]
On 7/22/25 12:41 PM, Oleksii Kurochko wrote:
>
>
> On 7/21/25 2:18 PM, Jan Beulich wrote:
>> On 18.07.2025 11:52, Oleksii Kurochko wrote:
>>> On 7/17/25 12:25 PM, Jan Beulich wrote:
>>>> On 17.07.2025 10:56, Oleksii Kurochko wrote:
>>>>> On 7/16/25 6:18 PM, Jan Beulich wrote:
>>>>>> On 16.07.2025 18:07, Oleksii Kurochko wrote:
>>>>>>> On 7/16/25 1:31 PM, Jan Beulich wrote:
>>>>>>>> On 15.07.2025 16:47, Oleksii Kurochko wrote:
>>>>>>>>> On 7/1/25 5:08 PM, Jan Beulich wrote:
>>>>>>>>>> On 10.06.2025 15:05, Oleksii Kurochko wrote:
>>>>>>>>>>> --- a/xen/arch/riscv/p2m.c
>>>>>>>>>>> +++ b/xen/arch/riscv/p2m.c
>>>>>>>>>>> @@ -345,6 +345,26 @@ static pte_t *p2m_get_root_pointer(struct p2m_domain *p2m, gfn_t gfn)
>>>>>>>>>>> return __map_domain_page(p2m->root + root_table_indx);
>>>>>>>>>>> }
>>>>>>>>>>>
>>>>>>>>>>> +static int p2m_type_radix_set(struct p2m_domain *p2m, pte_t pte, p2m_type_t t)
>>>>>>>>>> See comments on the earlier patch regarding naming.
>>>>>>>>>>
>>>>>>>>>>> +{
>>>>>>>>>>> + int rc;
>>>>>>>>>>> + gfn_t gfn = mfn_to_gfn(p2m->domain, mfn_from_pte(pte));
>>>>>>>>>> How does this work, when you record GFNs only for Xenheap pages?
>>>>>>>>> I think I don't understand what is an issue. Could you please provide
>>>>>>>>> some extra details?
>>>>>>>> Counter question: The mfn_to_gfn() you currently have is only a stub. It only
>>>>>>>> works for 1:1 mapped domains. Can you show me the eventual final implementation
>>>>>>>> of the function, making it possible to use it here?
>>>>>>> At the moment, I planned to support only 1:1 mapped domains, so it is final
>>>>>>> implementation.
>>>>>> Isn't that on overly severe limitation?
>>>>> I wouldn't say that it's a severe limitation, as it's just a matter of how
>>>>> |mfn_to_gfn()| is implemented. When non-1:1 mapped domains are supported,
>>>>> |mfn_to_gfn()| can be implemented differently, while the code where it’s called
>>>>> will likely remain unchanged.
>>>>>
>>>>> What I meant in my reply is that, for the current state and current limitations,
>>>>> this is the final implementation of|mfn_to_gfn()|. But that doesn't mean I don't
>>>>> see the value in, or the need for, non-1:1 mapped domains—it's just that this
>>>>> limitation simplifies development at the current stage of the RISC-V port.
>>>> Simplification is fine in some cases, but not supporting the "normal" way of
>>>> domain construction looks like a pretty odd restriction. I'm also curious
>>>> how you envision to implement mfn_to_gfn() then, suitable for generic use like
>>>> the one here. Imo, current limitation or not, you simply want to avoid use of
>>>> that function outside of the special gnttab case.
>>>>
>>>>>>>>>> In this context (not sure if I asked before): With this use of a radix tree,
>>>>>>>>>> how do you intend to bound the amount of memory that a domain can use, by
>>>>>>>>>> making Xen insert very many entries?
>>>>>>>>> I didn’t think about that. I assumed it would be enough to set the amount of
>>>>>>>>> memory a guest domain can use by specifying|xen,domain-p2m-mem-mb| in the DTS,
>>>>>>>>> or using some predefined value if|xen,domain-p2m-mem-mb| isn’t explicitly set.
>>>>>>>> Which would require these allocations to come from that pool.
>>>>>>> Yes, and it is true only for non-hardware domains with the current implementation.
>>>>>> ???
>>>>> I meant that pool is used now only for non-hardware domains at the moment.
>>>> And how does this matter here? The memory required for the radix tree doesn't
>>>> come from that pool anyway.
>>> I thought that is possible to do that somehow, but looking at a code of
>>> radix-tree.c it seems like the only one way to allocate memroy for the radix
>>> tree isradix_tree_node_alloc() -> xzalloc(struct rcu_node).
>>>
>>> Then it is needed to introduce radix_tree_node_allocate(domain)
>> That would be a possibility, but you may have seen that less than half a
>> year ago we got rid of something along these lines. So it would require
>> some pretty good justification to re-introduce.
>>
>>> or radix tree
>>> can't be used at all for mentioned in the previous replies security reason, no?
>> (Very) careful use may still be possible. But the downside of using this
>> (potentially long lookup times) would always remain.
> Could you please clarify what do you mean here by "(Very) careful"?
> I thought about an introduction of an amount of possible keys in radix tree and if this amount
> is 0 then stop domain. And it is also unclear what should be a value for this amount.
> Probably, you have better idea.
>
> But generally your idea below ...
>>>>>>>>> Also, it seems this would just lead to the issue you mentioned earlier: when
>>>>>>>>> the memory runs out,|domain_crash()| will be called or PTE will be zapped.
>>>>>>>> Or one domain exhausting memory would cause another domain to fail. A domain
>>>>>>>> impacting just itself may be tolerable. But a domain affecting other domains
>>>>>>>> isn't.
>>>>>>> But it seems like this issue could happen in any implementation. It won't happen only
>>>>>>> if we will have only pre-populated pool for any domain type (hardware, control, guest
>>>>>>> domain) without ability to extend them or allocate extra pages from domheap in runtime.
>>>>>>> Otherwise, if extra pages allocation is allowed then we can't really do something
>>>>>>> with this issue.
>>>>>> But that's why I brought this up: You simply have to. Or, as indicated, the
>>>>>> moment you mark Xen security-supported on RISC-V, there will be an XSA needed.
>>>>> Why it isn't XSA for other architectures? At least, Arm then should have such
>>>>> XSA.
>>>> Does Arm use a radix tree for storing types? It uses one for mem-access, but
>>>> it's not clear to me whether that's actually a supported feature.
>>>>
>>>>> I don't understand why x86 won't have the same issue. Memory is the limited
>>>>> and shared resource, so if one of the domain will use to much memory then it could
>>>>> happen that other domains won't have enough memory for its purpose...
>>>> The question is whether allocations are bounded. With this use of a radix tree,
>>>> you give domains a way to have Xen allocate pretty much arbitrary amounts of
>>>> memory to populate that tree. That unbounded-ness is the problem, not memory
>>>> allocations in general.
>>> Isn't radix tree key bounded to an amount of GFNs given for a domain? We can't have
>>> more keys then a max GFN number for a domain. So a potential amount of necessary memory
>>> for radix tree is also bounded to an amount of GFNs.
>> To some degree yes, hence why I said "pretty much arbitrary amounts".
>> But recall that "amount of GFNs" is a fuzzy term; I think you mean to
>> use it to describe the amount of memory pages given to the guest. GFNs
>> can be used for other purposes, though. Guests could e.g. grant
>> themselves access to their own memory, then map those grants at
>> otherwise unused GFNs.
>>
>>> Anyway, IIUC I just can't use radix tree for p2m types at all, right?
>>> If yes, does it make sense to borrow 2 bits from struct page_info->type_info as now it
>>> is used 9-bits for count of a frame?
>> struct page_info describes MFNs, when you want to describe GFNs. As you
>> mentioned earlier, multiple GFNs can in principle map to the same MFN.
>> You would force them to all have the same properties, which would be in
>> direct conflict with e.g. the grant P2M types.
>>
>> Just to mention one possible alternative to using radix trees: You could
>> maintain a 2nd set of intermediate "page tables", just that leaf entries
>> would hold meta data for the respective GFN. The memory for those "page
>> tables" could come from the normal P2M pool (and allocation would thus
>> only consume domain-specific resources). Of course in any model like
>> this the question of lookup times (as mentioned above) would remain.
> ...looks like an optimal option.
>
> The only thing I worry about is that it will require some code duplication
> (I will think how to re-use the current one code), as for example, when
> setting/getting metadata, TLB flushing isn’t needed at all as we aren't
> working with with real P2M page tables.
> Agree that lookup won't be the best one, but nothing can be done with
> such models.
Probably, instead of having a second set of intermediate "page tables",
we could just allocate two consecutive pages within the real P2M page
tables for the intermediate page table. The first page would serve as
the actual page table to which the intermediate page table points,
and the second page would store metadata for each entry of the page
table that the intermediate page table references.
As we are supporting only 1gb, 2mb and 4kb mappings we could do a little
optimization and start allocate these consecutive pages only for PT levels
which corresponds to 1gb, 2mb, 4kb mappings.
Does it make sense?
~ Oleksii
[-- Attachment #2: Type: text/html, Size: 13162 bytes --]
next prev parent reply other threads:[~2025-07-22 11:34 UTC|newest]
Thread overview: 161+ messages / expand[flat|nested] mbox.gz Atom feed top
2025-06-10 13:05 [PATCH v2 00/17] xen/riscv: introduce p2m functionality Oleksii Kurochko
2025-06-10 13:05 ` [PATCH v2 01/17] xen/riscv: implement sbi_remote_hfence_gvma() Oleksii Kurochko
2025-06-18 15:15 ` Jan Beulich
2025-06-23 14:31 ` Oleksii Kurochko
2025-06-23 14:39 ` Jan Beulich
2025-06-23 14:45 ` Oleksii Kurochko
2025-06-24 10:33 ` Oleksii Kurochko
2025-06-24 10:48 ` Jan Beulich
2025-06-10 13:05 ` [PATCH v2 02/17] xen/riscv: introduce sbi_remote_hfence_gvma_vmid() Oleksii Kurochko
2025-06-18 15:20 ` Jan Beulich
2025-06-23 14:38 ` Oleksii Kurochko
2025-06-10 13:05 ` [PATCH v2 03/17] xen/riscv: introduce guest domain's VMID allocation and manegement Oleksii Kurochko
2025-06-18 15:46 ` Jan Beulich
2025-06-24 9:46 ` Oleksii Kurochko
2025-06-24 10:44 ` Jan Beulich
2025-06-24 13:47 ` Oleksii Kurochko
2025-06-24 14:01 ` Jan Beulich
2025-06-24 15:32 ` Oleksii Kurochko
2025-06-26 10:05 ` Oleksii Kurochko
2025-06-26 10:41 ` Jan Beulich
2025-06-26 11:34 ` Oleksii Kurochko
2025-06-26 11:43 ` Juergen Gross
2025-06-26 12:05 ` Oleksii Kurochko
2025-06-26 12:17 ` Teddy Astie
2025-06-26 12:37 ` Jan Beulich
2025-06-26 12:16 ` Jan Beulich
2025-06-26 12:25 ` Oleksii Kurochko
2025-06-10 13:05 ` [PATCH v2 04/17] xen/riscv: construct the P2M pages pool for guests Oleksii Kurochko
2025-06-18 15:53 ` Jan Beulich
2025-06-25 14:48 ` Oleksii Kurochko
2025-06-25 14:55 ` Jan Beulich
2025-07-01 13:04 ` Jan Beulich
2025-07-02 10:30 ` Oleksii Kurochko
2025-07-02 10:34 ` Jan Beulich
2025-07-02 11:17 ` Oleksii Kurochko
2025-07-02 11:48 ` Oleksii Kurochko
2025-07-02 11:56 ` Jan Beulich
2025-07-02 12:34 ` Oleksii Kurochko
2025-07-02 12:49 ` Jan Beulich
2025-06-10 13:05 ` [PATCH v2 05/17] xen/riscv: introduce things necessary for p2m initialization Oleksii Kurochko
2025-06-18 16:08 ` Jan Beulich
2025-06-25 15:31 ` Oleksii Kurochko
2025-06-25 15:53 ` Jan Beulich
2025-06-26 8:40 ` Oleksii Kurochko
2025-06-26 11:01 ` Jan Beulich
2025-06-26 11:55 ` Oleksii Kurochko
2025-06-10 13:05 ` [PATCH v2 06/17] xen/riscv: add root page table allocation Oleksii Kurochko
2025-06-30 15:22 ` Jan Beulich
2025-06-30 16:18 ` Oleksii Kurochko
2025-07-01 6:29 ` Jan Beulich
2025-07-01 9:44 ` Oleksii Kurochko
2025-07-01 10:27 ` Jan Beulich
2025-07-01 14:02 ` Oleksii Kurochko
2025-07-01 14:28 ` Jan Beulich
2025-06-10 13:05 ` [PATCH v2 07/17] xen/riscv: introduce pte_{set,get}_mfn() Oleksii Kurochko
2025-06-26 14:57 ` Jan Beulich
2025-06-10 13:05 ` [PATCH v2 08/17] xen/riscv: add new p2m types and helper macros for type classification Oleksii Kurochko
2025-06-26 14:59 ` Jan Beulich
2025-06-30 14:33 ` Oleksii Kurochko
2025-06-30 14:38 ` Oleksii Kurochko
2025-06-30 14:45 ` Jan Beulich
2025-06-30 15:27 ` Oleksii Kurochko
2025-06-30 15:50 ` Jan Beulich
2025-07-02 10:13 ` Oleksii Kurochko
2025-07-02 10:36 ` Jan Beulich
2025-06-30 14:42 ` Jan Beulich
2025-06-30 15:13 ` Oleksii Kurochko
2025-06-30 15:27 ` Jan Beulich
2025-06-10 13:05 ` [PATCH v2 09/17] xen/riscv: introduce page_set_xenheap_gfn() Oleksii Kurochko
2025-06-30 15:48 ` Jan Beulich
2025-07-02 15:59 ` Oleksii Kurochko
2025-07-03 5:59 ` Jan Beulich
2025-06-10 13:05 ` [PATCH v2 10/17] xen/riscv: implement guest_physmap_add_entry() for mapping GFNs to MFNs Oleksii Kurochko
2025-06-30 15:59 ` Jan Beulich
2025-07-03 11:02 ` Oleksii Kurochko
2025-07-03 11:33 ` Jan Beulich
2025-07-03 11:54 ` Oleksii Kurochko
2025-07-03 13:09 ` Jan Beulich
2025-07-03 13:28 ` Oleksii Kurochko
2025-07-03 13:34 ` Jan Beulich
2025-06-10 13:05 ` [PATCH v2 11/17] xen/riscv: implement p2m_set_entry() and __p2m_set_entry() Oleksii Kurochko
2025-07-01 13:49 ` Jan Beulich
2025-07-04 15:01 ` Oleksii Kurochko
2025-07-07 7:20 ` Jan Beulich
2025-07-07 11:46 ` Oleksii Kurochko
2025-07-07 12:53 ` Jan Beulich
2025-07-07 15:00 ` Oleksii Kurochko
2025-07-07 15:15 ` Jan Beulich
2025-07-07 16:10 ` Oleksii Kurochko
2025-07-08 7:10 ` Jan Beulich
2025-07-08 9:01 ` Oleksii Kurochko
2025-07-08 10:37 ` Oleksii Kurochko
2025-07-08 12:45 ` Jan Beulich
2025-07-08 15:42 ` Oleksii Kurochko
2025-07-08 16:04 ` Jan Beulich
2025-07-09 8:24 ` Oleksii Kurochko
2025-07-09 8:41 ` Jan Beulich
2025-06-10 13:05 ` [PATCH v2 12/17] xen/riscv: Implement p2m_free_entry() and related helpers Oleksii Kurochko
2025-07-01 14:23 ` Jan Beulich
2025-07-11 15:56 ` Oleksii Kurochko
2025-07-14 7:15 ` Jan Beulich
2025-07-14 16:01 ` Oleksii Kurochko
2025-07-14 16:17 ` Jan Beulich
2025-06-10 13:05 ` [PATCH v2 13/17] xen/riscv: Implement p2m_entry_from_mfn() and support PBMT configuration Oleksii Kurochko
2025-07-01 15:08 ` Jan Beulich
2025-07-15 14:47 ` Oleksii Kurochko
2025-07-16 11:31 ` Jan Beulich
2025-07-16 16:07 ` Oleksii Kurochko
2025-07-16 16:18 ` Jan Beulich
2025-07-17 8:56 ` Oleksii Kurochko
2025-07-17 10:25 ` Jan Beulich
2025-07-18 9:52 ` Oleksii Kurochko
2025-07-21 12:18 ` Jan Beulich
2025-07-22 10:41 ` Oleksii Kurochko
2025-07-22 11:34 ` Oleksii Kurochko [this message]
2025-07-22 12:00 ` Jan Beulich
2025-07-22 14:25 ` Oleksii Kurochko
2025-07-22 14:35 ` Jan Beulich
2025-07-22 16:07 ` Oleksii Kurochko
2025-07-23 9:46 ` Jan Beulich
2025-07-28 8:52 ` Oleksii Kurochko
2025-07-28 9:09 ` Jan Beulich
2025-07-28 11:37 ` Oleksii Kurochko
2025-07-28 11:49 ` Jan Beulich
2025-07-22 11:54 ` Jan Beulich
2025-06-10 13:05 ` [PATCH v2 14/17] xen/riscv: implement p2m_next_level() Oleksii Kurochko
2025-07-02 8:35 ` Jan Beulich
2025-07-16 11:32 ` Oleksii Kurochko
2025-07-16 11:43 ` Jan Beulich
2025-07-16 15:53 ` Oleksii Kurochko
2025-07-16 16:12 ` Jan Beulich
2025-07-17 9:42 ` Oleksii Kurochko
2025-07-17 10:37 ` Jan Beulich
2025-07-18 11:19 ` Oleksii Kurochko
2025-07-21 13:14 ` Jan Beulich
2025-06-10 13:05 ` [PATCH v2 15/17] xen/riscv: Implement superpage splitting for p2m mappings Oleksii Kurochko
2025-07-02 9:25 ` Jan Beulich
2025-07-17 16:37 ` Oleksii Kurochko
2025-07-21 13:34 ` Jan Beulich
2025-07-22 14:57 ` Oleksii Kurochko
2025-07-22 16:02 ` Jan Beulich
2025-07-23 19:51 ` Oleksii Kurochko
2025-07-24 7:58 ` Jan Beulich
2025-06-10 13:05 ` [PATCH v2 16/17] xen/riscv: implement mfn_valid() and page reference, ownership handling helpers Oleksii Kurochko
2025-07-02 10:09 ` Jan Beulich
2025-07-02 10:28 ` Jan Beulich
2025-07-18 14:37 ` Oleksii Kurochko
2025-07-21 13:39 ` Jan Beulich
2025-07-22 12:03 ` Oleksii Kurochko
2025-07-22 12:05 ` Jan Beulich
2025-07-29 13:47 ` Oleksii Kurochko
2025-07-29 14:48 ` Jan Beulich
2025-07-02 12:52 ` Orzel, Michal
2025-07-18 14:49 ` Oleksii Kurochko
2025-07-21 13:42 ` Jan Beulich
2025-07-22 13:38 ` Oleksii Kurochko
2025-07-21 13:53 ` Jan Beulich
2025-06-10 13:05 ` [PATCH v2 17/17] xen/riscv: add support of page lookup by GFN Oleksii Kurochko
2025-07-02 11:44 ` Jan Beulich
2025-07-21 9:43 ` Oleksii Kurochko
2025-07-21 14:06 ` Jan Beulich
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=f110cd42-d34b-44fc-bd76-85cb0bfc2357@gmail.com \
--to=oleksii.kurochko@gmail.com \
--cc=alistair.francis@wdc.com \
--cc=andrew.cooper3@citrix.com \
--cc=anthony.perard@vates.tech \
--cc=bobbyeshleman@gmail.com \
--cc=connojdavis@gmail.com \
--cc=jbeulich@suse.com \
--cc=julien@xen.org \
--cc=michal.orzel@amd.com \
--cc=roger.pau@citrix.com \
--cc=sstabellini@kernel.org \
--cc=xen-devel@lists.xenproject.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.