From: Anshuman Khandual <khandual@linux.vnet.ibm.com>
To: Michael Ellerman <mpe@ellerman.id.au>, linuxppc-dev@lists.ozlabs.org
Cc: aneesh.kumar@linux.vnet.ibm.com
Subject: Re: [PATCH V2] powerpc/mm: Add validation for platform reserved memory ranges
Date: Thu, 14 Jul 2016 08:59:04 +0530 [thread overview]
Message-ID: <57870700.2010902@linux.vnet.ibm.com> (raw)
In-Reply-To: <87zipwetm4.fsf@@concordia.ellerman.id.au>
On 07/05/2016 07:25 AM, Michael Ellerman wrote:
> Anshuman Khandual <khandual@linux.vnet.ibm.com> writes:
>
>> For partition running on PHYP, there can be a adjunct partition
>> which shares the virtual address range with the operating system.
>> Virtual address ranges which can be used by the adjunct partition
>> are communicated with virtual device node of the device tree with
>> a property known as "ibm,reserved-virtual-addresses". This patch
>> introduces a new function named 'validate_reserved_va_range' which
>> is called during initialization to validate that these reserved
>> virtual address ranges do not overlap with the address ranges used
>> by the kernel for all supported memory contexts. This helps prevent
>> the possibility of getting return codes similar to H_RESOURCE for
>> H_PROTECT hcalls for conflicting HPTE entries.
>
> Have you tested this? The endian conversions look wrong to me.
I had tested this both on LE and BE LPARs on PVM environment.
>
>> diff --git a/arch/powerpc/mm/hash_utils_64.c b/arch/powerpc/mm/hash_utils_64.c
>> index ba59d59..b47f667 100644
>> --- a/arch/powerpc/mm/hash_utils_64.c
>> +++ b/arch/powerpc/mm/hash_utils_64.c
>> @@ -1564,3 +1564,80 @@ void setup_initial_memory_limit(phys_addr_t first_memblock_base,
>> /* Finally limit subsequent allocations */
>> memblock_set_current_limit(ppc64_rma_size);
>> }
>> +
>> +/*
>> + * PAPR says that each reserved virtual address range record
>> + * contains three be32 elements which is of toal 12 bytes.
>> + * First two be32 elements contain the abbreviated virtual
>> + * address (high order 32 bits and low order 32 bits that
>> + * generate the abbreviated virtual address of 64 bits which
>> + * need to be concatenated with 24 bits of 0 at the end) and
>> + * the third be32 element contains the size of the reserved
>> + * virtual address range as number of consecutive 4K pages.
>> + */
>> +struct reserved_va_record {
>> + __be32 high_addr;
>> + __be32 low_addr;
>> + __be32 nr_pages_4K;
>> +};
>
> Here you define those fields as __be32.
Hmm, I believe we had agreed upon this. Will check back.
>
>> +/*
>> + * Linux uses 65 bits (CONTEXT_BITS + ESID_BITS + SID_SHIFT)
>> + * of virtual address. As reserved virtual address comes in
>> + * as an abbreviated form (64 bits) from the device tree, we
>> + * will use a partial address bit mask (65 >> 24) to match it
>> + * for simplicity.
>> + */
>> +#define RVA_LESS_BITS 24
>> +#define LINUX_VA_BITS (CONTEXT_BITS + ESID_BITS + SID_SHIFT)
>> +#define PARTIAL_LINUX_VA_MASK ((1ULL << (LINUX_VA_BITS - RVA_LESS_BITS)) - 1)
>> +
>> +static int __init validate_reserved_va_range(void)
>> +{
>> + struct reserved_va_record rva;
>> + struct device_node *np;
>> + int records, ret, i;
>> + __be64 vaddr;
>> +
>> + np = of_find_node_by_name(NULL, "vdevice");
>> + if (!np)
>> + return -ENODEV;
>> +
>> + records = of_property_count_elems_of_size(np,
>> + "ibm,reserved-virtual-addresses",
>> + sizeof(struct reserved_va_record));
>> + if (records < 0)
>> + return records;
>> +
>> + for (i = 0; i < records; i++) {
>> + ret = of_property_read_u32_index(np,
>> + "ibm,reserved-virtual-addresses",
>> + 3 * i, &rva.high_addr);
>
> But then here you use of_property_read_u32_index(), which does the
> endian conversion (to CPU endian) for you.
Okay.
>
>> + ret = of_property_read_u32_index(np,
>> + "ibm,reserved-virtual-addresses",
>> + 3 * i + 1, &rva.low_addr);
>
>> + ret = of_property_read_u32_index(np,
>> + "ibm,reserved-virtual-addresses",
>> + 3 * i + 2, &rva.nr_pages_4K);
>
> So now all the values in rva are CPU endian.
Okay.
>
>> + vaddr = rva.high_addr;
>> + vaddr = (vaddr << 32) | rva.low_addr;
>> + if (vaddr & cpu_to_be64(~PARTIAL_LINUX_VA_MASK))
>> + continue;
>
> But then here you do the comparison against a __be64 value.
>
> I know I told you to use "properly endian-annotated struct", but you
> stil need to use the right conversions in the right places.
>
> I think the best option is to use of_property_read_u32_array() and just
> read the three 32 values into a CPU endian struct.
Sure. But I have kind of lost context of this patch, will look into these
details and get back.
prev parent reply other threads:[~2016-07-14 3:29 UTC|newest]
Thread overview: 5+ messages / expand[flat|nested] mbox.gz Atom feed top
2016-03-07 9:55 [PATCH V2] powerpc/mm: Add validation for platform reserved memory ranges Anshuman Khandual
2016-04-11 6:13 ` Anshuman Khandual
2016-04-11 13:31 ` Michael Ellerman
2016-07-05 1:55 ` Michael Ellerman
2016-07-14 3:29 ` Anshuman Khandual [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=57870700.2010902@linux.vnet.ibm.com \
--to=khandual@linux.vnet.ibm.com \
--cc=aneesh.kumar@linux.vnet.ibm.com \
--cc=linuxppc-dev@lists.ozlabs.org \
--cc=mpe@ellerman.id.au \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).