From: Daniel Henrique Barboza <danielhb413@gmail.com>
To: David Gibson <david@gibson.dropbear.id.au>
Cc: qemu-ppc@nongnu.org, qemu-devel@nongnu.org, groug@kaod.org
Subject: Re: [PATCH v3 7/7] spapr_drc.c: use DRC reconfiguration to cleanup DIMM unplug state
Date: Fri, 19 Feb 2021 18:31:46 -0300 [thread overview]
Message-ID: <5026c1ed-ebbc-99fc-ac7e-146fe6c9d32b@gmail.com> (raw)
In-Reply-To: <YCyAAe4dJzpsgQ0x@yekko.fritz.box>
On 2/16/21 11:31 PM, David Gibson wrote:
> On Thu, Feb 11, 2021 at 07:52:46PM -0300, Daniel Henrique Barboza wrote:
>> Handling errors in memory hotunplug in the pSeries machine is more complex
>> than any other device type, because there are all the complications that other
>> devices has, and more.
[...]
>>
>> diff --git a/hw/ppc/spapr.c b/hw/ppc/spapr.c
>> index ecce8abf14..4bcded4a1a 100644
>> --- a/hw/ppc/spapr.c
>> +++ b/hw/ppc/spapr.c
>> @@ -3575,6 +3575,36 @@ static SpaprDimmState *spapr_recover_pending_dimm_state(SpaprMachineState *ms,
>> return spapr_pending_dimm_unplugs_add(ms, avail_lmbs, dimm);
>> }
>>
>> +void spapr_clear_pending_dimm_unplug_state(SpaprMachineState *spapr,
>> + PCDIMMDevice *dimm)
>> +{
>> + SpaprDimmState *ds = spapr_pending_dimm_unplugs_find(spapr, dimm);
>> + SpaprDrc *drc;
>> + uint32_t nr_lmbs;
>> + uint64_t size, addr_start, addr;
>> + int i;
>> +
>> + if (ds) {
>> + spapr_pending_dimm_unplugs_remove(spapr, ds);
>> + }
>
> Hrm... how would !ds arise? Could this just be an assert?
!ds would appear if we do not assert g_assert(drc->dev) down there, where you
suggested down below that a malicious/buggy code would trigger it, for example.
With that assert in place then this less likely to occcur.
I guess what I can do here is:
- remove the g_assert(drc->dev) from down below, since it's more related to the
logic of this function;
- here, check if drc->dev is NULL. Return doing nothing if that's the case (all the
function relies on drc->dev being valid);
- if drc->dev is not NULL, then we can g_assert(ds) and proceed with the rest of
the function
This way we become a little more tolerant on drc->dev being NULL, but if drc->dev
is valid we will expect a unplug dimm state to always exist and assert it.
Thanks,
DHB
>
>> +
>> + size = memory_device_get_region_size(MEMORY_DEVICE(dimm), &error_abort);
>> + nr_lmbs = size / SPAPR_MEMORY_BLOCK_SIZE;
>> +
>> + addr_start = object_property_get_uint(OBJECT(dimm), PC_DIMM_ADDR_PROP,
>> + &error_abort);
>> +
>> + addr = addr_start;
>> + for (i = 0; i < nr_lmbs; i++) {
>> + drc = spapr_drc_by_id(TYPE_SPAPR_DRC_LMB,
>> + addr / SPAPR_MEMORY_BLOCK_SIZE);
>> + g_assert(drc);
>> +
>> + drc->unplug_requested = false;
>> + addr += SPAPR_MEMORY_BLOCK_SIZE;
>> + }
>> +}
>> +
>> /* Callback to be called during DRC release. */
>> void spapr_lmb_release(DeviceState *dev)
>> {
>> diff --git a/hw/ppc/spapr_drc.c b/hw/ppc/spapr_drc.c
>> index c143bfb6d3..eae941233a 100644
>> --- a/hw/ppc/spapr_drc.c
>> +++ b/hw/ppc/spapr_drc.c
>> @@ -1230,6 +1230,20 @@ static void rtas_ibm_configure_connector(PowerPCCPU *cpu,
>>
>> drck = SPAPR_DR_CONNECTOR_GET_CLASS(drc);
>>
>> + /*
>> + * This indicates that the kernel is reconfiguring a LMB due to
>> + * a failed hotunplug. Clear the pending unplug state for the whole
>> + * DIMM.
>> + */
>> + if (spapr_drc_type(drc) == SPAPR_DR_CONNECTOR_TYPE_LMB &&
>> + drc->unplug_requested) {
>> +
>> + /* This really shouldn't happen in this point, but ... */
>> + g_assert(drc->dev);
>
> I'm a little worried that a buggy or malicious guest could trigger
> this assert.
>
>> +
>> + spapr_clear_pending_dimm_unplug_state(spapr, PC_DIMM(drc->dev));
>> + }
>> +
>> if (!drc->fdt) {
>> void *fdt;
>> int fdt_size;
>> diff --git a/include/hw/ppc/spapr.h b/include/hw/ppc/spapr.h
>> index ccbeeca1de..5bcc8f3bb8 100644
>> --- a/include/hw/ppc/spapr.h
>> +++ b/include/hw/ppc/spapr.h
>> @@ -847,6 +847,8 @@ int spapr_hpt_shift_for_ramsize(uint64_t ramsize);
>> int spapr_reallocate_hpt(SpaprMachineState *spapr, int shift, Error **errp);
>> void spapr_clear_pending_events(SpaprMachineState *spapr);
>> void spapr_clear_pending_hotplug_events(SpaprMachineState *spapr);
>> +void spapr_clear_pending_dimm_unplug_state(SpaprMachineState *spapr,
>> + PCDIMMDevice *dimm);
>> int spapr_max_server_number(SpaprMachineState *spapr);
>> void spapr_store_hpte(PowerPCCPU *cpu, hwaddr ptex,
>> uint64_t pte0, uint64_t pte1);
>
next prev parent reply other threads:[~2021-02-19 21:32 UTC|newest]
Thread overview: 28+ messages / expand[flat|nested] mbox.gz Atom feed top
2021-02-11 22:52 [PATCH v3 0/7] CPU unplug timeout/LMB unplug cleanup in DRC reconfiguration Daniel Henrique Barboza
2021-02-11 22:52 ` [PATCH v3 1/7] spapr_drc.c: do not call spapr_drc_detach() in drc_isolate_logical() Daniel Henrique Barboza
2021-02-15 10:40 ` Greg Kurz
2021-02-17 0:51 ` David Gibson
2021-02-11 22:52 ` [PATCH v3 2/7] spapr_pci.c: simplify spapr_pci_unplug_request() function handling Daniel Henrique Barboza
2021-02-16 15:50 ` Greg Kurz
2021-02-16 16:09 ` Daniel Henrique Barboza
2021-02-16 17:16 ` Greg Kurz
2021-02-16 17:44 ` Daniel Henrique Barboza
2021-02-17 0:54 ` David Gibson
2021-02-11 22:52 ` [PATCH v3 3/7] spapr_drc.c: use spapr_drc_release() in isolate_physical/set_unusable Daniel Henrique Barboza
2021-02-17 0:57 ` David Gibson
2021-02-17 10:58 ` Greg Kurz
2021-02-11 22:52 ` [PATCH v3 4/7] spapr: rename spapr_drc_detach() to spapr_drc_unplug_request() Daniel Henrique Barboza
2021-02-17 0:58 ` David Gibson
2021-02-17 11:01 ` Greg Kurz
2021-02-11 22:52 ` [PATCH v3 5/7] spapr_drc.c: introduce unplug_timeout_timer Daniel Henrique Barboza
2021-02-17 1:14 ` David Gibson
2021-02-17 1:20 ` David Gibson
2021-02-11 22:52 ` [PATCH v3 6/7] spapr_drc.c: add hotunplug timeout for CPUs Daniel Henrique Barboza
2021-02-17 1:23 ` David Gibson
2021-02-11 22:52 ` [PATCH v3 7/7] spapr_drc.c: use DRC reconfiguration to cleanup DIMM unplug state Daniel Henrique Barboza
2021-02-17 2:31 ` David Gibson
2021-02-19 20:04 ` Daniel Henrique Barboza
2021-02-22 5:53 ` David Gibson
2021-02-19 21:31 ` Daniel Henrique Barboza [this message]
2021-02-22 5:54 ` David Gibson
2021-02-17 2:33 ` [PATCH v3 0/7] CPU unplug timeout/LMB unplug cleanup in DRC reconfiguration David Gibson
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=5026c1ed-ebbc-99fc-ac7e-146fe6c9d32b@gmail.com \
--to=danielhb413@gmail.com \
--cc=david@gibson.dropbear.id.au \
--cc=groug@kaod.org \
--cc=qemu-devel@nongnu.org \
--cc=qemu-ppc@nongnu.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).