qemu-devel.nongnu.org archive mirror
 help / color / mirror / Atom feed
From: Daniel Henrique Barboza <danielhb413@gmail.com>
To: "Cédric Le Goater" <clg@kaod.org>,
	qemu-ppc@nongnu.org, qemu-devel@nongnu.org
Cc: Kardashevskiy <aik@ozlabs.ru>,
	Frederic Barrat <fbarrat@linux.ibm.com>,
	Fabiano Rosas <farosas@linux.ibm.com>, Greg Kurz <groug@kaod.org>,
	David Gibson <david@gibson.dropbear.id.au>
Subject: Re: [PATCH] ppc/spapr: Advertise StoreEOI for POWER10 compat guests
Date: Thu, 17 Feb 2022 08:28:35 -0300	[thread overview]
Message-ID: <b285a2c3-0a63-3242-454b-b58f52c74084@gmail.com> (raw)
In-Reply-To: <523b6988-57ac-7b1e-9791-a26bafc799ad@kaod.org>



On 2/17/22 04:51, Cédric Le Goater wrote:
> [ adding a few people for the comments ]
> 
> On 2/16/22 23:17, Daniel Henrique Barboza wrote:
>>
>>
>> On 2/14/22 11:11, Cédric Le Goater wrote:
>>> When an interrupt has been handled, the OS notifies the interrupt
>>> controller with a EOI sequence. On a POWER9 and POWER10 systems using
>>
>> nit:  s/a EOI sequence/an EOI sequence
>>
>>
>>> the XIVE interrupt controller, this can be done with a load or a store
>>> operation on the ESB interrupt management page of the interrupt. The
>>> StoreEOI operation has less latency and improves interrupt handling
>>> performance but it was deactivated during the POWER9 DD2.0 timeframe
>>> because of ordering issues. POWER9 systems use the LoadEOI instead.
>>> POWER10 compat guests should have fixed the issue with
>>> Load-after-Store ordering and StoreEOI can be activated for them
>>> again.
>>>
>>> To maintain performance, this ordering is only enforced for the
>>> XIVE_ESB_SET_PQ_10 load operation. This operation can be used to
>>> disable temporarily an interrupt source. If StoreEOI is active, a
>>> source could be left enabled if the load and store operations come
>>> out of order.
>>>
>>> Add a check in our XIVE emulation model for Load-after-Store when
>>> StoreEOI is active. It should catch unreliable sequences. Other load
>>> operations should be fine without it.
>>>
>>> Signed-off-by: Cédric Le Goater <clg@kaod.org>
>>> ---
>>>   include/hw/ppc/spapr_xive.h |  1 +
>>>   include/hw/ppc/xive.h       |  8 ++++++++
>>>   hw/intc/spapr_xive.c        | 15 +++++++++++++++
>>>   hw/intc/spapr_xive_kvm.c    | 15 +++++++++++++++
>>>   hw/intc/xive.c              |  6 ++++++
>>>   hw/ppc/spapr_hcall.c        |  7 +++++++
>>>   6 files changed, 52 insertions(+)
>>>
>>> diff --git a/include/hw/ppc/spapr_xive.h b/include/hw/ppc/spapr_xive.h
>>> index b282960ad90d..9c247d8bf57d 100644
>>> --- a/include/hw/ppc/spapr_xive.h
>>> +++ b/include/hw/ppc/spapr_xive.h
>>> @@ -73,6 +73,7 @@ void spapr_xive_map_mmio(SpaprXive *xive);
>>>   int spapr_xive_end_to_target(uint8_t end_blk, uint32_t end_idx,
>>>                                uint32_t *out_server, uint8_t *out_prio);
>>> +void spapr_xive_enable_store_eoi(SpaprXive *xive, bool enable);
>>>   /*
>>>    * KVM XIVE device helpers
>>> diff --git a/include/hw/ppc/xive.h b/include/hw/ppc/xive.h
>>> index 126e4e2c3a17..133f308c2792 100644
>>> --- a/include/hw/ppc/xive.h
>>> +++ b/include/hw/ppc/xive.h
>>> @@ -285,6 +285,14 @@ uint8_t xive_esb_set(uint8_t *pq, uint8_t value);
>>>   #define XIVE_ESB_SET_PQ_10      0xe00 /* Load */
>>>   #define XIVE_ESB_SET_PQ_11      0xf00 /* Load */
>>> +/*
>>> + * Load-after-store ordering
>>> + *
>>> + * Adding this offset to the load address will enforce
>>> + * load-after-store ordering. This is required to use with StoreEOI.
>>> + */
>>> +#define XIVE_ESB_LD_ST_MO       0x40 /* Load-after-store ordering */
>>> +
>>>   uint8_t xive_source_esb_get(XiveSource *xsrc, uint32_t srcno);
>>>   uint8_t xive_source_esb_set(XiveSource *xsrc, uint32_t srcno, uint8_t pq);
>>> diff --git a/hw/intc/spapr_xive.c b/hw/intc/spapr_xive.c
>>> index dc641cc604bf..0b8a246ad594 100644
>>> --- a/hw/intc/spapr_xive.c
>>> +++ b/hw/intc/spapr_xive.c
>>> @@ -25,6 +25,7 @@
>>>   #include "hw/ppc/xive_regs.h"
>>>   #include "hw/qdev-properties.h"
>>>   #include "trace.h"
>>> +#include "cpu-models.h"
>>>   /*
>>>    * XIVE Virtualization Controller BAR and Thread Managment BAR that we
>>> @@ -1854,3 +1855,17 @@ void spapr_xive_hcall_init(SpaprMachineState *spapr)
>>>       spapr_register_hypercall(H_INT_SYNC, h_int_sync);
>>>       spapr_register_hypercall(H_INT_RESET, h_int_reset);
>>>   }
>>> +
>>> +/*
>>> + * Advertise StoreEOI for a P10 compat guest. OS is required to
>>> + * enforce load-after-store ordering.
>>> + */
>>> +void spapr_xive_enable_store_eoi(SpaprXive *xive, bool enable)
>>> +{
>>> +    if (enable) {
>>> +        xive->source.esb_flags |= XIVE_SRC_STORE_EOI;
>>> +    } else {
>>> +        xive->source.esb_flags &= ~XIVE_SRC_STORE_EOI;
>>> +    }
>>> +
>>> +}
>>> diff --git a/hw/intc/spapr_xive_kvm.c b/hw/intc/spapr_xive_kvm.c
>>> index 61fe7bd2d322..bd023407bd7f 100644
>>> --- a/hw/intc/spapr_xive_kvm.c
>>> +++ b/hw/intc/spapr_xive_kvm.c
>>> @@ -296,6 +296,21 @@ static uint64_t xive_esb_rw(XiveSource *xsrc, int srcno, uint32_t offset,
>>>   static uint8_t xive_esb_read(XiveSource *xsrc, int srcno, uint32_t offset)
>>>   {
>>> +    /*
>>> +     * The XIVE_ESB_SET_PQ_10 load operation is used to disable
>>> +     * temporarily an interrupt source. If StoreEOI is active, a
>>> +     * source could be left enabled if the load and store operations
>>> +     * come out of order.
>>> +     *
>>> +     * As we don't know the characteristics of the host source
>>> +     * interrupts (StoreEOI or not), enforce the load-after-store
>>> +     * ordering always. The performance penalty will be very small for
>>> +     * QEMU.
>>> +     */
>>> +    if (offset == XIVE_ESB_SET_PQ_10) {
>>> +        offset |= XIVE_ESB_LD_ST_MO;
>>> +    }
>>> +
>>>       return xive_esb_rw(xsrc, srcno, offset, 0, 0) & 0x3;
>>>   }
>>> diff --git a/hw/intc/xive.c b/hw/intc/xive.c
>>> index b8e4c7294d59..d62881873b1b 100644
>>> --- a/hw/intc/xive.c
>>> +++ b/hw/intc/xive.c
>>> @@ -1024,6 +1024,12 @@ static uint64_t xive_source_esb_read(void *opaque, hwaddr addr, unsigned size)
>>>       case XIVE_ESB_SET_PQ_01 ... XIVE_ESB_SET_PQ_01 + 0x0FF:
>>>       case XIVE_ESB_SET_PQ_10 ... XIVE_ESB_SET_PQ_10 + 0x0FF:
>>>       case XIVE_ESB_SET_PQ_11 ... XIVE_ESB_SET_PQ_11 + 0x0FF:
>>> +        if (offset == XIVE_ESB_SET_PQ_10 &&
>>> +            xsrc->esb_flags & XIVE_SRC_STORE_EOI) {
>>> +            qemu_log_mask(LOG_GUEST_ERROR, "XIVE: load-after-store ordering "
>>> +                          "not enforced with Store EOI active for IRQ %d\n",
>>> +                          srcno);
>>> +        }
>>
>> I'm afraid I have a lack of understanding of what is happening here but I'll try. Up
>> there, in xive_esb_read(), you forced the load-after-store ordering for the Load
>> operation that can left the source enabled if we have an out of order store-after-load
>> situation. Given that it's a function in spapr_xive_kvm.c 
> 
> Yes, everything done in spapr_xive_kvm.c is related to the KVM support.
> 
> This means that the memory load/store operations are done directly on
> the HW MMIO page of the XIVE controller and this from the QEMU user
> space program. And so ...
> 
> 
>> I understand that this is
>> a wrapper for the kvmppc() calls into the host kernel XIVE irqchip, 
> 
> the host kernel XIVE irqchip is not involded. QEMU manipulates HW
> directly bypassing the host kernel. This is important to understand.
> QEMU "pokes and peeks" HW from user space.

Oh, I thought the host XIVE chip was involved in this case. Good to know.

> 
>> so that piece
>> of code is forcing the load-after-store at all times for that Load operation since
>> the performance penalty is not relevant.
> 
> yes. In that part, we don't know if StoreEOI has been enabled, so we
> use the magic offset in any case.
> 
> This would be a no-op on P9 and would enforce load-after-store always
> on P10. on P10, all sources should have StoreEOI, so it is a correct
> thing to do.
> 
> xive_esb_read() is used in many places:
> 
>   1. in the machine change state handler, for migration and to catch
>      inflight interrupts.
>   2. to query state, for migration and for the monitor
>   3. for LSI handling
> 
> We care about 1.
> 
>> Assuming that what I said is mostly correct, I don't understand why we can't do a
>> similar thing in the hw/intc/xive.c case, where I suppose it's the XIVE emulation
>> controller implementation that we're going to use if we don't have an  irqchip to
>> use with KVM.
> 
> yes. This part of the code is always used for emulated sources. When
> the KVM device is in use, it is not involved.
> 
>> Can't we force the load-after-store ordering in the emulated
>> XIVE_ESB_SET_PQ_10 instead of throwing an error?
> 
> There are no load-after-store issues when under emulation. This is a
> HW-only problem on high end systems. I think you need more than 4sockets
> to show up. I have never seen it on 2 sockets.
> 
> But the HW does not tell if something is wrong in the code. At some point,
> the OS freezes and possibly complains because an IRQ is (wrongly) pending.
> The check done in xive_source_esb_read() is here to catch a misbehaving
> OS which would use in some code path the racy XIVE_ESB_SET_PQ_10 without
> adding the load-after-store ordering offset XIVE_ESB_LD_ST_MO. That's
> how I tracked all the possible paths using XIVE_ESB_SET_PQ_10 under Linux.
> 
> Hope this is mostly clear.

Yes, much clear. Thanks!


Daniel

> 
> Thanks,
> 
> C.


  reply	other threads:[~2022-02-17 11:33 UTC|newest]

Thread overview: 9+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2022-02-14 14:11 [PATCH] ppc/spapr: Advertise StoreEOI for POWER10 compat guests Cédric Le Goater
2022-02-16 22:17 ` Daniel Henrique Barboza
2022-02-17  7:51   ` Cédric Le Goater
2022-02-17 11:28     ` Daniel Henrique Barboza [this message]
2022-02-17 11:28 ` Daniel Henrique Barboza
2022-02-17 13:23   ` Cédric Le Goater
2022-02-17 13:30     ` Daniel Henrique Barboza
2022-02-17 17:42       ` Cédric Le Goater
2022-02-17 21:03     ` Daniel Henrique Barboza

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=b285a2c3-0a63-3242-454b-b58f52c74084@gmail.com \
    --to=danielhb413@gmail.com \
    --cc=aik@ozlabs.ru \
    --cc=clg@kaod.org \
    --cc=david@gibson.dropbear.id.au \
    --cc=farosas@linux.ibm.com \
    --cc=fbarrat@linux.ibm.com \
    --cc=groug@kaod.org \
    --cc=qemu-devel@nongnu.org \
    --cc=qemu-ppc@nongnu.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).