From: David Gibson <david@gibson.dropbear.id.au>
To: Alexey Kardashevskiy <aik@ozlabs.ru>
Cc: Alex Williamson <alex.williamson@redhat.com>,
qemu-ppc@nongnu.org, qemu-devel@nongnu.org
Subject: Re: [Qemu-devel] [Qemu-ppc] [PATCH qemu v13 07/16] vfio, memory: Notify IOMMU about starting/stopping being used by VFIO
Date: Thu, 3 Mar 2016 16:28:20 +1100 [thread overview]
Message-ID: <20160303052820.GH1620@voom.redhat.com> (raw)
In-Reply-To: <1456823441-46757-8-git-send-email-aik@ozlabs.ru>
[-- Attachment #1: Type: text/plain, Size: 7280 bytes --]
On Tue, Mar 01, 2016 at 08:10:32PM +1100, Alexey Kardashevskiy wrote:
> This adds a vfio_votify() callback to inform an IOMMU (and then its owner)
> that VFIO started using the IOMMU. This is used by the pseries machine to
> enable/disable in-kernel acceleration of TCE hypercalls.
>
> Signed-off-by: Alexey Kardashevskiy <aik@ozlabs.ru>
Hmm.. the current approach of having a hook when vfio-pci devices are
attached is pretty ugly, but what exactly the case that it doesn't
handle and this approach does?
This two tiered notify system for a single bit is also kinda ugly.
> ---
> hw/ppc/spapr_iommu.c | 9 +++++++++
> hw/ppc/spapr_pci.c | 14 ++++++++------
> hw/vfio/common.c | 7 +++++++
> include/exec/memory.h | 2 ++
> include/hw/ppc/spapr.h | 4 ++++
> 5 files changed, 30 insertions(+), 6 deletions(-)
>
> diff --git a/hw/ppc/spapr_iommu.c b/hw/ppc/spapr_iommu.c
> index 8a88a74..67a8356 100644
> --- a/hw/ppc/spapr_iommu.c
> +++ b/hw/ppc/spapr_iommu.c
> @@ -136,6 +136,13 @@ static IOMMUTLBEntry spapr_tce_translate_iommu(MemoryRegion *iommu, hwaddr addr,
> return ret;
> }
>
> +static int spapr_tce_vfio_notify(MemoryRegion *iommu, bool attached)
> +{
> + sPAPRTCETable *tcet = container_of(iommu, sPAPRTCETable, iommu);
> +
> + return spapr_tce_vfio_notify_owner(tcet->owner, tcet, attached);
I'm guessing the "owner" is the PHB, but I'm not entirely clear.
Could you use the QOM parent to get the the PHB instead of storing it
explicitly?
> +}
> +
> static int spapr_tce_table_post_load(void *opaque, int version_id)
> {
> sPAPRTCETable *tcet = SPAPR_TCE_TABLE(opaque);
> @@ -167,6 +174,7 @@ static const VMStateDescription vmstate_spapr_tce_table = {
>
> static MemoryRegionIOMMUOps spapr_iommu_ops = {
> .translate = spapr_tce_translate_iommu,
> + .vfio_notify = spapr_tce_vfio_notify,
> };
>
> static int spapr_tce_table_realize(DeviceState *dev)
> @@ -235,6 +243,7 @@ sPAPRTCETable *spapr_tce_new_table(DeviceState *owner, uint32_t liobn)
>
> tcet = SPAPR_TCE_TABLE(object_new(TYPE_SPAPR_TCE_TABLE));
> tcet->liobn = liobn;
> + tcet->owner = owner;
>
> snprintf(tmp, sizeof(tmp), "tce-table-%x", liobn);
> object_property_add_child(OBJECT(owner), tmp, OBJECT(tcet), NULL);
> diff --git a/hw/ppc/spapr_pci.c b/hw/ppc/spapr_pci.c
> index ee0fecf..b0cd148 100644
> --- a/hw/ppc/spapr_pci.c
> +++ b/hw/ppc/spapr_pci.c
> @@ -1084,6 +1084,14 @@ static int spapr_populate_pci_child_dt(PCIDevice *dev, void *fdt, int offset,
> return 0;
> }
>
> +int spapr_tce_vfio_notify_owner(DeviceState *dev, sPAPRTCETable *tcet,
> + bool attached)
> +{
> + spapr_tce_set_need_vfio(tcet, attached);
Hmm.. you go to the trouble of storing owner in dev, then don't
actually use it.
> + return 0;
> +}
> +
> /* create OF node for pci device and required OF DT properties */
> static int spapr_create_pci_child_dt(sPAPRPHBState *phb, PCIDevice *dev,
> void *fdt, int node_offset)
> @@ -1118,12 +1126,6 @@ static void spapr_phb_add_pci_device(sPAPRDRConnector *drc,
> void *fdt = NULL;
> int fdt_start_offset = 0, fdt_size;
>
> - if (object_dynamic_cast(OBJECT(pdev), "vfio-pci")) {
> - sPAPRTCETable *tcet = spapr_tce_find_by_liobn(phb->dma_liobn);
> -
> - spapr_tce_set_need_vfio(tcet, true);
> - }
> -
> if (dev->hotplugged) {
> fdt = create_device_tree(&fdt_size);
> fdt_start_offset = spapr_create_pci_child_dt(phb, pdev, fdt, 0);
> diff --git a/hw/vfio/common.c b/hw/vfio/common.c
> index 9bf4c3b..ca3fd47 100644
> --- a/hw/vfio/common.c
> +++ b/hw/vfio/common.c
> @@ -384,6 +384,7 @@ static void vfio_listener_region_add(MemoryListener *listener,
> QLIST_INSERT_HEAD(&container->giommu_list, giommu, giommu_next);
>
> memory_region_register_iommu_notifier(giommu->iommu, &giommu->n);
> + giommu->iommu->iommu_ops->vfio_notify(section->mr, true);
> memory_region_iommu_replay(giommu->iommu, &giommu->n,
> vfio_container_granularity(container),
> false);
> @@ -430,6 +431,7 @@ static void vfio_listener_region_del(MemoryListener *listener,
> VFIOContainer *container = container_of(listener, VFIOContainer, listener);
> hwaddr iova, end;
> int ret;
> + MemoryRegion *iommu = NULL;
>
> if (vfio_listener_skipped_section(section)) {
> trace_vfio_listener_region_del_skip(
> @@ -451,6 +453,7 @@ static void vfio_listener_region_del(MemoryListener *listener,
> QLIST_FOREACH(giommu, &container->giommu_list, giommu_next) {
> if (giommu->iommu == section->mr) {
> memory_region_unregister_iommu_notifier(&giommu->n);
> + iommu = giommu->iommu;
> QLIST_REMOVE(giommu, giommu_next);
> g_free(giommu);
> break;
> @@ -483,6 +486,10 @@ static void vfio_listener_region_del(MemoryListener *listener,
> "0x%"HWADDR_PRIx") = %d (%m)",
> container, iova, end - iova, ret);
> }
> +
> + if (iommu && iommu->iommu_ops && iommu->iommu_ops->vfio_notify) {
> + iommu->iommu_ops->vfio_notify(section->mr, false);
> + }
So, if an IOMMU is removed from the guest, this will turn off VFIO
enablement. However, IIUC this won't get caled in the more likely
case that the address space stays the same, but the VFIO device is
removed.
> }
>
> static const MemoryListener vfio_memory_listener = {
> diff --git a/include/exec/memory.h b/include/exec/memory.h
> index d5284c2..9f82629 100644
> --- a/include/exec/memory.h
> +++ b/include/exec/memory.h
> @@ -150,6 +150,8 @@ typedef struct MemoryRegionIOMMUOps MemoryRegionIOMMUOps;
> struct MemoryRegionIOMMUOps {
> /* Return a TLB entry that contains a given address. */
> IOMMUTLBEntry (*translate)(MemoryRegion *iommu, hwaddr addr, bool is_write);
> + /* Called when VFIO starts/stops using this */
> + int (*vfio_notify)(MemoryRegion *iommu, bool attached);
> };
>
> typedef struct CoalescedMemoryRange CoalescedMemoryRange;
> diff --git a/include/hw/ppc/spapr.h b/include/hw/ppc/spapr.h
> index 8aa0c45..5d2f8f4 100644
> --- a/include/hw/ppc/spapr.h
> +++ b/include/hw/ppc/spapr.h
> @@ -550,6 +550,7 @@ struct sPAPRTCETable {
> int fd;
> MemoryRegion root, iommu;
> struct VIOsPAPRDevice *vdev; /* for @bypass migration compatibility only */
> + DeviceState *owner;
> QLIST_ENTRY(sPAPRTCETable) list;
> };
>
> @@ -629,4 +630,7 @@ int spapr_rng_populate_dt(void *fdt);
> */
> #define SPAPR_LMB_FLAGS_ASSIGNED 0x00000008
>
> +int spapr_tce_vfio_notify_owner(DeviceState *dev, sPAPRTCETable *tcet,
> + bool attached);
> +
> #endif /* !defined (__HW_SPAPR_H__) */
--
David Gibson | I'll have my music baroque, and my code
david AT gibson.dropbear.id.au | minimalist, thank you. NOT _the_ _other_
| _way_ _around_!
http://www.ozlabs.org/~dgibson
[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 819 bytes --]
next prev parent reply other threads:[~2016-03-03 5:34 UTC|newest]
Thread overview: 50+ messages / expand[flat|nested] mbox.gz Atom feed top
2016-03-01 9:10 [Qemu-devel] [PATCH qemu v13 00/16] spapr: vfio: Enable Dynamic DMA windows (DDW) Alexey Kardashevskiy
2016-03-01 9:10 ` [Qemu-devel] [PATCH qemu v13 01/16] memory: Fix IOMMU replay base address Alexey Kardashevskiy
2016-03-03 1:34 ` [Qemu-devel] [Qemu-ppc] " David Gibson
2016-03-01 9:10 ` [Qemu-devel] [PATCH qemu v13 02/16] spapr_pci: Move DMA window enablement to a helper Alexey Kardashevskiy
2016-03-03 1:40 ` [Qemu-devel] [Qemu-ppc] " David Gibson
2016-03-10 5:47 ` Alexey Kardashevskiy
2016-03-15 5:30 ` David Gibson
2016-03-01 9:10 ` [Qemu-devel] [PATCH qemu v13 03/16] spapr_iommu: Move table allocation to helpers Alexey Kardashevskiy
2016-03-01 9:10 ` [Qemu-devel] [PATCH qemu v13 04/16] spapr_iommu: Introduce "enabled" state for TCE table Alexey Kardashevskiy
2016-03-03 3:00 ` [Qemu-devel] [Qemu-ppc] " David Gibson
2016-03-10 7:39 ` Alexey Kardashevskiy
2016-03-15 5:32 ` David Gibson
2016-03-01 9:10 ` [Qemu-devel] [PATCH qemu v13 05/16] spapr_iommu: Add root memory region Alexey Kardashevskiy
2016-03-04 4:08 ` [Qemu-devel] [Qemu-ppc] " David Gibson
2016-03-01 9:10 ` [Qemu-devel] [PATCH qemu v13 06/16] spapr_pci: Reset DMA config on PHB reset Alexey Kardashevskiy
2016-03-03 3:02 ` [Qemu-devel] [Qemu-ppc] " David Gibson
2016-03-01 9:10 ` [Qemu-devel] [PATCH qemu v13 07/16] vfio, memory: Notify IOMMU about starting/stopping being used by VFIO Alexey Kardashevskiy
2016-03-03 5:28 ` David Gibson [this message]
2016-03-03 6:01 ` [Qemu-devel] [Qemu-ppc] " Alexey Kardashevskiy
2016-03-04 4:01 ` David Gibson
2016-03-01 9:10 ` [Qemu-devel] [PATCH qemu v13 08/16] memory: Add reporting of supported page sizes Alexey Kardashevskiy
2016-03-03 5:33 ` [Qemu-devel] [Qemu-ppc] " David Gibson
2016-03-01 9:10 ` [Qemu-devel] [PATCH qemu v13 09/16] vfio: Generalize IOMMU memory listener Alexey Kardashevskiy
2016-03-03 5:36 ` [Qemu-devel] [Qemu-ppc] " David Gibson
2016-03-03 6:07 ` Alexey Kardashevskiy
2016-03-04 3:44 ` David Gibson
2016-03-01 9:10 ` [Qemu-devel] [PATCH qemu v13 10/16] vfio: Use different page size for different IOMMU types Alexey Kardashevskiy
2016-03-03 6:08 ` [Qemu-devel] [Qemu-ppc] " David Gibson
2016-03-01 9:10 ` [Qemu-devel] [PATCH qemu v13 11/16] vfio: spapr: Add SPAPR IOMMU v2 support (DMA memory preregistering) Alexey Kardashevskiy
2016-03-03 6:30 ` [Qemu-devel] [Qemu-ppc] " David Gibson
2016-03-15 2:53 ` Alexey Kardashevskiy
2016-03-15 5:42 ` David Gibson
2016-03-17 5:04 ` Alexey Kardashevskiy
2016-03-17 6:10 ` David Gibson
2016-03-17 9:23 ` Alexey Kardashevskiy
2016-03-21 4:53 ` David Gibson
2016-03-21 6:08 ` Alexey Kardashevskiy
2016-03-01 9:10 ` [Qemu-devel] [PATCH qemu v13 12/16] vmstate: Define VARRAY with VMS_ALLOC Alexey Kardashevskiy
2016-03-03 6:31 ` [Qemu-devel] [Qemu-ppc] " David Gibson
2016-03-01 9:10 ` [Qemu-devel] [PATCH qemu v13 13/16] spapr_iommu: Remove need_vfio flag from sPAPRTCETable Alexey Kardashevskiy
2016-03-03 6:38 ` [Qemu-devel] [Qemu-ppc] " David Gibson
2016-03-01 9:10 ` [Qemu-devel] [PATCH qemu v13 14/16] spapr_pci: Add and export DMA resetting helper Alexey Kardashevskiy
2016-03-03 6:39 ` [Qemu-devel] [Qemu-ppc] " David Gibson
2016-03-01 9:10 ` [Qemu-devel] [PATCH qemu v13 15/16] vfio: Move iova_pgsizes from container to guest IOMMU Alexey Kardashevskiy
2016-03-03 11:22 ` [Qemu-devel] [Qemu-ppc] " David Gibson
2016-03-04 0:02 ` Alexey Kardashevskiy
2016-03-01 9:10 ` [Qemu-devel] [PATCH qemu v13 16/16] spapr_pci/spapr_pci_vfio: Support Dynamic DMA Windows (DDW) Alexey Kardashevskiy
2016-03-04 4:51 ` [Qemu-devel] [Qemu-ppc] " David Gibson
2016-03-11 9:03 ` Alexey Kardashevskiy
2016-03-15 5:53 ` David Gibson
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20160303052820.GH1620@voom.redhat.com \
--to=david@gibson.dropbear.id.au \
--cc=aik@ozlabs.ru \
--cc=alex.williamson@redhat.com \
--cc=qemu-devel@nongnu.org \
--cc=qemu-ppc@nongnu.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).