* [PATCH v2 0/4] Support dynamic MSI-X allocation
@ 2023-09-18 9:45 Jing Liu
2023-09-18 9:45 ` [PATCH v2 1/4] vfio/pci: detect the support of " Jing Liu
` (4 more replies)
0 siblings, 5 replies; 12+ messages in thread
From: Jing Liu @ 2023-09-18 9:45 UTC (permalink / raw)
To: qemu-devel
Cc: alex.williamson, clg, pbonzini, kevin.tian, reinette.chatre,
jing2.liu, jing2.liu
Changes since v1:
- v1: https://www.mail-archive.com/qemu-devel@nongnu.org/msg982842.html
- Revise Qemu to QEMU. (Cédric)
- Add g_free when failure of getting MSI-X irq info. (Cédric)
- Apply Cédric's Reviewed-by. (Cédric)
- Use g_autofree to automatically release. (Cédric)
- Remove the failure message in vfio_enable_msix_no_vec(). (Cédric)
Changes since RFC v1:
- RFC v1: https://www.mail-archive.com/qemu-devel@nongnu.org/msg978637.html
- Revise the comments. (Alex)
- Report error of getting irq info and remove the trace of failure
case. (Alex, Cédric)
- Only store dynamic allocation flag as a bool type and test
accordingly. (Alex)
- Move dynamic allocation detection to vfio_msix_early_setup(). (Alex)
- Change the condition logic in vfio_msix_vector_do_use() that moving
the defer_kvm_irq_routing test out and create a common place to update
nr_vectors. (Alex)
- Consolidate the way of MSI-X enabling during device initialization and
interrupt restoring that uses fd = -1 trick. Create a function doing
that. (Alex)
Before kernel v6.5, dynamic allocation of MSI-X interrupts was not
supported. QEMU therefore when allocating a new interrupt, should first
release all previously allocated interrupts (including disable of MSI-X)
and re-allocate all interrupts that includes the new one.
The kernel series [1] adds the support of dynamic MSI-X allocation to
vfio-pci and uses the existing flag VFIO_IRQ_INFO_NORESIZE to guide user
space, that when dynamic MSI-X is supported the flag is cleared.
This series makes the behavior for VFIO PCI devices when dynamic MSI-X
allocation is supported. When guest unmasks an interrupt, QEMU can
directly allocate an interrupt on host for this and has nothing to do
with the previously allocated ones. Therefore, host only allocates
interrupts for those unmasked (enabled) interrupts inside guest when
dynamic MSI-X allocation is supported by device.
When guests enable MSI-X with all of the vectors masked, QEMU need match
the state to enable MSI-X with no vector enabled. During migration
restore, QEMU also need enable MSI-X first in dynamic allocation mode,
to avoid the guest unused vectors being allocated on host. To
consolidate them, we use vector 0 with an invalid fd to get MSI-X
enabled and create a common function for this. This is cleaner than
setting userspace triggering and immediately release.
Any feedback is appreciated.
Jing
[1] https://lwn.net/Articles/931679/
Jing Liu (4):
vfio/pci: detect the support of dynamic MSI-X allocation
vfio/pci: enable vector on dynamic MSI-X allocation
vfio/pci: use an invalid fd to enable MSI-X
vfio/pci: enable MSI-X in interrupt restoring on dynamic allocation
hw/vfio/pci.c | 121 +++++++++++++++++++++++++++++++++----------
hw/vfio/pci.h | 1 +
hw/vfio/trace-events | 2 +-
3 files changed, 96 insertions(+), 28 deletions(-)
--
2.27.0
^ permalink raw reply [flat|nested] 12+ messages in thread
* [PATCH v2 1/4] vfio/pci: detect the support of dynamic MSI-X allocation
2023-09-18 9:45 [PATCH v2 0/4] Support dynamic MSI-X allocation Jing Liu
@ 2023-09-18 9:45 ` Jing Liu
2023-09-18 9:45 ` [PATCH v2 2/4] vfio/pci: enable vector on " Jing Liu
` (3 subsequent siblings)
4 siblings, 0 replies; 12+ messages in thread
From: Jing Liu @ 2023-09-18 9:45 UTC (permalink / raw)
To: qemu-devel
Cc: alex.williamson, clg, pbonzini, kevin.tian, reinette.chatre,
jing2.liu, jing2.liu
Kernel provides the guidance of dynamic MSI-X allocation support of
passthrough device, by clearing the VFIO_IRQ_INFO_NORESIZE flag to
guide user space.
Fetch the flags from host to determine if dynamic MSI-X allocation is
supported.
Originally-by: Reinette Chatre <reinette.chatre@intel.com>
Signed-off-by: Jing Liu <jing2.liu@intel.com>
Reviewed-by: Cédric Le Goater <clg@redhat.com>
---
Changes since v1:
- Free msix when failed to get MSI-X irq info. (Cédric)
- Apply Cédric's Reviewed-by.
Changes since RFC v1:
- Filter the dynamic MSI-X allocation flag and store as a bool type.
(Alex)
- Move the detection to vfio_msix_early_setup(). (Alex)
- Report error of getting irq info and remove the trace of failure
case. (Alex, Cédric)
---
hw/vfio/pci.c | 16 ++++++++++++++--
hw/vfio/pci.h | 1 +
hw/vfio/trace-events | 2 +-
3 files changed, 16 insertions(+), 3 deletions(-)
diff --git a/hw/vfio/pci.c b/hw/vfio/pci.c
index a205c6b1130f..60654ca28ab8 100644
--- a/hw/vfio/pci.c
+++ b/hw/vfio/pci.c
@@ -1493,7 +1493,9 @@ static void vfio_msix_early_setup(VFIOPCIDevice *vdev, Error **errp)
uint8_t pos;
uint16_t ctrl;
uint32_t table, pba;
- int fd = vdev->vbasedev.fd;
+ int ret, fd = vdev->vbasedev.fd;
+ struct vfio_irq_info irq_info = { .argsz = sizeof(irq_info),
+ .index = VFIO_PCI_MSIX_IRQ_INDEX };
VFIOMSIXInfo *msix;
pos = pci_find_capability(&vdev->pdev, PCI_CAP_ID_MSIX);
@@ -1530,6 +1532,15 @@ static void vfio_msix_early_setup(VFIOPCIDevice *vdev, Error **errp)
msix->pba_offset = pba & ~PCI_MSIX_FLAGS_BIRMASK;
msix->entries = (ctrl & PCI_MSIX_FLAGS_QSIZE) + 1;
+ ret = ioctl(vdev->vbasedev.fd, VFIO_DEVICE_GET_IRQ_INFO, &irq_info);
+ if (ret < 0) {
+ error_setg_errno(errp, -ret, "failed to get MSI-X irq info");
+ g_free(msix);
+ return;
+ }
+
+ msix->noresize = !!(irq_info.flags & VFIO_IRQ_INFO_NORESIZE);
+
/*
* Test the size of the pba_offset variable and catch if it extends outside
* of the specified BAR. If it is the case, we need to apply a hardware
@@ -1562,7 +1573,8 @@ static void vfio_msix_early_setup(VFIOPCIDevice *vdev, Error **errp)
}
trace_vfio_msix_early_setup(vdev->vbasedev.name, pos, msix->table_bar,
- msix->table_offset, msix->entries);
+ msix->table_offset, msix->entries,
+ msix->noresize);
vdev->msix = msix;
vfio_pci_fixup_msix_region(vdev);
diff --git a/hw/vfio/pci.h b/hw/vfio/pci.h
index a2771b9ff3cc..0717574d79e9 100644
--- a/hw/vfio/pci.h
+++ b/hw/vfio/pci.h
@@ -113,6 +113,7 @@ typedef struct VFIOMSIXInfo {
uint32_t table_offset;
uint32_t pba_offset;
unsigned long *pending;
+ bool noresize;
} VFIOMSIXInfo;
#define TYPE_VFIO_PCI "vfio-pci"
diff --git a/hw/vfio/trace-events b/hw/vfio/trace-events
index 81ec7c7a958b..cc7c21365c92 100644
--- a/hw/vfio/trace-events
+++ b/hw/vfio/trace-events
@@ -27,7 +27,7 @@ vfio_vga_read(uint64_t addr, int size, uint64_t data) " (0x%"PRIx64", %d) = 0x%"
vfio_pci_read_config(const char *name, int addr, int len, int val) " (%s, @0x%x, len=0x%x) 0x%x"
vfio_pci_write_config(const char *name, int addr, int val, int len) " (%s, @0x%x, 0x%x, len=0x%x)"
vfio_msi_setup(const char *name, int pos) "%s PCI MSI CAP @0x%x"
-vfio_msix_early_setup(const char *name, int pos, int table_bar, int offset, int entries) "%s PCI MSI-X CAP @0x%x, BAR %d, offset 0x%x, entries %d"
+vfio_msix_early_setup(const char *name, int pos, int table_bar, int offset, int entries, bool noresize) "%s PCI MSI-X CAP @0x%x, BAR %d, offset 0x%x, entries %d, noresize %d"
vfio_check_pcie_flr(const char *name) "%s Supports FLR via PCIe cap"
vfio_check_pm_reset(const char *name) "%s Supports PM reset"
vfio_check_af_flr(const char *name) "%s Supports FLR via AF cap"
--
2.27.0
^ permalink raw reply related [flat|nested] 12+ messages in thread
* [PATCH v2 2/4] vfio/pci: enable vector on dynamic MSI-X allocation
2023-09-18 9:45 [PATCH v2 0/4] Support dynamic MSI-X allocation Jing Liu
2023-09-18 9:45 ` [PATCH v2 1/4] vfio/pci: detect the support of " Jing Liu
@ 2023-09-18 9:45 ` Jing Liu
2023-09-22 20:54 ` Alex Williamson
2023-09-18 9:45 ` [PATCH v2 3/4] vfio/pci: use an invalid fd to enable MSI-X Jing Liu
` (2 subsequent siblings)
4 siblings, 1 reply; 12+ messages in thread
From: Jing Liu @ 2023-09-18 9:45 UTC (permalink / raw)
To: qemu-devel
Cc: alex.williamson, clg, pbonzini, kevin.tian, reinette.chatre,
jing2.liu, jing2.liu
The vector_use callback is used to enable vector that is unmasked in
guest. The kernel used to only support static MSI-X allocation. When
allocating a new interrupt using "static MSI-X allocation" kernels,
QEMU first disables all previously allocated vectors and then
re-allocates all including the new one. The nr_vectors of VFIOPCIDevice
indicates that all vectors from 0 to nr_vectors are allocated (and may
be enabled), which is used to to loop all the possibly used vectors
When, e.g., disabling MSI-X interrupts.
Extend the vector_use function to support dynamic MSI-X allocation when
host supports the capability. QEMU therefore can individually allocate
and enable a new interrupt without affecting others or causing interrupts
lost during runtime.
Utilize nr_vectors to calculate the upper bound of enabled vectors in
dynamic MSI-X allocation mode since looping all msix_entries_nr is not
efficient and unnecessary.
Signed-off-by: Jing Liu <jing2.liu@intel.com>
Tested-by: Reinette Chatre <reinette.chatre@intel.com>
---
Changes since v1:
- Revise Qemu to QEMU.
Changes since RFC v1:
- Test vdev->msix->noresize to identify the allocation mode. (Alex)
- Move defer_kvm_irq_routing test out and update nr_vectors in a
common place before vfio_enable_vectors(). (Alex)
- Revise the comments. (Alex)
---
hw/vfio/pci.c | 44 +++++++++++++++++++++++++++-----------------
1 file changed, 27 insertions(+), 17 deletions(-)
diff --git a/hw/vfio/pci.c b/hw/vfio/pci.c
index 60654ca28ab8..84987e46fd7a 100644
--- a/hw/vfio/pci.c
+++ b/hw/vfio/pci.c
@@ -470,6 +470,7 @@ static int vfio_msix_vector_do_use(PCIDevice *pdev, unsigned int nr,
VFIOPCIDevice *vdev = VFIO_PCI(pdev);
VFIOMSIVector *vector;
int ret;
+ int old_nr_vecs = vdev->nr_vectors;
trace_vfio_msix_vector_do_use(vdev->vbasedev.name, nr);
@@ -512,33 +513,42 @@ static int vfio_msix_vector_do_use(PCIDevice *pdev, unsigned int nr,
}
/*
- * We don't want to have the host allocate all possible MSI vectors
- * for a device if they're not in use, so we shutdown and incrementally
- * increase them as needed.
+ * When dynamic allocation is not supported, we don't want to have the
+ * host allocate all possible MSI vectors for a device if they're not
+ * in use, so we shutdown and incrementally increase them as needed.
+ * nr_vectors represents the total number of vectors allocated.
+ *
+ * When dynamic allocation is supported, let the host only allocate
+ * and enable a vector when it is in use in guest. nr_vectors represents
+ * the upper bound of vectors being enabled (but not all of the ranges
+ * is allocated or enabled).
*/
if (vdev->nr_vectors < nr + 1) {
vdev->nr_vectors = nr + 1;
- if (!vdev->defer_kvm_irq_routing) {
+ }
+
+ if (!vdev->defer_kvm_irq_routing) {
+ if (vdev->msix->noresize && (old_nr_vecs < nr + 1)) {
vfio_disable_irqindex(&vdev->vbasedev, VFIO_PCI_MSIX_IRQ_INDEX);
ret = vfio_enable_vectors(vdev, true);
if (ret) {
error_report("vfio: failed to enable vectors, %d", ret);
}
- }
- } else {
- Error *err = NULL;
- int32_t fd;
-
- if (vector->virq >= 0) {
- fd = event_notifier_get_fd(&vector->kvm_interrupt);
} else {
- fd = event_notifier_get_fd(&vector->interrupt);
- }
+ Error *err = NULL;
+ int32_t fd;
- if (vfio_set_irq_signaling(&vdev->vbasedev,
- VFIO_PCI_MSIX_IRQ_INDEX, nr,
- VFIO_IRQ_SET_ACTION_TRIGGER, fd, &err)) {
- error_reportf_err(err, VFIO_MSG_PREFIX, vdev->vbasedev.name);
+ if (vector->virq >= 0) {
+ fd = event_notifier_get_fd(&vector->kvm_interrupt);
+ } else {
+ fd = event_notifier_get_fd(&vector->interrupt);
+ }
+
+ if (vfio_set_irq_signaling(&vdev->vbasedev,
+ VFIO_PCI_MSIX_IRQ_INDEX, nr,
+ VFIO_IRQ_SET_ACTION_TRIGGER, fd, &err)) {
+ error_reportf_err(err, VFIO_MSG_PREFIX, vdev->vbasedev.name);
+ }
}
}
--
2.27.0
^ permalink raw reply related [flat|nested] 12+ messages in thread
* [PATCH v2 3/4] vfio/pci: use an invalid fd to enable MSI-X
2023-09-18 9:45 [PATCH v2 0/4] Support dynamic MSI-X allocation Jing Liu
2023-09-18 9:45 ` [PATCH v2 1/4] vfio/pci: detect the support of " Jing Liu
2023-09-18 9:45 ` [PATCH v2 2/4] vfio/pci: enable vector on " Jing Liu
@ 2023-09-18 9:45 ` Jing Liu
2023-09-19 15:18 ` Cédric Le Goater
2023-09-18 9:45 ` [PATCH v2 4/4] vfio/pci: enable MSI-X in interrupt restoring on dynamic allocation Jing Liu
2023-09-22 20:56 ` [PATCH v2 0/4] Support dynamic MSI-X allocation Alex Williamson
4 siblings, 1 reply; 12+ messages in thread
From: Jing Liu @ 2023-09-18 9:45 UTC (permalink / raw)
To: qemu-devel
Cc: alex.williamson, clg, pbonzini, kevin.tian, reinette.chatre,
jing2.liu, jing2.liu
Guests typically enable MSI-X with all of the vectors masked in the MSI-X
vector table. To match the guest state of device, QEMU enables MSI-X by
enabling vector 0 with userspace triggering and immediately release.
However the release function actually does not release it due to already
using userspace mode.
It is no need to enable triggering on host and rely on the mask bit to
avoid spurious interrupts. Use an invalid fd (i.e. fd = -1) is enough
to get MSI-X enabled.
After dynamic MSI-X allocation is supported, the interrupt restoring
also need use such way to enable MSI-X, therefore, create a function
for that.
Suggested-by: Alex Williamson <alex.williamson@redhat.com>
Signed-off-by: Jing Liu <jing2.liu@intel.com>
---
Changes since v1:
- Revise Qemu to QEMU. (Cédric)
- Use g_autofree to automatically release. (Cédric)
- Just return 'ret' and let the caller of vfio_enable_msix_no_vec()
report the error. (Cédric)
Changes since RFC v1:
- A new patch. Use an invalid fd to get MSI-X enabled instead of using
userspace triggering. (Alex)
---
hw/vfio/pci.c | 44 ++++++++++++++++++++++++++++++++++++--------
1 file changed, 36 insertions(+), 8 deletions(-)
diff --git a/hw/vfio/pci.c b/hw/vfio/pci.c
index 84987e46fd7a..0117f230e934 100644
--- a/hw/vfio/pci.c
+++ b/hw/vfio/pci.c
@@ -369,6 +369,33 @@ static void vfio_msi_interrupt(void *opaque)
notify(&vdev->pdev, nr);
}
+/*
+ * Get MSI-X enabled, but no vector enabled, by setting vector 0 with an invalid
+ * fd to kernel.
+ */
+static int vfio_enable_msix_no_vec(VFIOPCIDevice *vdev)
+{
+ g_autofree struct vfio_irq_set *irq_set = NULL;
+ int ret = 0, argsz;
+ int32_t *fd;
+
+ argsz = sizeof(*irq_set) + sizeof(*fd);
+
+ irq_set = g_malloc0(argsz);
+ irq_set->argsz = argsz;
+ irq_set->flags = VFIO_IRQ_SET_DATA_EVENTFD |
+ VFIO_IRQ_SET_ACTION_TRIGGER;
+ irq_set->index = VFIO_PCI_MSIX_IRQ_INDEX;
+ irq_set->start = 0;
+ irq_set->count = 1;
+ fd = (int32_t *)&irq_set->data;
+ *fd = -1;
+
+ ret = ioctl(vdev->vbasedev.fd, VFIO_DEVICE_SET_IRQS, irq_set);
+
+ return ret;
+}
+
static int vfio_enable_vectors(VFIOPCIDevice *vdev, bool msix)
{
struct vfio_irq_set *irq_set;
@@ -618,6 +645,8 @@ static void vfio_commit_kvm_msi_virq_batch(VFIOPCIDevice *vdev)
static void vfio_msix_enable(VFIOPCIDevice *vdev)
{
+ int ret;
+
vfio_disable_interrupts(vdev);
vdev->msi_vectors = g_new0(VFIOMSIVector, vdev->msix->entries);
@@ -640,8 +669,6 @@ static void vfio_msix_enable(VFIOPCIDevice *vdev)
vfio_commit_kvm_msi_virq_batch(vdev);
if (vdev->nr_vectors) {
- int ret;
-
ret = vfio_enable_vectors(vdev, true);
if (ret) {
error_report("vfio: failed to enable vectors, %d", ret);
@@ -655,13 +682,14 @@ static void vfio_msix_enable(VFIOPCIDevice *vdev)
* MSI-X capability, but leaves the vector table masked. We therefore
* can't rely on a vector_use callback (from request_irq() in the guest)
* to switch the physical device into MSI-X mode because that may come a
- * long time after pci_enable_msix(). This code enables vector 0 with
- * triggering to userspace, then immediately release the vector, leaving
- * the physical device with no vectors enabled, but MSI-X enabled, just
- * like the guest view.
+ * long time after pci_enable_msix(). This code sets vector 0 with an
+ * invalid fd to make the physical device MSI-X enabled, but with no
+ * vectors enabled, just like the guest view.
*/
- vfio_msix_vector_do_use(&vdev->pdev, 0, NULL, NULL);
- vfio_msix_vector_release(&vdev->pdev, 0);
+ ret = vfio_enable_msix_no_vec(vdev);
+ if (ret) {
+ error_report("vfio: failed to enable MSI-X, %d", ret);
+ }
}
trace_vfio_msix_enable(vdev->vbasedev.name);
--
2.27.0
^ permalink raw reply related [flat|nested] 12+ messages in thread
* [PATCH v2 4/4] vfio/pci: enable MSI-X in interrupt restoring on dynamic allocation
2023-09-18 9:45 [PATCH v2 0/4] Support dynamic MSI-X allocation Jing Liu
` (2 preceding siblings ...)
2023-09-18 9:45 ` [PATCH v2 3/4] vfio/pci: use an invalid fd to enable MSI-X Jing Liu
@ 2023-09-18 9:45 ` Jing Liu
2023-09-19 15:21 ` Cédric Le Goater
2023-09-22 20:56 ` [PATCH v2 0/4] Support dynamic MSI-X allocation Alex Williamson
4 siblings, 1 reply; 12+ messages in thread
From: Jing Liu @ 2023-09-18 9:45 UTC (permalink / raw)
To: qemu-devel
Cc: alex.williamson, clg, pbonzini, kevin.tian, reinette.chatre,
jing2.liu, jing2.liu
During migration restoring, vfio_enable_vectors() is called to restore
enabling MSI-X interrupts for assigned devices. It sets the range from
0 to nr_vectors to kernel to enable MSI-X and the vectors unmasked in
guest. During the MSI-X enabling, all the vectors within the range are
allocated according to the VFIO_DEVICE_SET_IRQS ioctl.
When dynamic MSI-X allocation is supported, we only want the guest
unmasked vectors being allocated and enabled. Use vector 0 with an
invalid fd to get MSI-X enabled, after that, all the vectors can be
allocated in need.
Signed-off-by: Jing Liu <jing2.liu@intel.com>
---
Changes since v1:
- No change.
Changes since RFC v1:
- Revise the comments. (Alex)
- Call the new helper function in previous patch to enable MSI-X. (Alex)
---
hw/vfio/pci.c | 17 +++++++++++++++++
1 file changed, 17 insertions(+)
diff --git a/hw/vfio/pci.c b/hw/vfio/pci.c
index 0117f230e934..f5f891dc0792 100644
--- a/hw/vfio/pci.c
+++ b/hw/vfio/pci.c
@@ -402,6 +402,23 @@ static int vfio_enable_vectors(VFIOPCIDevice *vdev, bool msix)
int ret = 0, i, argsz;
int32_t *fds;
+ /*
+ * If dynamic MSI-X allocation is supported, the vectors to be allocated
+ * and enabled can be scattered. Before kernel enabling MSI-X, setting
+ * nr_vectors causes all these vectors to be allocated on host.
+ *
+ * To keep allocation as needed, use vector 0 with an invalid fd to get
+ * MSI-X enabled first, then set vectors with a potentially sparse set of
+ * eventfds to enable interrupts only when enabled in guest.
+ */
+ if (msix && !vdev->msix->noresize) {
+ ret = vfio_enable_msix_no_vec(vdev);
+
+ if (ret) {
+ return ret;
+ }
+ }
+
argsz = sizeof(*irq_set) + (vdev->nr_vectors * sizeof(*fds));
irq_set = g_malloc0(argsz);
--
2.27.0
^ permalink raw reply related [flat|nested] 12+ messages in thread
* Re: [PATCH v2 3/4] vfio/pci: use an invalid fd to enable MSI-X
2023-09-18 9:45 ` [PATCH v2 3/4] vfio/pci: use an invalid fd to enable MSI-X Jing Liu
@ 2023-09-19 15:18 ` Cédric Le Goater
0 siblings, 0 replies; 12+ messages in thread
From: Cédric Le Goater @ 2023-09-19 15:18 UTC (permalink / raw)
To: Jing Liu, qemu-devel
Cc: alex.williamson, pbonzini, kevin.tian, reinette.chatre, jing2.liu
On 9/18/23 11:45, Jing Liu wrote:
> Guests typically enable MSI-X with all of the vectors masked in the MSI-X
> vector table. To match the guest state of device, QEMU enables MSI-X by
> enabling vector 0 with userspace triggering and immediately release.
> However the release function actually does not release it due to already
> using userspace mode.
>
> It is no need to enable triggering on host and rely on the mask bit to
> avoid spurious interrupts. Use an invalid fd (i.e. fd = -1) is enough
> to get MSI-X enabled.
>
> After dynamic MSI-X allocation is supported, the interrupt restoring
> also need use such way to enable MSI-X, therefore, create a function
> for that.
>
> Suggested-by: Alex Williamson <alex.williamson@redhat.com>
> Signed-off-by: Jing Liu <jing2.liu@intel.com>
Reviewed-by: Cédric Le Goater <clg@redhat.com>
Thanks,
C.
> ---
> Changes since v1:
> - Revise Qemu to QEMU. (Cédric)
> - Use g_autofree to automatically release. (Cédric)
> - Just return 'ret' and let the caller of vfio_enable_msix_no_vec()
> report the error. (Cédric)
>
> Changes since RFC v1:
> - A new patch. Use an invalid fd to get MSI-X enabled instead of using
> userspace triggering. (Alex)
> ---
> hw/vfio/pci.c | 44 ++++++++++++++++++++++++++++++++++++--------
> 1 file changed, 36 insertions(+), 8 deletions(-)
>
> diff --git a/hw/vfio/pci.c b/hw/vfio/pci.c
> index 84987e46fd7a..0117f230e934 100644
> --- a/hw/vfio/pci.c
> +++ b/hw/vfio/pci.c
> @@ -369,6 +369,33 @@ static void vfio_msi_interrupt(void *opaque)
> notify(&vdev->pdev, nr);
> }
>
> +/*
> + * Get MSI-X enabled, but no vector enabled, by setting vector 0 with an invalid
> + * fd to kernel.
> + */
> +static int vfio_enable_msix_no_vec(VFIOPCIDevice *vdev)
> +{
> + g_autofree struct vfio_irq_set *irq_set = NULL;
> + int ret = 0, argsz;
> + int32_t *fd;
> +
> + argsz = sizeof(*irq_set) + sizeof(*fd);
> +
> + irq_set = g_malloc0(argsz);
> + irq_set->argsz = argsz;
> + irq_set->flags = VFIO_IRQ_SET_DATA_EVENTFD |
> + VFIO_IRQ_SET_ACTION_TRIGGER;
> + irq_set->index = VFIO_PCI_MSIX_IRQ_INDEX;
> + irq_set->start = 0;
> + irq_set->count = 1;
> + fd = (int32_t *)&irq_set->data;
> + *fd = -1;
> +
> + ret = ioctl(vdev->vbasedev.fd, VFIO_DEVICE_SET_IRQS, irq_set);
> +
> + return ret;
> +}
> +
> static int vfio_enable_vectors(VFIOPCIDevice *vdev, bool msix)
> {
> struct vfio_irq_set *irq_set;
> @@ -618,6 +645,8 @@ static void vfio_commit_kvm_msi_virq_batch(VFIOPCIDevice *vdev)
>
> static void vfio_msix_enable(VFIOPCIDevice *vdev)
> {
> + int ret;
> +
> vfio_disable_interrupts(vdev);
>
> vdev->msi_vectors = g_new0(VFIOMSIVector, vdev->msix->entries);
> @@ -640,8 +669,6 @@ static void vfio_msix_enable(VFIOPCIDevice *vdev)
> vfio_commit_kvm_msi_virq_batch(vdev);
>
> if (vdev->nr_vectors) {
> - int ret;
> -
> ret = vfio_enable_vectors(vdev, true);
> if (ret) {
> error_report("vfio: failed to enable vectors, %d", ret);
> @@ -655,13 +682,14 @@ static void vfio_msix_enable(VFIOPCIDevice *vdev)
> * MSI-X capability, but leaves the vector table masked. We therefore
> * can't rely on a vector_use callback (from request_irq() in the guest)
> * to switch the physical device into MSI-X mode because that may come a
> - * long time after pci_enable_msix(). This code enables vector 0 with
> - * triggering to userspace, then immediately release the vector, leaving
> - * the physical device with no vectors enabled, but MSI-X enabled, just
> - * like the guest view.
> + * long time after pci_enable_msix(). This code sets vector 0 with an
> + * invalid fd to make the physical device MSI-X enabled, but with no
> + * vectors enabled, just like the guest view.
> */
> - vfio_msix_vector_do_use(&vdev->pdev, 0, NULL, NULL);
> - vfio_msix_vector_release(&vdev->pdev, 0);
> + ret = vfio_enable_msix_no_vec(vdev);
> + if (ret) {
> + error_report("vfio: failed to enable MSI-X, %d", ret);
> + }
> }
>
> trace_vfio_msix_enable(vdev->vbasedev.name);
^ permalink raw reply [flat|nested] 12+ messages in thread
* Re: [PATCH v2 4/4] vfio/pci: enable MSI-X in interrupt restoring on dynamic allocation
2023-09-18 9:45 ` [PATCH v2 4/4] vfio/pci: enable MSI-X in interrupt restoring on dynamic allocation Jing Liu
@ 2023-09-19 15:21 ` Cédric Le Goater
2023-09-21 6:48 ` Liu, Jing2
0 siblings, 1 reply; 12+ messages in thread
From: Cédric Le Goater @ 2023-09-19 15:21 UTC (permalink / raw)
To: Jing Liu, qemu-devel
Cc: alex.williamson, pbonzini, kevin.tian, reinette.chatre, jing2.liu
On 9/18/23 11:45, Jing Liu wrote:
> During migration restoring, vfio_enable_vectors() is called to restore
> enabling MSI-X interrupts for assigned devices. It sets the range from
> 0 to nr_vectors to kernel to enable MSI-X and the vectors unmasked in
> guest. During the MSI-X enabling, all the vectors within the range are
> allocated according to the VFIO_DEVICE_SET_IRQS ioctl.
>
> When dynamic MSI-X allocation is supported, we only want the guest
> unmasked vectors being allocated and enabled. Use vector 0 with an
> invalid fd to get MSI-X enabled, after that, all the vectors can be
> allocated in need.
>
> Signed-off-by: Jing Liu <jing2.liu@intel.com>
Reviewed-by: Cédric Le Goater <clg@redhat.com>
Thanks,
C.
> ---
> Changes since v1:
> - No change.
>
> Changes since RFC v1:
> - Revise the comments. (Alex)
> - Call the new helper function in previous patch to enable MSI-X. (Alex)
> ---
> hw/vfio/pci.c | 17 +++++++++++++++++
> 1 file changed, 17 insertions(+)
>
> diff --git a/hw/vfio/pci.c b/hw/vfio/pci.c
> index 0117f230e934..f5f891dc0792 100644
> --- a/hw/vfio/pci.c
> +++ b/hw/vfio/pci.c
> @@ -402,6 +402,23 @@ static int vfio_enable_vectors(VFIOPCIDevice *vdev, bool msix)
> int ret = 0, i, argsz;
> int32_t *fds;
>
> + /*
> + * If dynamic MSI-X allocation is supported, the vectors to be allocated
> + * and enabled can be scattered. Before kernel enabling MSI-X, setting
> + * nr_vectors causes all these vectors to be allocated on host.
> + *
> + * To keep allocation as needed, use vector 0 with an invalid fd to get
> + * MSI-X enabled first, then set vectors with a potentially sparse set of
> + * eventfds to enable interrupts only when enabled in guest.
> + */
> + if (msix && !vdev->msix->noresize) {
> + ret = vfio_enable_msix_no_vec(vdev);
> +
> + if (ret) {
> + return ret;
> + }
> + }
> +
> argsz = sizeof(*irq_set) + (vdev->nr_vectors * sizeof(*fds));
>
> irq_set = g_malloc0(argsz);
^ permalink raw reply [flat|nested] 12+ messages in thread
* RE: [PATCH v2 4/4] vfio/pci: enable MSI-X in interrupt restoring on dynamic allocation
2023-09-19 15:21 ` Cédric Le Goater
@ 2023-09-21 6:48 ` Liu, Jing2
0 siblings, 0 replies; 12+ messages in thread
From: Liu, Jing2 @ 2023-09-21 6:48 UTC (permalink / raw)
To: Cédric Le Goater, qemu-devel@nongnu.org
Cc: alex.williamson@redhat.com, pbonzini@redhat.com, Tian, Kevin,
Chatre, Reinette, jing2.liu@linux.intel.com
Hi Cédric,
> On 9/19/2023 11:21 PM, Cédric Le Goater wrote:
>
> On 9/18/23 11:45, Jing Liu wrote:
> > During migration restoring, vfio_enable_vectors() is called to restore
> > enabling MSI-X interrupts for assigned devices. It sets the range from
> > 0 to nr_vectors to kernel to enable MSI-X and the vectors unmasked in
> > guest. During the MSI-X enabling, all the vectors within the range are
> > allocated according to the VFIO_DEVICE_SET_IRQS ioctl.
> >
> > When dynamic MSI-X allocation is supported, we only want the guest
> > unmasked vectors being allocated and enabled. Use vector 0 with an
> > invalid fd to get MSI-X enabled, after that, all the vectors can be
> > allocated in need.
> >
> > Signed-off-by: Jing Liu <jing2.liu@intel.com>
>
>
> Reviewed-by: Cédric Le Goater <clg@redhat.com>
Thanks very much for your feedback.
Jing
>
> Thanks,
>
> C.
>
>
> > ---
> > Changes since v1:
> > - No change.
> >
> > Changes since RFC v1:
> > - Revise the comments. (Alex)
> > - Call the new helper function in previous patch to enable MSI-X.
> > (Alex)
> > ---
> > hw/vfio/pci.c | 17 +++++++++++++++++
> > 1 file changed, 17 insertions(+)
> >
> > diff --git a/hw/vfio/pci.c b/hw/vfio/pci.c index
> > 0117f230e934..f5f891dc0792 100644
> > --- a/hw/vfio/pci.c
> > +++ b/hw/vfio/pci.c
> > @@ -402,6 +402,23 @@ static int vfio_enable_vectors(VFIOPCIDevice *vdev,
> bool msix)
> > int ret = 0, i, argsz;
> > int32_t *fds;
> >
> > + /*
> > + * If dynamic MSI-X allocation is supported, the vectors to be allocated
> > + * and enabled can be scattered. Before kernel enabling MSI-X, setting
> > + * nr_vectors causes all these vectors to be allocated on host.
> > + *
> > + * To keep allocation as needed, use vector 0 with an invalid fd to get
> > + * MSI-X enabled first, then set vectors with a potentially sparse set of
> > + * eventfds to enable interrupts only when enabled in guest.
> > + */
> > + if (msix && !vdev->msix->noresize) {
> > + ret = vfio_enable_msix_no_vec(vdev);
> > +
> > + if (ret) {
> > + return ret;
> > + }
> > + }
> > +
> > argsz = sizeof(*irq_set) + (vdev->nr_vectors * sizeof(*fds));
> >
> > irq_set = g_malloc0(argsz);
^ permalink raw reply [flat|nested] 12+ messages in thread
* Re: [PATCH v2 2/4] vfio/pci: enable vector on dynamic MSI-X allocation
2023-09-18 9:45 ` [PATCH v2 2/4] vfio/pci: enable vector on " Jing Liu
@ 2023-09-22 20:54 ` Alex Williamson
2023-09-25 6:04 ` Liu, Jing2
0 siblings, 1 reply; 12+ messages in thread
From: Alex Williamson @ 2023-09-22 20:54 UTC (permalink / raw)
To: Jing Liu
Cc: qemu-devel, clg, pbonzini, kevin.tian, reinette.chatre, jing2.liu
On Mon, 18 Sep 2023 05:45:05 -0400
Jing Liu <jing2.liu@intel.com> wrote:
> The vector_use callback is used to enable vector that is unmasked in
> guest. The kernel used to only support static MSI-X allocation. When
> allocating a new interrupt using "static MSI-X allocation" kernels,
> QEMU first disables all previously allocated vectors and then
> re-allocates all including the new one. The nr_vectors of VFIOPCIDevice
> indicates that all vectors from 0 to nr_vectors are allocated (and may
> be enabled), which is used to to loop all the possibly used vectors
^^ ^^
s/to to/to/
> When, e.g., disabling MSI-X interrupts.
>
> Extend the vector_use function to support dynamic MSI-X allocation when
> host supports the capability. QEMU therefore can individually allocate
> and enable a new interrupt without affecting others or causing interrupts
> lost during runtime.
>
> Utilize nr_vectors to calculate the upper bound of enabled vectors in
> dynamic MSI-X allocation mode since looping all msix_entries_nr is not
> efficient and unnecessary.
>
> Signed-off-by: Jing Liu <jing2.liu@intel.com>
> Tested-by: Reinette Chatre <reinette.chatre@intel.com>
> ---
> Changes since v1:
> - Revise Qemu to QEMU.
>
> Changes since RFC v1:
> - Test vdev->msix->noresize to identify the allocation mode. (Alex)
> - Move defer_kvm_irq_routing test out and update nr_vectors in a
> common place before vfio_enable_vectors(). (Alex)
> - Revise the comments. (Alex)
> ---
> hw/vfio/pci.c | 44 +++++++++++++++++++++++++++-----------------
> 1 file changed, 27 insertions(+), 17 deletions(-)
>
> diff --git a/hw/vfio/pci.c b/hw/vfio/pci.c
> index 60654ca28ab8..84987e46fd7a 100644
> --- a/hw/vfio/pci.c
> +++ b/hw/vfio/pci.c
> @@ -470,6 +470,7 @@ static int vfio_msix_vector_do_use(PCIDevice *pdev, unsigned int nr,
> VFIOPCIDevice *vdev = VFIO_PCI(pdev);
> VFIOMSIVector *vector;
> int ret;
> + int old_nr_vecs = vdev->nr_vectors;
Minor suggestion, it reads slightly better below if this were something
like:
bool resizing = !!(vdev->nr_vectors < nr + 1);
Then use the bool in place of the nr+1 tests below. Thanks,
Alex
>
> trace_vfio_msix_vector_do_use(vdev->vbasedev.name, nr);
>
> @@ -512,33 +513,42 @@ static int vfio_msix_vector_do_use(PCIDevice *pdev, unsigned int nr,
> }
>
> /*
> - * We don't want to have the host allocate all possible MSI vectors
> - * for a device if they're not in use, so we shutdown and incrementally
> - * increase them as needed.
> + * When dynamic allocation is not supported, we don't want to have the
> + * host allocate all possible MSI vectors for a device if they're not
> + * in use, so we shutdown and incrementally increase them as needed.
> + * nr_vectors represents the total number of vectors allocated.
> + *
> + * When dynamic allocation is supported, let the host only allocate
> + * and enable a vector when it is in use in guest. nr_vectors represents
> + * the upper bound of vectors being enabled (but not all of the ranges
> + * is allocated or enabled).
> */
> if (vdev->nr_vectors < nr + 1) {
> vdev->nr_vectors = nr + 1;
> - if (!vdev->defer_kvm_irq_routing) {
> + }
> +
> + if (!vdev->defer_kvm_irq_routing) {
> + if (vdev->msix->noresize && (old_nr_vecs < nr + 1)) {
> vfio_disable_irqindex(&vdev->vbasedev, VFIO_PCI_MSIX_IRQ_INDEX);
> ret = vfio_enable_vectors(vdev, true);
> if (ret) {
> error_report("vfio: failed to enable vectors, %d", ret);
> }
> - }
> - } else {
> - Error *err = NULL;
> - int32_t fd;
> -
> - if (vector->virq >= 0) {
> - fd = event_notifier_get_fd(&vector->kvm_interrupt);
> } else {
> - fd = event_notifier_get_fd(&vector->interrupt);
> - }
> + Error *err = NULL;
> + int32_t fd;
>
> - if (vfio_set_irq_signaling(&vdev->vbasedev,
> - VFIO_PCI_MSIX_IRQ_INDEX, nr,
> - VFIO_IRQ_SET_ACTION_TRIGGER, fd, &err)) {
> - error_reportf_err(err, VFIO_MSG_PREFIX, vdev->vbasedev.name);
> + if (vector->virq >= 0) {
> + fd = event_notifier_get_fd(&vector->kvm_interrupt);
> + } else {
> + fd = event_notifier_get_fd(&vector->interrupt);
> + }
> +
> + if (vfio_set_irq_signaling(&vdev->vbasedev,
> + VFIO_PCI_MSIX_IRQ_INDEX, nr,
> + VFIO_IRQ_SET_ACTION_TRIGGER, fd, &err)) {
> + error_reportf_err(err, VFIO_MSG_PREFIX, vdev->vbasedev.name);
> + }
> }
> }
>
^ permalink raw reply [flat|nested] 12+ messages in thread
* Re: [PATCH v2 0/4] Support dynamic MSI-X allocation
2023-09-18 9:45 [PATCH v2 0/4] Support dynamic MSI-X allocation Jing Liu
` (3 preceding siblings ...)
2023-09-18 9:45 ` [PATCH v2 4/4] vfio/pci: enable MSI-X in interrupt restoring on dynamic allocation Jing Liu
@ 2023-09-22 20:56 ` Alex Williamson
2023-09-25 6:10 ` Liu, Jing2
4 siblings, 1 reply; 12+ messages in thread
From: Alex Williamson @ 2023-09-22 20:56 UTC (permalink / raw)
To: Jing Liu
Cc: qemu-devel, clg, pbonzini, kevin.tian, reinette.chatre, jing2.liu
On Mon, 18 Sep 2023 05:45:03 -0400
Jing Liu <jing2.liu@intel.com> wrote:
> Changes since v1:
> - v1: https://www.mail-archive.com/qemu-devel@nongnu.org/msg982842.html
> - Revise Qemu to QEMU. (Cédric)
> - Add g_free when failure of getting MSI-X irq info. (Cédric)
> - Apply Cédric's Reviewed-by. (Cédric)
> - Use g_autofree to automatically release. (Cédric)
> - Remove the failure message in vfio_enable_msix_no_vec(). (Cédric)
>
> Changes since RFC v1:
> - RFC v1: https://www.mail-archive.com/qemu-devel@nongnu.org/msg978637.html
> - Revise the comments. (Alex)
> - Report error of getting irq info and remove the trace of failure
> case. (Alex, Cédric)
> - Only store dynamic allocation flag as a bool type and test
> accordingly. (Alex)
> - Move dynamic allocation detection to vfio_msix_early_setup(). (Alex)
> - Change the condition logic in vfio_msix_vector_do_use() that moving
> the defer_kvm_irq_routing test out and create a common place to update
> nr_vectors. (Alex)
> - Consolidate the way of MSI-X enabling during device initialization and
> interrupt restoring that uses fd = -1 trick. Create a function doing
> that. (Alex)
>
> Before kernel v6.5, dynamic allocation of MSI-X interrupts was not
> supported. QEMU therefore when allocating a new interrupt, should first
> release all previously allocated interrupts (including disable of MSI-X)
> and re-allocate all interrupts that includes the new one.
>
> The kernel series [1] adds the support of dynamic MSI-X allocation to
> vfio-pci and uses the existing flag VFIO_IRQ_INFO_NORESIZE to guide user
> space, that when dynamic MSI-X is supported the flag is cleared.
>
> This series makes the behavior for VFIO PCI devices when dynamic MSI-X
> allocation is supported. When guest unmasks an interrupt, QEMU can
> directly allocate an interrupt on host for this and has nothing to do
> with the previously allocated ones. Therefore, host only allocates
> interrupts for those unmasked (enabled) interrupts inside guest when
> dynamic MSI-X allocation is supported by device.
>
> When guests enable MSI-X with all of the vectors masked, QEMU need match
> the state to enable MSI-X with no vector enabled. During migration
> restore, QEMU also need enable MSI-X first in dynamic allocation mode,
> to avoid the guest unused vectors being allocated on host. To
> consolidate them, we use vector 0 with an invalid fd to get MSI-X
> enabled and create a common function for this. This is cleaner than
> setting userspace triggering and immediately release.
>
> Any feedback is appreciated.
>
> Jing
>
> [1] https://lwn.net/Articles/931679/
>
> Jing Liu (4):
> vfio/pci: detect the support of dynamic MSI-X allocation
> vfio/pci: enable vector on dynamic MSI-X allocation
> vfio/pci: use an invalid fd to enable MSI-X
> vfio/pci: enable MSI-X in interrupt restoring on dynamic allocation
>
> hw/vfio/pci.c | 121 +++++++++++++++++++++++++++++++++----------
> hw/vfio/pci.h | 1 +
> hw/vfio/trace-events | 2 +-
> 3 files changed, 96 insertions(+), 28 deletions(-)
>
Some minor comments on 2/ but otherwise:
Reviewed-by: Alex Williamson <alex.williamson@redhat.com>
^ permalink raw reply [flat|nested] 12+ messages in thread
* RE: [PATCH v2 2/4] vfio/pci: enable vector on dynamic MSI-X allocation
2023-09-22 20:54 ` Alex Williamson
@ 2023-09-25 6:04 ` Liu, Jing2
0 siblings, 0 replies; 12+ messages in thread
From: Liu, Jing2 @ 2023-09-25 6:04 UTC (permalink / raw)
To: Alex Williamson
Cc: qemu-devel@nongnu.org, clg@redhat.com, pbonzini@redhat.com,
Tian, Kevin, Chatre, Reinette, jing2.liu@linux.intel.com
Hi Alex,
> On Sat, 9/23/2023 4:55 AM, Alex Williamson <alex.williamson@redhat.com> wrote:
> On Mon, 18 Sep 2023 05:45:05 -0400
> Jing Liu <jing2.liu@intel.com> wrote:
>
> > The vector_use callback is used to enable vector that is unmasked in
> > guest. The kernel used to only support static MSI-X allocation. When
> > allocating a new interrupt using "static MSI-X allocation" kernels,
> > QEMU first disables all previously allocated vectors and then
> > re-allocates all including the new one. The nr_vectors of
> > VFIOPCIDevice indicates that all vectors from 0 to nr_vectors are
> > allocated (and may be enabled), which is used to to loop all the
> > possibly used vectors
> ^^ ^^
>
> s/to to/to/
Will change.
>
> > When, e.g., disabling MSI-X interrupts.
> >
> > Extend the vector_use function to support dynamic MSI-X allocation
> > when host supports the capability. QEMU therefore can individually
> > allocate and enable a new interrupt without affecting others or
> > causing interrupts lost during runtime.
> >
> > Utilize nr_vectors to calculate the upper bound of enabled vectors in
> > dynamic MSI-X allocation mode since looping all msix_entries_nr is not
> > efficient and unnecessary.
> >
> > Signed-off-by: Jing Liu <jing2.liu@intel.com>
> > Tested-by: Reinette Chatre <reinette.chatre@intel.com>
> > ---
> > Changes since v1:
> > - Revise Qemu to QEMU.
> >
> > Changes since RFC v1:
> > - Test vdev->msix->noresize to identify the allocation mode. (Alex)
> > - Move defer_kvm_irq_routing test out and update nr_vectors in a
> > common place before vfio_enable_vectors(). (Alex)
> > - Revise the comments. (Alex)
> > ---
> > hw/vfio/pci.c | 44 +++++++++++++++++++++++++++-----------------
> > 1 file changed, 27 insertions(+), 17 deletions(-)
> >
> > diff --git a/hw/vfio/pci.c b/hw/vfio/pci.c index
> > 60654ca28ab8..84987e46fd7a 100644
> > --- a/hw/vfio/pci.c
> > +++ b/hw/vfio/pci.c
> > @@ -470,6 +470,7 @@ static int vfio_msix_vector_do_use(PCIDevice *pdev,
> unsigned int nr,
> > VFIOPCIDevice *vdev = VFIO_PCI(pdev);
> > VFIOMSIVector *vector;
> > int ret;
> > + int old_nr_vecs = vdev->nr_vectors;
>
> Minor suggestion, it reads slightly better below if this were something
> like:
>
> bool resizing = !!(vdev->nr_vectors < nr + 1);
>
> Then use the bool in place of the nr+1 tests below. Thanks,
>
Got it. This change makes it nice to read. Thanks for the advice. Will send v3 later.
Thanks,
Jing
> Alex
>
> >
> > trace_vfio_msix_vector_do_use(vdev->vbasedev.name, nr);
> >
> > @@ -512,33 +513,42 @@ static int vfio_msix_vector_do_use(PCIDevice
> *pdev, unsigned int nr,
> > }
> >
> > /*
> > - * We don't want to have the host allocate all possible MSI vectors
> > - * for a device if they're not in use, so we shutdown and incrementally
> > - * increase them as needed.
> > + * When dynamic allocation is not supported, we don't want to have the
> > + * host allocate all possible MSI vectors for a device if they're not
> > + * in use, so we shutdown and incrementally increase them as needed.
> > + * nr_vectors represents the total number of vectors allocated.
> > + *
> > + * When dynamic allocation is supported, let the host only allocate
> > + * and enable a vector when it is in use in guest. nr_vectors represents
> > + * the upper bound of vectors being enabled (but not all of the ranges
> > + * is allocated or enabled).
> > */
> > if (vdev->nr_vectors < nr + 1) {
> > vdev->nr_vectors = nr + 1;
> > - if (!vdev->defer_kvm_irq_routing) {
> > + }
> > +
> > + if (!vdev->defer_kvm_irq_routing) {
> > + if (vdev->msix->noresize && (old_nr_vecs < nr + 1)) {
> > vfio_disable_irqindex(&vdev->vbasedev, VFIO_PCI_MSIX_IRQ_INDEX);
> > ret = vfio_enable_vectors(vdev, true);
> > if (ret) {
> > error_report("vfio: failed to enable vectors, %d", ret);
> > }
> > - }
> > - } else {
> > - Error *err = NULL;
> > - int32_t fd;
> > -
> > - if (vector->virq >= 0) {
> > - fd = event_notifier_get_fd(&vector->kvm_interrupt);
> > } else {
> > - fd = event_notifier_get_fd(&vector->interrupt);
> > - }
> > + Error *err = NULL;
> > + int32_t fd;
> >
> > - if (vfio_set_irq_signaling(&vdev->vbasedev,
> > - VFIO_PCI_MSIX_IRQ_INDEX, nr,
> > - VFIO_IRQ_SET_ACTION_TRIGGER, fd, &err)) {
> > - error_reportf_err(err, VFIO_MSG_PREFIX, vdev->vbasedev.name);
> > + if (vector->virq >= 0) {
> > + fd = event_notifier_get_fd(&vector->kvm_interrupt);
> > + } else {
> > + fd = event_notifier_get_fd(&vector->interrupt);
> > + }
> > +
> > + if (vfio_set_irq_signaling(&vdev->vbasedev,
> > + VFIO_PCI_MSIX_IRQ_INDEX, nr,
> > + VFIO_IRQ_SET_ACTION_TRIGGER, fd, &err)) {
> > + error_reportf_err(err, VFIO_MSG_PREFIX, vdev->vbasedev.name);
> > + }
> > }
> > }
> >
^ permalink raw reply [flat|nested] 12+ messages in thread
* RE: [PATCH v2 0/4] Support dynamic MSI-X allocation
2023-09-22 20:56 ` [PATCH v2 0/4] Support dynamic MSI-X allocation Alex Williamson
@ 2023-09-25 6:10 ` Liu, Jing2
0 siblings, 0 replies; 12+ messages in thread
From: Liu, Jing2 @ 2023-09-25 6:10 UTC (permalink / raw)
To: Alex Williamson
Cc: qemu-devel@nongnu.org, clg@redhat.com, pbonzini@redhat.com,
Tian, Kevin, Chatre, Reinette, jing2.liu@linux.intel.com
Hi Alex,
> On Sat, 9/23/2023 4:57AM, Alex Williamson <alex.williamson@redhat.com> wrote:
>
> On Mon, 18 Sep 2023 05:45:03 -0400
> Jing Liu <jing2.liu@intel.com> wrote:
>
> > Changes since v1:
> > - v1:
> > https://www.mail-archive.com/qemu-devel@nongnu.org/msg982842.html
> > - Revise Qemu to QEMU. (Cédric)
> > - Add g_free when failure of getting MSI-X irq info. (Cédric)
> > - Apply Cédric's Reviewed-by. (Cédric)
> > - Use g_autofree to automatically release. (Cédric)
> > - Remove the failure message in vfio_enable_msix_no_vec(). (Cédric)
> >
> > Changes since RFC v1:
> > - RFC v1:
> > https://www.mail-archive.com/qemu-devel@nongnu.org/msg978637.html
> > - Revise the comments. (Alex)
> > - Report error of getting irq info and remove the trace of failure
> > case. (Alex, Cédric)
> > - Only store dynamic allocation flag as a bool type and test
> > accordingly. (Alex)
> > - Move dynamic allocation detection to vfio_msix_early_setup(). (Alex)
> > - Change the condition logic in vfio_msix_vector_do_use() that moving
> > the defer_kvm_irq_routing test out and create a common place to update
> > nr_vectors. (Alex)
> > - Consolidate the way of MSI-X enabling during device initialization and
> > interrupt restoring that uses fd = -1 trick. Create a function doing
> > that. (Alex)
> >
> > Before kernel v6.5, dynamic allocation of MSI-X interrupts was not
> > supported. QEMU therefore when allocating a new interrupt, should
> > first release all previously allocated interrupts (including disable
> > of MSI-X) and re-allocate all interrupts that includes the new one.
> >
> > The kernel series [1] adds the support of dynamic MSI-X allocation to
> > vfio-pci and uses the existing flag VFIO_IRQ_INFO_NORESIZE to guide
> > user space, that when dynamic MSI-X is supported the flag is cleared.
> >
> > This series makes the behavior for VFIO PCI devices when dynamic MSI-X
> > allocation is supported. When guest unmasks an interrupt, QEMU can
> > directly allocate an interrupt on host for this and has nothing to do
> > with the previously allocated ones. Therefore, host only allocates
> > interrupts for those unmasked (enabled) interrupts inside guest when
> > dynamic MSI-X allocation is supported by device.
> >
> > When guests enable MSI-X with all of the vectors masked, QEMU need
> > match the state to enable MSI-X with no vector enabled. During
> > migration restore, QEMU also need enable MSI-X first in dynamic
> > allocation mode, to avoid the guest unused vectors being allocated on
> > host. To consolidate them, we use vector 0 with an invalid fd to get
> > MSI-X enabled and create a common function for this. This is cleaner
> > than setting userspace triggering and immediately release.
> >
> > Any feedback is appreciated.
> >
> > Jing
> >
> > [1] https://lwn.net/Articles/931679/
> >
> > Jing Liu (4):
> > vfio/pci: detect the support of dynamic MSI-X allocation
> > vfio/pci: enable vector on dynamic MSI-X allocation
> > vfio/pci: use an invalid fd to enable MSI-X
> > vfio/pci: enable MSI-X in interrupt restoring on dynamic allocation
> >
> > hw/vfio/pci.c | 121 +++++++++++++++++++++++++++++++++----------
> > hw/vfio/pci.h | 1 +
> > hw/vfio/trace-events | 2 +-
> > 3 files changed, 96 insertions(+), 28 deletions(-)
> >
>
> Some minor comments on 2/ but otherwise:
>
> Reviewed-by: Alex Williamson <alex.williamson@redhat.com>
Thank you very much for the feedback. Will apply on v3 with fix for 2/4.
Jing
^ permalink raw reply [flat|nested] 12+ messages in thread
end of thread, other threads:[~2023-09-25 6:11 UTC | newest]
Thread overview: 12+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2023-09-18 9:45 [PATCH v2 0/4] Support dynamic MSI-X allocation Jing Liu
2023-09-18 9:45 ` [PATCH v2 1/4] vfio/pci: detect the support of " Jing Liu
2023-09-18 9:45 ` [PATCH v2 2/4] vfio/pci: enable vector on " Jing Liu
2023-09-22 20:54 ` Alex Williamson
2023-09-25 6:04 ` Liu, Jing2
2023-09-18 9:45 ` [PATCH v2 3/4] vfio/pci: use an invalid fd to enable MSI-X Jing Liu
2023-09-19 15:18 ` Cédric Le Goater
2023-09-18 9:45 ` [PATCH v2 4/4] vfio/pci: enable MSI-X in interrupt restoring on dynamic allocation Jing Liu
2023-09-19 15:21 ` Cédric Le Goater
2023-09-21 6:48 ` Liu, Jing2
2023-09-22 20:56 ` [PATCH v2 0/4] Support dynamic MSI-X allocation Alex Williamson
2023-09-25 6:10 ` Liu, Jing2
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).