All of lore.kernel.org
 help / color / mirror / Atom feed
From: Alexey Kardashevskiy <aik@au1.ibm.com>
To: Alex Williamson <alex.williamson@redhat.com>
Cc: chrisw@sous-sol.org, pmac@au1.ibm.com, dwg@au1.ibm.com,
	joerg.roedel@amd.com, agraf@suse.de, benve@cisco.com,
	aafabbri@cisco.com, B08248@freescale.com, B07421@freescale.com,
	avi@redhat.com, konrad.wilk@oracle.com, kvm@vger.kernel.org,
	qemu-devel@nongnu.org, iommu@lists.linux-foundation.org,
	linux-pci@vger.kernel.org
Subject: Re: [RFC PATCH] vfio: VFIO Driver core framework
Date: Tue, 29 Nov 2011 13:01:34 +1100	[thread overview]
Message-ID: <4ED43CFE.8040009@au1.ibm.com> (raw)
In-Reply-To: <4ED43AD9.5090509@au1.ibm.com>

Hi all,

Another problem I hit on POWER - MSI interrupts allocation. The existing VFIO does not expect a PBH
to support less interrupts that a device might request. In my case, PHB's limit is 8 interrupts
while my test card (10Gb ethernet CXGB3) wants 9. Below are the patches to demonstrate the idea.


KERNEL patch:

diff --git a/drivers/vfio/pci/vfio_pci_intrs.c b/drivers/vfio/pci/vfio_pci_intrs.c
index 7d45c6b..d44b9bf 100644
--- a/drivers/vfio/pci/vfio_pci_intrs.c
+++ b/drivers/vfio/pci/vfio_pci_intrs.c
@@ -458,17 +458,32 @@ int vfio_pci_setup_msix(struct vfio_pci_device *vdev, int nvec, int __user *inta
 		vdev->msix[i].entry = i;
 		vdev->ev_msix[i] = ctx;
 	}
-	if (!ret)
+	if (!ret) {
 		ret = pci_enable_msix(pdev, vdev->msix, nvec);
+		/*
+		   The kernel is unable to allocate requested number of IRQs
+		   and returned the available number.
+		 */
+		if (0 < ret) {
+			ret = pci_enable_msix(pdev, vdev->msix, ret);
+		}
+	}
 	vdev->msix_nvec = 0;
-	for (i = 0; i < nvec && !ret; i++) {
-		ret = request_irq(vdev->msix[i].vector, msihandler, 0,
-				  "vfio", vdev->ev_msix[i]);
-		if (ret)
-			break;
-		vdev->msix_nvec = i+1;
+	if (0 == ret) {
+		vdev->msix_nvec = 0;
+		ret = 0;
+		for (i = 0; i < nvec && !ret; i++) {
+			ret = request_irq(vdev->msix[i].vector, msihandler, 0,
+					"vfio", vdev->ev_msix[i]);
+			if (ret)
+				break;
+			vdev->msix_nvec = i+1;
+		}
+		if ((0 == vdev->msix_nvec) && (0 != ret))
+			vfio_pci_drop_msix(vdev);
+		else
+			ret = vdev->msix_nvec;
 	}
-	if (ret)
-		vfio_pci_drop_msix(vdev);
+
 	return ret;
 }

=== end ===


QEMU patch:

diff --git a/hw/vfio_pci.c b/hw/vfio_pci.c
index 020961a..980eec7 100644
--- a/hw/vfio_pci.c
+++ b/hw/vfio_pci.c
@@ -341,7 +341,8 @@ static void vfio_enable_msi(VFIODevice *vdev, bool msix)
         }
     }

-    if (ioctl(vdev->fd, VFIO_DEVICE_SET_IRQ_EVENTFDS, fds)) {
+    ret = ioctl(vdev->fd, VFIO_DEVICE_SET_IRQ_EVENTFDS, fds);
+    if (0 > ret) {
         fprintf(stderr, "vfio: Error: Failed to setup MSI/X fds %s\n",
                 strerror(errno));
         for (i = 0; i < vdev->nr_vectors; i++) {
@@ -355,6 +356,8 @@ static void vfio_enable_msi(VFIODevice *vdev, bool msix)
         qemu_free(vdev->msi_vectors);
         vdev->nr_vectors = 0;
         return;
+    } else if (0 < ret) {
+        vdev->nr_vectors = ret;
     }

     vdev->interrupt = msix ? INT_MSIX : INT_MSI;


=== end ===




On 29/11/11 12:52, Alexey Kardashevskiy wrote:
> Hi!
> 
> I tried (successfully) to run it on POWER and while doing that I found some issues. I'll try to
> explain them in separate mails.
> 
> 
> 
> On 04/11/11 07:12, Alex Williamson wrote:
>> VFIO provides a secure, IOMMU based interface for user space
>> drivers, including device assignment to virtual machines.
>> This provides the base management of IOMMU groups, devices,
>> and IOMMU objects.  See Documentation/vfio.txt included in
>> this patch for user and kernel API description.
>>
>> Note, this implements the new API discussed at KVM Forum
>> 2011, as represented by the drvier version 0.2.  It's hoped
>> that this provides a modular enough interface to support PCI
>> and non-PCI userspace drivers across various architectures
>> and IOMMU implementations.
>>
>> Signed-off-by: Alex Williamson <alex.williamson@redhat.com>
>> ---
>>
>> Fingers crossed, this is the last RFC for VFIO, but we need
>> the iommu group support before this can go upstream
>> (http://lkml.indiana.edu/hypermail/linux/kernel/1110.2/02303.html),
>> hoping this helps push that along.
>>
>> Since the last posting, this version completely modularizes
>> the device backends and better defines the APIs between the
>> core VFIO code and the device backends.  I expect that we
>> might also adopt a modular IOMMU interface as iommu_ops learns
>> about different types of hardware.  Also many, many cleanups.
>> Check the complete git history for details:
>>
>> git://github.com/awilliam/linux-vfio.git vfio-ng
>>
>> (matching qemu tree: git://github.com/awilliam/qemu-vfio.git)
>>
>> This version, along with the supporting VFIO PCI backend can
>> be found here:
>>
>> git://github.com/awilliam/linux-vfio.git vfio-next-20111103
>>
>> I've held off on implementing a kernel->user signaling
>> mechanism for now since the previous netlink version produced
>> too many gag reflexes.  It's easy enough to set a bit in the
>> group flags too indicate such support in the future, so I
>> think we can move ahead without it.
>>
>> Appreciate any feedback or suggestions.  Thanks,
>>
>> Alex
>>
> 
> 


-- 
Alexey Kardashevskiy
IBM OzLabs, LTC Team

e-mail: aik@au1.ibm.com
notes: Alexey Kardashevskiy/Australia/IBM


WARNING: multiple messages have this Message-ID (diff)
From: Alexey Kardashevskiy <aik@au1.ibm.com>
To: Alex Williamson <alex.williamson@redhat.com>
Cc: aafabbri@cisco.com, kvm@vger.kernel.org, pmac@au1.ibm.com,
	qemu-devel@nongnu.org, joerg.roedel@amd.com,
	konrad.wilk@oracle.com, agraf@suse.de, dwg@au1.ibm.com,
	chrisw@sous-sol.org, B08248@freescale.com,
	iommu@lists.linux-foundation.org, avi@redhat.com,
	linux-pci@vger.kernel.org, B07421@freescale.com, benve@cisco.com
Subject: Re: [Qemu-devel] [RFC PATCH] vfio: VFIO Driver core framework
Date: Tue, 29 Nov 2011 13:01:34 +1100	[thread overview]
Message-ID: <4ED43CFE.8040009@au1.ibm.com> (raw)
In-Reply-To: <4ED43AD9.5090509@au1.ibm.com>

Hi all,

Another problem I hit on POWER - MSI interrupts allocation. The existing VFIO does not expect a PBH
to support less interrupts that a device might request. In my case, PHB's limit is 8 interrupts
while my test card (10Gb ethernet CXGB3) wants 9. Below are the patches to demonstrate the idea.


KERNEL patch:

diff --git a/drivers/vfio/pci/vfio_pci_intrs.c b/drivers/vfio/pci/vfio_pci_intrs.c
index 7d45c6b..d44b9bf 100644
--- a/drivers/vfio/pci/vfio_pci_intrs.c
+++ b/drivers/vfio/pci/vfio_pci_intrs.c
@@ -458,17 +458,32 @@ int vfio_pci_setup_msix(struct vfio_pci_device *vdev, int nvec, int __user *inta
 		vdev->msix[i].entry = i;
 		vdev->ev_msix[i] = ctx;
 	}
-	if (!ret)
+	if (!ret) {
 		ret = pci_enable_msix(pdev, vdev->msix, nvec);
+		/*
+		   The kernel is unable to allocate requested number of IRQs
+		   and returned the available number.
+		 */
+		if (0 < ret) {
+			ret = pci_enable_msix(pdev, vdev->msix, ret);
+		}
+	}
 	vdev->msix_nvec = 0;
-	for (i = 0; i < nvec && !ret; i++) {
-		ret = request_irq(vdev->msix[i].vector, msihandler, 0,
-				  "vfio", vdev->ev_msix[i]);
-		if (ret)
-			break;
-		vdev->msix_nvec = i+1;
+	if (0 == ret) {
+		vdev->msix_nvec = 0;
+		ret = 0;
+		for (i = 0; i < nvec && !ret; i++) {
+			ret = request_irq(vdev->msix[i].vector, msihandler, 0,
+					"vfio", vdev->ev_msix[i]);
+			if (ret)
+				break;
+			vdev->msix_nvec = i+1;
+		}
+		if ((0 == vdev->msix_nvec) && (0 != ret))
+			vfio_pci_drop_msix(vdev);
+		else
+			ret = vdev->msix_nvec;
 	}
-	if (ret)
-		vfio_pci_drop_msix(vdev);
+
 	return ret;
 }

=== end ===


QEMU patch:

diff --git a/hw/vfio_pci.c b/hw/vfio_pci.c
index 020961a..980eec7 100644
--- a/hw/vfio_pci.c
+++ b/hw/vfio_pci.c
@@ -341,7 +341,8 @@ static void vfio_enable_msi(VFIODevice *vdev, bool msix)
         }
     }

-    if (ioctl(vdev->fd, VFIO_DEVICE_SET_IRQ_EVENTFDS, fds)) {
+    ret = ioctl(vdev->fd, VFIO_DEVICE_SET_IRQ_EVENTFDS, fds);
+    if (0 > ret) {
         fprintf(stderr, "vfio: Error: Failed to setup MSI/X fds %s\n",
                 strerror(errno));
         for (i = 0; i < vdev->nr_vectors; i++) {
@@ -355,6 +356,8 @@ static void vfio_enable_msi(VFIODevice *vdev, bool msix)
         qemu_free(vdev->msi_vectors);
         vdev->nr_vectors = 0;
         return;
+    } else if (0 < ret) {
+        vdev->nr_vectors = ret;
     }

     vdev->interrupt = msix ? INT_MSIX : INT_MSI;


=== end ===




On 29/11/11 12:52, Alexey Kardashevskiy wrote:
> Hi!
> 
> I tried (successfully) to run it on POWER and while doing that I found some issues. I'll try to
> explain them in separate mails.
> 
> 
> 
> On 04/11/11 07:12, Alex Williamson wrote:
>> VFIO provides a secure, IOMMU based interface for user space
>> drivers, including device assignment to virtual machines.
>> This provides the base management of IOMMU groups, devices,
>> and IOMMU objects.  See Documentation/vfio.txt included in
>> this patch for user and kernel API description.
>>
>> Note, this implements the new API discussed at KVM Forum
>> 2011, as represented by the drvier version 0.2.  It's hoped
>> that this provides a modular enough interface to support PCI
>> and non-PCI userspace drivers across various architectures
>> and IOMMU implementations.
>>
>> Signed-off-by: Alex Williamson <alex.williamson@redhat.com>
>> ---
>>
>> Fingers crossed, this is the last RFC for VFIO, but we need
>> the iommu group support before this can go upstream
>> (http://lkml.indiana.edu/hypermail/linux/kernel/1110.2/02303.html),
>> hoping this helps push that along.
>>
>> Since the last posting, this version completely modularizes
>> the device backends and better defines the APIs between the
>> core VFIO code and the device backends.  I expect that we
>> might also adopt a modular IOMMU interface as iommu_ops learns
>> about different types of hardware.  Also many, many cleanups.
>> Check the complete git history for details:
>>
>> git://github.com/awilliam/linux-vfio.git vfio-ng
>>
>> (matching qemu tree: git://github.com/awilliam/qemu-vfio.git)
>>
>> This version, along with the supporting VFIO PCI backend can
>> be found here:
>>
>> git://github.com/awilliam/linux-vfio.git vfio-next-20111103
>>
>> I've held off on implementing a kernel->user signaling
>> mechanism for now since the previous netlink version produced
>> too many gag reflexes.  It's easy enough to set a bit in the
>> group flags too indicate such support in the future, so I
>> think we can move ahead without it.
>>
>> Appreciate any feedback or suggestions.  Thanks,
>>
>> Alex
>>
> 
> 


-- 
Alexey Kardashevskiy
IBM OzLabs, LTC Team

e-mail: aik@au1.ibm.com
notes: Alexey Kardashevskiy/Australia/IBM

  reply	other threads:[~2011-11-29  2:01 UTC|newest]

Thread overview: 156+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2011-11-03 20:12 [RFC PATCH] vfio: VFIO Driver core framework Alex Williamson
2011-11-03 20:12 ` [Qemu-devel] " Alex Williamson
2011-11-09  4:17 ` Aaron Fabbri
2011-11-09  4:17   ` [Qemu-devel] " Aaron Fabbri
2011-11-09  4:41   ` Alex Williamson
2011-11-09  4:41     ` [Qemu-devel] " Alex Williamson
2011-11-09  4:41     ` Alex Williamson
2011-11-09  4:17 ` Aaron Fabbri
2011-11-09  8:11 ` Christian Benvenuti (benve)
2011-11-09  8:11 ` Christian Benvenuti (benve)
2011-11-09  8:11   ` [Qemu-devel] " Christian Benvenuti (benve)
2011-11-09 18:02   ` Alex Williamson
2011-11-09 18:02     ` [Qemu-devel] " Alex Williamson
2011-11-09 21:08     ` Christian Benvenuti (benve)
2011-11-09 21:08       ` [Qemu-devel] " Christian Benvenuti (benve)
2011-11-09 21:08       ` Christian Benvenuti (benve)
2011-11-09 23:40       ` Alex Williamson
2011-11-09 23:40         ` [Qemu-devel] " Alex Williamson
2011-11-10  0:57 ` Christian Benvenuti (benve)
2011-11-10  0:57 ` Christian Benvenuti (benve)
2011-11-10  0:57   ` [Qemu-devel] " Christian Benvenuti (benve)
2011-11-11 18:04   ` Alex Williamson
2011-11-11 18:04     ` [Qemu-devel] " Alex Williamson
2011-11-11 18:04     ` Alex Williamson
2011-11-11 22:22     ` Christian Benvenuti (benve)
2011-11-11 22:22       ` [Qemu-devel] " Christian Benvenuti (benve)
2011-11-11 22:22       ` Christian Benvenuti (benve)
2011-11-14 22:59       ` Alex Williamson
2011-11-14 22:59         ` [Qemu-devel] " Alex Williamson
2011-11-14 22:59         ` Alex Williamson
2011-11-15  0:05         ` David Gibson
2011-11-15  0:05           ` [Qemu-devel] " David Gibson
2011-11-15  0:49           ` Benjamin Herrenschmidt
2011-11-15  0:49             ` [Qemu-devel] " Benjamin Herrenschmidt
2011-11-15  0:49             ` Benjamin Herrenschmidt
2011-11-11 17:51 ` Konrad Rzeszutek Wilk
2011-11-11 17:51   ` [Qemu-devel] " Konrad Rzeszutek Wilk
2011-11-11 17:51   ` Konrad Rzeszutek Wilk
2011-11-11 22:10   ` Alex Williamson
2011-11-11 22:10     ` [Qemu-devel] " Alex Williamson
2011-11-15  0:00     ` David Gibson
2011-11-15  0:00       ` [Qemu-devel] " David Gibson
2011-11-16 16:52     ` Konrad Rzeszutek Wilk
2011-11-16 16:52       ` [Qemu-devel] " Konrad Rzeszutek Wilk
2011-11-16 16:52       ` Konrad Rzeszutek Wilk
2011-11-17 20:22       ` Alex Williamson
2011-11-17 20:22         ` [Qemu-devel] " Alex Williamson
2011-11-17 20:22         ` Alex Williamson
2011-11-17 20:56         ` Scott Wood
2011-11-17 20:56           ` [Qemu-devel] " Scott Wood
2011-11-16 17:47     ` Scott Wood
2011-11-16 17:47       ` [Qemu-devel] " Scott Wood
2011-11-17 20:52       ` Alex Williamson
2011-11-17 20:52         ` [Qemu-devel] " Alex Williamson
2011-11-17 20:52         ` Alex Williamson
2011-11-12  0:14 ` Scott Wood
2011-11-12  0:14   ` [Qemu-devel] " Scott Wood
2011-11-14 20:54   ` Alex Williamson
2011-11-14 20:54     ` [Qemu-devel] " Alex Williamson
2011-11-14 20:54     ` Alex Williamson
2011-11-14 21:46     ` Alex Williamson
2011-11-14 21:46       ` [Qemu-devel] " Alex Williamson
2011-11-14 22:26     ` Scott Wood
2011-11-14 22:26       ` [Qemu-devel] " Scott Wood
2011-11-14 22:48       ` Alexander Graf
2011-11-14 22:48         ` [Qemu-devel] " Alexander Graf
2011-11-15  2:29     ` Alex Williamson
2011-11-15  2:29       ` [Qemu-devel] " Alex Williamson
2011-11-15  2:29       ` Alex Williamson
2011-11-15  6:34 ` David Gibson
2011-11-15  6:34   ` [Qemu-devel] " David Gibson
2011-11-15 18:01   ` Alex Williamson
2011-11-15 18:01     ` [Qemu-devel] " Alex Williamson
2011-11-15 18:01     ` Alex Williamson
2011-11-17  0:02     ` David Gibson
2011-11-17  0:02       ` [Qemu-devel] " David Gibson
2011-11-18 20:32       ` Alex Williamson
2011-11-18 20:32         ` [Qemu-devel] " Alex Williamson
2011-11-18 20:32         ` Alex Williamson
2011-11-18 21:09         ` Scott Wood
2011-11-18 21:09           ` [Qemu-devel] " Scott Wood
2011-11-18 21:09           ` Scott Wood
2011-11-22 19:16           ` [Qemu-devel] " Alex Williamson
2011-11-22 19:16             ` Alex Williamson
2011-11-22 20:00             ` Scott Wood
2011-11-22 20:00               ` Scott Wood
2011-11-22 21:28               ` Alex Williamson
2011-11-22 21:28                 ` Alex Williamson
2011-11-22 21:28                 ` Alex Williamson
2011-11-21  2:47         ` David Gibson
2011-11-21  2:47           ` [Qemu-devel] " David Gibson
2011-11-22 18:22           ` Alex Williamson
2011-11-22 18:22             ` [Qemu-devel] " Alex Williamson
2011-11-22 18:22             ` Alex Williamson
2011-11-15 20:10   ` Scott Wood
2011-11-15 20:10     ` [Qemu-devel] " Scott Wood
2011-11-15 21:40     ` Aaron Fabbri
2011-11-15 21:40       ` [Qemu-devel] " Aaron Fabbri
2011-11-15 21:40       ` Aaron Fabbri
2011-11-15 22:29       ` Scott Wood
2011-11-15 22:29         ` [Qemu-devel] " Scott Wood
2011-11-16 23:34         ` Alex Williamson
2011-11-16 23:34           ` [Qemu-devel] " Alex Williamson
2011-11-16 23:34           ` Alex Williamson
2011-11-15 20:10   ` Scott Wood
2011-11-29  1:52 ` Alexey Kardashevskiy
2011-11-29  1:52   ` [Qemu-devel] " Alexey Kardashevskiy
2011-11-29  2:01   ` Alexey Kardashevskiy [this message]
2011-11-29  2:01     ` Alexey Kardashevskiy
2011-11-29  2:11     ` Alexey Kardashevskiy
2011-11-29  2:11       ` [Qemu-devel] " Alexey Kardashevskiy
2011-11-29  3:54     ` Alex Williamson
2011-11-29  3:54       ` [Qemu-devel] " Alex Williamson
2011-11-29  3:54       ` Alex Williamson
2011-11-29 19:26       ` Alex Williamson
2011-11-29 19:26         ` [Qemu-devel] " Alex Williamson
2011-11-29 23:20         ` Stuart Yoder
2011-11-29 23:20           ` Stuart Yoder
2011-11-29 23:44           ` Alex Williamson
2011-11-29 23:44             ` Alex Williamson
2011-11-29 23:44             ` Alex Williamson
2011-11-30 15:41             ` [Qemu-devel] " Stuart Yoder
2011-11-30 15:41               ` Stuart Yoder
2011-11-30 16:58               ` Alex Williamson
2011-11-30 16:58                 ` Alex Williamson
2011-11-30 16:58                 ` Alex Williamson
2011-12-01 20:58                 ` [Qemu-devel] " Stuart Yoder
2011-12-01 20:58                   ` Stuart Yoder
2011-12-01 21:25                   ` Alex Williamson
2011-12-01 21:25                     ` Alex Williamson
2011-12-01 21:25                     ` Alex Williamson
2011-12-02 14:40                     ` [Qemu-devel] " Stuart Yoder
2011-12-02 14:40                       ` Stuart Yoder
2011-12-02 18:11                       ` Bhushan Bharat-R65777
2011-12-02 18:11                         ` Bhushan Bharat-R65777
2011-12-02 18:27                         ` Scott Wood
2011-12-02 18:27                           ` Scott Wood
2011-12-02 18:35                           ` Bhushan Bharat-R65777
2011-12-02 18:35                             ` Bhushan Bharat-R65777
2011-12-02 18:45                           ` Bhushan Bharat-R65777
2011-12-02 18:45                             ` Bhushan Bharat-R65777
2011-12-02 18:52                             ` Scott Wood
2011-12-02 18:52                               ` Scott Wood
2011-12-02 18:21                       ` Scott Wood
2011-12-02 18:21                         ` Scott Wood
2011-11-29  3:46   ` Alex Williamson
2011-11-29  3:46     ` [Qemu-devel] " Alex Williamson
2011-11-29  3:46     ` Alex Williamson
2011-11-29  4:34     ` Alexey Kardashevskiy
2011-11-29  4:34       ` [Qemu-devel] " Alexey Kardashevskiy
2011-11-29  5:48       ` Alex Williamson
2011-11-29  5:48         ` [Qemu-devel] " Alex Williamson
2011-11-29  5:48         ` Alex Williamson
2011-12-02  5:06         ` Alexey Kardashevskiy
2011-12-02  5:06           ` [Qemu-devel] " Alexey Kardashevskiy
2011-12-02  5:06           ` Alexey Kardashevskiy

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=4ED43CFE.8040009@au1.ibm.com \
    --to=aik@au1.ibm.com \
    --cc=B07421@freescale.com \
    --cc=B08248@freescale.com \
    --cc=aafabbri@cisco.com \
    --cc=agraf@suse.de \
    --cc=alex.williamson@redhat.com \
    --cc=avi@redhat.com \
    --cc=benve@cisco.com \
    --cc=chrisw@sous-sol.org \
    --cc=dwg@au1.ibm.com \
    --cc=iommu@lists.linux-foundation.org \
    --cc=joerg.roedel@amd.com \
    --cc=konrad.wilk@oracle.com \
    --cc=kvm@vger.kernel.org \
    --cc=linux-pci@vger.kernel.org \
    --cc=pmac@au1.ibm.com \
    --cc=qemu-devel@nongnu.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.