From: Alex Williamson <alex.williamson@redhat.com>
To: Jason Gunthorpe <jgg@nvidia.com>
Cc: Jacob Pan <jacob.pan@linux.microsoft.com>,
linux-kernel@vger.kernel.org,
"iommu@lists.linux.dev" <iommu@lists.linux.dev>,
"Liu, Yi L" <yi.l.liu@intel.com>,
Zhang Yu <zhangyu1@microsoft.com>,
Easwar Hariharan <eahariha@linux.microsoft.com>,
Saurabh Sengar <ssengar@linux.microsoft.com>
Subject: Re: [PATCH v2 2/2] vfio: Fix unbalanced vfio_df_close call in no-iommu mode
Date: Mon, 16 Jun 2025 13:40:04 -0600 [thread overview]
Message-ID: <20250616134004.60105ad5.alex.williamson@redhat.com> (raw)
In-Reply-To: <20250616153455.GK1174925@nvidia.com>
On Mon, 16 Jun 2025 12:34:55 -0300
Jason Gunthorpe <jgg@nvidia.com> wrote:
> On Mon, Jun 16, 2025 at 08:47:08AM -0600, Alex Williamson wrote:
> > On Fri, 13 Jun 2025 21:15:55 -0300
> > Jason Gunthorpe <jgg@nvidia.com> wrote:
> >
> > > On Fri, Jun 13, 2025 at 04:31:03PM -0600, Alex Williamson wrote:
> > > > On Tue, 3 Jun 2025 08:23:43 -0700
> > > > Jacob Pan <jacob.pan@linux.microsoft.com> wrote:
> > > >
> > > > > From: Jason Gunthorpe <jgg@nvidia.com>
> > > > >
> > > > > For devices with no-iommu enabled in IOMMUFD VFIO compat mode, the group
> > > > > open path skips vfio_df_open(), leaving open_count at 0. This causes a
> > > > > warning in vfio_assert_device_open(device) when vfio_df_close() is called
> > > > > during group close.
> > > > >
> > > > > The correct behavior is to skip only the IOMMUFD bind in the device open
> > > > > path for no-iommu devices. Commit 6086efe73498 omitted vfio_df_open(),
> > > > > which was too broad. This patch restores the previous behavior, ensuring
> > > > > the vfio_df_open is called in the group open path.
> > > > >
> > > > > Fixes: 6086efe73498 ("vfio-iommufd: Move noiommu compat validation out of vfio_iommufd_bind()")
> > > > > Signed-off-by: Jason Gunthorpe <jgg@nvidia.com>
> > > > > Tested-by: Jacob Pan <jacob.pan@linux.microsoft.com>
> > > > > Signed-off-by: Jacob Pan <jacob.pan@linux.microsoft.com>
> > > > > ---
> > > > > v2: Use a fix from Jason
> > > > > ---
> > > > > drivers/vfio/group.c | 10 +++++-----
> > > > > drivers/vfio/iommufd.c | 3 ---
> > > > > drivers/vfio/vfio_main.c | 26 ++++++++++++++++----------
> > > > > 3 files changed, 21 insertions(+), 18 deletions(-)
> > > > >
> > > > > diff --git a/drivers/vfio/group.c b/drivers/vfio/group.c
> > > > > index c321d442f0da..8f5fe8a392de 100644
> > > > > --- a/drivers/vfio/group.c
> > > > > +++ b/drivers/vfio/group.c
> > > > > @@ -192,18 +192,18 @@ static int vfio_df_group_open(struct vfio_device_file *df)
> > > > > * implies they expected translation to exist
> > > > > */
> > > > > if (!capable(CAP_SYS_RAWIO) ||
> > > > > - vfio_iommufd_device_has_compat_ioas(device, df->iommufd))
> > > > > + vfio_iommufd_device_has_compat_ioas(device, df->iommufd)) {
> > > > > ret = -EPERM;
> > > > > - else
> > > > > - ret = 0;
> > > > > - goto out_put_kvm;
> > > > > + goto out_put_kvm;
> > > > > + }
> > > > > }
> > > > >
> > > > > ret = vfio_df_open(df);
> > > > > if (ret)
> > > > > goto out_put_kvm;
> > > > >
> > > > > - if (df->iommufd && device->open_count == 1) {
> > > > > + if (df->iommufd && device->open_count == 1 &&
> > > > > + !vfio_device_is_noiommu(device)) {
> > > >
> > > > Why do we need this?
> > >
> > > What I was trying to do is put all the logic about noiommu into only
> > > vfio_df..open/close functions instead of sprikling it into a bunch of
> > > other functions. That seemed to be the right point to make this cut.
> >
> > Alternatively we could be consistent about breaking out of the
> > vfio/iommufd.c functions that aren't relevant to noiommu. The
> > container side handles noiommu internally, why should iommufd push
> > handling up to the device file layer? We're really just missing the
> > bind path.
>
> Broadly what I was going for was to just remove the iommufd stuff
> entirely from the DF layer rather than to half pretend there is an
> iommufd layer below it. This should ideally go as far as not having an
> iommufd_ctx at all. So things start to look really weird calling
> iommufd functions without an iommufd ctx.
>
> > > With this patch we move toward the vfio_df..open/close functions being
> > > symmetrical in their decision making.
> >
> > But is it? We special case all the iommufd paths to filter out noiommu
> > but it's inconsistent with the legacy paths. Thanks,
>
> The container still exists in noiommu mode and internally does things,
> eg it has a container->noiommu indicationm and the vfio-noiommu ops to
> manage this.
>
> The iommufd should not exist and should never be used. They are
> different cases.
>
> If Jacob eventually does what I suggested in another email then we
> would have a noiommu special mode inside iommufd and it would look
> more like the container.
A concise fix would be nice for stable backports though, so even if we
want to move to testing noiommu in the device file layer or create a
special mode in iommufd, the smallest, most consistent initial fix
would be to continue the _group_open:
--- a/drivers/vfio/group.c
+++ b/drivers/vfio/group.c
@@ -192,18 +192,18 @@ static int vfio_df_group_open(struct vfio_device_file *df)
* implies they expected translation to exist
*/
if (!capable(CAP_SYS_RAWIO) ||
- vfio_iommufd_device_has_compat_ioas(device, df->iommufd))
+ vfio_iommufd_device_has_compat_ioas(device, df->iommufd)) {
ret = -EPERM;
- else
- ret = 0;
- goto out_put_kvm;
+ goto out_put_kvm;
+ }
}
And add a noiommu exit branch to _iommufd_bind, symmetric to unbind.
Right? Thanks,
Alex
next prev parent reply other threads:[~2025-06-16 19:40 UTC|newest]
Thread overview: 14+ messages / expand[flat|nested] mbox.gz Atom feed top
2025-06-03 15:23 [PATCH v2 1/2] vfio: Prevent open_count decrement to negative Jacob Pan
2025-06-03 15:23 ` [PATCH v2 2/2] vfio: Fix unbalanced vfio_df_close call in no-iommu mode Jacob Pan
2025-06-13 22:31 ` Alex Williamson
2025-06-14 0:15 ` Jason Gunthorpe
2025-06-16 14:47 ` Alex Williamson
2025-06-16 15:34 ` Jason Gunthorpe
2025-06-16 19:40 ` Alex Williamson [this message]
2025-06-16 20:05 ` Jason Gunthorpe
2025-06-18 23:11 ` Jacob Pan
2025-06-18 23:25 ` Jason Gunthorpe
2025-06-13 22:31 ` [PATCH v2 1/2] vfio: Prevent open_count decrement to negative Alex Williamson
2025-06-14 0:09 ` Jason Gunthorpe
2025-06-16 14:40 ` Alex Williamson
2025-06-18 23:08 ` Jacob Pan
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20250616134004.60105ad5.alex.williamson@redhat.com \
--to=alex.williamson@redhat.com \
--cc=eahariha@linux.microsoft.com \
--cc=iommu@lists.linux.dev \
--cc=jacob.pan@linux.microsoft.com \
--cc=jgg@nvidia.com \
--cc=linux-kernel@vger.kernel.org \
--cc=ssengar@linux.microsoft.com \
--cc=yi.l.liu@intel.com \
--cc=zhangyu1@microsoft.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).