From: Matthew Rosato <mjrosato@linux.ibm.com>
To: Pierre Morel <pmorel@linux.ibm.com>, linux-s390@vger.kernel.org
Cc: alex.williamson@redhat.com, cohuck@redhat.com,
schnelle@linux.ibm.com, farman@linux.ibm.com,
borntraeger@linux.ibm.com, hca@linux.ibm.com, gor@linux.ibm.com,
gerald.schaefer@linux.ibm.com, agordeev@linux.ibm.com,
frankja@linux.ibm.com, david@redhat.com, imbrenda@linux.ibm.com,
vneethv@linux.ibm.com, oberpar@linux.ibm.com,
freude@linux.ibm.com, thuth@redhat.com, pasic@linux.ibm.com,
kvm@vger.kernel.org, linux-kernel@vger.kernel.org
Subject: Re: [PATCH v2 21/30] KVM: s390: pci: handle refresh of PCI translations
Date: Wed, 19 Jan 2022 15:02:36 -0500 [thread overview]
Message-ID: <bbd5a23e-0f83-cc35-5ea1-79ce015d2105@linux.ibm.com> (raw)
In-Reply-To: <cebcc3de-e332-6381-f450-a6a26ef88182@linux.ibm.com>
On 1/19/22 1:25 PM, Pierre Morel wrote:
>
>
> On 1/19/22 17:39, Matthew Rosato wrote:
>> On 1/19/22 4:29 AM, Pierre Morel wrote:
>>>
>>>
>>> On 1/14/22 21:31, Matthew Rosato wrote:
>> ...
>>>> +static int dma_table_shadow(struct kvm_vcpu *vcpu, struct zpci_dev
>>>> *zdev,
>>>> + dma_addr_t dma_addr, size_t size)
>>>> +{
>>>> + unsigned int nr_pages = PAGE_ALIGN(size) >> PAGE_SHIFT;
>>>> + struct kvm_zdev *kzdev = zdev->kzdev;
>>>> + unsigned long *entry, *gentry;
>>>> + int i, rc = 0, rc2;
>>>> +
>>>> + if (!nr_pages || !kzdev)
>>>> + return -EINVAL;
>>>> +
>>>> + mutex_lock(&kzdev->ioat.lock);
>>>> + if (!zdev->dma_table || !kzdev->ioat.head[0]) {
>>>> + rc = -EINVAL;
>>>> + goto out_unlock;
>>>> + }
>>>> +
>>>> + for (i = 0; i < nr_pages; i++) {
>>>> + gentry = dma_walk_guest_cpu_trans(vcpu, &kzdev->ioat,
>>>> dma_addr);
>>>> + if (!gentry)
>>>> + continue;
>>>> + entry = dma_walk_cpu_trans(zdev->dma_table, dma_addr);
>>>> +
>>>> + if (!entry) {
>>>> + rc = -ENOMEM;
>>>> + goto out_unlock;
>>>> + }
>>>> +
>>>> + rc2 = dma_shadow_cpu_trans(vcpu, entry, gentry);
>>>> + if (rc2 < 0) {
>>>> + rc = -EIO;
>>>> + goto out_unlock;
>>>> + }
>>>> + dma_addr += PAGE_SIZE;
>>>> + rc += rc2;
>>>> + }
>>>> +
>>>
>>> In case of error, shouldn't we invalidate the shadow tables entries
>>> we did validate until the error?
>>
>> Hmm, I don't think this is strictly necessary - the status returned
>> should indicate the specified DMA range is now in an indeterminate
>> state (putting the onus on the guest to take corrective action via a
>> global refresh).
>>
>> In fact I think I screwed that up below in kvm_s390_pci_refresh_trans,
>> the fabricated status should always be KVM_S390_RPCIT_INS_RES.
>
> OK
>
>>
>>>
>>>> +out_unlock:
>>>> + mutex_unlock(&kzdev->ioat.lock);
>>>> + return rc;
>>>> +}
>>>> +
>>>> +int kvm_s390_pci_refresh_trans(struct kvm_vcpu *vcpu, unsigned long
>>>> req,
>>>> + unsigned long start, unsigned long size,
>>>> + u8 *status)
>>>> +{
>>>> + struct zpci_dev *zdev;
>>>> + u32 fh = req >> 32;
>>>> + int rc;
>>>> +
>>>> + /* Make sure this is a valid device associated with this guest */
>>>> + zdev = get_zdev_by_fh(fh);
>>>> + if (!zdev || !zdev->kzdev || zdev->kzdev->kvm != vcpu->kvm) {
>>>> + *status = 0;
>>>
>>> Wouldn't it be interesting to add some debug information here.
>>> When would this appear?
>>
>> Yes, I agree -- One of the follow-ons I'd like to add after this
>> series is s390dbf entries; this seems like a good spot for one.
>>
>> As to when this could happen; it should not under normal
>> circumstances, but consider something like arbitrary function handles
>> coming from the intercepted guest instruction. We need to ensure that
>> the specified function 1) exists and 2) is associated with the guest
>> issuing the refresh.
>>
>>>
>>> Also if we have this error this looks like we have a VM problem,
>>> shouldn't we treat this in QEMU and return -EOPNOTSUPP ?
>>>
>>
>> Well, I'm not sure if we can really tell where the problem is (it
>> could for example indicate a misbehaving guest, or a bug in our KVM
>> tracking of hostdevs).
>>
>> The guest chose the function handle, and if we got here then that
>> means it doesn't indicate that it's an emulated device, which means
>> either we are using the assist and KVM should handle the intercept or
>> we are not and userspace should handle it. But in both of those
>> cases, there should be a host device and it should be associated with
>> the guest.
>
> That is right if we can not find an associated zdev = F(fh)
> but the two other errors are KVM or QEMU errors AFAIU.
I don't think we know for sure for any of the cases... For a
well-behaved guest I agree with your assessment. However, the guest
decides what fh to put into its refresh instruction and so a misbehaving
guest could just pick arbitrary numbers for fh and circumstantially
match some other host device. What if the guest just decided to try
every single possible fh number in a loop with a refresh instruction?
That's neither KVM nor QEMU's fault but can trip each of these cases.
Consider the different cases:
!zdev - Either the guest provided a bogus fh, KVM provided a bad fh via
the VFIO ioctl which then QEMU fed into CLP or KVM provided the right fh
via ioctl but QEMU clobbered it when providing it to the guest via CLP.
!zdev->kzdev - Either the guest provided a bogus fh that just so
happened to match a host fh that has no KVM association, or KVM or QEMU
screwed up somewhere (as above or because we failed to make the KVM
assocation somehow)
kzdev->kvm != vcpu->kvm - Pretty much the same as above, but the
matching device is actually in use by some other guest. Again it's
possible the a misbehaving guest 'got lucky' with an arbitrary fh that
happened to match a host fh with an existing KVM association -- or more
likely that KVM or QEMU screwed up somewhere.
>
>>
>> I think if we decide to throw this to userspace in this event, QEMU
>> needs some extra code to handle it (basically, if QEMU receives the
>> intercept and the device is neither emulated nor using intercept mode
>> then we must treat as an invalid handle as this intercept should have
>> been handled by KVM)
>
> I do not want to start a discussion on this, I think we can let it like
> this at first and come back to it when we have a good idea on how to
> handle this.
> May be just add a /* TODO */
OK, sure. In any of the above cases, we are certainly done in KVM
anyway. Whether there's value in passing it onto userspace vs
immediately giving an error, let's think about it.
next prev parent reply other threads:[~2022-01-19 20:02 UTC|newest]
Thread overview: 97+ messages / expand[flat|nested] mbox.gz Atom feed top
2022-01-14 20:31 [PATCH v2 00/30] KVM: s390: enable zPCI for interpretive execution Matthew Rosato
2022-01-14 20:31 ` [PATCH v2 01/30] s390/sclp: detect the zPCI load/store interpretation facility Matthew Rosato
2022-01-14 20:31 ` [PATCH v2 02/30] s390/sclp: detect the AISII facility Matthew Rosato
2022-01-14 20:31 ` [PATCH v2 03/30] s390/sclp: detect the AENI facility Matthew Rosato
2022-01-14 20:31 ` [PATCH v2 04/30] s390/sclp: detect the AISI facility Matthew Rosato
2022-01-17 7:57 ` Thomas Huth
2022-01-14 20:31 ` [PATCH v2 05/30] s390/airq: pass more TPI info to airq handlers Matthew Rosato
2022-01-17 8:27 ` Thomas Huth
2022-01-14 20:31 ` [PATCH v2 06/30] s390/airq: allow for airq structure that uses an input vector Matthew Rosato
2022-01-17 12:29 ` Claudio Imbrenda
2022-01-18 18:52 ` Matthew Rosato
2022-01-18 9:50 ` Pierre Morel
2022-01-14 20:31 ` [PATCH v2 07/30] s390/pci: externalize the SIC operation controls and routine Matthew Rosato
2022-01-17 16:19 ` Niklas Schnelle
2022-01-26 10:07 ` Claudio Imbrenda
2022-01-27 9:57 ` Pierre Morel
2022-01-14 20:31 ` [PATCH v2 08/30] s390/pci: stash associated GISA designation Matthew Rosato
2022-01-24 14:08 ` Pierre Morel
2022-01-24 15:12 ` Matthew Rosato
2022-01-14 20:31 ` [PATCH v2 09/30] s390/pci: export some routines related to RPCIT processing Matthew Rosato
2022-01-18 9:51 ` Pierre Morel
2022-01-14 20:31 ` [PATCH v2 10/30] s390/pci: stash dtsm and maxstbl Matthew Rosato
2022-01-14 20:31 ` [PATCH v2 11/30] s390/pci: add helper function to find device by handle Matthew Rosato
2022-01-18 9:53 ` Pierre Morel
2022-01-14 20:31 ` [PATCH v2 12/30] s390/pci: get SHM information from list pci Matthew Rosato
2022-01-18 10:36 ` Pierre Morel
2022-01-26 10:13 ` Claudio Imbrenda
2022-01-27 13:41 ` Pierre Morel
2022-01-27 15:14 ` Matthew Rosato
2022-01-27 10:29 ` Niklas Schnelle
2022-01-14 20:31 ` [PATCH v2 13/30] s390/pci: return status from zpci_refresh_trans Matthew Rosato
2022-01-19 18:13 ` Pierre Morel
2022-01-26 10:45 ` Claudio Imbrenda
2022-01-27 10:30 ` Niklas Schnelle
2022-01-14 20:31 ` [PATCH v2 14/30] KVM: s390: pci: add basic kvm_zdev structure Matthew Rosato
2022-01-17 16:25 ` Pierre Morel
2022-01-18 17:32 ` Pierre Morel
2022-01-18 18:39 ` Matthew Rosato
2022-01-14 20:31 ` [PATCH v2 15/30] KVM: s390: pci: do initial setup for AEN interpretation Matthew Rosato
2022-01-19 18:06 ` Pierre Morel
2022-01-19 20:19 ` Matthew Rosato
2022-01-25 12:23 ` Pierre Morel
2022-01-25 14:57 ` Matthew Rosato
2022-01-14 20:31 ` [PATCH v2 16/30] KVM: s390: pci: enable host forwarding of Adapter Event Notifications Matthew Rosato
2022-01-17 17:38 ` Pierre Morel
2022-01-18 17:25 ` Matthew Rosato
2022-01-14 20:31 ` [PATCH v2 17/30] KVM: s390: mechanism to enable guest zPCI Interpretation Matthew Rosato
2022-01-24 14:24 ` Pierre Morel
2022-01-24 15:28 ` Matthew Rosato
2022-01-24 17:15 ` Pierre Morel
2022-01-14 20:31 ` [PATCH v2 18/30] KVM: s390: pci: provide routines for enabling/disabling interpretation Matthew Rosato
2022-01-24 14:36 ` Pierre Morel
2022-01-24 15:14 ` Matthew Rosato
2022-01-14 20:31 ` [PATCH v2 19/30] KVM: s390: pci: provide routines for enabling/disabling interrupt forwarding Matthew Rosato
2022-01-25 12:41 ` Pierre Morel
2022-01-25 15:44 ` Matthew Rosato
2022-01-14 20:31 ` [PATCH v2 20/30] KVM: s390: pci: provide routines for enabling/disabling IOAT assist Matthew Rosato
2022-01-25 13:29 ` Pierre Morel
2022-01-25 14:47 ` Matthew Rosato
2022-01-26 8:30 ` Pierre Morel
2022-01-14 20:31 ` [PATCH v2 21/30] KVM: s390: pci: handle refresh of PCI translations Matthew Rosato
2022-01-19 9:29 ` Pierre Morel
2022-01-19 16:39 ` Matthew Rosato
2022-01-19 18:25 ` Pierre Morel
2022-01-19 20:02 ` Matthew Rosato [this message]
2022-01-20 9:47 ` Pierre Morel
2022-01-14 20:31 ` [PATCH v2 22/30] KVM: s390: intercept the rpcit instruction Matthew Rosato
2022-01-18 11:05 ` Pierre Morel
2022-01-18 17:27 ` Matthew Rosato
2022-01-18 17:54 ` Pierre Morel
2022-01-19 14:06 ` Pierre Morel
2022-01-14 20:31 ` [PATCH v2 23/30] vfio/pci: re-introduce CONFIG_VFIO_PCI_ZDEV Matthew Rosato
2022-01-18 17:20 ` Pierre Morel
2022-01-18 17:32 ` Matthew Rosato
2022-01-18 17:45 ` Pierre Morel
2022-01-18 18:05 ` Matthew Rosato
2022-01-14 20:31 ` [PATCH v2 24/30] vfio-pci/zdev: wire up group notifier Matthew Rosato
2022-01-18 17:34 ` Pierre Morel
2022-01-18 18:37 ` Matthew Rosato
2022-01-14 20:31 ` [PATCH v2 25/30] vfio-pci/zdev: wire up zPCI interpretive execution support Matthew Rosato
2022-01-25 13:01 ` Pierre Morel
2022-01-25 14:21 ` Matthew Rosato
2022-01-14 20:31 ` [PATCH v2 26/30] vfio-pci/zdev: wire up zPCI adapter interrupt forwarding support Matthew Rosato
2022-01-19 17:10 ` Pierre Morel
2022-01-19 17:20 ` Matthew Rosato
2022-01-25 12:36 ` Pierre Morel
2022-01-25 14:16 ` Matthew Rosato
2022-01-26 8:24 ` Pierre Morel
2022-01-14 20:31 ` [PATCH v2 27/30] vfio-pci/zdev: wire up zPCI IOAT assist support Matthew Rosato
2022-01-19 14:03 ` Pierre Morel
2022-01-14 20:31 ` [PATCH v2 28/30] vfio-pci/zdev: add DTSM to clp group capability Matthew Rosato
2022-01-19 13:48 ` Pierre Morel
2022-01-14 20:31 ` [PATCH v2 29/30] KVM: s390: introduce CPU feature for zPCI Interpretation Matthew Rosato
2022-01-19 13:39 ` Pierre Morel
2022-01-14 20:31 ` [PATCH v2 30/30] MAINTAINERS: additional files related kvm s390 pci passthrough Matthew Rosato
2022-01-14 20:49 ` [PATCH v2 00/30] KVM: s390: enable zPCI for interpretive execution Matthew Rosato
2022-01-19 18:10 ` Pierre Morel
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=bbd5a23e-0f83-cc35-5ea1-79ce015d2105@linux.ibm.com \
--to=mjrosato@linux.ibm.com \
--cc=agordeev@linux.ibm.com \
--cc=alex.williamson@redhat.com \
--cc=borntraeger@linux.ibm.com \
--cc=cohuck@redhat.com \
--cc=david@redhat.com \
--cc=farman@linux.ibm.com \
--cc=frankja@linux.ibm.com \
--cc=freude@linux.ibm.com \
--cc=gerald.schaefer@linux.ibm.com \
--cc=gor@linux.ibm.com \
--cc=hca@linux.ibm.com \
--cc=imbrenda@linux.ibm.com \
--cc=kvm@vger.kernel.org \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-s390@vger.kernel.org \
--cc=oberpar@linux.ibm.com \
--cc=pasic@linux.ibm.com \
--cc=pmorel@linux.ibm.com \
--cc=schnelle@linux.ibm.com \
--cc=thuth@redhat.com \
--cc=vneethv@linux.ibm.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox