From: Pierre Morel <pmorel@linux.ibm.com>
To: Matthew Rosato <mjrosato@linux.ibm.com>, linux-s390@vger.kernel.org
Cc: alex.williamson@redhat.com, cohuck@redhat.com,
schnelle@linux.ibm.com, farman@linux.ibm.com,
borntraeger@linux.ibm.com, hca@linux.ibm.com, gor@linux.ibm.com,
gerald.schaefer@linux.ibm.com, agordeev@linux.ibm.com,
frankja@linux.ibm.com, david@redhat.com, imbrenda@linux.ibm.com,
vneethv@linux.ibm.com, oberpar@linux.ibm.com,
freude@linux.ibm.com, thuth@redhat.com, pasic@linux.ibm.com,
kvm@vger.kernel.org, linux-kernel@vger.kernel.org
Subject: Re: [PATCH v2 21/30] KVM: s390: pci: handle refresh of PCI translations
Date: Thu, 20 Jan 2022 10:47:30 +0100 [thread overview]
Message-ID: <dcfe1661-0779-6c50-6126-e9d0609e3911@linux.ibm.com> (raw)
In-Reply-To: <bbd5a23e-0f83-cc35-5ea1-79ce015d2105@linux.ibm.com>
On 1/19/22 21:02, Matthew Rosato wrote:
> On 1/19/22 1:25 PM, Pierre Morel wrote:
>>
>>
>> On 1/19/22 17:39, Matthew Rosato wrote:
>>> On 1/19/22 4:29 AM, Pierre Morel wrote:
>>>>
>>>>
>>>> On 1/14/22 21:31, Matthew Rosato wrote:
>>> ...
>>>>> +static int dma_table_shadow(struct kvm_vcpu *vcpu, struct zpci_dev
>>>>> *zdev,
>>>>> + dma_addr_t dma_addr, size_t size)
>>>>> +{
>>>>> + unsigned int nr_pages = PAGE_ALIGN(size) >> PAGE_SHIFT;
>>>>> + struct kvm_zdev *kzdev = zdev->kzdev;
>>>>> + unsigned long *entry, *gentry;
>>>>> + int i, rc = 0, rc2;
>>>>> +
>>>>> + if (!nr_pages || !kzdev)
>>>>> + return -EINVAL;
>>>>> +
>>>>> + mutex_lock(&kzdev->ioat.lock);
>>>>> + if (!zdev->dma_table || !kzdev->ioat.head[0]) {
>>>>> + rc = -EINVAL;
>>>>> + goto out_unlock;
>>>>> + }
>>>>> +
>>>>> + for (i = 0; i < nr_pages; i++) {
>>>>> + gentry = dma_walk_guest_cpu_trans(vcpu, &kzdev->ioat,
>>>>> dma_addr);
>>>>> + if (!gentry)
>>>>> + continue;
>>>>> + entry = dma_walk_cpu_trans(zdev->dma_table, dma_addr);
>>>>> +
>>>>> + if (!entry) {
>>>>> + rc = -ENOMEM;
>>>>> + goto out_unlock;
>>>>> + }
>>>>> +
>>>>> + rc2 = dma_shadow_cpu_trans(vcpu, entry, gentry);
>>>>> + if (rc2 < 0) {
>>>>> + rc = -EIO;
>>>>> + goto out_unlock;
>>>>> + }
>>>>> + dma_addr += PAGE_SIZE;
>>>>> + rc += rc2;
>>>>> + }
>>>>> +
>>>>
>>>> In case of error, shouldn't we invalidate the shadow tables entries
>>>> we did validate until the error?
>>>
>>> Hmm, I don't think this is strictly necessary - the status returned
>>> should indicate the specified DMA range is now in an indeterminate
>>> state (putting the onus on the guest to take corrective action via a
>>> global refresh).
>>>
>>> In fact I think I screwed that up below in
>>> kvm_s390_pci_refresh_trans, the fabricated status should always be
>>> KVM_S390_RPCIT_INS_RES.
>>
>> OK
>>
>>>
>>>>
>>>>> +out_unlock:
>>>>> + mutex_unlock(&kzdev->ioat.lock);
>>>>> + return rc;
>>>>> +}
>>>>> +
>>>>> +int kvm_s390_pci_refresh_trans(struct kvm_vcpu *vcpu, unsigned
>>>>> long req,
>>>>> + unsigned long start, unsigned long size,
>>>>> + u8 *status)
>>>>> +{
>>>>> + struct zpci_dev *zdev;
>>>>> + u32 fh = req >> 32;
>>>>> + int rc;
>>>>> +
>>>>> + /* Make sure this is a valid device associated with this guest */
>>>>> + zdev = get_zdev_by_fh(fh);
>>>>> + if (!zdev || !zdev->kzdev || zdev->kzdev->kvm != vcpu->kvm) {
>>>>> + *status = 0;
>>>>
>>>> Wouldn't it be interesting to add some debug information here.
>>>> When would this appear?
>>>
>>> Yes, I agree -- One of the follow-ons I'd like to add after this
>>> series is s390dbf entries; this seems like a good spot for one.
>>>
>>> As to when this could happen; it should not under normal
>>> circumstances, but consider something like arbitrary function handles
>>> coming from the intercepted guest instruction. We need to ensure
>>> that the specified function 1) exists and 2) is associated with the
>>> guest issuing the refresh.
>>>
>>>>
>>>> Also if we have this error this looks like we have a VM problem,
>>>> shouldn't we treat this in QEMU and return -EOPNOTSUPP ?
>>>>
>>>
>>> Well, I'm not sure if we can really tell where the problem is (it
>>> could for example indicate a misbehaving guest, or a bug in our KVM
>>> tracking of hostdevs).
>>>
>>> The guest chose the function handle, and if we got here then that
>>> means it doesn't indicate that it's an emulated device, which means
>>> either we are using the assist and KVM should handle the intercept or
>>> we are not and userspace should handle it. But in both of those
>>> cases, there should be a host device and it should be associated with
>>> the guest.
>>
>> That is right if we can not find an associated zdev = F(fh)
>> but the two other errors are KVM or QEMU errors AFAIU.
>
> I don't think we know for sure for any of the cases... For a
> well-behaved guest I agree with your assessment. However, the guest
> decides what fh to put into its refresh instruction and so a misbehaving
> guest could just pick arbitrary numbers for fh and circumstantially
> match some other host device. What if the guest just decided to try
> every single possible fh number in a loop with a refresh instruction?
> That's neither KVM nor QEMU's fault but can trip each of these cases.
>
> Consider the different cases:
>
> !zdev - Either the guest provided a bogus fh, KVM provided a bad fh via
> the VFIO ioctl which then QEMU fed into CLP or KVM provided the right fh
> via ioctl but QEMU clobbered it when providing it to the guest via CLP.
>
> !zdev->kzdev - Either the guest provided a bogus fh that just so
> happened to match a host fh that has no KVM association, or KVM or QEMU
> screwed up somewhere (as above or because we failed to make the KVM
> assocation somehow)
>
> kzdev->kvm != vcpu->kvm - Pretty much the same as above, but the
> matching device is actually in use by some other guest. Again it's
> possible the a misbehaving guest 'got lucky' with an arbitrary fh that
> happened to match a host fh with an existing KVM association -- or more
> likely that KVM or QEMU screwed up somewhere.
OK, I understand and you are right, my error was to consider that
get_zdev_by_fh() returns a zdev associated with a valid FH for the guest
while it returns a zdev associated with a valid FH for the host.
If the comment would have been after the get_zdev_by_fh() and before the
test I may be wouldn't have done this mistake.
>
>>
>>>
>>> I think if we decide to throw this to userspace in this event, QEMU
>>> needs some extra code to handle it (basically, if QEMU receives the
>>> intercept and the device is neither emulated nor using intercept mode
>>> then we must treat as an invalid handle as this intercept should have
>>> been handled by KVM)
>>
>> I do not want to start a discussion on this, I think we can let it
>> like this at first and come back to it when we have a good idea on how
>> to handle this.
>> May be just add a /* TODO */
>
> OK, sure. In any of the above cases, we are certainly done in KVM
> anyway. Whether there's value in passing it onto userspace vs
> immediately giving an error, let's think about it.
No, I do not think we should anymore.
Sorry for this wrong idea.
--
Pierre Morel
IBM Lab Boeblingen
next prev parent reply other threads:[~2022-01-20 9:45 UTC|newest]
Thread overview: 97+ messages / expand[flat|nested] mbox.gz Atom feed top
2022-01-14 20:31 [PATCH v2 00/30] KVM: s390: enable zPCI for interpretive execution Matthew Rosato
2022-01-14 20:31 ` [PATCH v2 01/30] s390/sclp: detect the zPCI load/store interpretation facility Matthew Rosato
2022-01-14 20:31 ` [PATCH v2 02/30] s390/sclp: detect the AISII facility Matthew Rosato
2022-01-14 20:31 ` [PATCH v2 03/30] s390/sclp: detect the AENI facility Matthew Rosato
2022-01-14 20:31 ` [PATCH v2 04/30] s390/sclp: detect the AISI facility Matthew Rosato
2022-01-17 7:57 ` Thomas Huth
2022-01-14 20:31 ` [PATCH v2 05/30] s390/airq: pass more TPI info to airq handlers Matthew Rosato
2022-01-17 8:27 ` Thomas Huth
2022-01-14 20:31 ` [PATCH v2 06/30] s390/airq: allow for airq structure that uses an input vector Matthew Rosato
2022-01-17 12:29 ` Claudio Imbrenda
2022-01-18 18:52 ` Matthew Rosato
2022-01-18 9:50 ` Pierre Morel
2022-01-14 20:31 ` [PATCH v2 07/30] s390/pci: externalize the SIC operation controls and routine Matthew Rosato
2022-01-17 16:19 ` Niklas Schnelle
2022-01-26 10:07 ` Claudio Imbrenda
2022-01-27 9:57 ` Pierre Morel
2022-01-14 20:31 ` [PATCH v2 08/30] s390/pci: stash associated GISA designation Matthew Rosato
2022-01-24 14:08 ` Pierre Morel
2022-01-24 15:12 ` Matthew Rosato
2022-01-14 20:31 ` [PATCH v2 09/30] s390/pci: export some routines related to RPCIT processing Matthew Rosato
2022-01-18 9:51 ` Pierre Morel
2022-01-14 20:31 ` [PATCH v2 10/30] s390/pci: stash dtsm and maxstbl Matthew Rosato
2022-01-14 20:31 ` [PATCH v2 11/30] s390/pci: add helper function to find device by handle Matthew Rosato
2022-01-18 9:53 ` Pierre Morel
2022-01-14 20:31 ` [PATCH v2 12/30] s390/pci: get SHM information from list pci Matthew Rosato
2022-01-18 10:36 ` Pierre Morel
2022-01-26 10:13 ` Claudio Imbrenda
2022-01-27 13:41 ` Pierre Morel
2022-01-27 15:14 ` Matthew Rosato
2022-01-27 10:29 ` Niklas Schnelle
2022-01-14 20:31 ` [PATCH v2 13/30] s390/pci: return status from zpci_refresh_trans Matthew Rosato
2022-01-19 18:13 ` Pierre Morel
2022-01-26 10:45 ` Claudio Imbrenda
2022-01-27 10:30 ` Niklas Schnelle
2022-01-14 20:31 ` [PATCH v2 14/30] KVM: s390: pci: add basic kvm_zdev structure Matthew Rosato
2022-01-17 16:25 ` Pierre Morel
2022-01-18 17:32 ` Pierre Morel
2022-01-18 18:39 ` Matthew Rosato
2022-01-14 20:31 ` [PATCH v2 15/30] KVM: s390: pci: do initial setup for AEN interpretation Matthew Rosato
2022-01-19 18:06 ` Pierre Morel
2022-01-19 20:19 ` Matthew Rosato
2022-01-25 12:23 ` Pierre Morel
2022-01-25 14:57 ` Matthew Rosato
2022-01-14 20:31 ` [PATCH v2 16/30] KVM: s390: pci: enable host forwarding of Adapter Event Notifications Matthew Rosato
2022-01-17 17:38 ` Pierre Morel
2022-01-18 17:25 ` Matthew Rosato
2022-01-14 20:31 ` [PATCH v2 17/30] KVM: s390: mechanism to enable guest zPCI Interpretation Matthew Rosato
2022-01-24 14:24 ` Pierre Morel
2022-01-24 15:28 ` Matthew Rosato
2022-01-24 17:15 ` Pierre Morel
2022-01-14 20:31 ` [PATCH v2 18/30] KVM: s390: pci: provide routines for enabling/disabling interpretation Matthew Rosato
2022-01-24 14:36 ` Pierre Morel
2022-01-24 15:14 ` Matthew Rosato
2022-01-14 20:31 ` [PATCH v2 19/30] KVM: s390: pci: provide routines for enabling/disabling interrupt forwarding Matthew Rosato
2022-01-25 12:41 ` Pierre Morel
2022-01-25 15:44 ` Matthew Rosato
2022-01-14 20:31 ` [PATCH v2 20/30] KVM: s390: pci: provide routines for enabling/disabling IOAT assist Matthew Rosato
2022-01-25 13:29 ` Pierre Morel
2022-01-25 14:47 ` Matthew Rosato
2022-01-26 8:30 ` Pierre Morel
2022-01-14 20:31 ` [PATCH v2 21/30] KVM: s390: pci: handle refresh of PCI translations Matthew Rosato
2022-01-19 9:29 ` Pierre Morel
2022-01-19 16:39 ` Matthew Rosato
2022-01-19 18:25 ` Pierre Morel
2022-01-19 20:02 ` Matthew Rosato
2022-01-20 9:47 ` Pierre Morel [this message]
2022-01-14 20:31 ` [PATCH v2 22/30] KVM: s390: intercept the rpcit instruction Matthew Rosato
2022-01-18 11:05 ` Pierre Morel
2022-01-18 17:27 ` Matthew Rosato
2022-01-18 17:54 ` Pierre Morel
2022-01-19 14:06 ` Pierre Morel
2022-01-14 20:31 ` [PATCH v2 23/30] vfio/pci: re-introduce CONFIG_VFIO_PCI_ZDEV Matthew Rosato
2022-01-18 17:20 ` Pierre Morel
2022-01-18 17:32 ` Matthew Rosato
2022-01-18 17:45 ` Pierre Morel
2022-01-18 18:05 ` Matthew Rosato
2022-01-14 20:31 ` [PATCH v2 24/30] vfio-pci/zdev: wire up group notifier Matthew Rosato
2022-01-18 17:34 ` Pierre Morel
2022-01-18 18:37 ` Matthew Rosato
2022-01-14 20:31 ` [PATCH v2 25/30] vfio-pci/zdev: wire up zPCI interpretive execution support Matthew Rosato
2022-01-25 13:01 ` Pierre Morel
2022-01-25 14:21 ` Matthew Rosato
2022-01-14 20:31 ` [PATCH v2 26/30] vfio-pci/zdev: wire up zPCI adapter interrupt forwarding support Matthew Rosato
2022-01-19 17:10 ` Pierre Morel
2022-01-19 17:20 ` Matthew Rosato
2022-01-25 12:36 ` Pierre Morel
2022-01-25 14:16 ` Matthew Rosato
2022-01-26 8:24 ` Pierre Morel
2022-01-14 20:31 ` [PATCH v2 27/30] vfio-pci/zdev: wire up zPCI IOAT assist support Matthew Rosato
2022-01-19 14:03 ` Pierre Morel
2022-01-14 20:31 ` [PATCH v2 28/30] vfio-pci/zdev: add DTSM to clp group capability Matthew Rosato
2022-01-19 13:48 ` Pierre Morel
2022-01-14 20:31 ` [PATCH v2 29/30] KVM: s390: introduce CPU feature for zPCI Interpretation Matthew Rosato
2022-01-19 13:39 ` Pierre Morel
2022-01-14 20:31 ` [PATCH v2 30/30] MAINTAINERS: additional files related kvm s390 pci passthrough Matthew Rosato
2022-01-14 20:49 ` [PATCH v2 00/30] KVM: s390: enable zPCI for interpretive execution Matthew Rosato
2022-01-19 18:10 ` Pierre Morel
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=dcfe1661-0779-6c50-6126-e9d0609e3911@linux.ibm.com \
--to=pmorel@linux.ibm.com \
--cc=agordeev@linux.ibm.com \
--cc=alex.williamson@redhat.com \
--cc=borntraeger@linux.ibm.com \
--cc=cohuck@redhat.com \
--cc=david@redhat.com \
--cc=farman@linux.ibm.com \
--cc=frankja@linux.ibm.com \
--cc=freude@linux.ibm.com \
--cc=gerald.schaefer@linux.ibm.com \
--cc=gor@linux.ibm.com \
--cc=hca@linux.ibm.com \
--cc=imbrenda@linux.ibm.com \
--cc=kvm@vger.kernel.org \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-s390@vger.kernel.org \
--cc=mjrosato@linux.ibm.com \
--cc=oberpar@linux.ibm.com \
--cc=pasic@linux.ibm.com \
--cc=schnelle@linux.ibm.com \
--cc=thuth@redhat.com \
--cc=vneethv@linux.ibm.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox