From: "Michael S. Tsirkin" <mst@redhat.com>
To: Marcel Apfelbaum <marcel.apfelbaum@gmail.com>
Cc: David Gibson <dgibson@redhat.com>,
Julia Suvorova <jusual@redhat.com>,
qemu devel list <qemu-devel@nongnu.org>
Subject: Re: [PATCH] pci: Refuse to hotplug PCI Devices when the Guest OS is not ready
Date: Thu, 22 Oct 2020 10:32:55 -0400 [thread overview]
Message-ID: <20201022102857-mutt-send-email-mst@kernel.org> (raw)
In-Reply-To: <CAC_L=vWctLK0Yjod_Vz=+xzFKFp4UoUdjSVa4jWeDm+g8en6wQ@mail.gmail.com>
On Thu, Oct 22, 2020 at 05:10:43PM +0300, Marcel Apfelbaum wrote:
>
>
> On Thu, Oct 22, 2020 at 5:01 PM Michael S. Tsirkin <mst@redhat.com> wrote:
>
> On Thu, Oct 22, 2020 at 04:55:10PM +0300, Marcel Apfelbaum wrote:
> > Hi David, Michael,
> >
> > On Thu, Oct 22, 2020 at 3:56 PM David Gibson <dgibson@redhat.com> wrote:
> >
> >Â Â Â On Thu, 22 Oct 2020 08:06:55 -0400
> >Â Â Â "Michael S. Tsirkin" <mst@redhat.com> wrote:
> >
> >Â Â Â > On Thu, Oct 22, 2020 at 02:40:26PM +0300, Marcel Apfelbaum wrote:
> >Â Â Â > > From: Marcel Apfelbaum <marcel@redhat.com>
> >Â Â Â > >
> >Â Â Â > > During PCIe Root Port's transition from Power-Off to Power-ON (or
> >Â Â Â vice-versa)
> >Â Â Â > > the "Slot Control Register" has the "Power Indicator Control"
> >Â Â Â > > set to "Blinking" expressing a "power transition" mode.
> >Â Â Â > >
> >Â Â Â > > Any hotplug operation during the "power transition" mode is not
> >Â Â Â permitted
> >Â Â Â > > or at least not expected by the Guest OS leading to strange
> failures.
> >Â Â Â > >
> >Â Â Â > > Detect and refuse hotplug operations in such case.
> >Â Â Â > >
> >Â Â Â > > Signed-off-by: Marcel Apfelbaum <marcel.apfelbaum@gmail.com>
> >Â Â Â > > ---
> >Â Â Â > >Â hw/pci/pcie.c | 7 +++++++
> >Â Â Â > >Â 1 file changed, 7 insertions(+)
> >Â Â Â > >
> >Â Â Â > > diff --git a/hw/pci/pcie.c b/hw/pci/pcie.c
> >Â Â Â > > index 5b48bae0f6..2fe5c1473f 100644
> >Â Â Â > > --- a/hw/pci/pcie.c
> >Â Â Â > > +++ b/hw/pci/pcie.c
> >Â Â Â > > @@ -410,6 +410,7 @@ void pcie_cap_slot_pre_plug_cb(HotplugHandler
> >Â Â Â *hotplug_dev, DeviceState *dev,
> >Â Â Â > >Â Â Â PCIDevice *hotplug_pdev = PCI_DEVICE(hotplug_dev);
> >Â Â Â > >Â Â Â uint8_t *exp_cap = hotplug_pdev->config + hotplug_pdev->
> >Â Â Â exp.exp_cap;
> >Â Â Â > >Â Â Â uint32_t sltcap = pci_get_word(exp_cap + PCI_EXP_SLTCAP);
> >Â Â Â > > +Â Â uint32_t sltctl = pci_get_word(exp_cap + PCI_EXP_SLTCTL);
> >Â Â Â > >Â
> >Â Â Â > >Â Â Â /* Check if hot-plug is disabled on the slot */
> >Â Â Â > >Â Â Â if (dev->hotplugged && (sltcap & PCI_EXP_SLTCAP_HPC) == 0) {
> >Â Â Â > > @@ -418,6 +419,12 @@ void pcie_cap_slot_pre_plug_cb
> (HotplugHandler
> >Â Â Â *hotplug_dev, DeviceState *dev,
> >Â Â Â > >Â Â Â Â Â return;
> >Â Â Â > >Â Â Â }
> >Â Â Â > >Â
> >Â Â Â > > +Â Â if ((sltctl & PCI_EXP_SLTCTL_PIC) ==
> PCI_EXP_SLTCTL_PWR_IND_BLINK)
> >Â Â Â {
> >Â Â Â > > +Â Â Â Â error_setg(errp, "Hot-plug failed: %s is in Power
> Transition",
> >Â Â Â > > +Â Â Â Â Â Â Â Â Â Â DEVICE(hotplug_pdev)->id);
> >Â Â Â > > +Â Â Â Â return;
> >Â Â Â > > +Â Â }
> >Â Â Â > > +
> >Â Â Â > >Â Â Â pcie_cap_slot_plug_common(PCI_DEVICE(hotplug_dev), dev,
> errp);
> >Â Â Â > >Â }Â
> >Â Â Â >
> >Â Â Â > Probably the only way to handle for existing machine types.
> >
> >
> > I agree
> > Â
> >
> >Â Â Â > For new ones, can't we queue it in host memory somewhere?
> >
> >
> >
> > I am not sure I understand what will be the flow.
> > Â - The user asks for a hotplug operation.
> > Â -Â QEMU deferred operation.
> > After that the operation may still fail, how would the user know if the
> > operation
> > succeeded or not?
>
>
> How can it fail? It's just a button press ...
>
>
>
> Currently we have "Hotplug unsupported."
> With this change we have "Guest/System not ready"
Hotplug unsupported is not an error that can trigger with
a well behaved management such as libvirt.
> Â
>
> > Â
> >
> >Â Â Â I'm not actually convinced we can't do that even for existing machine
> >Â Â Â types.Â
> >
> >
> > Is a Guest visible change, I don't think we can do it.
> > Â
> >
> >Â Â Â So I'm a bit hesitant to suggest going ahead with this without
> >Â Â Â looking a bit closer at whether we can implement a wait-for-ready in
> >Â Â Â qemu, rather than forcing every user of qemu (human or machine) to do
> >Â Â Â so.
> >
> >
> > While I agree it is a pain from the usability point of view, hotplug
> operations
> > are allowed to fail. This is not more than a corner case, ensuring the
> right
> > response (gracefully erroring out) may be enough.
> >
> > Thanks,
> > Marcel
> >
>
>
> I don't think they ever failed in the past so management is unlikely
> to handle the failure by retrying ...
>
>
> That would require some management handling, yes.
> But even without a "retry", failing is better than strange OS behavior.
>
> Trying a better alternative like deferring the operation for new machines
> would make sense, however is out of the scope of this patch
Expand the scope please. The scope should be "solve a problem xx" not
"solve a problem xx by doing abc".
> that simply
> detects the error leaving us in a slightly better state than today.
>
> Thanks,
> Marcel
Not applying a patch is the only tool we maintainers have to influence
people to solve the problem fully. That's why I'm not inclined to apply
"slightly better" patches generally.
>
>
> >
> >
> >
> >Â Â Â --
> >Â Â Â David Gibson <dgibson@redhat.com>
> >Â Â Â Principal Software Engineer, Virtualization, Red Hat
> >
>
>
next prev parent reply other threads:[~2020-10-22 14:34 UTC|newest]
Thread overview: 33+ messages / expand[flat|nested] mbox.gz Atom feed top
2020-10-22 11:40 [PATCH] pci: Refuse to hotplug PCI Devices when the Guest OS is not ready Marcel Apfelbaum
2020-10-22 12:06 ` Michael S. Tsirkin
2020-10-22 12:56 ` David Gibson
2020-10-22 13:15 ` Michael S. Tsirkin
2020-10-23 3:30 ` David Gibson
2020-10-22 13:55 ` Marcel Apfelbaum
2020-10-22 14:01 ` Michael S. Tsirkin
2020-10-22 14:10 ` Marcel Apfelbaum
2020-10-22 14:32 ` Michael S. Tsirkin [this message]
2020-10-22 14:50 ` Marcel Apfelbaum
2020-10-22 15:01 ` Michael S. Tsirkin
2020-10-23 3:49 ` David Gibson
2020-10-23 6:47 ` Marcel Apfelbaum
2020-10-23 15:54 ` Michael S. Tsirkin
2020-10-23 17:27 ` Igor Mammedov
2020-10-26 6:38 ` David Gibson
2020-10-26 9:17 ` Peter Krempa
2020-10-26 6:35 ` David Gibson
2020-10-23 6:26 ` Marcel Apfelbaum
2020-10-26 6:45 ` David Gibson
2020-10-27 11:26 ` Michael S. Tsirkin
2020-10-27 12:54 ` Igor Mammedov
2020-10-27 13:02 ` Michael S. Tsirkin
2020-10-28 3:34 ` David Gibson
2020-10-28 3:31 ` David Gibson
2020-10-28 15:39 ` Igor Mammedov
2020-10-28 17:49 ` Michael S. Tsirkin
2020-10-27 11:30 ` Michael S. Tsirkin
2020-10-23 3:31 ` David Gibson
2020-11-11 12:35 ` Michael S. Tsirkin
2020-11-15 16:48 ` Marcel Apfelbaum
2020-11-11 16:09 ` Roman Kagan
2020-11-15 16:43 ` Marcel Apfelbaum
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20201022102857-mutt-send-email-mst@kernel.org \
--to=mst@redhat.com \
--cc=dgibson@redhat.com \
--cc=jusual@redhat.com \
--cc=marcel.apfelbaum@gmail.com \
--cc=qemu-devel@nongnu.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).