From: Anthony Liguori <anthony@codemonkey.ws>
To: Avi Kivity <avi@redhat.com>
Cc: Libvirt <libvir-list@redhat.com>,
Jiri Denemark <jdenemar@redhat.com>,
Chris Lalancette <clalance@redhat.com>,
qemu-devel@nongnu.org, Luiz Capitulino <lcapitulino@redhat.com>
Subject: Re: [Qemu-devel] Re: [libvirt] Libvirt debug API
Date: Fri, 23 Apr 2010 13:29:55 -0500 [thread overview]
Message-ID: <4BD1E723.6070005@codemonkey.ws> (raw)
In-Reply-To: <4BD1ADA2.2050605@redhat.com>
On 04/23/2010 09:24 AM, Avi Kivity wrote:
> On 04/23/2010 04:48 PM, Anthony Liguori wrote:
>> On 04/23/2010 07:48 AM, Avi Kivity wrote:
>>> On 04/22/2010 09:49 PM, Anthony Liguori wrote:
>>>>> real API. Say, adding a device libvirt doesn't know about or
>>>>> stopping the VM
>>>>> while libvirt thinks it's still running or anything like that.
>>>> Another problem is issuing Monitor commands that could confuse
>>>> libvirt's
>>>>
>>>> We need to make libvirt and qemu smarter.
>>>>
>>>> We already face this problem today with multiple libvirt users.
>>>> This is why sophisticated management mechanisms (like LDAP) have
>>>> mechanisms to do transactions or at least a series of atomic
>>>> operations.
>>>
>>> And people said qmp/json was overengineered...
>>>
>>> But seriously, transactions won't help anything. qemu maintains
>>> state, and when you have two updaters touching a shared variable not
>>> excepting each other to, things break, no matter how much locking
>>> there is.
>>
>> Let's consider some concrete examples. I'm using libvirt and QMP and
>> in QMP, I want to hot unplug a device.
>>
>> Today, I do this by listing the pci devices, and issuing a pci_del
>> that takes a PCI address. This is intrinsically racy though because
>> in the worst case scenario, in between when I enumerate pci devices
>> and do the pci_del in QMP, in libvirt, I've done a pci_del and then a
>> pci_add within libvirt of a completely different device.
>
> Obviously you should do the pci_del through libvirt. Once libvirt
> supports an API, use it.
It was just an example...
>>
>> There are a few ways to solve this, the simplest being that we give
>> devices unique ids that are never reused and instead of pci_del
>> taking a pci bus address, it takes a device id. That would address
>> this race.
>>
>> You can get very far by just being clever about unique ids and
>> notifications. There are some cases where a true RMW may be required
>> but I can't really think of one off hand. The way LDAP addresses
>> this is that it has a batched operation and a simple set of boolean
>> comparison operations. This lets you execute a batched operation
>> that will do a RMW.
>
> I'm sure we can be very clever, but I'd rather direct this cleverness
> to qemu core issues, not to the QMP (which in turn requires that users
> be clever to use it correctly). QMP is a low bandwidth protocol, so
> races will never show up in testing. We're laying mines here for
> users to step on that we will never encounter ourselves.
>
>>
>>> The only way that separate monitors could work is if they touch
>>> completely separate state, which is difficult to ensure if you
>>> upgrade your libvirt.
>>>
>>
>> I don't think this is as difficult of a problem as you think it is.
>> If you look at Active Directory and the whole set of management tools
>> based on it, they certainly allow concurrent management
>> applications. You can certainly get into trouble still but with just
>> some careful considerations, you can make two management applications
>> work together 90% of the time without much fuss on the applications
>> part.
>
> Maybe. We'll still have issues. For example, sVirt: if a QMP command
> names a labeled resource, the non-libvirt user will have no way of
> knowing how to label it.
This is orthogonal to QMP and has to do strictly with how libvirt
prepares a resource for qemu.
> Much better to exact a commitment from libvirt to track all QMP (and
> command line) capabilities. Instead of adding cleverness to QMP, add
> APIs to libvirt.
>
Let's step back for a minute because I think we're missing the forest
through the trees.
We're trying to address a few distinct problems:
1) Allow libvirt users to access features of qemu that are not exposed
through libvirt
2) Provide a means for non-libvirt users to interact with qemu
3) Provide a unified and interoperable view of the world for non-libvirt
and libvirt users
For (1), we all agree that the best case scenario would be for libvirt
to support every qemu feature. I think we can also all agree though
that this is not really practical and certainly not practical for
developers since there is a development cost associated with libvirt
support (to model an API appropriately).
The new API proposed addresses (1) by allowing a user to drill down to
the QMP context. It's a good solution IMHO and I think we all agree
that there's an inherent risk to this that users will have to evaluate
on a case-by-case basis. It's a good stop-gap though.
(2) is largely addressed by QMP and a config file. I'd like to see a
nice C library, but I think a lot of other folks are happy with JSON
support in higher level languages.
(3) is the place where there are still potential challenges. I think at
the very least, our goal should be to enable conversion from (2) and (1)
to be as easy as possible. That's why I have proposed implementing a C
library for the JSON transport because we could plumb that through the
new libvirt API. This would allow a user to very quickly port an
application from QMP to libvirt. In order to do this, we need the
libvirt API to expose a dedicated monitor because we'll need to be able
to manipulate events and negotiate features.
Beyond simple porting, there's a secondary question of having
non-libvirt apps co-exist with libvirt apps. I think it's a good long
term goal, but I don't think we should worry too much about it now.
Regards,
Anthony Liguori
next prev parent reply other threads:[~2010-04-23 18:30 UTC|newest]
Thread overview: 66+ messages / expand[flat|nested] mbox.gz Atom feed top
2010-04-09 13:41 [Qemu-devel] Libvirt debug API Chris Lalancette
2010-04-09 14:27 ` [Qemu-devel] Re: [libvirt] " Daniel P. Berrange
2010-04-09 18:16 ` Chris Lalancette
2010-04-12 12:41 ` Daniel P. Berrange
2010-04-12 13:56 ` Chris Lalancette
2010-04-12 14:18 ` Daniel P. Berrange
2010-04-09 21:06 ` Jamie Lokier
2010-04-09 21:30 ` [libvirt] [Qemu-devel] " Eric Blake
2010-04-10 12:05 ` Paolo Bonzini
2010-04-11 20:28 ` [Qemu-devel] Re: [libvirt] " Richard W.M. Jones
2010-04-11 22:17 ` Jamie Lokier
[not found] ` <20100412085621.GN26162@redhat.com>
2010-04-12 12:23 ` [libvirt] [Qemu-devel] " Jamie Lokier
2010-04-12 13:05 ` Daniel P. Berrange
2010-04-22 18:47 ` Anthony Liguori
2010-04-23 6:36 ` Jes Sorensen
2010-04-23 10:30 ` Daniel P. Berrange
2010-04-12 12:53 ` [Qemu-devel] Re: [libvirt] " Daniel P. Berrange
2010-04-12 15:20 ` Luiz Capitulino
2010-04-22 18:49 ` Anthony Liguori
2010-04-23 12:48 ` Avi Kivity
2010-04-23 13:48 ` Anthony Liguori
2010-04-23 14:24 ` Avi Kivity
2010-04-23 14:36 ` [libvirt] [Qemu-devel] " Daniel P. Berrange
2010-04-26 12:54 ` Jamie Lokier
2010-04-26 14:25 ` Chris Lalancette
2010-04-26 14:34 ` Avi Kivity
2010-04-26 14:54 ` Daniel P. Berrange
2010-04-26 15:08 ` Anthony Liguori
2010-04-26 15:20 ` Daniel P. Berrange
2010-04-26 15:55 ` Anthony Liguori
2010-04-23 18:29 ` Anthony Liguori [this message]
2010-04-24 9:46 ` [Qemu-devel] Re: [libvirt] " Avi Kivity
2010-04-25 3:39 ` Anthony Liguori
2010-04-25 11:51 ` Avi Kivity
2010-04-26 1:53 ` Anthony Liguori
2010-04-26 5:56 ` Avi Kivity
2010-04-26 9:56 ` [libvirt] [Qemu-devel] " Matthias Bolte
2010-04-26 13:14 ` [Qemu-devel] Re: [libvirt] " Anthony Liguori
2010-04-26 13:41 ` Avi Kivity
2010-04-26 13:46 ` Anthony Liguori
2010-04-26 13:53 ` Avi Kivity
2010-04-26 13:58 ` Daniel P. Berrange
2010-04-26 14:26 ` Anthony Liguori
2010-04-26 14:32 ` Daniel P. Berrange
2010-04-26 9:59 ` Daniel P. Berrange
2010-04-26 13:13 ` Anthony Liguori
2010-04-26 13:31 ` Daniel P. Berrange
2010-04-26 13:43 ` Anthony Liguori
2010-04-26 14:01 ` Avi Kivity
2010-04-26 14:19 ` Anthony Liguori
2010-04-26 14:25 ` Avi Kivity
2010-04-26 14:28 ` Anthony Liguori
2010-04-26 14:38 ` Avi Kivity
2010-04-26 14:48 ` Anthony Liguori
2010-04-26 14:51 ` Avi Kivity
2010-04-23 14:34 ` Daniel P. Berrange
2010-04-23 15:43 ` Markus Armbruster
2010-04-22 18:45 ` Anthony Liguori
2010-04-22 19:10 ` Anthony Liguori
2010-04-23 10:28 ` Daniel P. Berrange
2010-04-23 13:40 ` Anthony Liguori
2010-04-23 14:21 ` Daniel P. Berrange
2010-04-23 18:33 ` Anthony Liguori
2010-04-25 14:50 ` Avi Kivity
2010-04-26 13:14 ` Anthony Liguori
2010-04-09 20:07 ` Eric Blake
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=4BD1E723.6070005@codemonkey.ws \
--to=anthony@codemonkey.ws \
--cc=avi@redhat.com \
--cc=clalance@redhat.com \
--cc=jdenemar@redhat.com \
--cc=lcapitulino@redhat.com \
--cc=libvir-list@redhat.com \
--cc=qemu-devel@nongnu.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).