From: "Michael S. Tsirkin" <mst@redhat.com>
To: Ryan Harper <ryanh@us.ibm.com>
Cc: Kevin Wolf <kwolf@redhat.com>,
yamahata@valinux.co.jp, Markus Armbruster <armbru@redhat.com>,
qemu-devel@nongnu.org,
Anthony Liguori <aliguori@linux.vnet.ibm.com>,
Stefan Hajnoczi <stefan.hajnoczi@uk.ibm.com>
Subject: Re: [Qemu-devel] [PATCH 0/3] v4 Decouple block device removal from device removal
Date: Mon, 8 Nov 2010 18:56:02 +0200 [thread overview]
Message-ID: <20101108165602.GF7962@redhat.com> (raw)
In-Reply-To: <20101108140250.GW22381@us.ibm.com>
On Mon, Nov 08, 2010 at 08:02:50AM -0600, Ryan Harper wrote:
> * Markus Armbruster <armbru@redhat.com> [2010-11-08 06:04]:
> > "Michael S. Tsirkin" <mst@redhat.com> writes:
> >
> > > On Mon, Nov 08, 2010 at 11:32:01AM +0100, Markus Armbruster wrote:
> > >> Ryan Harper <ryanh@us.ibm.com> writes:
> > >>
> > >> > * Markus Armbruster <armbru@redhat.com> [2010-11-06 04:19]:
> > >> >> Ryan Harper <ryanh@us.ibm.com> writes:
> > >> >>
> > >> >> > * Markus Armbruster <armbru@redhat.com> [2010-11-05 11:11]:
> > >> >> >> Ryan Harper <ryanh@us.ibm.com> writes:
> > >> >> >>
> > >> >> >> > * Markus Armbruster <armbru@redhat.com> [2010-11-05 08:28]:
> > >> >> >> >> I'd be fine with any of these:
> > >> >> >> >>
> > >> >> >> >> 1. A new command "device_disconnet ID" (or similar name) to disconnect
> > >> >> >> >> device ID from any host parts. Nice touch: you don't have to know
> > >> >> >> >> about the device's host part(s) to disconnect it. But it might be
> > >> >> >> >> more work than the other two.
> > >> >> >> >
> > >> >> >> > This is sort of what netdev_del() and drive_unplug() are today; we're
> > >> >> >> > just saying sever the connection of this device id.
> > >> >> >>
> > >> >> >> No, I have netdev_del as (3).
> > >> >> >>
> > >> >> >> All three options are "sort of" the same, just different commands with
> > >> >> >> a common purpose.
> > >> >> >>
> > >> >> >> > I'd like to rename drive_unplug() to blockdev_del() and call it done. I
> > >> >> >> > was looking at libvirt and the right call to netdev_del is already
> > >> >> >> > in-place; I'd just need to re-spin my block patch to call blockdev_del()
> > >> >> >> > after invoking device_del() to match what is done for net.
> > >> >> >>
> > >> >> >> Unless I'm missing something, you can't just rename: your unplug does
> > >> >> >> not delete the host part.
> > >> >> >>
> > >> >> >> >> 2. New commands netdev_disconnect, drive_disconnect (or similar names)
> > >> >> >> >> to disconnect a host part from a guest device. Like (1), except you
> > >> >> >> >> have to point to the other end of the connection to cut it.
> > >> >> >> >
> > >> >> >> > What's the advantage here? We need an additional piece of info (host
> > >> >> >> > part) in addition to the device id?
> > >> >> >>
> > >> >> >> That's a disadvantage.
> > >> >> >>
> > >> >> >> Possible advantage: implementation could be slightly easier than (1),
> > >> >> >> because you don't have to find the host parts.
> > >> >> >>
> > >> >> >> >> 3. A new command "drive_del ID" similar to existing netdev_del. This is
> > >> >> >> >> (2) fused with delete. Conceptual wart: you can't disconnect and
> > >> >> >> >> keep the host part around. Moreover, delete is slightly dangerous,
> > >> >> >> >> because it renders any guest device still using the host part
> > >> >> >> >> useless.
> > >> >> >> >
> > >> >> >> > Hrm, I thought that's what (1) is.
> > >> >> >>
> > >> >> >> No.
> > >> >> >>
> > >> >> >> With (1), the argument is a *device* ID, and we disconnect *all* host
> > >> >> >> parts connected to this device (typically just one).
> > >> >> >>
> > >> >> >> With (3), the argument is a netdev/drive ID, and disconnect *this* host
> > >> >> >> part from the peer device.
> > >> >> >>
> > >> >> >> > Well, either (1) or (3); I'd like to
> > >> >> >> > rename drive_unplug() to blockdev_del() since they're similar function
> > >> >> >> > w.r.t removing access to the host resource. And we can invoke them in
> > >> >> >> > the same way from libvirt (after doing guest notification, remove
> > >> >> >> > access).
> > >> >> >>
> > >> >> >> I'd call it drive_del for now, to match drive_add.
> > >> >> >
> > >> >> > OK, drive_del() and as you mentioned, drive_unplug will take out the
> > >> >> > block driver, but doesn't remove the dinfo object; that ends up dying
> > >> >> > when we call the device destructor. I think for symmetry we'll want
> > >> >> > drive_del to remove the dinfo object as well.
> > >> >>
> > >> >> Exactly.
> > >> >>
> > >> >> a. bdrv_detach() to zap the pointer from bdrv to qdev
> > >> >> b. zap the pointer from qdev to bdrv
> > >> >> c. drive_uninit() to dispose of the host part
> > >> >
> > >> > a-c need to be done to match netdev_del symmetry? How hard of a req is
> > >> > this?
> > >>
> > >> Without (c), it's not a delete. And (c) without (b) leaves a dangling
> > >> pointer. (c) without (a) fails an assertion in bdrv_delete().
> > >>
> > >> Aside: (b) should probably be folded into bdrv_detach().
> > >>
> > >> >> Step b could be awkward with (3), because you don't know device details.
> > >> >> I guess you have to search device properties for a drive property
> > >> >> pointing to bdrv. I like (1) because it puts that loop in the one place
> > >> >> where it belongs: qdev core. (3) duplicates it in every HOSTDEV_del.
> > >> >> Except for netdev_del, which is special because of VLANs.
> > >> >>
> > >> >> To avoid step b, you could try to keep the bdrv around in a special
> > >> >> zombie state. Still have to free the dinfo, but can't use
> > >> >> drive_uninit() for that then.
> > >> >>
> > >> >> If you think I'm overcomplicating this, feel free to prove me wrong with
> > >> >> working code :)
> > >> >
> > >> > drive_unplug() works as-is today; so it does feel very combursome at
> > >> > this point. Other than the name change and agreement on how mgmt should
> > >> > invoke the command, it's been a long ride to get here.
> > >>
> > >> Sometimes it takes a tough man to make a tender chicken.
> > >
> > >> > I'll take my best shot at trying to clean up the other
> > >> > pointers and objects; though on one of my attempts when I took out the
> > >> > dinfo() object that didn't go so well; going to have to audit who uses
> > >> > dinfo and where and what they check before calling it to have a proper
> > >> > cleanup that doesn't remove the whole device altogether.
> > >>
> > >> Steps a, b, c are the result of my (admittedly quick) audit.
> > >>
> > >> Here's how the various objects are connected to each other:
> > >>
> > >> contains
> > >> drivelist -----------> DriveInfo
> > >> |
> > >> | .bdrv
> > >> | .id == .bdrv->device_name
> > >> |
> > >> contains V
> > >> bdrv_states -----------> BlockDriverState
> > >> | ^
> > >> .peer | |
> > >> | | host part
> > >> -----------------------------|---|-----------------------------------
> > >> | | guest part
> > >> | | property "drive"
> > >> v |
> > >> DeviceState
> > >>
> > >> To disconnect host from guest part, you need to cut both pointers. To
> > >> delete the host part, you need to delete both objects, BlockDriverState
> > >> and DriveInfo.
> > >
> > >
> > > If we remove DriveInfo, how can management later detect that guest part
> > > was deleted?
> >
> > Directly: check whether the qdev is gone.
> >
> > I don't know how to check that indirectly, via DriveInfo.
> >
> > > If you want symmetry with netdev, it's possible to keep a
> > > shell of BlockDriverState/DriveInfo around (solving dangling pointer
> > > problems).
> >
> > netdev_del deletes the host network part:
> >
> > (qemu) info network
> > Devices not on any VLAN:
> > net.0: net=10.0.2.0, restricted=n peer=nic.0
> > nic.0: model=virtio-net-pci,macaddr=52:54:00:12:34:56 peer=net.0
> > (qemu) netdev_del net.0
> > (qemu) info network
> > Devices not on any VLAN:
> > nic.0: model=virtio-net-pci,macaddr=52:54:00:12:34:56 peer=net.0
> >
> > It leaves around the VLAN object. Since qdev property points to that,
> > it doesn't dangle.
> >
> > In my opinion, drive_del should make the drive vanish from "info block",
>
> Yeah; that's the right thing to do here. Let me respin the patch with
> the name change and the additional work to fix up the pointers and
> ensure that we don't see the drive in info block.
Daniel, I'd like your input here: can you live with
device diappearing from info block and parsing
qdev tree info to figure out whether device is really gone?
> > just like netdev_del makes the netdev vanish from "info network". And
> > that means deleting it from bdrv_states. Whether we delete it
> > alltogether (which is what I sketched), or turn it into a zombie is a
> > separate question. Both work for me.
>
>
> --
> Ryan Harper
> Software Engineer; Linux Technology Center
> IBM Corp., Austin, Tx
> ryanh@us.ibm.com
next prev parent reply other threads:[~2010-11-08 16:56 UTC|newest]
Thread overview: 60+ messages / expand[flat|nested] mbox.gz Atom feed top
2010-10-25 18:22 [Qemu-devel] [PATCH 0/3] v4 Decouple block device removal from device removal Ryan Harper
2010-10-25 18:22 ` [Qemu-devel] [PATCH 1/3] v2 Add drive_get_by_id Ryan Harper
2010-10-29 13:18 ` Markus Armbruster
2010-10-25 18:22 ` [Qemu-devel] [PATCH 2/3] v2 Fix Block Hotplug race with drive_unplug() Ryan Harper
2010-10-29 14:01 ` Markus Armbruster
2010-10-29 14:15 ` Anthony Liguori
2010-10-29 14:29 ` Kevin Wolf
2010-10-29 14:40 ` Anthony Liguori
2010-10-29 14:57 ` Kevin Wolf
2010-10-29 15:28 ` Anthony Liguori
2010-10-29 16:08 ` Kevin Wolf
2010-10-30 13:25 ` Christoph Hellwig
2010-10-29 15:28 ` Markus Armbruster
2010-11-01 21:06 ` Ryan Harper
2010-10-25 18:22 ` [Qemu-devel] [PATCH 3/3] Add qmp version of drive_unplug Ryan Harper
2010-10-29 14:12 ` [Qemu-devel] [PATCH 0/3] v4 Decouple block device removal from device removal Markus Armbruster
2010-10-29 15:03 ` Ryan Harper
2010-10-29 16:10 ` Markus Armbruster
2010-10-29 16:50 ` Ryan Harper
2010-11-02 9:40 ` Markus Armbruster
2010-11-02 13:22 ` Michael S. Tsirkin
2010-11-02 13:41 ` Kevin Wolf
2010-11-02 13:46 ` Ryan Harper
2010-11-02 13:58 ` Michael S. Tsirkin
2010-11-02 14:22 ` Ryan Harper
2010-11-02 15:46 ` Michael S. Tsirkin
2010-11-02 16:53 ` Ryan Harper
2010-11-02 17:59 ` Michael S. Tsirkin
2010-11-02 19:01 ` Ryan Harper
2010-11-02 19:17 ` Michael S. Tsirkin
2010-11-02 20:23 ` Ryan Harper
2010-11-03 7:21 ` Michael S. Tsirkin
2010-11-03 12:04 ` Ryan Harper
2010-11-03 16:41 ` Markus Armbruster
2010-11-03 17:29 ` Ryan Harper
2010-11-03 18:02 ` Michael S. Tsirkin
2010-11-03 20:59 ` Ryan Harper
2010-11-03 21:26 ` Michael S. Tsirkin
2010-11-04 16:45 ` Ryan Harper
2010-11-04 17:04 ` Michael S. Tsirkin
2010-11-05 13:27 ` Markus Armbruster
2010-11-05 14:17 ` Michael S. Tsirkin
2010-11-05 14:29 ` Ryan Harper
2010-11-05 16:01 ` Markus Armbruster
2010-11-08 21:02 ` Michael S. Tsirkin
2010-11-05 14:25 ` Ryan Harper
2010-11-05 16:10 ` Markus Armbruster
2010-11-05 16:22 ` Ryan Harper
2010-11-06 8:18 ` Markus Armbruster
2010-11-08 2:19 ` Ryan Harper
2010-11-08 10:32 ` Markus Armbruster
2010-11-08 10:49 ` Michael S. Tsirkin
2010-11-08 12:03 ` Markus Armbruster
2010-11-08 14:02 ` Ryan Harper
2010-11-08 16:56 ` Michael S. Tsirkin [this message]
2010-11-08 17:04 ` Daniel P. Berrange
2010-11-08 18:41 ` Ryan Harper
2010-11-08 18:39 ` Ryan Harper
2010-11-08 19:06 ` Daniel P. Berrange
2010-11-08 16:34 ` Michael S. Tsirkin
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20101108165602.GF7962@redhat.com \
--to=mst@redhat.com \
--cc=aliguori@linux.vnet.ibm.com \
--cc=armbru@redhat.com \
--cc=kwolf@redhat.com \
--cc=qemu-devel@nongnu.org \
--cc=ryanh@us.ibm.com \
--cc=stefan.hajnoczi@uk.ibm.com \
--cc=yamahata@valinux.co.jp \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).