qemu-devel.nongnu.org archive mirror
 help / color / mirror / Atom feed
From: "Dr. David Alan Gilbert" <dgilbert@redhat.com>
To: "Daniel P. Berrange" <berrange@redhat.com>
Cc: Chen Fan <chen.fan.fnst@cn.fujitsu.com>,
	libvir-list@redhat.com, qemu-devel@nongnu.org,
	izumi.taku@jp.fujitsu.com
Subject: Re: [Qemu-devel] [RFC 0/7] Live Migration with Pass-through Devices proposal
Date: Wed, 22 Apr 2015 18:20:42 +0100	[thread overview]
Message-ID: <20150422172041.GG2386@work-vm> (raw)
In-Reply-To: <20150422171530.GQ32086@redhat.com>

* Daniel P. Berrange (berrange@redhat.com) wrote:
> On Wed, Apr 22, 2015 at 06:12:25PM +0100, Dr. David Alan Gilbert wrote:
> > * Daniel P. Berrange (berrange@redhat.com) wrote:
> > > On Wed, Apr 22, 2015 at 06:01:56PM +0100, Dr. David Alan Gilbert wrote:
> > > > * Daniel P. Berrange (berrange@redhat.com) wrote:
> > > > > On Fri, Apr 17, 2015 at 04:53:02PM +0800, Chen Fan wrote:
> > > > > > backgrond:
> > > > > > Live migration is one of the most important features of virtualization technology.
> > > > > > With regard to recent virtualization techniques, performance of network I/O is critical.
> > > > > > Current network I/O virtualization (e.g. Para-virtualized I/O, VMDq) has a significant
> > > > > > performance gap with native network I/O. Pass-through network devices have near
> > > > > > native performance, however, they have thus far prevented live migration. No existing
> > > > > > methods solve the problem of live migration with pass-through devices perfectly.
> > > > > > 
> > > > > > There was an idea to solve the problem in website:
> > > > > > https://www.kernel.org/doc/ols/2008/ols2008v2-pages-261-267.pdf
> > > > > > Please refer to above document for detailed information.
> > > > > > 
> > > > > > So I think this problem maybe could be solved by using the combination of existing
> > > > > > technology. and the following steps are we considering to implement:
> > > > > > 
> > > > > > -  before boot VM, we anticipate to specify two NICs for creating bonding device
> > > > > >    (one plugged and one virtual NIC) in XML. here we can specify the NIC's mac addresses
> > > > > >    in XML, which could facilitate qemu-guest-agent to find the network interfaces in guest.
> > > > > > 
> > > > > > -  when qemu-guest-agent startup in guest it would send a notification to libvirt,
> > > > > >    then libvirt will call the previous registered initialize callbacks. so through
> > > > > >    the callback functions, we can create the bonding device according to the XML
> > > > > >    configuration. and here we use netcf tool which can facilitate to create bonding device
> > > > > >    easily.
> > > > > 
> > > > > I'm not really clear on why libvirt/guest agent needs to be involved in this.
> > > > > I think configuration of networking is really something that must be left to
> > > > > the guest OS admin to control. I don't think the guest agent should be trying
> > > > > to reconfigure guest networking itself, as that is inevitably going to conflict
> > > > > with configuration attempted by things in the guest like NetworkManager or
> > > > > systemd-networkd.
> > > > > 
> > > > > IOW, if you want to do this setup where the guest is given multiple NICs connected
> > > > > to the same host LAN, then I think we should just let the gues admin configure
> > > > > bonding in whatever manner they decide is best for their OS install.
> > > > 
> > > > I disagree; there should be a way for the admin not to have to do this manually;
> > > > however it should interact well with existing management stuff.
> > > > 
> > > > At the simplest, something that marks the two NICs in a discoverable way
> > > > so that they can be seen that they're part of a set;  with just that ID system
> > > > then an installer or setup tool can notice them and offer to put them into
> > > > a bond automatically; I'd assume it would be possible to add a rule somewhere
> > > > that said anything with the same ID would automatically be added to the bond.
> > > 
> > > I didn't mean the admin would literally configure stuff manually. I really
> > > just meant that the guest OS itself should decide how it is done, whether
> > > NetworkManager magically does the right thing, or the person building the
> > > cloud disk image provides a magic udev rule, or $something else. I just
> > > don't think that the QEMU guest agent should be involved, as that will
> > > definitely trample all over other things that manage networking in the
> > > guest.
> > 
> > OK, good, that's about the same level I was at.
> > 
> > > I could see this being solved in the cloud disk images by using
> > > cloud-init metadata to mark the NICs as being in a set, or perhaps there
> > > is some magic you could define in SMBIOS tables, or something else again.
> > > A cloud-init based solution wouldn't need any QEMU work, but an SMBIOS
> > > solution might.
> > 
> > Would either of these work with hotplug though?   I guess as the VM starts
> > off with the pair of NICs, then when you remove one and add it back after
> > migration then you don't need any more information added; so yes
> > cloud-init or SMBIOS would do it.  (I was thinking SMBIOS stuff
> > in the way that you get device/slot numbering that NIC naming is sometimes based
> > off).
> >
> > What about if we hot-add a new NIC later on (not during migration);
> > a normal hot-add of a NIC now turns into a hot-add of two new NICs; how
> > do we pass the information at hot-add time to provide that?
> 
> Hmm, yes, actually hotplug would be a problem with that.
> 
> A even simpler idea would be to just keep things real dumb and simply
> use the same MAC address for both NICs. Once you put them in a bond
> device, the kernel will be copying the MAC address of the first NIC
> into the second NIC anyway, so unless I'm missing something, we might
> as well just use the same MAC address for both right away. That makes
> it easy for guest to discover NICs in the same set and works with
> hotplug trivially.

I bet you need to distinguish the two NICs though; you'd want the bond
to send all the traffic through the real NIC during normal use;
and how does the guest know when it sees the hotplug of the 1st NIC in the pair
that this is a special NIC that it's about to see it's sibbling arrive.

Dave

> 
> Regards,
> Daniel
> -- 
> |: http://berrange.com      -o-    http://www.flickr.com/photos/dberrange/ :|
> |: http://libvirt.org              -o-             http://virt-manager.org :|
> |: http://autobuild.org       -o-         http://search.cpan.org/~danberr/ :|
> |: http://entangle-photo.org       -o-       http://live.gnome.org/gtk-vnc :|
--
Dr. David Alan Gilbert / dgilbert@redhat.com / Manchester, UK

  reply	other threads:[~2015-04-22 17:20 UTC|newest]

Thread overview: 45+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2015-04-17  8:53 [Qemu-devel] [RFC 0/7] Live Migration with Pass-through Devices proposal Chen Fan
2015-04-17  8:53 ` [Qemu-devel] [RFC 1/7] qemu-agent: add agent init callback when detecting guest setup Chen Fan
2015-04-17  8:53 ` [Qemu-devel] [RFC 2/7] qemu: add guest init event callback to do the initialize work for guest Chen Fan
2015-04-17  8:53 ` [Qemu-devel] [RFC 3/7] hostdev: add a 'bond' type element in <hostdev> element Chen Fan
2015-04-17  8:53 ` [Qemu-devel] [RFC 4/7] qemu-agent: add qemuAgentCreateBond interface Chen Fan
2015-05-19  9:13   ` Michael S. Tsirkin
2015-05-29  7:37   ` Michal Privoznik
2015-04-17  8:53 ` [Qemu-devel] [RFC 5/7] hostdev: add parse ip and route for bond configure Chen Fan
2015-04-17  8:53 ` [Qemu-devel] [RFC 6/7] migrate: hot remove hostdev at perform phase for bond device Chen Fan
2015-04-17  8:53 ` [Qemu-devel] [RFC 7/7] migrate: add hostdev migrate status to support hostdev migration Chen Fan
2015-04-17  8:53 ` [Qemu-devel] [RFC 0/3] add support migration with passthrough device Chen Fan
2015-04-17  8:53   ` [Qemu-devel] [RFC 1/3] qemu-agent: add guest-network-set-interface command Chen Fan
2015-05-21 13:52     ` Olga Krishtal
2015-05-21 14:43       ` [Qemu-devel] [libvirt] " Eric Blake
2015-04-17  8:53   ` [Qemu-devel] [RFC 2/3] qemu-agent: add guest-network-delete-interface command Chen Fan
2015-04-17  8:53   ` [Qemu-devel] [RFC 3/3] qemu-agent: add notify for qemu-ga boot Chen Fan
2015-04-21 23:38     ` Eric Blake
2015-04-19 22:29 ` [Qemu-devel] [libvirt] [RFC 0/7] Live Migration with Pass-through Devices proposal Laine Stump
2015-04-22  4:22   ` Chen Fan
2015-04-23 14:14     ` Laine Stump
2015-04-23  8:34   ` Chen Fan
2015-04-23 15:01     ` Laine Stump
2015-05-19  9:10       ` Michael S. Tsirkin
2015-04-22  9:23 ` [Qemu-devel] " Daniel P. Berrange
2015-04-22 13:05   ` Daniel P. Berrange
2015-04-22 17:01   ` Dr. David Alan Gilbert
2015-04-22 17:06     ` Daniel P. Berrange
2015-04-22 17:12       ` Dr. David Alan Gilbert
2015-04-22 17:15         ` Daniel P. Berrange
2015-04-22 17:20           ` Dr. David Alan Gilbert [this message]
2015-04-23 16:35             ` [Qemu-devel] [libvirt] " Laine Stump
2015-05-19  9:04               ` Michael S. Tsirkin
2015-05-19  9:07   ` [Qemu-devel] " Michael S. Tsirkin
2015-05-19 14:15     ` [Qemu-devel] [libvirt] " Laine Stump
2015-05-19 14:21       ` Daniel P. Berrange
2015-05-19 15:03         ` Dr. David Alan Gilbert
2015-05-19 15:18           ` Michael S. Tsirkin
2015-05-19 15:35           ` Daniel P. Berrange
2015-05-19 15:39             ` Michael S. Tsirkin
2015-05-19 15:45               ` Daniel P. Berrange
2015-05-19 16:08                 ` Michael S. Tsirkin
2015-05-19 16:13                   ` Daniel P. Berrange
2015-05-19 16:27                   ` Dr. David Alan Gilbert
2015-05-19 15:21         ` Michael S. Tsirkin
2015-05-19 15:14       ` Michael S. Tsirkin

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20150422172041.GG2386@work-vm \
    --to=dgilbert@redhat.com \
    --cc=berrange@redhat.com \
    --cc=chen.fan.fnst@cn.fujitsu.com \
    --cc=izumi.taku@jp.fujitsu.com \
    --cc=libvir-list@redhat.com \
    --cc=qemu-devel@nongnu.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).