Re: Fix migration issue when the destination is loaded

public inbox for kvm@vger.kernel.org
 help / color / mirror / Atom feed

From: Mark McLoughlin <markmc@redhat.com>
To: dlaor@redhat.com
Cc: kvm-devel <kvm@vger.kernel.org>
Subject: Re: Fix migration issue when the destination is loaded
Date: Thu, 16 Jul 2009 16:20:01 +0100	[thread overview]
Message-ID: <1247757601.3038.51.camel@blaa> (raw)
In-Reply-To: <4A5F166F.2000802@redhat.com>

On Thu, 2009-07-16 at 15:00 +0300, Dor Laor wrote:
> On 07/16/2009 12:39 PM, Mark McLoughlin wrote:
> > Hi Dor,
> >
> > On Wed, 2009-07-15 at 18:03 +0300, Dor Laor wrote:
> >> If the migration socket is full, we get EAGAIN for the write.
> >> The set_fd_handler2 defers the write for later on. The function
> >> tries to wake up the iothread by qemu_kvm_notify_work.
> >> Since this happens in a loop, multiple times, the pipe that emulates
> >> eventfd becomes full and we get a deadlock.
> >
> > I'm not sure I follow:
> >
> >    - You're seeing qemu_kvm_notify_work() being called many times
> >
> >    - The call chain is migrate_fd_put_buffer(), qemu_set_fd_handler2(),
> >      main_loop_break()
> >
> >    - This happens when write() in migrate_fd_put_buffer() returns EAGAIN
> >      because the socket buffer has filled up
> >
> > Correct?
> >
> > That sounds like migrate_fd_put_buffer() is being called repeatedly
> > while we know the socket isn't writable?
> >
> > Shouldn't the buffered file could stop attempting to call put_buffer()
> > until it has been notified that the underlying fd is writable?
> 
> There are two fds here:
> The one returning EAGAIN, is the migration socket. That's why 
> migrate_fd_put_buffer is called and a call back is rescheduled by 
> qemu_set_fd_handler2.
> 
> In the procedure of qemu_set_fd_handler2, the main_loop_break is called. 
> It needs to notify the iothread. It does is by qemu_eventfd, since it is 
> being emulated on older kernels, we use a pipe.
> 
> The pipe fd is the one that blocks.
> I though of setting it to non-blocking but we might get partial writes 
> with a non blocking fd (in theory) too.

Yes, but if the pipe fd is blocking, that means we're writing a lot of
events to it, which means the write to the migration socket is failing
with EAGAIN a lot.

As soon as we get EAGAIN on the migration socket, we should stop trying
to write to socket until the we get notification that its writable.

If we did that, we'd only be writing a very small number of events to
the pipe and we wouldn't block.

Cheers,
Mark.

     prev parent reply	other threads:[~2009-07-16 15:20 UTC|newest]

Thread overview: 4+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2009-07-15 15:03 Fix migration issue when the destination is loaded Dor Laor
2009-07-16  9:39 ` Mark McLoughlin
2009-07-16 12:00   ` Dor Laor
2009-07-16 15:20     ` Mark McLoughlin [this message]

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=1247757601.3038.51.camel@blaa \
    --to=markmc@redhat.com \
    --cc=dlaor@redhat.com \
    --cc=kvm@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link

Be sure your reply has a Subject: header at the top and a blank line before the message body.

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox