From: Igor Mammedov <imammedo@redhat.com>
To: Andrey Korolyov <andrey@xdel.ru>
Cc: "qemu-devel@nongnu.org" <qemu-devel@nongnu.org>, ehabkost@redhat.com
Subject: Re: [Qemu-devel] Runtime-modified DIMMs and live migration issue
Date: Wed, 17 Jun 2015 16:52:26 +0200 [thread overview]
Message-ID: <20150617165226.166e7750@nial.brq.redhat.com> (raw)
In-Reply-To: <CABYiri_WHiiFM8rpxtkxYD4DLfv79xAsmOkOJya21zD8sWj7mw@mail.gmail.com>
On Tue, 16 Jun 2015 17:41:03 +0300
Andrey Korolyov <andrey@xdel.ru> wrote:
> > Answering back to myself - I made a wrong statement before, the
> > physical mapping *are* different with different cases, of course!
> > Therefore, the issue looks much simpler and I`d have a patch over a
> > couple of days if nobody fix this earlier.
> >
>
> ... and another (possibly last) update. This is not a memory
> misplacement but a quirky race - if no internal workload applied to
> the virtual machine during migration, no harm is done - after its
> completion the VM passes all imaginable tests as well. If no device
> I/O is involved (perf bench numa for 1000s), guest is not falling with
> guts out as with disk-test case, just crashing a process instead:
>
> [ 784.613032] thread 0/0[2750]: segfault at 0 ip (null) sp
> 00007fda4ea6a940 error 14 in perf_3.16[400000+144000]
>
> I think we are facing a very interesting memory access race during a
> live migration but there are no visible reasons for it to be bound
> only for a runtime-plugged memory case. All possible cases where
> either userspace or kernel driver are involved showing null as an
> instruction pointer for trace, may be this can be a hint for someone.
I've checked logs, so far I don't see anything suspicious there
except of "acpi PNP0C80:00: Already enumerated" lines,
probably rising log level might show more info
+ upload full logs
+ enable ACPI debug info to so that dimm device's _CRS would show up
+ QEMU's CLI that was used to produce such log
wrt migration:
could you provide exact CLI args on source and destination along with
used intermediate mem hotplug commands or even better if it's just
reproduced with migration of cold-plugged dimm-s for simplification
+ steps to reproduce (and guest kernel versions).
next prev parent reply other threads:[~2015-06-17 14:52 UTC|newest]
Thread overview: 10+ messages / expand[flat|nested] mbox.gz Atom feed top
2015-06-11 17:14 [Qemu-devel] Runtime-modified DIMMs and live migration issue Andrey Korolyov
2015-06-14 13:16 ` Andrey Korolyov
2015-06-15 11:11 ` Andrey Korolyov
2015-06-16 14:41 ` Andrey Korolyov
2015-06-17 14:52 ` Igor Mammedov [this message]
2015-06-17 15:32 ` Andrey Korolyov
2015-06-19 16:57 ` Andrey Korolyov
2015-06-19 20:02 ` Andrey Korolyov
2015-08-18 14:51 ` Andrey Korolyov
2015-09-03 19:22 ` Andrey Korolyov
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20150617165226.166e7750@nial.brq.redhat.com \
--to=imammedo@redhat.com \
--cc=andrey@xdel.ru \
--cc=ehabkost@redhat.com \
--cc=qemu-devel@nongnu.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).