qemu-devel.nongnu.org archive mirror
 help / color / mirror / Atom feed
From: "Alex Bennée" <alex.bennee@linaro.org>
To: "Paolo Bonzini" <pbonzini@redhat.com>,
	"Richard Henderson" <rth@twiddle.net>,
	"Peter Maydell" <peter.maydell@linaro.org>,
	"Philippe Mathieu-Daudé" <f4bug@amsat.org>
Cc: qemu-devel <qemu-devel@nongnu.org>, maciej.borzecki@open-rnd.pl
Subject: [Qemu-devel] qemu_system_reset_request() broken w.r.t BQL locking regime
Date: Wed, 05 Jul 2017 17:01:38 +0100	[thread overview]
Message-ID: <8737aaeuu5.fsf@linaro.org> (raw)


Hi,

An interesting bug was reported on #qemu today. It was bisected to
8d04fb55 (drop global lock for TCG) and only occurred when QEMU was run
with taskset -c 0. Originally the fingers where pointed at mttcg but it
occurs in both single and multi-threaded modes.

I think the problem is qemu_system_reset_request() is certainly racy
when resetting a running CPU. AFAICT:

  - Guest resets board, writing to some hw address (e.g.
    arm_sysctl_write)
  - This triggers qemu_system_reset_request(SHUTDOWN_CAUSE_GUEST_RESET)
  - We exit iowrite and drop the BQL
  - vl.c schedules qemu_system_reset->qemu_devices_reset...arm_cpu_reset
  - we start writing new values to CPU env while still in TCG code
  - CHAOS!

The general solution for this is to ensure these sort of tasks are done
with safe work in the CPUs context when we know nothing else is running.
It seems this is probably best done by modifying
qemu_system_reset_request to queue work up on current_cpu and execute it
as safe work - I don't think the vl.c thread should ever be messing
about with calling cpu_reset directly.

Looking at the calls most of these are made by device code but I see KVM
also does it. I just wanted to check this was a reasonable approach and
wouldn't upset anything else.

Any thoughts?

--
Alex Bennée

             reply	other threads:[~2017-07-05 16:00 UTC|newest]

Thread overview: 10+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2017-07-05 16:01 Alex Bennée [this message]
2017-07-05 16:14 ` [Qemu-devel] qemu_system_reset_request() broken w.r.t BQL locking regime Peter Maydell
2017-07-05 16:21   ` Paolo Bonzini
2017-07-05 19:31     ` Alex Bennée
2017-07-06  8:37     ` Alex Bennée
2017-07-05 19:30   ` Alex Bennée
2017-07-05 19:42     ` Peter Maydell
2017-07-05 20:10       ` Alex Bennée
2017-07-05 21:46       ` Alex Bennée
     [not found] <mailman.82700.1499272965.22738.qemu-devel@nongnu.org>
2017-07-05 16:54 ` G 3

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=8737aaeuu5.fsf@linaro.org \
    --to=alex.bennee@linaro.org \
    --cc=f4bug@amsat.org \
    --cc=maciej.borzecki@open-rnd.pl \
    --cc=pbonzini@redhat.com \
    --cc=peter.maydell@linaro.org \
    --cc=qemu-devel@nongnu.org \
    --cc=rth@twiddle.net \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).