From: John Snow <jsnow@redhat.com>
To: "Dr. David Alan Gilbert" <dgilbert@redhat.com>,
Stefan Hajnoczi <stefanha@gmail.com>
Cc: qemu-devel@nongnu.org, pbonzini@redhat.com,
Kevin Wolf <kwolf@redhat.com>, Prasad Pandit <ppandit@redhat.com>
Subject: Re: [Qemu-devel] [PATCH] vl.c/exit: pause cpus before closing block devices
Date: Mon, 17 Jul 2017 12:43:53 -0400 [thread overview]
Message-ID: <c6161e7a-9b03-16bf-fb20-86df6f3fd9bc@redhat.com> (raw)
In-Reply-To: <20170717102642.GG2106@work-vm>
On 07/17/2017 06:26 AM, Dr. David Alan Gilbert wrote:
> * Stefan Hajnoczi (stefanha@gmail.com) wrote:
>> On Thu, Jul 13, 2017 at 08:01:16PM +0100, Dr. David Alan Gilbert (git) wrote:
>>> From: "Dr. David Alan Gilbert" <dgilbert@redhat.com>
>>>
>>> There's a rare exit seg if the guest is accessing
>>> IO during exit.
>>> It's always hitting the atomic_inc(&bs->in_flight) with a NULL
>>> bs. This was added recently in 99723548 but I don't see it
>>> as the cause.
>>>
>>> Flip vl.c around so we pause the cpus before closing the block devices,
>>> that way we shouldn't have anything trying to access them when
>>> they're gone.
>>>
>>> This was originally Red Hat bz https://bugzilla.redhat.com/show_bug.cgi?id=1451015
>>>
>>> Signed-off-by: Dr. David Alan Gilbert <dgilbert@redhat.com>
>>> Reported-by: Cong Li <coli@redhat.com>
>>>
>>> --
>>> This is a very rare race, I'll leave it running in a loop to see if
>>> we hit anything else and to check this really fixes it.
>>>
>>> I do worry if there are other cases that can trigger this - e.g.
>>> hot-unplug or ejecting a CD.
>>>
>>> ---
>>> vl.c | 2 +-
>>> 1 file changed, 1 insertion(+), 1 deletion(-)
>>
>> Reviewed-by: Stefan Hajnoczi <stefanha@redhat.com>
>
> Thanks; and the test I left running seems solid - ~12k runs
> over the weekend with no seg.
>
> Dave
>
> --
> Dr. David Alan Gilbert / dgilbert@redhat.com / Manchester, UK
>
the root cause of this bug is related to this as well:
https://lists.gnu.org/archive/html/qemu-devel/2017-07/msg02945.html
>From commit 99723548 we started assuming (incorrectly?) that blk_
functions always WILL have an attached BDS, but this is not always true,
for instance, flushing the cache from an empty CDROM.
Paolo, can we move the flight counter increment outside of the
block-backend layer, is that safe?
--js
next prev parent reply other threads:[~2017-07-17 16:44 UTC|newest]
Thread overview: 17+ messages / expand[flat|nested] mbox.gz Atom feed top
2017-07-13 19:01 [Qemu-devel] [PATCH] vl.c/exit: pause cpus before closing block devices Dr. David Alan Gilbert (git)
2017-07-17 10:17 ` Stefan Hajnoczi
2017-07-17 10:26 ` Dr. David Alan Gilbert
2017-07-17 16:43 ` John Snow [this message]
2017-08-04 9:58 ` Stefan Hajnoczi
2017-08-04 11:46 ` Paolo Bonzini
2017-08-08 10:02 ` Kevin Wolf
2017-08-08 11:04 ` Paolo Bonzini
2017-08-08 11:56 ` Kevin Wolf
2017-08-08 12:47 ` Paolo Bonzini
2017-08-08 12:53 ` Stefan Hajnoczi
2017-08-08 13:03 ` Kevin Wolf
2017-08-08 13:07 ` Paolo Bonzini
2017-08-02 14:42 ` Alberto Garcia
2017-08-03 16:45 ` Dr. David Alan Gilbert
2017-08-03 22:36 ` Paolo Bonzini
2017-08-04 9:56 ` Stefan Hajnoczi
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=c6161e7a-9b03-16bf-fb20-86df6f3fd9bc@redhat.com \
--to=jsnow@redhat.com \
--cc=dgilbert@redhat.com \
--cc=kwolf@redhat.com \
--cc=pbonzini@redhat.com \
--cc=ppandit@redhat.com \
--cc=qemu-devel@nongnu.org \
--cc=stefanha@gmail.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).