From: Fam Zheng <famz@redhat.com>
To: Max Reitz <mreitz@redhat.com>
Cc: Kevin Wolf <kwolf@redhat.com>, Alberto Garcia <berto@igalia.com>,
qemu-block@nongnu.org, Markus Armbruster <armbru@redhat.com>,
qemu-devel@nongnu.org, Stefan Hajnoczi <stefanha@redhat.com>,
Paolo Bonzini <pbonzini@redhat.com>, John Snow <jsnow@redhat.com>
Subject: Re: [Qemu-devel] [PATCH v7 22/24] block: Rewrite bdrv_close_all()
Date: Wed, 18 Nov 2015 10:52:59 +0800 [thread overview]
Message-ID: <20151118025259.GB9000@ad.usersys.redhat.com> (raw)
In-Reply-To: <564A0119.7080704@redhat.com>
On Mon, 11/16 17:15, Max Reitz wrote:
> >> @@ -1971,13 +1969,44 @@ static void bdrv_close(BlockDriverState *bs)
> >> void bdrv_close_all(void)
> >> {
> >> BlockDriverState *bs;
> >> + AioContext *aio_context;
> >> + int original_refcount = 0;
> >>
> >> - QTAILQ_FOREACH(bs, &bdrv_states, device_list) {
> >> - AioContext *aio_context = bdrv_get_aio_context(bs);
> >> + /* Drop references from requests still in flight, such as canceled block
> >> + * jobs whose AIO context has not been polled yet */
> >> + bdrv_drain_all();
> >>
> >> - aio_context_acquire(aio_context);
> >> - bdrv_close(bs);
> >> - aio_context_release(aio_context);
> >> + blockdev_close_all_bdrv_states();
> >> + blk_remove_all_bs();
> >> +
> >> + /* Cancel all block jobs */
> >> + while (!QTAILQ_EMPTY(&all_bdrv_states)) {
> >> + QTAILQ_FOREACH(bs, &all_bdrv_states, bs_list) {
> >> + aio_context = bdrv_get_aio_context(bs);
> >> +
> >> + aio_context_acquire(aio_context);
> >> + if (bs->job) {
> >> + /* So we can safely query the current refcount */
> >> + bdrv_ref(bs);
> >> + original_refcount = bs->refcnt;
> >> +
> >> + block_job_cancel_sync(bs->job);
> >> + aio_context_release(aio_context);
> >> + break;
> >> + }
> >> + aio_context_release(aio_context);
> >> + }
> >> +
> >> + /* All the remaining BlockDriverStates are referenced directly or
> >> + * indirectly from block jobs, so there needs to be at least one BDS
> >> + * directly used by a block job */
> >> + assert(bs);
> >> +
> >> + /* Wait for the block job to release its reference */
> >> + while (bs->refcnt >= original_refcount) {
> >> + aio_poll(aio_context, true);
> >> + }
> >> + bdrv_unref(bs);
> >
> > If at this point bs->refcnt is greater than 1, why don't we care where are the
> > remaining references from?
>
> We do care. A BDS will not be removed from all_bdrv_states until it is
> deleted (i.e. its refcount becomes 0). Therefore, this loop will
> continue until all BDSs have been deleted.
>
> So where might additional references come from? Since this loop only
> cares about direct or indirect references from block jobs, that's
> exactly it:
>
> (1) You might have multiple block jobs running on a BDS in the future.
> Then, you'll cancel them one by one, and after having canceled the
> first one, the refcount will still be greater than one before the
> bdrv_unref().
>
> (2) Imagine a BDS A with a parent BDS B. There are block jobs running on
> both of them. Now, B is referenced by both its block job and by A
> (indirectly by the block job referencing A). If we cancel the job on
> B before the one on A, then the refcount on B will still be greater
> than 1 before bdrv_unref() because it is still referenced by its
> parent (until the block job on A is canceled, too).
>
> The first cannot happen right now, but the second one may, I'm not sure
> (depending on whether op blockers allow it).
OK, that makes sense. The all_bdrv_states is the central place to make sure all
refcnts reaching 0.
>
>
> And we do make sure that there are no additional references besides:
> - directly from the monitor (monitor-owned BDSs)
> - from a BB
> - from block jobs
> - (+ everything transitively through the respective BDS tree)
>
> If there were additional references, the inner loop would at some point
> no longer find a BDS with a block job while all_bdrv_states is still not
> empty. That's what the "assert(bs)" after the inner loop is for.
>
> If you can imagine another way where a reference to a BDS may come from,
> that would be a bug in this patch and we have to make sure to respect
> that case, too.
I think the only one reference I'm not sure is in xen_disk. blk_unref is called
in the .free and .disconnect call backs but I have no idea if they are called
before bdrv_close_all.
Otherwise this patch looks good for me.
Thanks,
Fam
next prev parent reply other threads:[~2015-11-18 2:53 UTC|newest]
Thread overview: 71+ messages / expand[flat|nested] mbox.gz Atom feed top
2015-11-09 22:39 [Qemu-devel] [PATCH v7 for-2.6 00/24] block: Rework bdrv_close_all() Max Reitz
2015-11-09 22:39 ` [Qemu-devel] [PATCH v7 01/24] blockdev: Add missing bdrv_unref() in drive-backup Max Reitz
2015-11-12 6:14 ` Fam Zheng
2015-11-18 15:24 ` Kevin Wolf
2015-11-09 22:39 ` [Qemu-devel] [PATCH v7 02/24] blockjob: Call bdrv_unref() on creation error Max Reitz
2015-11-12 6:16 ` Fam Zheng
2015-11-18 15:24 ` Kevin Wolf
2015-11-09 22:39 ` [Qemu-devel] [PATCH v7 03/24] block: Release dirty bitmaps in bdrv_close() Max Reitz
2015-11-11 21:08 ` John Snow
2015-11-12 6:23 ` Fam Zheng
2015-11-13 22:49 ` John Snow
2015-11-16 1:27 ` Fam Zheng
2015-11-16 17:07 ` John Snow
2015-11-17 4:22 ` Fam Zheng
2015-11-17 17:05 ` [Qemu-devel] Closing Bitmaps (Was: Re: [PATCH v7 03/24] block: Release dirty bitmaps in bdrv_close()) John Snow
2015-11-18 2:29 ` Fam Zheng
2015-11-18 15:47 ` John Snow
2015-11-18 15:03 ` Kevin Wolf
2015-11-18 15:49 ` John Snow
2015-11-09 22:39 ` [Qemu-devel] [PATCH v7 04/24] iotests: Rename filter_nbd to _filter_nbd in 083 Max Reitz
2015-11-12 6:25 ` Fam Zheng
2015-11-09 22:39 ` [Qemu-devel] [PATCH v7 05/24] iotests: Change coding style of " Max Reitz
2015-11-12 6:25 ` Fam Zheng
2015-11-09 22:39 ` [Qemu-devel] [PATCH v7 06/24] iotests: Move _filter_nbd into common.filter Max Reitz
2015-11-12 6:26 ` Fam Zheng
2015-11-09 22:39 ` [Qemu-devel] [PATCH v7 07/24] iotests: Make _filter_nbd drop log lines Max Reitz
2015-11-12 6:27 ` Fam Zheng
2015-11-09 22:39 ` [Qemu-devel] [PATCH v7 08/24] iotests: Make _filter_nbd support more URL types Max Reitz
2015-11-12 6:28 ` Fam Zheng
2015-11-09 22:39 ` [Qemu-devel] [PATCH v7 09/24] iotests: Make redirecting qemu's stderr optional Max Reitz
2015-11-12 6:31 ` Fam Zheng
2015-11-09 22:39 ` [Qemu-devel] [PATCH v7 10/24] iotests: Add test for eject under NBD server Max Reitz
2015-11-11 21:46 ` John Snow
2015-11-12 6:37 ` Fam Zheng
2015-11-09 22:39 ` [Qemu-devel] [PATCH v7 11/24] block: Add BB-BDS remove/insert notifiers Max Reitz
2015-11-09 22:39 ` [Qemu-devel] [PATCH v7 12/24] virtio-blk: Functions for op blocker management Max Reitz
2015-11-25 15:57 ` Kevin Wolf
2015-11-25 16:03 ` Max Reitz
2015-11-25 16:18 ` Kevin Wolf
2015-11-25 16:26 ` Max Reitz
2015-11-26 7:48 ` Stefan Hajnoczi
2015-11-26 10:43 ` Kevin Wolf
2015-11-09 22:39 ` [Qemu-devel] [PATCH v7 13/24] virtio-scsi: Catch BDS-BB removal/insertion Max Reitz
2015-11-25 16:03 ` Kevin Wolf
2015-11-25 16:08 ` Max Reitz
2015-11-09 22:39 ` [Qemu-devel] [PATCH v7 14/24] nbd: Switch from close to eject notifier Max Reitz
2015-11-30 15:36 ` Kevin Wolf
2015-11-30 17:22 ` Max Reitz
2015-12-01 13:16 ` Kevin Wolf
2015-12-02 15:51 ` Max Reitz
2015-11-09 22:39 ` [Qemu-devel] [PATCH v7 15/24] block: Remove BDS close notifier Max Reitz
2015-11-30 15:38 ` Kevin Wolf
2015-11-09 22:39 ` [Qemu-devel] [PATCH v7 16/24] block: Use blk_remove_bs() in blk_delete() Max Reitz
2015-11-09 22:39 ` [Qemu-devel] [PATCH v7 17/24] blockdev: Use blk_remove_bs() in do_drive_del() Max Reitz
2015-11-09 22:39 ` [Qemu-devel] [PATCH v7 18/24] block: Make bdrv_close() static Max Reitz
2015-11-12 7:07 ` Fam Zheng
2015-11-09 22:39 ` [Qemu-devel] [PATCH v7 19/24] block: Add list of all BlockDriverStates Max Reitz
2015-11-12 7:12 ` Fam Zheng
2015-11-16 16:03 ` Max Reitz
2015-11-09 22:39 ` [Qemu-devel] [PATCH v7 20/24] blockdev: Keep track of monitor-owned BDS Max Reitz
2015-11-10 1:25 ` Max Reitz
2015-11-09 22:39 ` [Qemu-devel] [PATCH v7 21/24] block: Add blk_remove_all_bs() Max Reitz
2015-11-09 22:39 ` [Qemu-devel] [PATCH v7 22/24] block: Rewrite bdrv_close_all() Max Reitz
2015-11-12 7:34 ` Fam Zheng
2015-11-16 16:15 ` Max Reitz
2015-11-18 2:52 ` Fam Zheng [this message]
2015-11-09 22:39 ` [Qemu-devel] [PATCH v7 23/24] iotests: Add test for multiple BB on BDS tree Max Reitz
2015-11-09 22:39 ` [Qemu-devel] [PATCH v7 24/24] iotests: Add test for block jobs and BDS ejection Max Reitz
2015-11-30 16:23 ` Kevin Wolf
2015-11-30 17:44 ` Max Reitz
2015-11-25 16:09 ` [Qemu-devel] [PATCH v7 for-2.6 00/24] block: Rework bdrv_close_all() Kevin Wolf
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20151118025259.GB9000@ad.usersys.redhat.com \
--to=famz@redhat.com \
--cc=armbru@redhat.com \
--cc=berto@igalia.com \
--cc=jsnow@redhat.com \
--cc=kwolf@redhat.com \
--cc=mreitz@redhat.com \
--cc=pbonzini@redhat.com \
--cc=qemu-block@nongnu.org \
--cc=qemu-devel@nongnu.org \
--cc=stefanha@redhat.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).