From: Vladimir Sementsov-Ogievskiy <vsementsov@yandex-team.ru>
To: Kevin Wolf <kwolf@redhat.com>
Cc: qemu-block@nongnu.org, hreitz@redhat.com, f.ebner@proxmox.com,
jsnow@redhat.com, jean-louis@dupond.be,
dionbosschieter@gmail.com, qemu-devel@nongnu.org,
qemu-stable@nongnu.org, pbonzini@redhat.com
Subject: Re: [PATCH] mirror: Fix missed dirty bitmap writes during startup
Date: Tue, 24 Mar 2026 17:44:27 +0300 [thread overview]
Message-ID: <af3bec19-96ee-4de2-b9bf-3875d59aaed1@yandex-team.ru> (raw)
In-Reply-To: <aaqflohdSVTFpkRZ@redhat.com>
On 06.03.26 12:34, Kevin Wolf wrote:
> Am 05.03.2026 um 19:34 hat Vladimir Sementsov-Ogievskiy geschrieben:
>> On 19.02.26 23:24, Kevin Wolf wrote:
>>> Currently, mirror disables the block layer's dirty bitmap before its own
>>> replacement is working. This means that during startup, there is a
>>> window in which the allocation status of blocks in the source has
>>> already been checked, but new writes coming in aren't tracked yet,
>>> resulting in a corrupted copy:
>>>
>>> 1. Dirty bitmap is disabled in mirror_start_job()
>>> 2. Some request are started in mirror_top_bs while s->job == NULL
>>> 3. mirror_dirty_init() -> bdrv_co_is_allocated_above() runs and because
>>> the request hasn't completed yet, the block isn't allocated
>>> 4. The request completes, still sees s->job == NULL and skips the
>>> bitmap, and nothing else will mark it dirty either
>>>
>>> One ingredient is that mirror_top_opaque->job is only set after the
>>> job is fully initialized. For the rationale, see commit 32125b1460
>>> ("mirror: Fix access of uninitialised fields during start").
>>>
>>> Fix this by giving mirror_top_bs access to dirty_bitmap and enabling it
>>> to track writes from the beginning. Disabling the block layer's tracking
>>> and enabling the mirror_top_bs one happens in a drained section, so
>>> there is no danger of races with in-flight requests any more. All of
>>> this happens well before the block allocation status is checked, so we
>>> can be sure that no writes will be missed.
>>>
>>> Cc: qemu-stable@nongnu.org
>>> Closes: https://gitlab.com/qemu-project/qemu/-/issues/3273
>>> Fixes: 32125b14606a ('mirror: Fix access of uninitialised fields during start')
>>> Signed-off-by: Kevin Wolf <kwolf@redhat.com>
>>> ---
>>> Supersedes: <20260212120411.369498-1-f.ebner@proxmox.com>
>>> ---
>>> block/mirror.c | 48 +++++++++++++++++++++++++++++-------------------
>>> 1 file changed, 29 insertions(+), 19 deletions(-)
>>>
>>> diff --git a/block/mirror.c b/block/mirror.c
>>> index b344182c747..f38636e7457 100644
>>> --- a/block/mirror.c
>>> +++ b/block/mirror.c
>>> @@ -99,6 +99,7 @@ typedef struct MirrorBlockJob {
>>> typedef struct MirrorBDSOpaque {
>>> MirrorBlockJob *job;
>>> + BdrvDirtyBitmap *dirty_bitmap;
>>> bool stop;
>>> bool is_commit;
>>> } MirrorBDSOpaque;
>>> @@ -1672,9 +1673,9 @@ bdrv_mirror_top_do_write(BlockDriverState *bs, MirrorMethod method,
>>> abort();
>>> }
>>> - if (!copy_to_target && s->job && s->job->dirty_bitmap) {
>>> + if (!copy_to_target) {
>>> qatomic_set(&s->job->actively_synced, false);
>>> - bdrv_set_dirty_bitmap(s->job->dirty_bitmap, offset, bytes);
>>> + bdrv_set_dirty_bitmap(s->dirty_bitmap, offset, bytes);
>>> }
>>> if (ret < 0) {
>>> @@ -1901,13 +1902,35 @@ static BlockJob *mirror_start_job(
>>> bdrv_drained_begin(bs);
>>> ret = bdrv_append(mirror_top_bs, bs, errp);
>>> - bdrv_drained_end(bs);
>>> -
>>> if (ret < 0) {
>>> + bdrv_drained_end(bs);
>>> + bdrv_unref(mirror_top_bs);
>>> + return NULL;
>>> + }
>>> +
>>> + bs_opaque->dirty_bitmap = bdrv_create_dirty_bitmap(mirror_top_bs,
>>> + granularity,
>>> + NULL, errp);
>>> + if (!bs_opaque->dirty_bitmap) {
>>> + bdrv_drained_end(bs);
>>> bdrv_unref(mirror_top_bs);
>>> return NULL;
>>> }
>>> + /*
>>> + * The mirror job doesn't use the block layer's dirty tracking because it
>>> + * needs to be able to switch seemlessly between background copy mode (which
>>> + * does need dirty tracking) and write blocking mode (which doesn't) and
>>> + * doing that would require draining the node. Instead, mirror_top_bs takes
>>> + * care of updating the dirty bitmap as appropriate.
>>> + *
>>> + * Note that write blocking mode only becomes effective after mirror_run()
>>> + * sets mirror_top_opaque->job (see should_copy_to_target()). Until then,
>>> + * we're still in background copy mode irrespective of @copy_mode.
>>> + */
>>> + bdrv_disable_dirty_bitmap(bs_opaque->dirty_bitmap);
>>> + bdrv_drained_end(bs);
>>> +
>>> /* Make sure that the source is not resized while the job is running */
>>> s = block_job_create(job_id, driver, NULL, mirror_top_bs,
>>> BLK_PERM_CONSISTENT_READ,
>>> @@ -2002,24 +2025,13 @@ static BlockJob *mirror_start_job(
>>> s->base_overlay = bdrv_find_overlay(bs, base);
>>> s->granularity = granularity;
>>> s->buf_size = ROUND_UP(buf_size, granularity);
>>> + s->dirty_bitmap = bs_opaque->dirty_bitmap;
>>> s->unmap = unmap;
>>> if (auto_complete) {
>>> s->should_complete = true;
>>> }
>>> bdrv_graph_rdunlock_main_loop();
>>> - s->dirty_bitmap = bdrv_create_dirty_bitmap(s->mirror_top_bs, granularity,
>>> - NULL, errp);
>>> - if (!s->dirty_bitmap) {
>>> - goto fail;
>>> - }
>>> -
>>> - /*
>>> - * The dirty bitmap is set by bdrv_mirror_top_do_write() when not in active
>>> - * mode.
>>> - */
>>> - bdrv_disable_dirty_bitmap(s->dirty_bitmap);
>>> -
>>> bdrv_graph_wrlock_drained();
>>> ret = block_job_add_bdrv(&s->common, "source", bs, 0,
>>> BLK_PERM_WRITE_UNCHANGED | BLK_PERM_WRITE |
>>> @@ -2099,9 +2111,6 @@ fail:
>>> g_free(s->replaces);
>>> blk_unref(s->target);
>>> bs_opaque->job = NULL;
>>> - if (s->dirty_bitmap) {
>>> - bdrv_release_dirty_bitmap(s->dirty_bitmap);
>>> - }
>>> job_early_fail(&s->common.job);
>>> }
>>> @@ -2115,6 +2124,7 @@ fail:
>>> bdrv_graph_wrunlock();
>>> bdrv_drained_end(bs);
>>> + bdrv_release_dirty_bitmap(bs_opaque->dirty_bitmap);
>>
>>
>> Hmm. Shouldn't we change position of _release_ in mirror_exit_common() too?
>>
>> Now the sequence is:
>>
>> bdrv_release_dirty_bitmap(s->dirty_bitmap);
>>
>>
>> < could mirror_top_bs access dirty_bitmap here, before drained begin? >
>>
>> ...
>>
>> drained begin
>>
>> .. a lot of logic, including actual removing of the mirror_top_bs from the chain ..
>>
>> drained end
>>
>> bdrv_unref(mirror_top_bs)
>
> I think you're right, but isn't this already a preexisting bug in
> master? After releasing, we don't set s->dirty_bitmap = NULL, which
> could have prevented the access in the code before this patch. So this
> should probably be a separate patch.
>
> mirror_exit_common() runs in the main loop, so I assume you can
> hit this when using an iothread.
>
> It seems that initially the release was later, but commit 2119882 moved
> it earlier, without saying why it did that. Paolo, do you remember?
>
> Kevin
>
On the other hand, in mirror_exit_common(), we are already in
bdrv_drained_begin(bs); section, started in mirror_run()...
Does bdrv_drained_begin(mirror_top_bs) add something to previous
bdrv_drained_begin(bs) ?
--
Best regards,
Vladimir
next prev parent reply other threads:[~2026-03-24 14:45 UTC|newest]
Thread overview: 15+ messages / expand[flat|nested] mbox.gz Atom feed top
2026-02-19 20:24 [PATCH] mirror: Fix missed dirty bitmap writes during startup Kevin Wolf
2026-02-20 14:00 ` Fiona Ebner
2026-02-24 13:58 ` Kevin Wolf
2026-02-24 14:06 ` Fiona Ebner
2026-02-25 12:32 ` Jean-Louis Dupond
2026-03-02 9:49 ` Jean-Louis Dupond
2026-03-02 13:13 ` Kevin Wolf
2026-03-05 18:34 ` Vladimir Sementsov-Ogievskiy
2026-03-06 9:34 ` Kevin Wolf
2026-03-24 14:44 ` Vladimir Sementsov-Ogievskiy [this message]
2026-03-25 10:13 ` Fiona Ebner
2026-03-08 8:25 ` Michael Tokarev
2026-03-10 16:22 ` Fiona Ebner
2026-03-10 18:35 ` Michael Tokarev
2026-03-11 11:10 ` Fiona Ebner
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=af3bec19-96ee-4de2-b9bf-3875d59aaed1@yandex-team.ru \
--to=vsementsov@yandex-team.ru \
--cc=dionbosschieter@gmail.com \
--cc=f.ebner@proxmox.com \
--cc=hreitz@redhat.com \
--cc=jean-louis@dupond.be \
--cc=jsnow@redhat.com \
--cc=kwolf@redhat.com \
--cc=pbonzini@redhat.com \
--cc=qemu-block@nongnu.org \
--cc=qemu-devel@nongnu.org \
--cc=qemu-stable@nongnu.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox