public inbox for qemu-devel@nongnu.org
 help / color / mirror / Atom feed
From: Vladimir Sementsov-Ogievskiy <vsementsov@yandex-team.ru>
To: Kevin Wolf <kwolf@redhat.com>
Cc: qemu-block@nongnu.org, hreitz@redhat.com, f.ebner@proxmox.com,
	jsnow@redhat.com, jean-louis@dupond.be,
	dionbosschieter@gmail.com, qemu-devel@nongnu.org,
	qemu-stable@nongnu.org, pbonzini@redhat.com
Subject: Re: [PATCH] mirror: Fix missed dirty bitmap writes during startup
Date: Tue, 24 Mar 2026 17:44:27 +0300	[thread overview]
Message-ID: <af3bec19-96ee-4de2-b9bf-3875d59aaed1@yandex-team.ru> (raw)
In-Reply-To: <aaqflohdSVTFpkRZ@redhat.com>

On 06.03.26 12:34, Kevin Wolf wrote:
> Am 05.03.2026 um 19:34 hat Vladimir Sementsov-Ogievskiy geschrieben:
>> On 19.02.26 23:24, Kevin Wolf wrote:
>>> Currently, mirror disables the block layer's dirty bitmap before its own
>>> replacement is working. This means that during startup, there is a
>>> window in which the allocation status of blocks in the source has
>>> already been checked, but new writes coming in aren't tracked yet,
>>> resulting in a corrupted copy:
>>>
>>> 1. Dirty bitmap is disabled in mirror_start_job()
>>> 2. Some request are started in mirror_top_bs while s->job == NULL
>>> 3. mirror_dirty_init() -> bdrv_co_is_allocated_above() runs and because
>>>      the request hasn't completed yet, the block isn't allocated
>>> 4. The request completes, still sees s->job == NULL and skips the
>>>      bitmap, and nothing else will mark it dirty either
>>>
>>> One ingredient is that mirror_top_opaque->job is only set after the
>>> job is fully initialized. For the rationale, see commit 32125b1460
>>> ("mirror: Fix access of uninitialised fields during start").
>>>
>>> Fix this by giving mirror_top_bs access to dirty_bitmap and enabling it
>>> to track writes from the beginning. Disabling the block layer's tracking
>>> and enabling the mirror_top_bs one happens in a drained section, so
>>> there is no danger of races with in-flight requests any more. All of
>>> this happens well before the block allocation status is checked, so we
>>> can be sure that no writes will be missed.
>>>
>>> Cc: qemu-stable@nongnu.org
>>> Closes: https://gitlab.com/qemu-project/qemu/-/issues/3273
>>> Fixes: 32125b14606a ('mirror: Fix access of uninitialised fields during start')
>>> Signed-off-by: Kevin Wolf <kwolf@redhat.com>
>>> ---
>>> Supersedes: <20260212120411.369498-1-f.ebner@proxmox.com>
>>> ---
>>>    block/mirror.c | 48 +++++++++++++++++++++++++++++-------------------
>>>    1 file changed, 29 insertions(+), 19 deletions(-)
>>>
>>> diff --git a/block/mirror.c b/block/mirror.c
>>> index b344182c747..f38636e7457 100644
>>> --- a/block/mirror.c
>>> +++ b/block/mirror.c
>>> @@ -99,6 +99,7 @@ typedef struct MirrorBlockJob {
>>>    typedef struct MirrorBDSOpaque {
>>>        MirrorBlockJob *job;
>>> +    BdrvDirtyBitmap *dirty_bitmap;
>>>        bool stop;
>>>        bool is_commit;
>>>    } MirrorBDSOpaque;
>>> @@ -1672,9 +1673,9 @@ bdrv_mirror_top_do_write(BlockDriverState *bs, MirrorMethod method,
>>>            abort();
>>>        }
>>> -    if (!copy_to_target && s->job && s->job->dirty_bitmap) {
>>> +    if (!copy_to_target) {
>>>            qatomic_set(&s->job->actively_synced, false);
>>> -        bdrv_set_dirty_bitmap(s->job->dirty_bitmap, offset, bytes);
>>> +        bdrv_set_dirty_bitmap(s->dirty_bitmap, offset, bytes);
>>>        }
>>>        if (ret < 0) {
>>> @@ -1901,13 +1902,35 @@ static BlockJob *mirror_start_job(
>>>        bdrv_drained_begin(bs);
>>>        ret = bdrv_append(mirror_top_bs, bs, errp);
>>> -    bdrv_drained_end(bs);
>>> -
>>>        if (ret < 0) {
>>> +        bdrv_drained_end(bs);
>>> +        bdrv_unref(mirror_top_bs);
>>> +        return NULL;
>>> +    }
>>> +
>>> +    bs_opaque->dirty_bitmap = bdrv_create_dirty_bitmap(mirror_top_bs,
>>> +                                                       granularity,
>>> +                                                       NULL, errp);
>>> +    if (!bs_opaque->dirty_bitmap) {
>>> +        bdrv_drained_end(bs);
>>>            bdrv_unref(mirror_top_bs);
>>>            return NULL;
>>>        }
>>> +    /*
>>> +     * The mirror job doesn't use the block layer's dirty tracking because it
>>> +     * needs to be able to switch seemlessly between background copy mode (which
>>> +     * does need dirty tracking) and write blocking mode (which doesn't) and
>>> +     * doing that would require draining the node. Instead, mirror_top_bs takes
>>> +     * care of updating the dirty bitmap as appropriate.
>>> +     *
>>> +     * Note that write blocking mode only becomes effective after mirror_run()
>>> +     * sets mirror_top_opaque->job (see should_copy_to_target()). Until then,
>>> +     * we're still in background copy mode irrespective of @copy_mode.
>>> +     */
>>> +    bdrv_disable_dirty_bitmap(bs_opaque->dirty_bitmap);
>>> +    bdrv_drained_end(bs);
>>> +
>>>        /* Make sure that the source is not resized while the job is running */
>>>        s = block_job_create(job_id, driver, NULL, mirror_top_bs,
>>>                             BLK_PERM_CONSISTENT_READ,
>>> @@ -2002,24 +2025,13 @@ static BlockJob *mirror_start_job(
>>>        s->base_overlay = bdrv_find_overlay(bs, base);
>>>        s->granularity = granularity;
>>>        s->buf_size = ROUND_UP(buf_size, granularity);
>>> +    s->dirty_bitmap = bs_opaque->dirty_bitmap;
>>>        s->unmap = unmap;
>>>        if (auto_complete) {
>>>            s->should_complete = true;
>>>        }
>>>        bdrv_graph_rdunlock_main_loop();
>>> -    s->dirty_bitmap = bdrv_create_dirty_bitmap(s->mirror_top_bs, granularity,
>>> -                                               NULL, errp);
>>> -    if (!s->dirty_bitmap) {
>>> -        goto fail;
>>> -    }
>>> -
>>> -    /*
>>> -     * The dirty bitmap is set by bdrv_mirror_top_do_write() when not in active
>>> -     * mode.
>>> -     */
>>> -    bdrv_disable_dirty_bitmap(s->dirty_bitmap);
>>> -
>>>        bdrv_graph_wrlock_drained();
>>>        ret = block_job_add_bdrv(&s->common, "source", bs, 0,
>>>                                 BLK_PERM_WRITE_UNCHANGED | BLK_PERM_WRITE |
>>> @@ -2099,9 +2111,6 @@ fail:
>>>            g_free(s->replaces);
>>>            blk_unref(s->target);
>>>            bs_opaque->job = NULL;
>>> -        if (s->dirty_bitmap) {
>>> -            bdrv_release_dirty_bitmap(s->dirty_bitmap);
>>> -        }
>>>            job_early_fail(&s->common.job);
>>>        }
>>> @@ -2115,6 +2124,7 @@ fail:
>>>        bdrv_graph_wrunlock();
>>>        bdrv_drained_end(bs);
>>> +    bdrv_release_dirty_bitmap(bs_opaque->dirty_bitmap);
>>
>>
>> Hmm. Shouldn't we change position of _release_ in mirror_exit_common() too?
>>
>> Now the sequence is:
>>
>> bdrv_release_dirty_bitmap(s->dirty_bitmap);
>>
>>
>> < could mirror_top_bs access dirty_bitmap here, before drained begin? >
>>
>> ...
>>
>> drained begin
>>
>> .. a lot of logic, including actual removing of the mirror_top_bs from the chain ..
>>
>> drained end
>>
>> bdrv_unref(mirror_top_bs)
> 
> I think you're right, but isn't this already a preexisting bug in
> master? After releasing, we don't set s->dirty_bitmap = NULL, which
> could have prevented the access in the code before this patch. So this
> should probably be a separate patch.
> 
> mirror_exit_common() runs in the main loop, so I assume you can
> hit this when using an iothread.
> 
> It seems that initially the release was later, but commit 2119882 moved
> it earlier, without saying why it did that. Paolo, do you remember?
> 
> Kevin
> 

On the other hand, in mirror_exit_common(), we are already in

bdrv_drained_begin(bs); section, started in mirror_run()...

Does bdrv_drained_begin(mirror_top_bs) add something to previous
bdrv_drained_begin(bs) ?

-- 
Best regards,
Vladimir


  reply	other threads:[~2026-03-24 14:45 UTC|newest]

Thread overview: 15+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2026-02-19 20:24 [PATCH] mirror: Fix missed dirty bitmap writes during startup Kevin Wolf
2026-02-20 14:00 ` Fiona Ebner
2026-02-24 13:58   ` Kevin Wolf
2026-02-24 14:06     ` Fiona Ebner
2026-02-25 12:32     ` Jean-Louis Dupond
2026-03-02  9:49       ` Jean-Louis Dupond
2026-03-02 13:13         ` Kevin Wolf
2026-03-05 18:34 ` Vladimir Sementsov-Ogievskiy
2026-03-06  9:34   ` Kevin Wolf
2026-03-24 14:44     ` Vladimir Sementsov-Ogievskiy [this message]
2026-03-25 10:13       ` Fiona Ebner
2026-03-08  8:25 ` Michael Tokarev
2026-03-10 16:22   ` Fiona Ebner
2026-03-10 18:35     ` Michael Tokarev
2026-03-11 11:10       ` Fiona Ebner

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=af3bec19-96ee-4de2-b9bf-3875d59aaed1@yandex-team.ru \
    --to=vsementsov@yandex-team.ru \
    --cc=dionbosschieter@gmail.com \
    --cc=f.ebner@proxmox.com \
    --cc=hreitz@redhat.com \
    --cc=jean-louis@dupond.be \
    --cc=jsnow@redhat.com \
    --cc=kwolf@redhat.com \
    --cc=pbonzini@redhat.com \
    --cc=qemu-block@nongnu.org \
    --cc=qemu-devel@nongnu.org \
    --cc=qemu-stable@nongnu.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox