From: Pavel Butsykin <pbutsykin@virtuozzo.com>
To: Stefan Hajnoczi <stefanha@gmail.com>, "Denis V. Lunev" <den@openvz.org>
Cc: qemu-devel@nongnu.org, Kevin Wolf <kwolf@redhat.com>,
Jeff Cody <jcody@redhat.com>,
Markus Armbruster <armbru@redhat.com>,
Stefan Hajnoczi <stefanha@redhat.com>,
John Snow <jsnow@redhat.com>
Subject: Re: [Qemu-devel] [PATCH 02/10] qcow2: add qcow2_co_write_compressed
Date: Mon, 30 May 2016 12:12:36 +0300 [thread overview]
Message-ID: <574C0404.9040507@virtuozzo.com> (raw)
In-Reply-To: <20160527173335.GF27946@stefanha-x1.localdomain>
On 27.05.2016 20:33, Stefan Hajnoczi wrote:
> On Sat, May 14, 2016 at 03:45:50PM +0300, Denis V. Lunev wrote:
>> + qemu_co_mutex_lock(&s->lock);
>> + cluster_offset = \
>> + qcow2_alloc_compressed_cluster_offset(bs, sector_num << 9, out_len);
>
> The backslash isn't necessary for wrapping lines in C. This kind of
> thing is only necessary in languages like Python where the grammar is
> whitespace sensistive.
>
> The C compiler is happy with an arbitrary amount of whitespace
> (newlines) in the middle of a statement. The backslash in C is handled
> by the preprocessor: it joins the line. That's useful for macro
> definitions where you need to tell the preprocessor that several lines
> belong to one macro definition. But it's not needed for normal C code.
>
Thanks for the explanation, but the backslash is used more for the
person as a marker a line break. The current coding style misses this
point, but I can remove the backslash, because I don't think it's
something important :)
>> + if (!cluster_offset) {
>> + qemu_co_mutex_unlock(&s->lock);
>> + ret = -EIO;
>> + goto fail;
>> + }
>> + cluster_offset &= s->cluster_offset_mask;
>>
>> - BLKDBG_EVENT(bs->file, BLKDBG_WRITE_COMPRESSED);
>> - ret = bdrv_pwrite(bs->file->bs, cluster_offset, out_buf, out_len);
>> - if (ret < 0) {
>> - goto fail;
>> - }
>> + ret = qcow2_pre_write_overlap_check(bs, 0, cluster_offset, out_len);
>> + qemu_co_mutex_unlock(&s->lock);
>> + if (ret < 0) {
>> + goto fail;
>> }
>>
>> + iov = (struct iovec) {
>> + .iov_base = out_buf,
>> + .iov_len = out_len,
>> + };
>> + qemu_iovec_init_external(&hd_qiov, &iov, 1);
>> +
>> + BLKDBG_EVENT(bs->file, BLKDBG_WRITE_COMPRESSED);
>> + ret = bdrv_co_pwritev(bs->file->bs, cluster_offset, out_len, &hd_qiov, 0);
>
> There is a race condition here:
>
> If the newly allocated cluster is only partially filled by compressed
> data then qcow2_alloc_compressed_cluster_offset() remembers that more
> bytes are still available in the cluster. The
> qcow2_alloc_compressed_cluster_offset() caller will continue filling the
> same cluster.
>
> Imagine two compressed writes running at the same time. Write A
> allocates just a few bytes so write B shares a sector with the first
> write:
>
> Sector 1
> |AAABBBBBBBBB|
>
> The race condition is that bdrv_co_pwritev() uses read-modify-write (a
> bounce buffer). If both requests call bdrv_co_pwritev() around the same
> time then the following could happen:
>
> Sector 1
> |000BBBBBBBBB|
>
> or:
>
> Sector 1
> |AAA000000000|
>
> It's necessary to hold s->lock around the compressed data write to avoid
> this race condition.
>
I agree, there is really a race.. Thank you, this is a very good point!
next prev parent reply other threads:[~2016-05-30 9:12 UTC|newest]
Thread overview: 34+ messages / expand[flat|nested] mbox.gz Atom feed top
2016-05-14 12:45 [Qemu-devel] [PATCH v3 00/10] backup compression Denis V. Lunev
2016-05-14 12:45 ` [Qemu-devel] [PATCH 01/10] block/io: add bdrv_co_write_compressed Denis V. Lunev
2016-05-16 16:52 ` Eric Blake
2016-05-17 15:01 ` Pavel Butsykin
2016-05-19 21:25 ` Stefan Hajnoczi
2016-05-19 21:39 ` Denis V. Lunev
2016-05-14 12:45 ` [Qemu-devel] [PATCH 02/10] qcow2: add qcow2_co_write_compressed Denis V. Lunev
2016-05-27 17:33 ` Stefan Hajnoczi
2016-05-30 9:12 ` Pavel Butsykin [this message]
2016-05-30 12:58 ` Pavel Butsykin
2016-05-31 18:42 ` Eric Blake
2016-05-31 21:00 ` Denis V. Lunev
2016-05-31 21:13 ` Eric Blake
2016-06-01 9:53 ` Pavel Butsykin
2016-06-01 9:31 ` Kevin Wolf
2016-06-01 9:25 ` Kevin Wolf
2016-06-01 20:06 ` Stefan Hajnoczi
2016-05-14 12:45 ` [Qemu-devel] [PATCH 03/10] vmdk: add vmdk_co_write_compressed Denis V. Lunev
2016-05-27 17:38 ` Stefan Hajnoczi
2016-05-14 12:45 ` [Qemu-devel] [PATCH 04/10] qcow: add qcow_co_write_compressed Denis V. Lunev
2016-05-27 17:45 ` Stefan Hajnoczi
2016-05-30 14:27 ` Pavel Butsykin
2016-05-14 12:45 ` [Qemu-devel] [PATCH 05/10] block: remove BlockDriver.bdrv_write_compressed Denis V. Lunev
2016-05-16 16:57 ` Eric Blake
2016-05-17 12:22 ` Pavel Butsykin
2016-05-14 12:45 ` [Qemu-devel] [PATCH 06/10] drive-backup: added support for data compression Denis V. Lunev
2016-05-16 16:59 ` Eric Blake
2016-05-27 17:56 ` Stefan Hajnoczi
2016-05-14 12:45 ` [Qemu-devel] [PATCH 07/10] blockdev-backup: " Denis V. Lunev
2016-05-16 17:00 ` Eric Blake
2016-05-27 17:57 ` Stefan Hajnoczi
2016-05-14 12:45 ` [Qemu-devel] [PATCH 08/10] qemu-iotests: test backup compression in 055 Denis V. Lunev
2016-05-14 12:45 ` [Qemu-devel] [PATCH 09/10] block: fix backup in vmdk format image Denis V. Lunev
2016-05-27 18:01 ` Stefan Hajnoczi
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=574C0404.9040507@virtuozzo.com \
--to=pbutsykin@virtuozzo.com \
--cc=armbru@redhat.com \
--cc=den@openvz.org \
--cc=jcody@redhat.com \
--cc=jsnow@redhat.com \
--cc=kwolf@redhat.com \
--cc=qemu-devel@nongnu.org \
--cc=stefanha@gmail.com \
--cc=stefanha@redhat.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).