From mboxrd@z Thu Jan  1 00:00:00 1970
Received: from eggs.gnu.org ([2001:4830:134:3::10]:43653)
	by lists.gnu.org with esmtp (Exim 4.71)
	(envelope-from <mreitz@redhat.com>) id 1YDZNW-0005vk-Ea
	for qemu-devel@nongnu.org; Tue, 20 Jan 2015 09:02:03 -0500
Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71)
	(envelope-from <mreitz@redhat.com>) id 1YDZNR-0003JU-Fl
	for qemu-devel@nongnu.org; Tue, 20 Jan 2015 09:01:58 -0500
Received: from mx1.redhat.com ([209.132.183.28]:46160)
	by eggs.gnu.org with esmtp (Exim 4.71)
	(envelope-from <mreitz@redhat.com>) id 1YDZNR-0003JB-8n
	for qemu-devel@nongnu.org; Tue, 20 Jan 2015 09:01:53 -0500
Received: from int-mx14.intmail.prod.int.phx2.redhat.com
	(int-mx14.intmail.prod.int.phx2.redhat.com [10.5.11.27])
	by mx1.redhat.com (8.14.4/8.14.4) with ESMTP id t0KE1q34001024
	(version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-GCM-SHA384 bits=256
	verify=FAIL)
	for <qemu-devel@nongnu.org>; Tue, 20 Jan 2015 09:01:52 -0500
Message-ID: <54BE5FCF.1020801@redhat.com>
Date: Tue, 20 Jan 2015 09:01:51 -0500
From: Max Reitz <mreitz@redhat.com>
MIME-Version: 1.0
References: <1421700544-24635-1-git-send-email-mreitz@redhat.com>
	<54BD7173.7060901@redhat.com> <54BD727D.5010608@redhat.com>
	<20150120100903.GA4318@noname.redhat.com>
	<54BE5CE0.4080207@redhat.com>
	<20150120140023.GB25265@noname.redhat.com>
In-Reply-To: <20150120140023.GB25265@noname.redhat.com>
Content-Type: text/plain; charset=utf-8; format=flowed
Content-Transfer-Encoding: quoted-printable
Subject: Re: [Qemu-devel] [PATCH 0/2] qcow2: Add two more unalignment checks
List-Id: <qemu-devel.nongnu.org>
List-Unsubscribe: <https://lists.nongnu.org/mailman/options/qemu-devel>,
	<mailto:qemu-devel-request@nongnu.org?subject=unsubscribe>
List-Archive: <http://lists.nongnu.org/archive/html/qemu-devel>
List-Post: <mailto:qemu-devel@nongnu.org>
List-Help: <mailto:qemu-devel-request@nongnu.org?subject=help>
List-Subscribe: <https://lists.nongnu.org/mailman/listinfo/qemu-devel>,
	<mailto:qemu-devel-request@nongnu.org?subject=subscribe>
To: Kevin Wolf <kwolf@redhat.com>
Cc: qemu-devel@nongnu.org, Stefan Hajnoczi <stefanha@redhat.com>

On 2015-01-20 at 09:00, Kevin Wolf wrote:
> Am 20.01.2015 um 14:49 hat Max Reitz geschrieben:
>> On 2015-01-20 at 05:09, Kevin Wolf wrote:
>>> Am 19.01.2015 um 22:09 hat Max Reitz geschrieben:
>>>> On 2015-01-19 at 16:04, Eric Blake wrote:
>>>>> On 01/19/2015 01:49 PM, Max Reitz wrote:
>>>>>> With the series adding unalignment checks and the series reworking=
 the
>>>>>> zero cluster expansion code overlapping, the unalignment checks ha=
ve not
>>>>>> been implemented in the latter code.
>>>>>>
>>>>>> This series fixes it.
>>>>>>
>>>>>> There are other places which would require unalignment checks, lik=
e the
>>>>>> offsets of L1 tables, especially for snapshots; but because it wou=
ld be
>>>>>> best to add these checks in the function which reads the snapshot =
table,
>>>>>> this would make images with broken snapshots completely unusable, =
which
>>>>>> is something I opted to avoid for now.
>>> That's something I noticed, too: For qemu-img check, our qcow2_open()
>>> checks may be to strict. With a corrupted snapshot table, it won't ev=
en
>>> run. Perhaps we should be laxer with some checks with BDRV_O_CHECK, a=
nd
>>> for example just set s->nb_snapshots =3D 0 if loading the table fails.
>>>
>>>>>> Ideally, we need to make the qcow2 repair function repair such cas=
es,
>>>>>> but until that is done there is not much we can do about them.
>>>>> What's the best repair?
>>>> That's what I was asking myself...
>>>>
>>>>> Read the data from the unaligned location, and
>>>>> write a fresh copy into a new aligned allocation?
>>>> Maybe. Maybe there is no way of repairing them and the only way is
>>>> asking the user whether it's fine to delete the snapshot (qemu-img
>>>> check -r make_consistent_whatever_it_takes).
>>>>
>>>> Also, the question remains for every unaligned data structure: L2
>>>> tables, data clusters=E2=80=A6 The refcount structures are the only =
ones
>>>> that can be easily recovered. For data clusters, reading from the
>>>> unaligned location would probably be best; for L2 tables=E2=80=A6 ma=
ybe the
>>>> same, then run a consistency check on the data and if it seems off=E2=
=84=A2,
>>>> throw the whole L2 table away.
>>> Reading from the unaligned location doesn't help. I have never seen a=
n
>>> image whose L2 entries contained valid entries, except for the least
>>> significant few bits. If your table is corrupted, it's usually garbag=
e
>> >from start to end.
>>
>> Probably so.
>>
>>> I think the only sane option is to drop the entries. The big question
>>> is what should be used to replace them.
>> /dev/random of course. There is a chance we are correct=E2=80=A6
>>
>> Seriously speaking, well, unmap them? Zero clusters don't feel right t=
o me.
> Actually, I have a feeling that zero clusters are better because they
> are obviously wrong when you debug a problem in the guest. Falling back
> to the backing file may result in patterns that don't look obviously
> corrupted, or even expose information to users that shouldn't have
> permission to read it.

That's a good point. Very well then, we only need to remember this=20
thread when we get to the implementation of all this...

Max