From mboxrd@z Thu Jan  1 00:00:00 1970
Return-Path: <SRS0=nSo8=TI=nongnu.org=qemu-devel-bounces+qemu-devel=archiver.kernel.org@kernel.org>
X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on
	aws-us-west-2-korg-lkml-1.web.codeaurora.org
X-Spam-Level: 
X-Spam-Status: No, score=-2.4 required=3.0 tests=HEADER_FROM_DIFFERENT_DOMAINS,
	MAILING_LIST_MULTI,SPF_PASS,T_HK_NAME_DR,USER_AGENT_MUTT autolearn=ham
	autolearn_force=no version=3.4.0
Received: from mail.kernel.org (mail.kernel.org [198.145.29.99])
	by smtp.lore.kernel.org (Postfix) with ESMTP id 181A5C04A6B
	for <qemu-devel@archiver.kernel.org>; Wed,  8 May 2019 12:45:28 +0000 (UTC)
Received: from lists.gnu.org (lists.gnu.org [209.51.188.17])
	(using TLSv1 with cipher AES256-SHA (256/256 bits))
	(No client certificate requested)
	by mail.kernel.org (Postfix) with ESMTPS id D170C20644
	for <qemu-devel@archiver.kernel.org>; Wed,  8 May 2019 12:45:27 +0000 (UTC)
DMARC-Filter: OpenDMARC Filter v1.3.2 mail.kernel.org D170C20644
Authentication-Results: mail.kernel.org; dmarc=fail (p=none dis=none) header.from=redhat.com
Authentication-Results: mail.kernel.org; spf=pass smtp.mailfrom=qemu-devel-bounces+qemu-devel=archiver.kernel.org@nongnu.org
Received: from localhost ([127.0.0.1]:36503 helo=lists.gnu.org)
	by lists.gnu.org with esmtp (Exim 4.71)
	(envelope-from <qemu-devel-bounces+qemu-devel=archiver.kernel.org@nongnu.org>)
	id 1hOLwt-0007XN-3r
	for qemu-devel@archiver.kernel.org; Wed, 08 May 2019 08:45:27 -0400
Received: from eggs.gnu.org ([209.51.188.92]:54345)
	by lists.gnu.org with esmtp (Exim 4.71)
	(envelope-from <dgilbert@redhat.com>) id 1hOLvh-0006jh-TU
	for qemu-devel@nongnu.org; Wed, 08 May 2019 08:44:15 -0400
Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71)
	(envelope-from <dgilbert@redhat.com>) id 1hOLvg-0008Na-6d
	for qemu-devel@nongnu.org; Wed, 08 May 2019 08:44:13 -0400
Received: from mx1.redhat.com ([209.132.183.28]:52900)
	by eggs.gnu.org with esmtps (TLS1.0:DHE_RSA_AES_256_CBC_SHA1:32)
	(Exim 4.71) (envelope-from <dgilbert@redhat.com>) id 1hOLvf-0008N4-UJ
	for qemu-devel@nongnu.org; Wed, 08 May 2019 08:44:12 -0400
Received: from smtp.corp.redhat.com (int-mx01.intmail.prod.int.phx2.redhat.com
	[10.5.11.11])
	(using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits))
	(No client certificate requested)
	by mx1.redhat.com (Postfix) with ESMTPS id 4A0503DD99
	for <qemu-devel@nongnu.org>; Wed,  8 May 2019 12:44:11 +0000 (UTC)
Received: from work-vm (ovpn-117-175.ams2.redhat.com [10.36.117.175])
	by smtp.corp.redhat.com (Postfix) with ESMTPS id 62FFA600D4;
	Wed,  8 May 2019 12:44:07 +0000 (UTC)
Date: Wed, 8 May 2019 13:44:04 +0100
From: "Dr. David Alan Gilbert" <dgilbert@redhat.com>
To: Markus Armbruster <armbru@redhat.com>
Message-ID: <20190508124404.GH2718@work-vm>
References: <20190430131919.GN6818@redhat.com> <20190430144546.GA3065@work-vm>
	<20190430150556.GA2423@redhat.com>
	<87sgtqejn9.fsf@dusky.pond.sub.org>
	<20190507093954.GG27205@redhat.com>
	<f30e9f99-5d9a-5d78-754e-a2acaa799edc@redhat.com>
	<87imul3ywc.fsf@dusky.pond.sub.org>
MIME-Version: 1.0
Content-Type: text/plain; charset=iso-8859-1
Content-Disposition: inline
In-Reply-To: <87imul3ywc.fsf@dusky.pond.sub.org>
User-Agent: Mutt/1.11.4 (2019-03-13)
X-Scanned-By: MIMEDefang 2.79 on 10.5.11.11
X-Greylist: Sender IP whitelisted, not delayed by milter-greylist-4.5.16
	(mx1.redhat.com [10.5.110.29]);
	Wed, 08 May 2019 12:44:11 +0000 (UTC)
Content-Transfer-Encoding: quoted-printable
X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic]
X-Received-From: 209.132.183.28
Subject: Re: [Qemu-devel] QMP; unsigned 64-bit ints;
 JSON standards compliance
X-BeenThere: qemu-devel@nongnu.org
X-Mailman-Version: 2.1.21
Precedence: list
List-Id: <qemu-devel.nongnu.org>
List-Unsubscribe: <https://lists.nongnu.org/mailman/options/qemu-devel>,
	<mailto:qemu-devel-request@nongnu.org?subject=unsubscribe>
List-Archive: <http://lists.nongnu.org/archive/html/qemu-devel/>
List-Post: <mailto:qemu-devel@nongnu.org>
List-Help: <mailto:qemu-devel-request@nongnu.org?subject=help>
List-Subscribe: <https://lists.nongnu.org/mailman/listinfo/qemu-devel>,
	<mailto:qemu-devel-request@nongnu.org?subject=subscribe>
Cc: libvir-list@redhat.com, =?iso-8859-1?B?SuFu?= Tomko <jtomko@redhat.com>,
	qemu-devel@nongnu.org
Errors-To: qemu-devel-bounces+qemu-devel=archiver.kernel.org@nongnu.org
Sender: "Qemu-devel"
	<qemu-devel-bounces+qemu-devel=archiver.kernel.org@nongnu.org>

* Markus Armbruster (armbru@redhat.com) wrote:
> Eric Blake <eblake@redhat.com> writes:
>=20
> > On 5/7/19 4:39 AM, Daniel P. Berrang=E9 wrote:
> >
> >>> JSON is terrible at interoperability, so good luck with that.
> >>>
> >>> If you reduce your order to "the commonly used JSON libraries we kn=
ow",
> >>> we can talk.
> >>=20
> >> I don't particularly want us to rely on semantics of small known set
> >> of JSON libs. I really do want us to do something that is capable of
> >> working with any JSON impl that exists in any programming language.
> >>=20
> >> My suggested option 2 & 3 at least would manage that I believe, as
> >> any credible JSON impl will be able to represent 32-bit integers
> >> or strings without loosing data.
> >>=20
> >> Option 1 would not cope as some impls can't even cope with
> >> signed 64-bit ints.
> >>=20
> >>>>>> I can think of some options:
> >>>>>>
> >>>>>>   1. Encode unsigned 64-bit integers as signed 64-bit integers.
> >>>>>>
> >>>>>>      This follows the example that most C libraries map JSON int=
s
> >>>>>>      to 'long long int'. This is still relying on undefined
> >>>>>>      behaviour as apps don't need to support > 2^53-1.
> >>>>>>
> >>>>>>      Apps would need to cast back to 'unsigned long long' for
> >>>>>>      those QMP fields they know are supposed to be unsigned.
> >>>
> >>> Ugly.  It's also what we did until v2.10, August 2017.  QMP's input
> >>> direction still does it, for backward compatibility.
> >
> > Having qemu accept signed ints in place of large unsigned values is e=
asy
> > enough. But you are right that it loses precision when doubles are
> > involved on the receiving end, and we cross the 2^53 barrier.
> >
> >>>
> >>>>>>
> >>>>>>
> >>>>>>   2. Encode all 64-bit integers as a pair of 32-bit integers.
> >>>>>>    =20
> >>>>>>      This is fully compliant with the JSON spec as each half
> >>>>>>      is fully within the declared limits. App has to split or
> >>>>>>      assemble the 2 pieces from/to a signed/unsigned 64-bit
> >>>>>>      int as needed.
> >>>
> >>> Differently ugly.
> >
> > Particularly ugly as we turn 1<<55 from:
> >
> > "value":36028797018963968
> >
> > into
> >
> > "value":[8388608,0]
> >
> > and now both qemu and the client end have to agree that an array of t=
wo
> > integers is a valid replacement for any larger 64-bit quantity
> > (presumably, we'd always accept the array form even for small integer
> > values, but only produce the array form for large values).  And while=
 it
> > manages just fine for uint64_t values, what rules would you place on
> > int64_t values? That the resulting 2-integer array is combined with t=
he
> > first number as a 2's-complement signed value, and the second being a
> > 32-bit unsigned value?
>=20
> There's more than one way to encode integers as a list of 53 bit signed
> integers.  Any of them will do, we just have to specify one.
>=20
> >>>>>>
> >>>>>>
> >>>>>>   3. Encode all 64-bit integers as strings
> >>>>>>
> >>>>>>      The application has todo all parsing/formatting client
> >>>>>>      side.
> >>>
> >>> Yet another ugly.
> >
> > But less so than option 2.
> >
> > "value":36028797018963968
> >
> > vs.
> >
> > "value":"36028797018963968"
> >
> > is at least tolerable.
>=20
> Yes.
>=20
> >>>>>> None of these changes are backwards compatible, so I doubt we co=
uld make
> >>>>>> the change transparently in QMP.  Instead we would have to have =
a
> >>>>>> QMP greeting message capability where the client can request ena=
blement
> >>>>>> of the enhanced integer handling.
> >>>
> >>> We might be able to do option 1 without capability negotiation.  v2=
.10's
> >>> change from option 1 to what we have now produced zero complaints.
> >>>
> >>> On the other hand, we made that change for a reason, so we may want=
 a
> >>> "send large integers as negative integers" capability regardless.
> >>>
> >>>>>>
> >>>>>> Any of the three options above would likely work for libvirt, bu=
t I
> >>>>>> would have a slight preference for either 2 or 3, so that we bec=
ome
> >>>>>> 100% standards compliant.
> >
> > If we're going to negotiate something, I'd lean towards option 3
> > (anywhere the introspection states that we accept 'int64' or similar,=
 it
> > is also appropriate to send a string value in its place). We'd also h=
ave
> > to decide if we want to allow "0xabcd", or strictly insist on 43981,
> > when stringizing an integer.  And while qemu should accept a string o=
r a
> > number on input, we'd still have to decide/document whether it's
> > response to the client capability negotiation is to output a string
> > always, or only for values larger than the 2^53 threshold.
>=20
> Picking option 3 is no excuse for complicating matters further.  QMP is
> primarily for machines.  So my first choice would be to keep everything
> decimal.  I could be persuaded to have QEMU parse integers from strings
> with base 0, i.e. leading 0x gets you hex, leading 0 gets you octal.

as I said in an earlier reply, my preference would also be to keep it
very strict and simple; although my preference is to stick to hex.
My suggestion was 0x%x format, always with the leading 0x, always lower
case, allowing but not requiring 0 padding, and never more than 18
characters (including the 0x). Always using the string format.

Dave

> >>>
> >>> There's no such thing.  You mean "we maximize interoperability with
> >>> common implementations of JSON".
> >>=20
> >> s/common/any/
> >>=20
> >>> Let's talk implementation for a bit.
> >>>
> >>> Encoding and decoding integers in funny ways should be fairly easy =
in
> >>> the QObject visitors.  The generated QMP marshallers all use them.
> >>> Trouble is a few commands still bypass the generated marshallers, a=
nd
> >>> mess with the QObject themselves:
> >>>
> >>> * query-qmp-schema: minor hack explained in qmp_query_qmp_schema()'=
s
> >>>   comment.  Should be harmless.
> >>>
> >>> * netdev_add: not QAPIfied.  Eric's patches to QAPIfy it got stuck
> >>>   because they reject some abuses like passing numbers and bools as
> >>>   strings.
> >>>
> >>> * device_add: not QAPIfied.  We're not sure QAPIfication is feasibl=
e.
> >>>
> >>> netdev_add and device_add both use qemu_opts_from_qdict().  Perhaps=
 we
> >>> could hack that to mirror what the QObject visitor do.
> >>>
> >>> Else, we might have to do it in the JSON parser.  Should be possibl=
e,
> >>> but I'd rather not.
> >>>
> >>>>> My preference would be 3 with the strings defined as being
> >>>>> %x lower case hex formated with a 0x prefix and no longer than 18=
 characters
> >>>>> ("0x" + 16 nybbles). Zero padding allowed but not required.
> >>>>> It's readable and unambiguous when dealing with addresses; I don'=
t want
> >>>>> to have to start decoding (2) by hand when debugging.
> >>>>
> >>>> Yep, that's a good point about readability.
> >>>
> >>> QMP sending all integers in decimal is inconvenient for some values=
,
> >>> such as addresses.  QMP sending all (large) integers in hexadecimal
> >>> would be inconvenient for other values.
> >>>
> >>> Let's keep it simple & stupid.  If you want sophistication, JSON is=
 the
> >>> wrong choice.
> >
> > JSON requires decimal-only, but I'm okay if we state that when
> > negotiating the alternative representation, that we output hex-only.
> > (JSON5 adds hex support among other things, but it is not an RFC
> > standard, and even fewer libraries exist that parse JSON5 in addition=
 to
> > straight JSON).
> >
> >>>
> >>>
> >>> Option 1 feels simplest.
> >>=20
> >> But will still fail with any JSON impl that uses double precision fl=
oating
> >> point for integers as it will loose precision.
> >>=20
> >>> Option 2 feels ugliest.  Less simple, more interoperable than optio=
n 1.
> >>=20
> >> If we assume any JSON impl can do 32-bit integers without loss of
> >> precision, then I think we can say it is guaranteed portable, but
> >> it is certainly horrible / ugly.
> >>=20
> >>> Option 3 is like option 2, just not quite as ugly.
> >>=20
> >> I think option 3 can be guaranteed to be loss-less with /any/ JSON i=
mpl
> >> that exists, since you're delegating all string -> int conversion to
> >> the application code taking the JSON parser/formatter out of the equ=
ation.
> >>=20
> >> This is close to the approach libvirt takes with YAJL parser today. =
YAJL
> >> parses as a int64 and we then ignore its result, and re-parse the st=
ring
> >> again in libvirt as uint64. When generating json we format as uint64
> >> in libvirt and ignore YAJLs formatting for int64.
> >>=20
> >>> Can we agree to eliminate option 2 from the race?
> >>=20
> >> I'm fine with eliminating option 2.
> >
> > Same here.
>=20
> Noted.
>=20
> >> I guess I'd have a preference for option 3 given that it has better
> >> interoperability
> >
> > Likewise - if we're going to bother with a capability that changes
> > output and allows the input validators to accept more forms, I'd pref=
er
> > a string form with correct sign over a negative integer that depends =
on
> > 64-bit 2's-complement arithmetic to intepret correctly.
>=20
> Noted.
--
Dr. David Alan Gilbert / dgilbert@redhat.com / Manchester, UK