From mboxrd@z Thu Jan  1 00:00:00 1970
Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.43)
	id 1FU9Ap-0000ob-6b
	for qemu-devel@nongnu.org; Thu, 13 Apr 2006 17:16:19 -0400
Received: from exim by lists.gnu.org with spam-scanned (Exim 4.43)
	id 1FU9Ao-0000oG-5U
	for qemu-devel@nongnu.org; Thu, 13 Apr 2006 17:16:18 -0400
Received: from [199.232.76.173] (helo=monty-python.gnu.org)
	by lists.gnu.org with esmtp (Exim 4.43) id 1FU9Ao-0000oD-2l
	for qemu-devel@nongnu.org; Thu, 13 Apr 2006 17:16:18 -0400
Received: from [212.247.155.44] (helo=swip.net)
	by monty-python.gnu.org with esmtp (Exim 4.52) id 1FU9GC-0002e5-Cv
	for qemu-devel@nongnu.org; Thu, 13 Apr 2006 17:21:52 -0400
From: Even Rouault <even.rouault@mines-paris.org>
Subject: Re: [Qemu-devel] [PATCH] SPARC target : Fix carry flagupdate inaddxcc
	and subxc
Date: Thu, 13 Apr 2006 23:14:43 +0200
References: <BAY104-F179318B08734CD6B904AFDFFC30@phx.gbl>
In-Reply-To: <BAY104-F179318B08734CD6B904AFDFFC30@phx.gbl>
MIME-Version: 1.0
Content-Type: multipart/alternative; boundary="Boundary-00=_E9rPEt2zgrfIGks"
Content-Transfer-Encoding: 7bit
Content-Disposition: inline
Message-Id: <200604132314.44622.even.rouault@mines-paris.org>
Reply-To: qemu-devel@nongnu.org
List-Id: qemu-devel.nongnu.org
List-Unsubscribe: <http://lists.nongnu.org/mailman/listinfo/qemu-devel>,
	<mailto:qemu-devel-request@nongnu.org?subject=unsubscribe>
List-Archive: <http://lists.gnu.org/pipermail/qemu-devel>
List-Post: <mailto:qemu-devel@nongnu.org>
List-Help: <mailto:qemu-devel-request@nongnu.org?subject=help>
List-Subscribe: <http://lists.nongnu.org/mailman/listinfo/qemu-devel>,
	<mailto:qemu-devel-request@nongnu.org?subject=subscribe>
To: Blue Swirl <blueswir1@hotmail.com>, qemu-devel@nongnu.org

--Boundary-00=_E9rPEt2zgrfIGks
Content-Type: text/plain;
  charset="iso-8859-15"
Content-Transfer-Encoding: quoted-printable

Hello,=20

As far as the V flag is concerned, I've taken a look at the Sparc V8 refere=
nce manual (www.sparc.org/standards/V8.pdf)

We can read at page 170 for the update of the V flag for "addcc" and "addxc=
c":
Vtheory =3D (r[rs1]<31> & operand2<31> & !result<31>) | (!r[rs1]<31> & !ope=
rand2<31> && result<31>)

Let's transform this with the name of the variables in the qemu code :
Vtheory =3D (src1<31> & T1<31> & !T0<31>) | (!src1<31> & !T1<31> & T0<31>)
Vtheory =3D ((src1 & T1 & ~T0) | (~src1 & ~T1 & T0)<31>

And we have in qemu code :
Vqemu =3D ((src1 ^ T1 ^ -1) & (src1 ^ T0))<31>

Now, let's transform Vqemu :
Vqemu =3D ((src1 ^ (T1 ^ -1)) & (src1 ^ T0))<31>
Vqemu =3D ((src1 ^ ~T1) & (src1 ^ T0))<31>
Vqemu =3D (((src1 & ~(~T1)) | (~src1 & ~T1)) & (src1 ^ T0))<31>
Vqemu =3D (((src1 & T1) | (~src1 & ~T1)) & (src1 ^ T0))<31>
Vqemu =3D ((src1 & T1 & (src1 ^ T0)) | (~src1 & ~T1 & (src1 ^ T0)))<31>
Vqemu =3D ((src1 & T1 & ((src1 & ~T0) | (~src1 & T0))) |
                (~src1 & ~T1 & ((src1 & ~T0) | (~src1 & T0))))<31>
Vqemu =3D ((src1 & T1 & src1 & ~T0) | (src1 & T1 & ~src1 & T0) |
                (~src1 & ~T1 & src1 & ~T0) | (~src1 & ~T1 & ~src1 & T0))<31>
Vqemu =3D ((src1 & T1 & ~T0) | (~src1 & ~T1 & T0))<31>
Vqemu =3D Vtheroy !

After theory, a bit of practice! I just wrote a small piece of code that en=
umerates the 2*2*2=3D8 combinations and proves experimentally that Vqemu =
=3D Vtheroy.

int main(int argc, char* argv[])
{
  int src1, T1, T0;
  for(src1=3D0;src1<=3D1;src1++)
  {
    for(T1=3D0;T1<=3D1;T1++)
    {
      for(T0=3D0;T0<=3D1;T0++)
      {
        int V1 =3D (src1 & T1 & ~T0) | (~src1 & ~T1 & T0);
        int V2 =3D (src1 ^ T1 ^ 1) & (src1 ^ T0);
        printf("src1=3D%d T1=3D%d T0=3D%d, V=3D%d=3D%d\n", src1, T1, T0,  V=
1, V2);
      }
    }
  }
}

The output is :
src1=3D0 T1=3D0 T0=3D0, V=3D0=3D0
src1=3D0 T1=3D0 T0=3D1, V=3D1=3D1
src1=3D0 T1=3D1 T0=3D0, V=3D0=3D0
src1=3D0 T1=3D1 T0=3D1, V=3D0=3D0
src1=3D1 T1=3D0 T0=3D0, V=3D0=3D0
src1=3D1 T1=3D0 T0=3D1, V=3D0=3D0
src1=3D1 T1=3D1 T0=3D0, V=3D1=3D1
src1=3D1 T1=3D1 T0=3D1, V=3D0=3D0

In other words, the V flag is set when :
the most significant bit of src1=3Dsrc2=3D0 and dst=3D1 : the result of the=
 addition of two signed positive words is not a signed positive word
the most significant bit of src1=3Dsrc2=3D1 and dst=3D0 : the result of the=
 addition of two signed negative words is not a signed negative word (or th=
e result of the addition of two unsigned words is a lower unsigned word)
Conclusion : the computation of the V flag in qemu is correct, and their is=
 no special case to consider if the C flag is set or not  :-)
=46or tomorrow, the formal proof of the correctness of the whole qemu code =
;-)

Le Jeudi 13 Avril 2006 20:39, vous avez =E9crit=A0:
> >As far as the V flag is concerned, mmm, I'm not really sure whether we
> >should
> >change something in the sparc code. If we compare to the arm code, we
> > don't take into account the fact that the carry flag is set before.
> >
> >We'd probably need some extensive tests and their associated expected
> >results.
>
> I made a small test program (attached) to test the addx instruction. The
> program calculates the sum of two 64-bit values, given on the command line
> as 32-bit lower and upper parts.  Native system produces following:
> $ ./addx -1 -1 0x80000000 -1
> ffffffffffffffff + ffffffff80000000 =3D ffffffff7fffffff, NZVC: 9
> while unpatched Qemu the following:
> $ qemu-sparc ./addx -1 -1 0x80000000 -1
> ffffffffffffffff + ffffffff80000000 =3D ffffffff7fffffff, NZVC: 8
>
> So the carry flag not set. When your patch is applied, the output is
> identical:
> ffffffffffffffff + ffffffff80000000 =3D ffffffff7fffffff, NZVC: 9
>
> I couldn't think of a combination of values that would set the V flag when
> there is also a carry from the 32-bit addition, any suggestions?
>
> _________________________________________________________________
> FREE pop-up blocking with the new MSN Toolbar - get it now!
> http://toolbar.msn.click-url.com/go/onm00200415ave/direct/01/

--Boundary-00=_E9rPEt2zgrfIGks
Content-Type: text/html;
  charset="iso-8859-15"
Content-Transfer-Encoding: quoted-printable

<html><head><meta name=3D"qrichtext" content=3D"1" /></head><body style=3D"=
font-size:11pt;font-family:DejaVu Sans">
<p>Hello, </p>
<p></p>
<p>As far as the V flag is concerned, I've taken a look at the Sparc V8 ref=
erence manual (www.sparc.org/standards/V8.pdf)</p>
<p></p>
<p>We can read at page 170 for the update of the V flag for &quot;addcc&quo=
t; and &quot;addxcc&quot;:</p>
<p>Vtheory =3D (r[rs1]&lt;31&gt; &amp; operand2&lt;31&gt; &amp; !result&lt;=
31&gt;) | (!r[rs1]&lt;31&gt; &amp; !operand2&lt;31&gt; &amp;&amp; result&lt=
;31&gt;)</p>
<p></p>
<p>Let's transform this with the name of the variables in the qemu code :</=
p>
<p>Vtheory =3D (src1&lt;31&gt; &amp; T1&lt;31&gt; &amp; !T0&lt;31&gt;) | (!=
src1&lt;31&gt; &amp; !T1&lt;31&gt; &amp; T0&lt;31&gt;)</p>
<p>Vtheory =3D ((src1 &amp; T1 &amp; ~T0) | (~src1 &amp; ~T1 &amp; T0)&lt;3=
1&gt;</p>
<p></p>
<p>And we have in qemu code :</p>
<p>Vqemu =3D ((src1 ^ T1 ^ -1) &amp; (src1 ^ T0))&lt;31&gt;</p>
<p></p>
<p>Now, let's transform Vqemu :</p>
<p>Vqemu =3D ((src1 ^ (T1 ^ -1)) &amp; (src1 ^ T0))&lt;31&gt;</p>
<p>Vqemu =3D ((src1 ^ ~T1) &amp; (src1 ^ T0))&lt;31&gt;</p>
<p>Vqemu =3D (((src1 &amp; ~(~T1)) | (~src1 &amp; ~T1)) &amp; (src1 ^ T0))&=
lt;31&gt;</p>
<p>Vqemu =3D (((src1 &amp; T1) | (~src1 &amp; ~T1)) &amp; (src1 ^ T0))&lt;3=
1&gt;</p>
<p>Vqemu =3D ((src1 &amp; T1 &amp; (src1 ^ T0)) | (~src1 &amp; ~T1 &amp; (s=
rc1 ^ T0)))&lt;31&gt;</p>
<p>Vqemu =3D ((src1 &amp; T1 &amp; ((src1 &amp; ~T0) | (~src1 &amp; T0))) |=
</p>
<p>                (~src1 &amp; ~T1 &amp; ((src1 &amp; ~T0) | (~src1 &amp; =
T0))))&lt;31&gt;</p>
<p>Vqemu =3D ((src1 &amp; T1 &amp; src1 &amp; ~T0) | (src1 &amp; T1 &amp; ~=
src1 &amp; T0) |</p>
<p>                (~src1 &amp; ~T1 &amp; src1 &amp; ~T0) | (~src1 &amp; ~T=
1 &amp; ~src1 &amp; T0))&lt;31&gt;</p>
<p>Vqemu =3D ((src1 &amp; T1 &amp; ~T0) | (~src1 &amp; ~T1 &amp; T0))&lt;31=
&gt;</p>
<p>Vqemu =3D Vtheroy !</p>
<p></p>
<p>After theory, a bit of practice! I just wrote a small piece of code that=
 enumerates the 2*2*2=3D8 combinations and proves experimentally that Vqemu=
 =3D Vtheroy.</p>
<p></p>
<p>int main(int argc, char* argv[])</p>
<p>{</p>
<p>  int src1, T1, T0;</p>
<p>  for(src1=3D0;src1&lt;=3D1;src1++)</p>
<p>  {</p>
<p>    for(T1=3D0;T1&lt;=3D1;T1++)</p>
<p>    {</p>
<p>      for(T0=3D0;T0&lt;=3D1;T0++)</p>
<p>      {</p>
<p>        int V1 =3D (src1 &amp; T1 &amp; ~T0) | (~src1 &amp; ~T1 &amp; T0=
);</p>
<p>        int V2 =3D (src1 ^ T1 ^ 1) &amp; (src1 ^ T0);</p>
<p>        printf(&quot;src1=3D%d T1=3D%d T0=3D%d, V=3D%d=3D%d\n&quot;, src=
1, T1, T0,  V1, V2);</p>
<p>      }</p>
<p>    }</p>
<p>  }</p>
<p>}</p>
<p></p>
<p>The output is :</p>
<p>src1=3D0 T1=3D0 T0=3D0, V=3D0=3D0</p>
<p><span style=3D"font-weight:600">src1=3D0 T1=3D0 T0=3D1, V=3D1=3D1</span>=
</p>
<p>src1=3D0 T1=3D1 T0=3D0, V=3D0=3D0</p>
<p>src1=3D0 T1=3D1 T0=3D1, V=3D0=3D0</p>
<p>src1=3D1 T1=3D0 T0=3D0, V=3D0=3D0</p>
<p>src1=3D1 T1=3D0 T0=3D1, V=3D0=3D0</p>
<p><span style=3D"font-weight:600">src1=3D1 T1=3D1 T0=3D0, V=3D1=3D1</span>=
</p>
<p>src1=3D1 T1=3D1 T0=3D1, V=3D0=3D0</p>
<p></p>
<p>In other words, the V flag is set when :</p>
<ul type=3D"disc"><li>the most significant bit of src1=3Dsrc2=3D0 and dst=
=3D1 : the result of the addition of two signed positive words is not a sig=
ned positive word</li>
<li>the most significant bit of src1=3Dsrc2=3D1 and dst=3D0 : the result of=
 the addition of two signed negative words is not a signed negative word (o=
r the result of the addition of two unsigned words is a lower unsigned word=
)</li></ul>
<p>Conclusion : the computation of the V flag in qemu is correct, and their=
 is no special case to consider if the C flag is set or not  :-)</p>
<p>For tomorrow, the formal proof of the correctness of the whole qemu code=
 ;-)</p>
<p></p>
<p>Le Jeudi 13 Avril 2006 20:39, vous avez =E9crit=A0:</p>
<p>&gt; &gt;As far as the V flag is concerned, mmm, I'm not really sure whe=
ther we</p>
<p>&gt; &gt;should</p>
<p>&gt; &gt;change something in the sparc code. If we compare to the arm co=
de, we</p>
<p>&gt; &gt; don't take into account the fact that the carry flag is set be=
fore.</p>
<p>&gt; &gt;</p>
<p>&gt; &gt;We'd probably need some extensive tests and their associated ex=
pected</p>
<p>&gt; &gt;results.</p>
<p>&gt;</p>
<p>&gt; I made a small test program (attached) to test the addx instruction=
=2E The</p>
<p>&gt; program calculates the sum of two 64-bit values, given on the comma=
nd line</p>
<p>&gt; as 32-bit lower and upper parts.  Native system produces following:=
</p>
<p>&gt; $ ./addx -1 -1 0x80000000 -1</p>
<p>&gt; ffffffffffffffff + ffffffff80000000 =3D ffffffff7fffffff, NZVC: 9</=
p>
<p>&gt; while unpatched Qemu the following:</p>
<p>&gt; $ qemu-sparc ./addx -1 -1 0x80000000 -1</p>
<p>&gt; ffffffffffffffff + ffffffff80000000 =3D ffffffff7fffffff, NZVC: 8</=
p>
<p>&gt;</p>
<p>&gt; So the carry flag not set. When your patch is applied, the output i=
s</p>
<p>&gt; identical:</p>
<p>&gt; ffffffffffffffff + ffffffff80000000 =3D ffffffff7fffffff, NZVC: 9</=
p>
<p>&gt;</p>
<p>&gt; I couldn't think of a combination of values that would set the V fl=
ag when</p>
<p>&gt; there is also a carry from the 32-bit addition, any suggestions?</p>
<p>&gt;</p>
<p>&gt; _________________________________________________________________</=
p>
<p>&gt; FREE pop-up blocking with the new MSN Toolbar - get it now!</p>
<p>&gt; http://toolbar.msn.click-url.com/go/onm00200415ave/direct/01/</p>
<p></p>
</body></html>
--Boundary-00=_E9rPEt2zgrfIGks--