public inbox for linux-8086@vger.kernel.org
 help / color / mirror / Atom feed
* Compilers
@ 2004-05-28 10:42 David Given
  2004-05-28 14:44 ` Compilers sandeep
  2004-05-28 14:47 ` Compilers Eduardo Pereira Habkost
  0 siblings, 2 replies; 6+ messages in thread
From: David Given @ 2004-05-28 10:42 UTC (permalink / raw)
  To: linux-8086

I've been playing with Minix, and in particular looking at its C compiler. 
Minix uses a cut-down version of the Amsterdam Compiler Kit, which used to 
cost serious money but is now open source.

It's nice.

It seems to be a full ANSI C compiler generating pretty good code. ELKS' 
compiler is bcc, which is a K&R C compiler with a nasty preprocessor that 
turns ANSI C into K&R C; this means that it will compile ANSI C, but you 
don't get the proper type checking. Minix' cc is designed to run self-hosted 
(there are up to *eight* seperate programs involved); on my 1MB machine 
there's a fair amount of disk churn because it has to keep reloading bits of 
code, which makes it slow. It would be way faster on a 2MB machine.

I've tried to compare the code produced by cc and bcc. I'm not entirely sure 
this is a fair comparison because I don't think I managed to invoke bcc's 
optimiser correctly, but here you go anyway. Fixed-pitch font recommended.

The source:

---snip---
extern int printf(char* format, ...);

int fnord(int i)
{
        int count = 0;

        while (i)
                count += i--;
        return count;
}

int main(int argc, char* argv[])
{
        printf("Hello, world!\n");
        printf("%d\n", fnord(argc));
        return 0;
}
---snip---

The output:

BCC				CC

_fnord:				_fnord:
push bp				push bp
mov bp,sp			mov bp,sp
dec sp				push si
dec sp				xor si,si
xor ax,ax			.4:
mov -2[bp],ax			cmp 4(bp),#0
jmp .2				je .3
.3:				mov dx,4(bp)
mov ax,4[bp]			mov cx,dx
dec ax				dec cx
mov 4[bp],ax			mov 4(bp),cx
inc ax
add ax,-2[bp]
mov -2[bp],ax			add si,dx
.2:				
mov ax,4[bp]			
test ax,ax
jne .3
.4:
.1:				.3:
mov ax,-2[bp]			mov ax,si
mov sp,bp
pop bp
ret				jmp .sret

The main() function produced by both compilers is identical apart from minor 
differences: cc likes using pop to do a stack retraction, where bcc does a 
mov sp,bp instead. This means that cc produces smaller but slower code, but 
bcc produces faster code. *shrug*

cc seems to have better register allocation and avoids using stack slots when 
it doesn't need to. bcc has put the loop conditional at the bottom instead of 
the top; not sure why, it makes the code larger.

Given that currently the ELKS kernel is about 70kB of code, a little 
problematic given the 64kB limit, anything that will make the code smaller is 
good! Plus, cc is a real ANSI compiler. When I get some spare time I'll see 
if I can have a look at building the ELKS kernel on Minix. Since the two 
compilers use the same calling conventions I don't think it would be terribly 
hard and I'd be interested in seeing what sort of differences there are in 
real code.

Incidentally, if you're interested in what ELKS might end up being, I strongly 
suggest you install a copy of Minix on something. It's fascinating just how 
usable it is, on such limited hardware.

-- 
+- David Given --McQ-+ 
|  dg@cowlark.com    | "All power corrupts, but we need electricity." ---
| (dg@tao-group.com) | Diana Wynne Jones, _Archer's Goon_
+- www.cowlark.com --+ 

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: Compilers
  2004-05-28 10:42 Compilers David Given
@ 2004-05-28 14:44 ` sandeep
  2004-05-28 14:47 ` Compilers Eduardo Pereira Habkost
  1 sibling, 0 replies; 6+ messages in thread
From: sandeep @ 2004-05-28 14:44 UTC (permalink / raw)
  To: linux-8086

> Given that currently the ELKS kernel is about 70kB of code, a little 
> problematic given the 64kB limit, anything that will make the code smaller is 
Apart from using ways/tools to get smaller code to fit it 64KB, I guess couple 
of other options should also be possible. If you see elks features wish-list, 
with more wishes fulfilled it might not be possible to have it fit in 64KB, may 
be unless entire OS is handcoded in assembly and tweaks/hacks applied here and 
there as needed to produce mainly smaller (faster) code.

1. something like loadable modules / overlays that can help here. these overlays 
  needn't be overlays in dos .OVL file ways but part of executable, and 
executable header gives information about these. yeah it will require sllightly 
different/ modified executable format to be devised.

2. even if code and data for elks kernel is more than 64 KB, that will not be a 
problem if this code and data is not moved around b/w memory and disk, i.e. once 
loaded it stays there till reboot. within kernel code you can even have far 
pointers stored even as local variables.

-- 
regards
sandeep
--------------------------------------------------------------------------
Loose bits sink chips.
--------------------------------------------------------------------------


^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: Compilers
  2004-05-28 10:42 Compilers David Given
  2004-05-28 14:44 ` Compilers sandeep
@ 2004-05-28 14:47 ` Eduardo Pereira Habkost
  2004-05-29 11:42   ` Compilers Mark Junker
  1 sibling, 1 reply; 6+ messages in thread
From: Eduardo Pereira Habkost @ 2004-05-28 14:47 UTC (permalink / raw)
  To: linux-8086

[-- Attachment #1: Type: text/plain, Size: 3920 bytes --]

Hi, all,

On Fri, May 28, 2004 at 11:42:38AM +0100, David Given wrote:
<snip>
> 
> I've tried to compare the code produced by cc and bcc. I'm not entirely sure 
> this is a fair comparison because I don't think I managed to invoke bcc's 
> optimiser correctly, but here you go anyway. Fixed-pitch font recommended.
> 
> The source:
> 
> ---snip---
> extern int printf(char* format, ...);
> 
> int fnord(int i)
> {
>         int count = 0;
> 
>         while (i)
>                 count += i--;
>         return count;
> }
> 
> int main(int argc, char* argv[])
> {
>         printf("Hello, world!\n");
>         printf("%d\n", fnord(argc));
>         return 0;
> }
> ---snip---
> 
> The output:
> 

Nice, I would like to compare with the code that gcc-8086 is producing
currently, too. Please note that there could be hiddend bugs that I've
seen yet.  :)

I've used -O3 for compilation:
(GNU as AT&T syntax used)

_fnord:
        push    %bp
        mov     %sp,%bp

        movw    2(%bp),%dx
        movw    $0x0,%ax
        cmpw    $0x0,%dx
        je      .L15
.L13:
        addw    %dx,%ax
        addw    $0xffffffff,%dx
        cmpw    $0x0,%dx
        jne     .L13
.L15:

        leave
        ret

Some things that can be noted: There is not 'dec' instruction, yet. It
can be added. It is using 'leave' for function exit. IIRC, it is a
80186 instruction, so it is better have an option to use is better have
an option for the user choosing to use it or not.

One bug appeared: it is using 2(%bp) for getting the function
parameter. It doesn't look correct.  :)

A -fno-frame-pointer option would be nice, too. I guess that most people
wouldn't want them on ELKS, and it will make code smaller and use less
stack space.

> BCC                             CC
> 
> _fnord:                         _fnord:
> push bp                         push bp
> mov bp,sp                       mov bp,sp
> dec sp                          push si
> dec sp                          xor si,si
> xor ax,ax                       .4:
> mov -2[bp],ax                   cmp 4(bp),#0
> jmp .2                          je .3
> .3:                             mov dx,4(bp)
> mov ax,4[bp]                    mov cx,dx
> dec ax                          dec cx
> mov 4[bp],ax                    mov 4(bp),cx
> inc ax
> add ax,-2[bp]
> mov -2[bp],ax                   add si,dx
> .2:                             
> mov ax,4[bp]                    
> test ax,ax
> jne .3
> .4:
> .1:                             .3:
> mov ax,-2[bp]                   mov ax,si
> mov sp,bp
> pop bp
> ret                             jmp .sret
> 
> The main() function produced by both compilers is identical apart from minor 
> differences: cc likes using pop to do a stack retraction, where bcc does a 
> mov sp,bp instead. This means that cc produces smaller but slower code, but 
> bcc produces faster code. *shrug*
> 
> cc seems to have better register allocation and avoids using stack slots when 
> it doesn't need to. bcc has put the loop conditional at the bottom instead of 
> the top; not sure why, it makes the code larger.
> 
> Given that currently the ELKS kernel is about 70kB of code, a little 
> problematic given the 64kB limit, anything that will make the code smaller is 
> good! Plus, cc is a real ANSI compiler. When I get some spare time I'll see 
> if I can have a look at building the ELKS kernel on Minix. Since the two 
> compilers use the same calling conventions I don't think it would be terribly 
> hard and I'd be interested in seeing what sort of differences there are in 
> real code.
> 
> Incidentally, if you're interested in what ELKS might end up being, I strongly 
> suggest you install a copy of Minix on something. It's fascinating just how 
> usable it is, on such limited hardware.
> 

-- 
Eduardo

[-- Attachment #2: Type: application/pgp-signature, Size: 189 bytes --]

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: Compilers
  2004-05-28 14:47 ` Compilers Eduardo Pereira Habkost
@ 2004-05-29 11:42   ` Mark Junker
  2004-05-29 11:58     ` Compilers David Given
  0 siblings, 1 reply; 6+ messages in thread
From: Mark Junker @ 2004-05-29 11:42 UTC (permalink / raw)
  To: linux-8086

Eduardo Pereira Habkost schrieb:

> 
> _fnord:
>         push    %bp
>         mov     %sp,%bp
> 
>         movw    2(%bp),%dx
should be "movw    4(%bp),%dx"

>         movw    $0x0,%ax
>         cmpw    $0x0,%dx
>         je      .L15
> .L13:
>         addw    %dx,%ax
>         addw    $0xffffffff,%dx
should be:
addw    $0xffff,%dx

$0xffffffff is a 32 bit constant

>         cmpw    $0x0,%dx
>         jne     .L13
> .L15:
> 
>         leave
"leave" is only available on 186+

>         ret
> 
> Some things that can be noted: There is not 'dec' instruction, yet. It
> can be added. It is using 'leave' for function exit. IIRC, it is a
> 80186 instruction, so it is better have an option to use is better have
> an option for the user choosing to use it or not.
You're right.

> One bug appeared: it is using 2(%bp) for getting the function
> parameter. It doesn't look correct.  :)
Again, you're right. It's not correct

> A -fno-frame-pointer option would be nice, too. I guess that most people
> wouldn't want them on ELKS, and it will make code smaller and use less
> stack space.
This can (sometimes) produce smaller code.

Regards,
Mark Junker


^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: Compilers
  2004-05-29 11:42   ` Compilers Mark Junker
@ 2004-05-29 11:58     ` David Given
  2004-05-29 12:32       ` Compilers Mark Junker
  0 siblings, 1 reply; 6+ messages in thread
From: David Given @ 2004-05-29 11:58 UTC (permalink / raw)
  To: linux-8086

Mark Junker wrote:
[...]
>> A -fno-frame-pointer option would be nice, too. I guess that most people
>> wouldn't want them on ELKS, and it will make code smaller and use less
>> stack space.
> 
> This can (sometimes) produce smaller code.

The main advantage is that you free up a register that can be used for 
other things. On the register-starved x86 family this is a huge benefit.

I really don't know why people use a frame pointer anyway. Given that 
the i86 can do arbitrary relative stack addressing there's no real need 
for it. It makes debugging marginally easier, but then the debug 
information should provide all the necessary data to find the stack 
frame from any point in the code...

(I work for a company that produces embedded operating systems, with our 
own cross-platform compiler architecture. The PowerPC has 32 
general-purpose registers. Guess how many the 386 has? *Five*. That 
leads to interesting scalability issues in our register allocation code...)

-- 
dg@cowlark.com --- http://www.cowlark.com
My other account has a real signature.

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: Compilers
  2004-05-29 11:58     ` Compilers David Given
@ 2004-05-29 12:32       ` Mark Junker
  0 siblings, 0 replies; 6+ messages in thread
From: Mark Junker @ 2004-05-29 12:32 UTC (permalink / raw)
  To: linux-8086

David Given schrieb:

> I really don't know why people use a frame pointer anyway. Given that 
> the i86 can do arbitrary relative stack addressing there's no real need 
> for it. It makes debugging marginally easier, but then the debug 
> information should provide all the necessary data to find the stack 
> frame from any point in the code...

The main problem is that the value of the stack pointer changes during 
the execution of a function and sometimes it's not easy to realize that

mov.l @(r15,12),r1
.
.
.
mov.l @(r15,8),r2

may refer to the same argument ...

> (I work for a company that produces embedded operating systems, with our 
> own cross-platform compiler architecture. The PowerPC has 32 
> general-purpose registers. Guess how many the 386 has? *Five*. That 
> leads to interesting scalability issues in our register allocation code...)

I work on embedded systems using the Renesas SH4 (it has 16 (+ 8) int 
and 16 fp GP registers)  so I think I know what you mean.

Regards,
Mark Junker


^ permalink raw reply	[flat|nested] 6+ messages in thread

end of thread, other threads:[~2004-05-29 12:32 UTC | newest]

Thread overview: 6+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2004-05-28 10:42 Compilers David Given
2004-05-28 14:44 ` Compilers sandeep
2004-05-28 14:47 ` Compilers Eduardo Pereira Habkost
2004-05-29 11:42   ` Compilers Mark Junker
2004-05-29 11:58     ` Compilers David Given
2004-05-29 12:32       ` Compilers Mark Junker

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox