Resources on deeper understanding of Translation blocks

qemu-devel.nongnu.org archive mirror
 help / color / mirror / Atom feed

* Resources on deeper understanding of Translation blocks
@ 2023-05-14 17:54 Gautam Bhat
  2023-05-15 13:45 ` Alex Bennée
  0 siblings, 1 reply; 5+ messages in thread
From: Gautam Bhat @ 2023-05-14 17:54 UTC (permalink / raw)
  To: QEMU Developers

Hi,

I am going through some translation code for existing targets.

I would like to know if there are any good resources on deeper
understanding of translation blocks? Also some advice on the best way
to read code related to translation in Qemu and trying it out maybe
using the debugger, printing etc? I am getting lost trying to make
sense of the translation code.

Thanks,
Gautam.

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: Resources on deeper understanding of Translation blocks
  2023-05-14 17:54 Resources on deeper understanding of Translation blocks Gautam Bhat
@ 2023-05-15 13:45 ` Alex Bennée
  2023-05-16 13:43   ` Gautam Bhat
  0 siblings, 1 reply; 5+ messages in thread
From: Alex Bennée @ 2023-05-15 13:45 UTC (permalink / raw)
  To: Gautam Bhat; +Cc: qemu-devel

Gautam Bhat <mindentropy@gmail.com> writes:

> Hi,
>
> I am going through some translation code for existing targets.
>
> I would like to know if there are any good resources on deeper
> understanding of translation blocks? Also some advice on the best way
> to read code related to translation in Qemu and trying it out maybe
> using the debugger, printing etc? I am getting lost trying to make
> sense of the translation code.

We have a section in the developers manual that gives an overview of the
translator and how it goes together:

  https://qemu.readthedocs.io/en/latest/devel/index-tcg.html

Generally for following code the best option is to use the debug flags,
the usual is:

 -d in_asm,op,op_opt,out_asm

which will dump in order:

  - in_asm, the guest instructions
  - op, the TCG ops they break down into
  - op_opt, the TCG ops after optimiation
  - out_asm, the host instructions for the block

as this is a lot of data you can limit to certain address ranges using
the -dfilter option.

Remember the above options will only dump the code as it is translated,
not each time the guest runs it.

You can attach to the guest using the gdbstub:

  https://qemu.readthedocs.io/en/latest/system/gdb.html

and stick a breakpoint at the code in question. If you have the HMP
monitor enabled you can then enable the debug options and single step
through the code and see each instruction translated and executed in
turn.

>
> Thanks,
> Gautam.

-- 
Alex Bennée
Virtualisation Tech Lead @ Linaro

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: Resources on deeper understanding of Translation blocks
  2023-05-15 13:45 ` Alex Bennée
@ 2023-05-16 13:43   ` Gautam Bhat
  2023-05-16 14:56     ` Alex Bennée
  0 siblings, 1 reply; 5+ messages in thread
From: Gautam Bhat @ 2023-05-16 13:43 UTC (permalink / raw)
  To: Alex Bennée; +Cc: qemu-devel

On Mon, May 15, 2023 at 7:22 PM Alex Bennée <alex.bennee@linaro.org> wrote:
>
>
> Gautam Bhat <mindentropy@gmail.com> writes:
>
> > Hi,
> >
> > I am going through some translation code for existing targets.
> >
> > I would like to know if there are any good resources on deeper
> > understanding of translation blocks? Also some advice on the best way
> > to read code related to translation in Qemu and trying it out maybe
> > using the debugger, printing etc? I am getting lost trying to make
> > sense of the translation code.
>
> We have a section in the developers manual that gives an overview of the
> translator and how it goes together:
>
>   https://qemu.readthedocs.io/en/latest/devel/index-tcg.html
>
> Generally for following code the best option is to use the debug flags,
> the usual is:
>
>  -d in_asm,op,op_opt,out_asm
>
> which will dump in order:
>
>   - in_asm, the guest instructions
>   - op, the TCG ops they break down into
>   - op_opt, the TCG ops after optimiation
>   - out_asm, the host instructions for the block
>
> as this is a lot of data you can limit to certain address ranges using
> the -dfilter option.
>
> Remember the above options will only dump the code as it is translated,
> not each time the guest runs it.
>
> You can attach to the guest using the gdbstub:
>
>   https://qemu.readthedocs.io/en/latest/system/gdb.html
>
> and stick a breakpoint at the code in question. If you have the HMP
> monitor enabled you can then enable the debug options and single step
> through the code and see each instruction translated and executed in
> turn.
>
>
> >
> > Thanks,
> > Gautam.
>
>
> --
> Alex Bennée
> Virtualisation Tech Lead @ Linaro

Hi Alex,

Thanks for the resources. I have been going through the documentation
and also running it using a debugger to analyze the code. I am still
not there with the op code emulation.
I am more or less stuck understanding how to handle translator_ops
callbacks and disassembly

I will be specific.

1. Could you please explain what the state machine should be for the
tb_stop(...) callback? I am looking at code from AVR
and Microblaze architecture and I see the following cases:
DISAS_NORETURN, DISAS_NEXT, DISAS_TOO_MANY, DISAS_LOOKUP and
DISAS_EXIT.  Is there some documentation on how to handle these cases
and how the disassembly goes through these states?
2. How should the callback for translate_insn be handled?
3. Do you have more information or a visual diagram of sorts for
TARGET_PAGE_* and PAGE_* and the relation between translation blocks
and pages? Also how
should one handle where architectures don't have any paging?

Thanks,
Gautam.

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: Resources on deeper understanding of Translation blocks
  2023-05-16 13:43   ` Gautam Bhat
@ 2023-05-16 14:56     ` Alex Bennée
  2023-05-16 15:34       ` Peter Maydell
  0 siblings, 1 reply; 5+ messages in thread
From: Alex Bennée @ 2023-05-16 14:56 UTC (permalink / raw)
  To: Gautam Bhat; +Cc: qemu-devel

Gautam Bhat <mindentropy@gmail.com> writes:

> On Mon, May 15, 2023 at 7:22 PM Alex Bennée <alex.bennee@linaro.org> wrote:
>>
>>
>> Gautam Bhat <mindentropy@gmail.com> writes:
>>
>> > Hi,
>> >
>> > I am going through some translation code for existing targets.
>> >
>> > I would like to know if there are any good resources on deeper
>> > understanding of translation blocks? Also some advice on the best way
>> > to read code related to translation in Qemu and trying it out maybe
>> > using the debugger, printing etc? I am getting lost trying to make
>> > sense of the translation code.
>>
>> We have a section in the developers manual that gives an overview of the
>> translator and how it goes together:
>>
>>   https://qemu.readthedocs.io/en/latest/devel/index-tcg.html
>>
<snip>
>
> Thanks for the resources. I have been going through the documentation
> and also running it using a debugger to analyze the code. I am still
> not there with the op code emulation.
> I am more or less stuck understanding how to handle translator_ops
> callbacks and disassembly
>
> I will be specific.
>
> 1. Could you please explain what the state machine should be for the
> tb_stop(...) callback? I am looking at code from AVR
> and Microblaze architecture and I see the following cases:
> DISAS_NORETURN, DISAS_NEXT, DISAS_TOO_MANY, DISAS_LOOKUP and
> DISAS_EXIT.  Is there some documentation on how to handle these cases
> and how the disassembly goes through these states?

The core documentation is in translator.h

  /**
   * DisasJumpType:
   * @DISAS_NEXT: Next instruction in program order.
   * @DISAS_TOO_MANY: Too many instructions translated.
   * @DISAS_NORETURN: Following code is dead.
   * @DISAS_TARGET_*: Start of target-specific conditions.
   *
   * What instruction to disassemble next.
   */

but as you can see the various frontends define there own target
specific end cases. Currently you have to look at target specific code
to see what it is handling.

Broadly there are two types of exit:

  - goto_tb + exit_tb

  We create an exit block with two exit cases, typically the two legs of
  a conditional branch. Initially these slots are empty which forces an
  exit to the main loop. Then the translated block matching the next PC
  is looked up and patched into the source TB so next time we jump
  directly.

  - lookup_and_goto_ptr

  We don't know what PC we might reach at the end of the block so we
  need to dynamically find the next TB based on what the PC is at
  execution time. If we can't find a TB we trigger an exit to the main
  loop so one can be translated.

The other cases are various special cases. For example we might not be
executing a branch and just falling through to the next instruction
(DISAS_NEXT). We might know the helper will never return, for example
because and exception is triggered, in which case we return to the main
loop and process it there.

> 2. How should the callback for translate_insn be handled?

It is called for each instruction in a block.

> 3. Do you have more information or a visual diagram of sorts for
> TARGET_PAGE_* and PAGE_* and the relation between translation blocks
> and pages? Also how
> should one handle where architectures don't have any paging?

All system emulation is handled by page size because that is the
fundamental granularity of the softmmu TLB which looks up a translation
from guest address to offset into the memory region. It is not directly
related to if the guest is using paging to implement virtual memory.

>
> Thanks,
> Gautam.

-- 
Alex Bennée
Virtualisation Tech Lead @ Linaro

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: Resources on deeper understanding of Translation blocks
  2023-05-16 14:56     ` Alex Bennée
@ 2023-05-16 15:34       ` Peter Maydell
  0 siblings, 0 replies; 5+ messages in thread
From: Peter Maydell @ 2023-05-16 15:34 UTC (permalink / raw)
  To: Alex Bennée; +Cc: Gautam Bhat, qemu-devel

On Tue, 16 May 2023 at 16:27, Alex Bennée <alex.bennee@linaro.org> wrote:
> Gautam Bhat <mindentropy@gmail.com> writes:
> > 3. Do you have more information or a visual diagram of sorts for
> > TARGET_PAGE_* and PAGE_* and the relation between translation blocks
> > and pages? Also how
> > should one handle where architectures don't have any paging?
>
> All system emulation is handled by page size because that is the
> fundamental granularity of the softmmu TLB which looks up a translation
> from guest address to offset into the memory region. It is not directly
> related to if the guest is using paging to implement virtual memory.

Also, each target defines a TARGET_PAGE_BITS even if the
target architecture doesn't have the concept of paging.
This specifies the minimum granularity at which you can
change things like r/w protection on addresses, so if your
target architecture has an MPU that might affect what
you want to set it to. If yo uhave neither an MPU nor an MMU,
you could look at what granularity/spacing typical systems
have between devices and particularly between devices and RAM.
If you have no pressing reason to set it to something as
a result of thinking about all that, then 4K pages is
probably as good a value as any.

-- PMM

^ permalink raw reply	[flat|nested] 5+ messages in thread

end of thread, other threads:[~2023-05-16 15:35 UTC | newest]

Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2023-05-14 17:54 Resources on deeper understanding of Translation blocks Gautam Bhat
2023-05-15 13:45 ` Alex Bennée
2023-05-16 13:43   ` Gautam Bhat
2023-05-16 14:56     ` Alex Bennée
2023-05-16 15:34       ` Peter Maydell

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).