qemu-devel.nongnu.org archive mirror
 help / color / mirror / Atom feed
* Re: [Qemu-devel] Expansion Ratio Issue
       [not found] ` <CAFEAcA8g5GLj3tjV8tAKC2ASMdGEg9gvKHSU17cExUrwdp1n6g@mail.gmail.com>
@ 2014-06-03 17:13   ` Sergey Fedorov
  2014-06-05  1:04     ` [Qemu-devel] 答复: " Chaos Shu
  0 siblings, 1 reply; 6+ messages in thread
From: Sergey Fedorov @ 2014-06-03 17:13 UTC (permalink / raw)
  To: Peter Maydell, Chaos Shu; +Cc: QEMU Developers

On 29.05.2014 13:04, Peter Maydell wrote:
> No, we don't in general have any benchmarking of TCG
> codegen. I think if we did do benchmarking we'd be interested
> in performance benchmarking -- code expansion ratio doesn't
> seem like a very interesting thing to measure to me.

Hi,

I have a plan to play with TCG performance benchmarking. And then try to
implement some optimizations. So maybe there would be some suggestions
on how to perform such benchmarking? What tests seems to be appropriate
for this task? I think the benchmarking should reflect real TCG use
cases. So what the most typical use cases for TCG are there? Seems that
system and user modes may be different from this point.

Appreciate any help.

Thanks,
Sergey.

^ permalink raw reply	[flat|nested] 6+ messages in thread

* [Qemu-devel] 答复:  Expansion Ratio Issue
  2014-06-03 17:13   ` [Qemu-devel] Expansion Ratio Issue Sergey Fedorov
@ 2014-06-05  1:04     ` Chaos Shu
  2014-06-05  8:02       ` Alex Bennée
  0 siblings, 1 reply; 6+ messages in thread
From: Chaos Shu @ 2014-06-05  1:04 UTC (permalink / raw)
  To: 'Sergey Fedorov', 'Peter Maydell'
  Cc: 'QEMU Developers'

Hi

I'm running SPEC CPU2006 on three kinds of situation, native aarch64 binary and emulator x86_64 system running SPEC CPU2006 and linux user mode level running x86_64 SPEC CPU2006 binary.

To find where the performance lose, translator ? or execution of instruction after TCG? Or something else

I guess most of  time, up to 90% should be spent on exec the instruction of TCG, does that mean the quality of translating lead to the performance lost directly ? 

Thanks
Chaos

On 29.05.2014 13:04, Peter Maydell wrote:
> No, we don't in general have any benchmarking of TCG codegen. I think 
> if we did do benchmarking we'd be interested in performance 
> benchmarking -- code expansion ratio doesn't seem like a very 
> interesting thing to measure to me.

Hi,

I have a plan to play with TCG performance benchmarking. And then try to implement some optimizations. So maybe there would be some suggestions on how to perform such benchmarking? What tests seems to be appropriate for this task? I think the benchmarking should reflect real TCG use cases. So what the most typical use cases for TCG are there? Seems that system and user modes may be different from this point.

Appreciate any help.

Thanks,
Sergey.

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: [Qemu-devel] 答复: Expansion Ratio Issue
  2014-06-05  1:04     ` [Qemu-devel] 答复: " Chaos Shu
@ 2014-06-05  8:02       ` Alex Bennée
  2014-06-05 13:00         ` Sergey Fedorov
  0 siblings, 1 reply; 6+ messages in thread
From: Alex Bennée @ 2014-06-05  8:02 UTC (permalink / raw)
  To: Chaos Shu
  Cc: 'Sergey Fedorov', 'QEMU Developers',
	'Peter Maydell'


Chaos Shu writes:

> Hi
>
> I'm running SPEC CPU2006 on three kinds of situation, native aarch64 binary and emulator x86_64 system running SPEC CPU2006 and linux user mode level running x86_64 SPEC CPU2006 binary.
>
> To find where the performance lose, translator ? or execution of instruction after TCG? Or something else
>
> I guess most of  time, up to 90% should be spent on exec the
> instruction of TCG, does that mean the quality of translating lead to
> the performance lost directly ?

It really depends on the type of code you are executing but yes most of
the time should be spent in TCG generated code. However if you are
running a lot of FP heavy code you'll find it spends a lot of time in
helper routines calling the internal softfloat code.

I posted some patches a few months ago that enabled output to help the
Linux "perf" tool track this. I haven't got time to re-work at the
moment but it might give you a head start to instrumentation:

https://patches.linaro.org/27229/

>
> Thanks
> Chaos
>
> On 29.05.2014 13:04, Peter Maydell wrote:
>> No, we don't in general have any benchmarking of TCG codegen. I think 
>> if we did do benchmarking we'd be interested in performance 
>> benchmarking -- code expansion ratio doesn't seem like a very 
>> interesting thing to measure to me.
>
> Hi,
>
> I have a plan to play with TCG performance benchmarking. And then try to implement some optimizations. So maybe there would be some suggestions on how to perform such benchmarking? What tests seems to be appropriate for this task? I think the benchmarking should reflect real TCG use cases. So what the most typical use cases for TCG are there? Seems that system and user modes may be different from this point.
>
> Appreciate any help.
>
> Thanks,
> Sergey.

-- 
Alex Bennée

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: [Qemu-devel] 答复:  Expansion Ratio Issue
  2014-06-05  8:02       ` Alex Bennée
@ 2014-06-05 13:00         ` Sergey Fedorov
  2014-06-05 13:07           ` Peter Maydell
  0 siblings, 1 reply; 6+ messages in thread
From: Sergey Fedorov @ 2014-06-05 13:00 UTC (permalink / raw)
  To: Alex Bennée, Chaos Shu
  Cc: 'Peter Maydell', 'QEMU Developers'

On 05.06.2014 12:02, Alex Bennée wrote:
> Chaos Shu writes:
>
>> Hi
>>
>> I'm running SPEC CPU2006 on three kinds of situation, native aarch64 binary and emulator x86_64 system running SPEC CPU2006 and linux user mode level running x86_64 SPEC CPU2006 binary.
>>
>> To find where the performance lose, translator ? or execution of instruction after TCG? Or something else
>>
>> I guess most of  time, up to 90% should be spent on exec the
>> instruction of TCG, does that mean the quality of translating lead to
>> the performance lost directly ?
> It really depends on the type of code you are executing but yes most of
> the time should be spent in TCG generated code. However if you are
> running a lot of FP heavy code you'll find it spends a lot of time in
> helper routines calling the internal softfloat code.
>
> I posted some patches a few months ago that enabled output to help the
> Linux "perf" tool track this. I haven't got time to re-work at the
> moment but it might give you a head start to instrumentation:
>
> https://patches.linaro.org/27229/

Thanks for replying! I used to think about Drystone, gzim, gcc in user
mode. In system mode, Linux boot up and, again, Drystone, gzim, gcc.
Regarding SPEC test, that is not available for free, isn't it?

Thanks,
Sergey

>
>> Thanks
>> Chaos
>>
>> On 29.05.2014 13:04, Peter Maydell wrote:
>>> No, we don't in general have any benchmarking of TCG codegen. I think 
>>> if we did do benchmarking we'd be interested in performance 
>>> benchmarking -- code expansion ratio doesn't seem like a very 
>>> interesting thing to measure to me.
>> Hi,
>>
>> I have a plan to play with TCG performance benchmarking. And then try to implement some optimizations. So maybe there would be some suggestions on how to perform such benchmarking? What tests seems to be appropriate for this task? I think the benchmarking should reflect real TCG use cases. So what the most typical use cases for TCG are there? Seems that system and user modes may be different from this point.
>>
>> Appreciate any help.
>>
>> Thanks,
>> Sergey.

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: [Qemu-devel] 答复: Expansion Ratio Issue
  2014-06-05 13:00         ` Sergey Fedorov
@ 2014-06-05 13:07           ` Peter Maydell
  2014-06-05 15:14             ` Sergey Fedorov
  0 siblings, 1 reply; 6+ messages in thread
From: Peter Maydell @ 2014-06-05 13:07 UTC (permalink / raw)
  To: Sergey Fedorov; +Cc: Chaos Shu, Alex Bennée, QEMU Developers

On 5 June 2014 14:00, Sergey Fedorov <serge.fdrv@gmail.com> wrote:
> Thanks for replying! I used to think about Drystone, gzim, gcc in user
> mode. In system mode, Linux boot up and, again, Drystone, gzim, gcc.

Probably worth making sure you also test workloads that do different
things in multiple processes (to catch performance issues from over
frequent TB/TLB flushes, and so on).

thanks
-- PMM

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: [Qemu-devel] 答复: Expansion Ratio Issue
  2014-06-05 13:07           ` Peter Maydell
@ 2014-06-05 15:14             ` Sergey Fedorov
  0 siblings, 0 replies; 6+ messages in thread
From: Sergey Fedorov @ 2014-06-05 15:14 UTC (permalink / raw)
  To: Peter Maydell; +Cc: Chaos Shu, Alex Bennée, QEMU Developers

On 05.06.2014 17:07, Peter Maydell wrote:
> Probably worth making sure you also test workloads that do different
> things in multiple processes (to catch performance issues from over
> frequent TB/TLB flushes, and so on).

Maybe "make -j<N>"?

Thanks,
Sergey

^ permalink raw reply	[flat|nested] 6+ messages in thread

end of thread, other threads:[~2014-06-05 15:14 UTC | newest]

Thread overview: 6+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
     [not found] <SNT149-DS20972BC8406328B03556C8FB240@phx.gbl>
     [not found] ` <CAFEAcA8g5GLj3tjV8tAKC2ASMdGEg9gvKHSU17cExUrwdp1n6g@mail.gmail.com>
2014-06-03 17:13   ` [Qemu-devel] Expansion Ratio Issue Sergey Fedorov
2014-06-05  1:04     ` [Qemu-devel] 答复: " Chaos Shu
2014-06-05  8:02       ` Alex Bennée
2014-06-05 13:00         ` Sergey Fedorov
2014-06-05 13:07           ` Peter Maydell
2014-06-05 15:14             ` Sergey Fedorov

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).