* Re: [Qemu-devel] Expansion Ratio Issue [not found] ` <CAFEAcA8g5GLj3tjV8tAKC2ASMdGEg9gvKHSU17cExUrwdp1n6g@mail.gmail.com> @ 2014-06-03 17:13 ` Sergey Fedorov 2014-06-05 1:04 ` [Qemu-devel] 答复: " Chaos Shu 0 siblings, 1 reply; 6+ messages in thread From: Sergey Fedorov @ 2014-06-03 17:13 UTC (permalink / raw) To: Peter Maydell, Chaos Shu; +Cc: QEMU Developers On 29.05.2014 13:04, Peter Maydell wrote: > No, we don't in general have any benchmarking of TCG > codegen. I think if we did do benchmarking we'd be interested > in performance benchmarking -- code expansion ratio doesn't > seem like a very interesting thing to measure to me. Hi, I have a plan to play with TCG performance benchmarking. And then try to implement some optimizations. So maybe there would be some suggestions on how to perform such benchmarking? What tests seems to be appropriate for this task? I think the benchmarking should reflect real TCG use cases. So what the most typical use cases for TCG are there? Seems that system and user modes may be different from this point. Appreciate any help. Thanks, Sergey. ^ permalink raw reply [flat|nested] 6+ messages in thread
* [Qemu-devel] 答复: Expansion Ratio Issue 2014-06-03 17:13 ` [Qemu-devel] Expansion Ratio Issue Sergey Fedorov @ 2014-06-05 1:04 ` Chaos Shu 2014-06-05 8:02 ` Alex Bennée 0 siblings, 1 reply; 6+ messages in thread From: Chaos Shu @ 2014-06-05 1:04 UTC (permalink / raw) To: 'Sergey Fedorov', 'Peter Maydell' Cc: 'QEMU Developers' Hi I'm running SPEC CPU2006 on three kinds of situation, native aarch64 binary and emulator x86_64 system running SPEC CPU2006 and linux user mode level running x86_64 SPEC CPU2006 binary. To find where the performance lose, translator ? or execution of instruction after TCG? Or something else I guess most of time, up to 90% should be spent on exec the instruction of TCG, does that mean the quality of translating lead to the performance lost directly ? Thanks Chaos On 29.05.2014 13:04, Peter Maydell wrote: > No, we don't in general have any benchmarking of TCG codegen. I think > if we did do benchmarking we'd be interested in performance > benchmarking -- code expansion ratio doesn't seem like a very > interesting thing to measure to me. Hi, I have a plan to play with TCG performance benchmarking. And then try to implement some optimizations. So maybe there would be some suggestions on how to perform such benchmarking? What tests seems to be appropriate for this task? I think the benchmarking should reflect real TCG use cases. So what the most typical use cases for TCG are there? Seems that system and user modes may be different from this point. Appreciate any help. Thanks, Sergey. ^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: [Qemu-devel] 答复: Expansion Ratio Issue 2014-06-05 1:04 ` [Qemu-devel] 答复: " Chaos Shu @ 2014-06-05 8:02 ` Alex Bennée 2014-06-05 13:00 ` Sergey Fedorov 0 siblings, 1 reply; 6+ messages in thread From: Alex Bennée @ 2014-06-05 8:02 UTC (permalink / raw) To: Chaos Shu Cc: 'Sergey Fedorov', 'QEMU Developers', 'Peter Maydell' Chaos Shu writes: > Hi > > I'm running SPEC CPU2006 on three kinds of situation, native aarch64 binary and emulator x86_64 system running SPEC CPU2006 and linux user mode level running x86_64 SPEC CPU2006 binary. > > To find where the performance lose, translator ? or execution of instruction after TCG? Or something else > > I guess most of time, up to 90% should be spent on exec the > instruction of TCG, does that mean the quality of translating lead to > the performance lost directly ? It really depends on the type of code you are executing but yes most of the time should be spent in TCG generated code. However if you are running a lot of FP heavy code you'll find it spends a lot of time in helper routines calling the internal softfloat code. I posted some patches a few months ago that enabled output to help the Linux "perf" tool track this. I haven't got time to re-work at the moment but it might give you a head start to instrumentation: https://patches.linaro.org/27229/ > > Thanks > Chaos > > On 29.05.2014 13:04, Peter Maydell wrote: >> No, we don't in general have any benchmarking of TCG codegen. I think >> if we did do benchmarking we'd be interested in performance >> benchmarking -- code expansion ratio doesn't seem like a very >> interesting thing to measure to me. > > Hi, > > I have a plan to play with TCG performance benchmarking. And then try to implement some optimizations. So maybe there would be some suggestions on how to perform such benchmarking? What tests seems to be appropriate for this task? I think the benchmarking should reflect real TCG use cases. So what the most typical use cases for TCG are there? Seems that system and user modes may be different from this point. > > Appreciate any help. > > Thanks, > Sergey. -- Alex Bennée ^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: [Qemu-devel] 答复: Expansion Ratio Issue 2014-06-05 8:02 ` Alex Bennée @ 2014-06-05 13:00 ` Sergey Fedorov 2014-06-05 13:07 ` Peter Maydell 0 siblings, 1 reply; 6+ messages in thread From: Sergey Fedorov @ 2014-06-05 13:00 UTC (permalink / raw) To: Alex Bennée, Chaos Shu Cc: 'Peter Maydell', 'QEMU Developers' On 05.06.2014 12:02, Alex Bennée wrote: > Chaos Shu writes: > >> Hi >> >> I'm running SPEC CPU2006 on three kinds of situation, native aarch64 binary and emulator x86_64 system running SPEC CPU2006 and linux user mode level running x86_64 SPEC CPU2006 binary. >> >> To find where the performance lose, translator ? or execution of instruction after TCG? Or something else >> >> I guess most of time, up to 90% should be spent on exec the >> instruction of TCG, does that mean the quality of translating lead to >> the performance lost directly ? > It really depends on the type of code you are executing but yes most of > the time should be spent in TCG generated code. However if you are > running a lot of FP heavy code you'll find it spends a lot of time in > helper routines calling the internal softfloat code. > > I posted some patches a few months ago that enabled output to help the > Linux "perf" tool track this. I haven't got time to re-work at the > moment but it might give you a head start to instrumentation: > > https://patches.linaro.org/27229/ Thanks for replying! I used to think about Drystone, gzim, gcc in user mode. In system mode, Linux boot up and, again, Drystone, gzim, gcc. Regarding SPEC test, that is not available for free, isn't it? Thanks, Sergey > >> Thanks >> Chaos >> >> On 29.05.2014 13:04, Peter Maydell wrote: >>> No, we don't in general have any benchmarking of TCG codegen. I think >>> if we did do benchmarking we'd be interested in performance >>> benchmarking -- code expansion ratio doesn't seem like a very >>> interesting thing to measure to me. >> Hi, >> >> I have a plan to play with TCG performance benchmarking. And then try to implement some optimizations. So maybe there would be some suggestions on how to perform such benchmarking? What tests seems to be appropriate for this task? I think the benchmarking should reflect real TCG use cases. So what the most typical use cases for TCG are there? Seems that system and user modes may be different from this point. >> >> Appreciate any help. >> >> Thanks, >> Sergey. ^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: [Qemu-devel] 答复: Expansion Ratio Issue 2014-06-05 13:00 ` Sergey Fedorov @ 2014-06-05 13:07 ` Peter Maydell 2014-06-05 15:14 ` Sergey Fedorov 0 siblings, 1 reply; 6+ messages in thread From: Peter Maydell @ 2014-06-05 13:07 UTC (permalink / raw) To: Sergey Fedorov; +Cc: Chaos Shu, Alex Bennée, QEMU Developers On 5 June 2014 14:00, Sergey Fedorov <serge.fdrv@gmail.com> wrote: > Thanks for replying! I used to think about Drystone, gzim, gcc in user > mode. In system mode, Linux boot up and, again, Drystone, gzim, gcc. Probably worth making sure you also test workloads that do different things in multiple processes (to catch performance issues from over frequent TB/TLB flushes, and so on). thanks -- PMM ^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: [Qemu-devel] 答复: Expansion Ratio Issue 2014-06-05 13:07 ` Peter Maydell @ 2014-06-05 15:14 ` Sergey Fedorov 0 siblings, 0 replies; 6+ messages in thread From: Sergey Fedorov @ 2014-06-05 15:14 UTC (permalink / raw) To: Peter Maydell; +Cc: Chaos Shu, Alex Bennée, QEMU Developers On 05.06.2014 17:07, Peter Maydell wrote: > Probably worth making sure you also test workloads that do different > things in multiple processes (to catch performance issues from over > frequent TB/TLB flushes, and so on). Maybe "make -j<N>"? Thanks, Sergey ^ permalink raw reply [flat|nested] 6+ messages in thread
end of thread, other threads:[~2014-06-05 15:14 UTC | newest] Thread overview: 6+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- [not found] <SNT149-DS20972BC8406328B03556C8FB240@phx.gbl> [not found] ` <CAFEAcA8g5GLj3tjV8tAKC2ASMdGEg9gvKHSU17cExUrwdp1n6g@mail.gmail.com> 2014-06-03 17:13 ` [Qemu-devel] Expansion Ratio Issue Sergey Fedorov 2014-06-05 1:04 ` [Qemu-devel] 答复: " Chaos Shu 2014-06-05 8:02 ` Alex Bennée 2014-06-05 13:00 ` Sergey Fedorov 2014-06-05 13:07 ` Peter Maydell 2014-06-05 15:14 ` Sergey Fedorov
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).