* volanoMark 24% regression in 2.6.24-rc6: why a simple patch makes it
@ 2007-12-25 5:14 Zhang, Yanmin
2007-12-25 8:25 ` Ingo Molnar
0 siblings, 1 reply; 2+ messages in thread
From: Zhang, Yanmin @ 2007-12-25 5:14 UTC (permalink / raw)
To: travis, LKML; +Cc: Ingo Molnar, akpm, suresh.b.siddha, venkatesh.pallipadi
With kernel 2.6.24-rc6, volanoMark has much regression.
1) On 8-core stoakley: 17%;
2) On 16-core tigerton: 24%.
I bisected it down to patch fbdcf18df73758b2e187ab94678b30cd5f6ff9f9. It is
to fix the bad cpu number in /proc/cpuinfo. As a matter of fact, this issue
is already fixed by other 2 patches:
699d934d5f958d7944d195c03c334f28cc0b3669
and
c0c52d28e05e8bdaa2126570c02ecb1a7358cecc.
At the first glance, the patch looks good, at least no conflict with the other
2 patches. After double-checking it, I found in below call chain:
smp_store_cpu_info => identify_cpu => init_intel => init_intel_cacheinfo.
When CONFIG_X86_HT=y, init_intel_cacheinfo will uses cpuinfo_x86->cpu_index, which
is initiated by smp_store_cpu_info. If with patch fbdcf18df73758b2e187ab94678b30cd5f6ff9f9,
cpuinfo_x86->cpu_index is initiated after identify_cpu is called, so
init_intel_cacheinfo just always initiates per_cpu(cpu_llc_id, 0) = l2_id or l3_id. Then,
set_cpu_sibling_map will set bad llc_shared_map, so the core domain won't be built.
By checking domain info from dmesg, it really confirms my consequence.
>From this case, I really found that core domain could improve performance, at least when
testing by volanoMark. :)
The solution is just to revert patch fbdcf18df73758b2e187ab94678b30cd5f6ff9f9,
because other 2 patches which fixed the same issue are already in 2.6.24-rc5.
-yanmin
^ permalink raw reply [flat|nested] 2+ messages in thread
* Re: volanoMark 24% regression in 2.6.24-rc6: why a simple patch makes it
2007-12-25 5:14 volanoMark 24% regression in 2.6.24-rc6: why a simple patch makes it Zhang, Yanmin
@ 2007-12-25 8:25 ` Ingo Molnar
0 siblings, 0 replies; 2+ messages in thread
From: Ingo Molnar @ 2007-12-25 8:25 UTC (permalink / raw)
To: Zhang, Yanmin, Linus Torvalds
Cc: travis, LKML, akpm, suresh.b.siddha, venkatesh.pallipadi,
Yinghai.Lu
* Zhang, Yanmin <yanmin_zhang@linux.intel.com> wrote:
> The solution is just to revert patch
> fbdcf18df73758b2e187ab94678b30cd5f6ff9f9, because other 2 patches
> which fixed the same issue are already in 2.6.24-rc5.
Linus, please revert fbdcf18df73758, as requested by Yanmin. It was
noticed before (by Yinghai Lu) that this commit was not needed, but it
looked a harmless duplication and we incorrectly thought it's only a NOP
and wanted to fix it in v2.6.25 - but as Yanmin has analyzed it now, it
creates a sub-optimal sched-domains hierarchy (not setting up the domain
belonging to the core) when CONFIG_X86_HT=y.
Acked-by: Ingo Molnar <mingo@elte.hu>
nice work Yanmin!
Ingo
^ permalink raw reply [flat|nested] 2+ messages in thread
end of thread, other threads:[~2007-12-25 8:26 UTC | newest]
Thread overview: 2+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2007-12-25 5:14 volanoMark 24% regression in 2.6.24-rc6: why a simple patch makes it Zhang, Yanmin
2007-12-25 8:25 ` Ingo Molnar
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.