* hackbench regression with 2.6.26-rc2 on tulsa machine
@ 2008-05-20 8:09 Zhang, Yanmin
2008-05-20 9:22 ` Mike Galbraith
0 siblings, 1 reply; 4+ messages in thread
From: Zhang, Yanmin @ 2008-05-20 8:09 UTC (permalink / raw)
To: Mike Galbraith; +Cc: Peter Zijlstra, LKML
Comparing with 2.6.26-rc1, hackbench has about 30% regression with 2.6.26-rc2 on my tulsa machine
which is a netburst architecure hyper-threading x86_64.
Command line to test: #hackbench 100 process 2000
With 2.6.26-rc1, it takes 30 seconds. But with 2.6.26-rc2/rc3, it takes 40 seconds.
Bisect located below patch:
46151122e0a2e80e5a6b2889f595e371fe2b600d is first bad commit
commit 46151122e0a2e80e5a6b2889f595e371fe2b600d
Author: Mike Galbraith <efault@gmx.de>
Date: Thu May 8 17:00:42 2008 +0200
sched: fix weight calculations
The conversion between virtual and real time is as follows:
dvt = rw/w * dt <=> dt = w/rw * dvt
Since we want the fair sleeper granularity to be in real time, we actually
need to do:
dvt = - rw/w * l
The bisect steps look stable.
On my core architecure machines(stoakley and tigerton), I do see improvement instead of regression,
like result from 31 seconds down to 28 seconds.
I'm not sure if hyper-threading need more cares in this situation.
-yanmin
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: hackbench regression with 2.6.26-rc2 on tulsa machine
2008-05-20 8:09 hackbench regression with 2.6.26-rc2 on tulsa machine Zhang, Yanmin
@ 2008-05-20 9:22 ` Mike Galbraith
2008-05-21 4:54 ` Zhang, Yanmin
0 siblings, 1 reply; 4+ messages in thread
From: Mike Galbraith @ 2008-05-20 9:22 UTC (permalink / raw)
To: Zhang, Yanmin; +Cc: Peter Zijlstra, LKML
On Tue, 2008-05-20 at 16:09 +0800, Zhang, Yanmin wrote:
> Comparing with 2.6.26-rc1, hackbench has about 30% regression with 2.6.26-rc2 on my tulsa machine
> which is a netburst architecure hyper-threading x86_64.
>
> Command line to test: #hackbench 100 process 2000
>
> With 2.6.26-rc1, it takes 30 seconds. But with 2.6.26-rc2/rc3, it takes 40 seconds.
>
> Bisect located below patch:
> 46151122e0a2e80e5a6b2889f595e371fe2b600d is first bad commit
> commit 46151122e0a2e80e5a6b2889f595e371fe2b600d
> Author: Mike Galbraith <efault@gmx.de>
> Date: Thu May 8 17:00:42 2008 +0200
>
> sched: fix weight calculations
>
> The conversion between virtual and real time is as follows:
>
> dvt = rw/w * dt <=> dt = w/rw * dvt
>
> Since we want the fair sleeper granularity to be in real time, we actually
> need to do:
>
> dvt = - rw/w * l
>
>
>
> The bisect steps look stable.
>
> On my core architecure machines(stoakley and tigerton), I do see improvement instead of regression,
> like result from 31 seconds down to 28 seconds.
>
> I'm not sure if hyper-threading need more cares in this situation.
Oh joy. I'll update my poor old P4 and see what I can duplicate this.
Do you still have group scheduling enabled? If so, can you turn it off
and try again? (when in doubt, grasp at any straw within reach;)
-Mike
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: hackbench regression with 2.6.26-rc2 on tulsa machine
2008-05-20 9:22 ` Mike Galbraith
@ 2008-05-21 4:54 ` Zhang, Yanmin
2008-05-21 5:19 ` Mike Galbraith
0 siblings, 1 reply; 4+ messages in thread
From: Zhang, Yanmin @ 2008-05-21 4:54 UTC (permalink / raw)
To: Mike Galbraith; +Cc: Peter Zijlstra, LKML
On Tue, 2008-05-20 at 11:22 +0200, Mike Galbraith wrote:
> On Tue, 2008-05-20 at 16:09 +0800, Zhang, Yanmin wrote:
> > Comparing with 2.6.26-rc1, hackbench has about 30% regression with 2.6.26-rc2 on my tulsa machine
> > which is a netburst architecure hyper-threading x86_64.
> >
> > Command line to test: #hackbench 100 process 2000
> >
> > With 2.6.26-rc1, it takes 30 seconds. But with 2.6.26-rc2/rc3, it takes 40 seconds.
> >
> > Bisect located below patch:
> > 46151122e0a2e80e5a6b2889f595e371fe2b600d is first bad commit
> > commit 46151122e0a2e80e5a6b2889f595e371fe2b600d
> > Author: Mike Galbraith <efault@gmx.de>
> > Date: Thu May 8 17:00:42 2008 +0200
> >
> > sched: fix weight calculations
> >
> > The conversion between virtual and real time is as follows:
> >
> > dvt = rw/w * dt <=> dt = w/rw * dvt
> >
> > Since we want the fair sleeper granularity to be in real time, we actually
> > need to do:
> >
> > dvt = - rw/w * l
> >
> >
> >
> > The bisect steps look stable.
> >
> > On my core architecure machines(stoakley and tigerton), I do see improvement instead of regression,
> > like result from 31 seconds down to 28 seconds.
> >
> > I'm not sure if hyper-threading need more cares in this situation.
>
> Oh joy. I'll update my poor old P4 and see what I can duplicate this.
>
> Do you still have group scheduling enabled?
Yes.
CONFIG_GROUP_SCHED=y
CONFIG_FAIR_GROUP_SCHED=y
# CONFIG_RT_GROUP_SCHED is not set
CONFIG_USER_SCHED=y
# CONFIG_CGROUP_SCHED is not set
> If so, can you turn it off
> and try again? (when in doubt, grasp at any straw within reach;)
I reran the test for dozeons of times.
1) Background processes have impact on the result and cause result to fluctuate with 8~9 seconds;
2) After turning off most services (background processes), the result looks stable;
3) I tested both 2.6.26-rc1 and 2.6.26-rc2 with CONFIG_GROUP_SCHED=n. the first one's result
is about 30 seconds and the second one's result is about 31 seconds. So CONFIG_GROUP_SCHED is the key.
The previous results wree got with turning off most background processes.
I'm busy in other issues.
-yanmin
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: hackbench regression with 2.6.26-rc2 on tulsa machine
2008-05-21 4:54 ` Zhang, Yanmin
@ 2008-05-21 5:19 ` Mike Galbraith
0 siblings, 0 replies; 4+ messages in thread
From: Mike Galbraith @ 2008-05-21 5:19 UTC (permalink / raw)
To: Zhang, Yanmin; +Cc: Peter Zijlstra, LKML
On Wed, 2008-05-21 at 12:54 +0800, Zhang, Yanmin wrote:
> On Tue, 2008-05-20 at 11:22 +0200, Mike Galbraith wrote:
> > Do you still have group scheduling enabled?
> Yes.
> CONFIG_GROUP_SCHED=y
> CONFIG_FAIR_GROUP_SCHED=y
> # CONFIG_RT_GROUP_SCHED is not set
> CONFIG_USER_SCHED=y
> # CONFIG_CGROUP_SCHED is not set
>
> > If so, can you turn it off
> > and try again? (when in doubt, grasp at any straw within reach;)
> I reran the test for dozeons of times.
> 1) Background processes have impact on the result and cause result to fluctuate with 8~9 seconds;
> 2) After turning off most services (background processes), the result looks stable;
> 3) I tested both 2.6.26-rc1 and 2.6.26-rc2 with CONFIG_GROUP_SCHED=n. the first one's result
> is about 30 seconds and the second one's result is about 31 seconds. So CONFIG_GROUP_SCHED is the key.
Thanks a bunch for re-testing.
-Mike
^ permalink raw reply [flat|nested] 4+ messages in thread
end of thread, other threads:[~2008-05-21 5:20 UTC | newest]
Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2008-05-20 8:09 hackbench regression with 2.6.26-rc2 on tulsa machine Zhang, Yanmin
2008-05-20 9:22 ` Mike Galbraith
2008-05-21 4:54 ` Zhang, Yanmin
2008-05-21 5:19 ` Mike Galbraith
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox