public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
* [benchmark] Interbench 2.6.16-ck/mm
@ 2006-03-25  2:51 Con Kolivas
  2006-03-25  3:19 ` [ck] " André Goddard Rosa
  0 siblings, 1 reply; 7+ messages in thread
From: Con Kolivas @ 2006-03-25  2:51 UTC (permalink / raw)
  To: linux list; +Cc: ck list

Here are interbench numbers for a pentium4 HT with 2.6.16-ck1 and 2.6.16-mm1

The rules I use for flagging diffferences are:
1. Deadlines met is the primary endpoint.
2. When latency is out of bounds, what is the worst max latency as that will 
be the most noticeable jerkiness, stutter etc, but I only flag them if at 
least one value is >=7ms since that is required to be human perceptible. 

Using 2036365 loops per ms, running every load for 30 seconds
Benchmarking kernel 2.6.16-ck1 at datestamp 200603250244

--- Benchmarking simulated cpu of Audio in the presence of simulated ---
Load	Latency +/- SD (ms)  Max Latency   % Desired CPU  % Deadlines Met
None	  0.015 +/- 0.0159     0.023 		 100	        100
Video	  0.757 +/- 0.881          2		 100	        100
X	  0.051 +/- 0.189       1.02 		 100	        100
Burn	  0.019 +/- 0.0201     0.056 *		 100 	        100 *
Write	  0.055 +/- 0.344       4.83		 100	        100
Read	  0.024 +/- 0.0251     0.064		 100	        100
Compile	   0.03 +/- 0.0888      1.69		 100	        100
Memload	  0.321 +/- 4.94         112		99.8	       99.8

--- Benchmarking simulated cpu of Video in the presence of simulated ---
Load	Latency +/- SD (ms)  Max Latency   % Desired CPU  % Deadlines Met
None	  0.016 +/- 0.0162     0.028		 100	        100
X	  0.043 +/- 0.17        2.01 *		 100	        100 *
Burn	  0.188 +/- 2.33        45.3 *		99.6	       99.2 *
Write	  0.043 +/- 0.558       16.7 *		 100	       99.9 *
Read	  0.021 +/- 0.0217     0.078 *		 100	        100 
Compile	  0.093 +/- 1.08        18.4 *		 100	       99.6 *
Memload	  0.385 +/- 4.83         138		98.9	       98.3 *

--- Benchmarking simulated cpu of X in the presence of simulated ---
Load	Latency +/- SD (ms)  Max Latency   % Desired CPU  % Deadlines Met
None	  0.026 +/- 0.356          6 *		 100	         99 *
Video	     14 +/- 26.5          72 *		35.6	       26.5 *
Burn	   51.4 +/- 98.4         456 *		15.1	       7.91 *
Write	   2.19 +/- 6.65          36 *		78.6	       71.1
Read	  0.756 +/- 3.33          24 *		90.6	         86
Compile	   34.5 +/- 53.8         165 *		20.7	       11.6 *
Memload	   4.05 +/- 12.6          80		73.3	       66.8

--- Benchmarking simulated cpu of Gaming in the presence of simulated ---
Load	Latency +/- SD (ms)  Max Latency   % Desired CPU
None	   1.49 +/- 6.03        34.8 *		98.5 *
Video	   68.7 +/- 68.9          74 *		59.3
X	   60.7 +/- 70.6         143		62.2
Burn	    514 +/- 534          602 *		16.3
Write	   20.6 +/- 31.4         109 *		82.9
Read	   7.77 +/- 9.04        31.3		92.8
Compile	    436 +/- 459          697 *		18.7
Memload	   27.1 +/- 58.3         755		78.7 *



Using 2036365 loops per ms, running every load for 30 seconds
Benchmarking kernel 2.6.16-mm1 at datestamp 200603250939

--- Benchmarking simulated cpu of Audio in the presence of simulated ---
Load	Latency +/- SD (ms)  Max Latency   % Desired CPU  % Deadlines Met
None	  0.012 +/- 0.0121     0.023		 100	        100
Video	   1.16 +/- 1.24        2.01		 100	        100
X	  0.302 +/- 0.681       2.01		 100	        100
Burn	  0.365 +/- 8.63         211		99.7	       99.7
Write	  0.017 +/- 0.0193     0.072		 100	        100
Read	   0.02 +/- 0.0209     0.056		 100	        100
Compile	  0.019 +/- 0.0196     0.046		 100	        100
Memload	  0.092 +/- 0.623         11 *	 	 100 *	        100 *

--- Benchmarking simulated cpu of Video in the presence of simulated ---
Load	Latency +/- SD (ms)  Max Latency   % Desired CPU  % Deadlines Met
None	  0.011 +/- 0.0114      0.04		 100	        100
X	  0.885 +/- 3.73        18.7		 100	       95.8
Burn	  0.977 +/- 22.6         934		98.4	       95.9
Write	  0.047 +/- 0.789       23.7		 100	       99.8
Read	  0.017 +/- 0.0181     0.058		 100	        100
Compile	    1.6 +/- 17.6         370		96.5	       94.5
Memload	  0.501 +/- 3.48        44.4 *		98.9	       97.4

--- Benchmarking simulated cpu of X in the presence of simulated ---
Load	Latency +/- SD (ms)  Max Latency   % Desired CPU  % Deadlines Met
None	  0.255 +/- 2.25          27		94.1	       93.3
Video	   14.8 +/- 27.7          76		34.8	       25.5
Burn	   39.4 +/- 75.6         505		13.1	       7.07
Write	    1.6 +/- 6.17          48		78.7	       74.9 *
Read	  0.709 +/- 3.26          30		91.2	         88 *
Compile	   42.9 +/- 80.5         667		11.3	       5.61
Memload	   2.23 +/- 7.24          42 *		71.4	       66.8

--- Benchmarking simulated cpu of Gaming in the presence of simulated ---
Load	Latency +/- SD (ms)  Max Latency   % Desired CPU
None	   5.31 +/- 19           123		  95
Video	   69.8 +/- 70.1        80.1		58.9 *
X	   60.3 +/- 70.3         120 *		62.4
Burn	    295 +/- 310          710		25.3 *
Write	   17.3 +/- 30.8         110		85.2 *
Read	   5.64 +/- 6.45        20.3 *		94.7 *
Compile	    392 +/- 420          713		20.3
Memload	   27.8 +/- 47.8         430 *		78.3


Summary:
	Deadlines	Max Latencies
	ck	mm	ck	mm
Audio 	1	1	1	1
Video	5	0	5	1
X	4	2	6	1
Game	2	4	5	3

Note that most of the differences between -ck and -mm on this benchmark are 
due to the staircase cpu scheduler in -ck. It has staircase v14.2 as recently 
posted to the mailing list.

Cheers,
Con

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: [ck] [benchmark] Interbench 2.6.16-ck/mm
  2006-03-25  2:51 [benchmark] Interbench 2.6.16-ck/mm Con Kolivas
@ 2006-03-25  3:19 ` André Goddard Rosa
  2006-03-25  4:01   ` Con Kolivas
  0 siblings, 1 reply; 7+ messages in thread
From: André Goddard Rosa @ 2006-03-25  3:19 UTC (permalink / raw)
  To: Con Kolivas; +Cc: linux list, ck list

On 3/24/06, Con Kolivas <kernel@kolivas.org> wrote:
> Here are interbench numbers for a pentium4 HT with 2.6.16-ck1 and 2.6.16-mm1
>
> The rules I use for flagging diffferences are:
> 1. Deadlines met is the primary endpoint.
> 2. When latency is out of bounds, what is the worst max latency as that will
> be the most noticeable jerkiness, stutter etc, but I only flag them if at
> least one value is >=7ms since that is required to be human perceptible.
>
> Using 2036365 loops per ms, running every load for 30 seconds
> Benchmarking kernel 2.6.16-ck1 at datestamp 200603250244
>
> --- Benchmarking simulated cpu of Audio in the presence of simulated ---
> Load    Latency +/- SD (ms)  Max Latency   % Desired CPU  % Deadlines Met
> None      0.015 +/- 0.0159     0.023             100            100
> Video     0.757 +/- 0.881          2             100            100
> X         0.051 +/- 0.189       1.02             100            100
> Burn      0.019 +/- 0.0201     0.056 *           100            100 *
> Write     0.055 +/- 0.344       4.83             100            100
> Read      0.024 +/- 0.0251     0.064             100            100
> Compile    0.03 +/- 0.0888      1.69             100            100
> Memload   0.321 +/- 4.94         112            99.8           99.8
>
> --- Benchmarking simulated cpu of Video in the presence of simulated ---
> Load    Latency +/- SD (ms)  Max Latency   % Desired CPU  % Deadlines Met
> None      0.016 +/- 0.0162     0.028             100            100
> X         0.043 +/- 0.17        2.01 *           100            100 *
> Burn      0.188 +/- 2.33        45.3 *          99.6           99.2 *
> Write     0.043 +/- 0.558       16.7 *           100           99.9 *
> Read      0.021 +/- 0.0217     0.078 *           100            100
> Compile   0.093 +/- 1.08        18.4 *           100           99.6 *
> Memload   0.385 +/- 4.83         138            98.9           98.3 *
>
> --- Benchmarking simulated cpu of X in the presence of simulated ---
> Load    Latency +/- SD (ms)  Max Latency   % Desired CPU  % Deadlines Met
> None      0.026 +/- 0.356          6 *           100             99 *
> Video        14 +/- 26.5          72 *          35.6           26.5 *
> Burn       51.4 +/- 98.4         456 *          15.1           7.91 *
> Write      2.19 +/- 6.65          36 *          78.6           71.1
> Read      0.756 +/- 3.33          24 *          90.6             86
> Compile    34.5 +/- 53.8         165 *          20.7           11.6 *
> Memload    4.05 +/- 12.6          80            73.3           66.8
>
> --- Benchmarking simulated cpu of Gaming in the presence of simulated ---
> Load    Latency +/- SD (ms)  Max Latency   % Desired CPU
> None       1.49 +/- 6.03        34.8 *          98.5 *
> Video      68.7 +/- 68.9          74 *          59.3
> X          60.7 +/- 70.6         143            62.2
> Burn        514 +/- 534          602 *          16.3
> Write      20.6 +/- 31.4         109 *          82.9
> Read       7.77 +/- 9.04        31.3            92.8
> Compile     436 +/- 459          697 *          18.7
> Memload    27.1 +/- 58.3         755            78.7 *
>
>
>
> Using 2036365 loops per ms, running every load for 30 seconds
> Benchmarking kernel 2.6.16-mm1 at datestamp 200603250939
>
> --- Benchmarking simulated cpu of Audio in the presence of simulated ---
> Load    Latency +/- SD (ms)  Max Latency   % Desired CPU  % Deadlines Met
> None      0.012 +/- 0.0121     0.023             100            100
> Video      1.16 +/- 1.24        2.01             100            100
> X         0.302 +/- 0.681       2.01             100            100
> Burn      0.365 +/- 8.63         211            99.7           99.7
> Write     0.017 +/- 0.0193     0.072             100            100
> Read       0.02 +/- 0.0209     0.056             100            100
> Compile   0.019 +/- 0.0196     0.046             100            100
> Memload   0.092 +/- 0.623         11 *           100 *          100 *
>
> --- Benchmarking simulated cpu of Video in the presence of simulated ---
> Load    Latency +/- SD (ms)  Max Latency   % Desired CPU  % Deadlines Met
> None      0.011 +/- 0.0114      0.04             100            100
> X         0.885 +/- 3.73        18.7             100           95.8
> Burn      0.977 +/- 22.6         934            98.4           95.9
> Write     0.047 +/- 0.789       23.7             100           99.8
> Read      0.017 +/- 0.0181     0.058             100            100
> Compile     1.6 +/- 17.6         370            96.5           94.5
> Memload   0.501 +/- 3.48        44.4 *          98.9           97.4
>
> --- Benchmarking simulated cpu of X in the presence of simulated ---
> Load    Latency +/- SD (ms)  Max Latency   % Desired CPU  % Deadlines Met
> None      0.255 +/- 2.25          27            94.1           93.3
> Video      14.8 +/- 27.7          76            34.8           25.5
> Burn       39.4 +/- 75.6         505            13.1           7.07
> Write       1.6 +/- 6.17          48            78.7           74.9 *
> Read      0.709 +/- 3.26          30            91.2             88 *
> Compile    42.9 +/- 80.5         667            11.3           5.61
> Memload    2.23 +/- 7.24          42 *          71.4           66.8
>
> --- Benchmarking simulated cpu of Gaming in the presence of simulated ---
> Load    Latency +/- SD (ms)  Max Latency   % Desired CPU
> None       5.31 +/- 19           123              95
> Video      69.8 +/- 70.1        80.1            58.9 *
> X          60.3 +/- 70.3         120 *          62.4
> Burn        295 +/- 310          710            25.3 *
> Write      17.3 +/- 30.8         110            85.2 *
> Read       5.64 +/- 6.45        20.3 *          94.7 *
> Compile     392 +/- 420          713            20.3
> Memload    27.8 +/- 47.8         430 *          78.3
>
>
> Summary:
>         Deadlines       Max Latencies
>         ck      mm      ck      mm
> Audio   1       1       1       1
> Video   5       0       5       1
> X       4       2       6       1
> Game    2       4       5       3
>
> Note that most of the differences between -ck and -mm on this benchmark are
> due to the staircase cpu scheduler in -ck. It has staircase v14.2 as recently
> posted to the mailing list.
>
> Cheers,
> Con

I just tried  2.6.16-mm1 with Mike Galbraith patches and it feels
better in some situations (like opening a new tab inside konqueror or
running a configure script ) but it gave some
stops when playing amarok/gstreamer and doing regular desktop work.

Best regards,
--
[]s,

André Goddard

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: [ck] [benchmark] Interbench 2.6.16-ck/mm
  2006-03-25  3:19 ` [ck] " André Goddard Rosa
@ 2006-03-25  4:01   ` Con Kolivas
  2006-03-25  8:21     ` Radoslaw Szkodzinski
  0 siblings, 1 reply; 7+ messages in thread
From: Con Kolivas @ 2006-03-25  4:01 UTC (permalink / raw)
  To: André Goddard Rosa; +Cc: linux list, ck list

On Saturday 25 March 2006 14:19, André Goddard Rosa wrote:
> On 3/24/06, Con Kolivas <kernel@kolivas.org> wrote:
> > Note that most of the differences between -ck and -mm on this benchmark
> > are due to the staircase cpu scheduler in -ck. It has staircase v14.2 as
> > recently posted to the mailing list.

> I just tried  2.6.16-mm1 with Mike Galbraith patches and it feels
> better in some situations (like opening a new tab inside konqueror or
> running a configure script ) but it gave some
> stops when playing amarok/gstreamer and doing regular desktop work.

Thanks.

I don't expect that staircase will be better in every single situation. 
However it will be better more often, especially when it counts (like audio 
or video skipping) and far more predictable. All that in 300 lines less 
code :)

Cheers,
Con

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: [ck] [benchmark] Interbench 2.6.16-ck/mm
  2006-03-25  4:01   ` Con Kolivas
@ 2006-03-25  8:21     ` Radoslaw Szkodzinski
  2006-03-25  8:28       ` Con Kolivas
  0 siblings, 1 reply; 7+ messages in thread
From: Radoslaw Szkodzinski @ 2006-03-25  8:21 UTC (permalink / raw)
  To: ck; +Cc: Con Kolivas, André Goddard Rosa, linux list

[-- Attachment #1: Type: text/plain, Size: 586 bytes --]

On Saturday 25 March 2006 05:01, Con Kolivas wrote yet:
>
> I don't expect that staircase will be better in every single situation.
> However it will be better more often, especially when it counts (like audio
> or video skipping) and far more predictable. All that in 300 lines less
> code :)
>

I thinks the main difference is those other scheduler improvements.
Some of them are compatible with staircase.
Could you also try a mixed and matched 2.6.16-ck1+mm?

-- 
GPG Key id:  0xD1F10BA2
Fingerprint: 96E2 304A B9C4 949A 10A0  9105 9543 0453 D1F1 0BA2

AstralStorm

[-- Attachment #2: Type: application/pgp-signature, Size: 191 bytes --]

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: [ck] [benchmark] Interbench 2.6.16-ck/mm
  2006-03-25  8:21     ` Radoslaw Szkodzinski
@ 2006-03-25  8:28       ` Con Kolivas
  2006-03-25  8:46         ` Radoslaw Szkodzinski
  0 siblings, 1 reply; 7+ messages in thread
From: Con Kolivas @ 2006-03-25  8:28 UTC (permalink / raw)
  To: Radoslaw Szkodzinski; +Cc: ck, André Goddard Rosa, linux list

On Saturday 25 March 2006 19:21, Radoslaw Szkodzinski wrote:
> On Saturday 25 March 2006 05:01, Con Kolivas wrote yet:
> > I don't expect that staircase will be better in every single situation.
> > However it will be better more often, especially when it counts (like
> > audio or video skipping) and far more predictable. All that in 300 lines
> > less code :)
>
> I thinks the main difference is those other scheduler improvements.
> Some of them are compatible with staircase.
> Could you also try a mixed and matched 2.6.16-ck1+mm?

You're kidding, right? Check the code.

Cheers,
Con

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: [ck] [benchmark] Interbench 2.6.16-ck/mm
  2006-03-25  8:28       ` Con Kolivas
@ 2006-03-25  8:46         ` Radoslaw Szkodzinski
  2006-03-25  9:15           ` Con Kolivas
  0 siblings, 1 reply; 7+ messages in thread
From: Radoslaw Szkodzinski @ 2006-03-25  8:46 UTC (permalink / raw)
  To: Con Kolivas; +Cc: ck, André Goddard Rosa, linux list

[-- Attachment #1: Type: text/plain, Size: 1147 bytes --]

On Saturday 25 March 2006 09:28, Con Kolivas wrote yet:
> On Saturday 25 March 2006 19:21, Radoslaw Szkodzinski wrote:
> > On Saturday 25 March 2006 05:01, Con Kolivas wrote yet:
> > > I don't expect that staircase will be better in every single situation.
> > > However it will be better more often, especially when it counts (like
> > > audio or video skipping) and far more predictable. All that in 300
> > > lines less code :)
> >
> > I thinks the main difference is those other scheduler improvements.
> > Some of them are compatible with staircase.
> > Could you also try a mixed and matched 2.6.16-ck1+mm?
>
> You're kidding, right? Check the code.

Yes and no. I was kidding about "scheduler improvements" part.
(they're mostly NUMA-only)

But of course memload, read and write latencies aren't necessarily caused by 
scheduler itself. 
(burn also reads a file)

The easiest thing to do would be to add staircase to -mm and see what happens.
It shouldn't be hard to port. (in fact, it may apply cleanly)

-- 
GPG Key id:  0xD1F10BA2
Fingerprint: 96E2 304A B9C4 949A 10A0  9105 9543 0453 D1F1 0BA2

AstralStorm

[-- Attachment #2: Type: application/pgp-signature, Size: 191 bytes --]

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: [ck] [benchmark] Interbench 2.6.16-ck/mm
  2006-03-25  8:46         ` Radoslaw Szkodzinski
@ 2006-03-25  9:15           ` Con Kolivas
  0 siblings, 0 replies; 7+ messages in thread
From: Con Kolivas @ 2006-03-25  9:15 UTC (permalink / raw)
  To: Radoslaw Szkodzinski; +Cc: ck, André Goddard Rosa, linux list

On Saturday 25 March 2006 19:46, Radoslaw Szkodzinski wrote:
> On Saturday 25 March 2006 09:28, Con Kolivas wrote yet:
> > On Saturday 25 March 2006 19:21, Radoslaw Szkodzinski wrote:
> > > On Saturday 25 March 2006 05:01, Con Kolivas wrote yet:
> > > > I don't expect that staircase will be better in every single
> > > > situation. However it will be better more often, especially when it
> > > > counts (like audio or video skipping) and far more predictable. All
> > > > that in 300 lines less code :)
> > >
> > > I thinks the main difference is those other scheduler improvements.
> > > Some of them are compatible with staircase.
> > > Could you also try a mixed and matched 2.6.16-ck1+mm?
> >
> > You're kidding, right? Check the code.
>
> Yes and no. I was kidding about "scheduler improvements" part.
> (they're mostly NUMA-only)
>
> But of course memload, read and write latencies aren't necessarily caused
> by scheduler itself.
> (burn also reads a file)
>
> The easiest thing to do would be to add staircase to -mm and see what
> happens. It shouldn't be hard to port. (in fact, it may apply cleanly)

Yes it would be nice to believe that taking some -mm components into -ck would 
actually make -ck better than everything across the board. However the policy 
changes that make up staircase are not compatible with most other changes in 
-mm, and most of the differences are due to the difference between the 
mainline scheduler and staircase. There are some vm tweaks and the like in 
-ck which only show up when you nice things though and that would improve the 
results if we were comparing different nice values on interbench. 
Furthermore,  Mike's changes that Andre tested only worsen interbench results 
so they would make staircase/ck look even better than -mm than it currently 
does. Some of the things Andre found better on -mm with Mike's changes may 
well be fairness related because staircase is more effective at maintaining 
fairness than mainline without any special throttling.

Cheers,
Con

^ permalink raw reply	[flat|nested] 7+ messages in thread

end of thread, other threads:[~2006-03-25  9:16 UTC | newest]

Thread overview: 7+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2006-03-25  2:51 [benchmark] Interbench 2.6.16-ck/mm Con Kolivas
2006-03-25  3:19 ` [ck] " André Goddard Rosa
2006-03-25  4:01   ` Con Kolivas
2006-03-25  8:21     ` Radoslaw Szkodzinski
2006-03-25  8:28       ` Con Kolivas
2006-03-25  8:46         ` Radoslaw Szkodzinski
2006-03-25  9:15           ` Con Kolivas

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox