All of lore.kernel.org
 help / color / mirror / Atom feed
From: Ingo Molnar <mingo@elte.hu>
To: Chuck Ebbert <76306.1226@compuserve.com>
Cc: Andi Kleen <ak@suse.de>, Linus Torvalds <torvalds@osdl.org>,
	Andrew Morton <akpm@osdl.org>,
	linux-kernel <linux-kernel@vger.kernel.org>
Subject: Re: [patch 2.6.14-rc4] i386: spinlock optimization
Date: Sat, 15 Oct 2005 17:09:48 +0200	[thread overview]
Message-ID: <20051015150948.GA10763@elte.hu> (raw)
In-Reply-To: <200510142128_MC3-1-ACAD-8CD3@compuserve.com>


* Chuck Ebbert <76306.1226@compuserve.com> wrote:

> Parent CPU 1, child CPU 0, using old code for lock
> CPU clocks: 2066947815
> Parent CPU 1, child CPU 0, using new code for lock
> CPU clocks: 2818166922

> Parent CPU 1, child CPU 0, using old code for lock
> CPU clocks: 5635093038
> Parent CPU 1, child CPU 0, using new code for lock
> CPU clocks: 5250078921

your numbers show that for the first box, there's a 36% net slowdown 
resulting from the new code. On the other (older) box, there's a 7% 
speedup from the new code.

i ran the code on two newer boxes, 2.4 and 3.4 GHz Xeons, on two sibling 
CPUs sharing the same physical CPU and on two different CPUs as well:

HT non-siblings [Xeon 2.40GHz], 1% slowdown:

 Parent CPU 2, child CPU 0, using old code for lock
 CPU clocks: 6712771070
 Parent CPU 2, child CPU 0, using new code for lock
 CPU clocks: 6787556068

HT siblings [Xeon 2.40GHz], 14% speedup:

 Parent CPU 1, child CPU 0, using old code for lock
 CPU clocks: 3587124593
 Parent CPU 1, child CPU 0, using new code for lock
 CPU clocks: 3079647206

HT non-siblings [Xeon 3.40GHz], 3% speedup:

 Parent CPU 2, child CPU 0, using old code for lock
 CPU clocks: 8486900988
 Parent CPU 2, child CPU 0, using new code for lock
 CPU clocks: 8255818784

HT siblings [Xeon 3.40GHz], 1% slowdown:

 Parent CPU 3, child CPU 0, using old code for lock
 CPU clocks: 3684195488
 Parent CPU 3, child CPU 0, using new code for lock
 CPU clocks: 3739797320

but given that the code is a drastic slowdown on older boxes (and 
results in higher memory bus traffic, which may slow down other CPUs 
too, which effect isnt measured here), i dont think we should apply the 
patch, just yet - up until the point it becomes a clear winner on new 
CPUs.

(in fact the ping-pong effect should be worse if more than 2 CPUs 
contend for the spinlock.)

could someone run the tests on a dual-core box too?

	Ingo

  reply	other threads:[~2005-10-15 15:09 UTC|newest]

Thread overview: 9+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2005-10-15  1:24 [patch 2.6.14-rc4] i386: spinlock optimization Chuck Ebbert
2005-10-15 15:09 ` Ingo Molnar [this message]
  -- strict thread matches above, loose matches on Subject: below --
2005-10-14 17:47 Chuck Ebbert
2005-10-14 18:04 ` Andi Kleen
2005-10-14 18:24 ` linux-os (Dick Johnson)
2005-10-14 18:32   ` Andrew Morton
2005-10-14 18:55     ` linux-os (Dick Johnson)
2005-10-14 18:37   ` Linus Torvalds
2005-10-14 18:56     ` linux-os (Dick Johnson)

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20051015150948.GA10763@elte.hu \
    --to=mingo@elte.hu \
    --cc=76306.1226@compuserve.com \
    --cc=ak@suse.de \
    --cc=akpm@osdl.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=torvalds@osdl.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.