All of lore.kernel.org
 help / color / mirror / Atom feed
From: Mathieu Desnoyers <mathieu.desnoyers@polymtl.ca>
To: Christoph Lameter <clameter@sgi.com>
Cc: akpm@linux-foundation.org, linux-kernel@vger.kernel.org,
	mingo@redhat.com
Subject: Re: [PATCH] SLUB use cmpxchg_local
Date: Tue, 21 Aug 2007 19:39:38 -0400	[thread overview]
Message-ID: <20070821233938.GD29691@Krystal> (raw)
In-Reply-To: <Pine.LNX.4.64.0708211614370.6062@schroedinger.engr.sgi.com>

* Christoph Lameter (clameter@sgi.com) wrote:
> On Tue, 21 Aug 2007, Mathieu Desnoyers wrote:
> 
> > SLUB Use cmpxchg() everywhere.
> > 
> > It applies to "SLUB: Single atomic instruction alloc/free using
> > cmpxchg".
> 
> > +++ slab/mm/slub.c	2007-08-20 18:42:28.000000000 -0400
> > @@ -1682,7 +1682,7 @@ redo:
> >  
> >  	object[c->offset] = freelist;
> >  
> > -	if (unlikely(cmpxchg_local(&c->freelist, freelist, object) != freelist))
> > +	if (unlikely(cmpxchg(&c->freelist, freelist, object) != freelist))
> >  		goto redo;
> >  	return;
> >  slow:
> 
> Ok so regular cmpxchg, no cmpxchg_local. cmpxchg_local does not bring 
> anything more? My measurements did not show any difference. I measured on 
> Athlon64. What processor is being used?
> 

This patch only cleans up the tree before proposing my cmpxchg_local
changes. There was an inconsistent use of cmpxchg/cmpxchg_local there.

Using cmpxchg_local vs cmpxchg has a clear impact on the fast paths, as
shown below: it saves about 60 to 70 cycles for kmalloc and 200 cycles
for the kmalloc/kfree pair (test 2).

Pros :
- we can use barrier() instead of rmb()
- cmpxchg_local is faster

Con :
- we must disable preemption

I use a 3GHz Pentium 4 for my tests.

Results (compared to cmpxchg_local numbers) :

SLUB Performance testing
========================
1. Kmalloc: Repeatedly allocate then free test
(kfree here is slow path)

* cmpxchg
kmalloc(8) = 271 cycles	    kfree = 645 cycles
kmalloc(16) = 158 cycles	  kfree = 428 cycles
kmalloc(32) = 153 cycles	  kfree = 446 cycles
kmalloc(64) = 178 cycles	  kfree = 459 cycles
kmalloc(128) = 247 cycles	  kfree = 481 cycles
kmalloc(256) = 363 cycles	  kfree = 605 cycles
kmalloc(512) = 449 cycles	  kfree = 677 cycles
kmalloc(1024) = 626 cycles	kfree = 810 cycles
kmalloc(2048) = 681 cycles	kfree = 869 cycles
kmalloc(4096) = 471 cycles	kfree = 575 cycles
kmalloc(8192) = 666 cycles	kfree = 747 cycles
kmalloc(16384) = 736 cycles	kfree = 853 cycles

* cmpxchg_local
kmalloc(8) = 83 cycles      kfree = 363 cycles
kmalloc(16) = 85 cycles     kfree = 372 cycles
kmalloc(32) = 92 cycles     kfree = 377 cycles
kmalloc(64) = 115 cycles    kfree = 397 cycles
kmalloc(128) = 179 cycles   kfree = 438 cycles
kmalloc(256) = 314 cycles   kfree = 564 cycles
kmalloc(512) = 398 cycles   kfree = 615 cycles
kmalloc(1024) = 573 cycles  kfree = 745 cycles
kmalloc(2048) = 629 cycles  kfree = 816 cycles
kmalloc(4096) = 473 cycles  kfree = 548 cycles
kmalloc(8192) = 659 cycles  kfree = 745 cycles
kmalloc(16384) = 724 cycles kfree = 843 cycles


2. Kmalloc: alloc/free test

*cmpxchg
kmalloc(8)/kfree = 321 cycles
kmalloc(16)/kfree = 308 cycles
kmalloc(32)/kfree = 311 cycles
kmalloc(64)/kfree = 310 cycles
kmalloc(128)/kfree = 306 cycles
kmalloc(256)/kfree = 325 cycles
kmalloc(512)/kfree = 324 cycles
kmalloc(1024)/kfree = 322 cycles
kmalloc(2048)/kfree = 309 cycles
kmalloc(4096)/kfree = 678 cycles
kmalloc(8192)/kfree = 1027 cycles
kmalloc(16384)/kfree = 1204 cycles

* cmpxchg_local
kmalloc(8)/kfree = 112 cycles
kmalloc(16)/kfree = 103 cycles
kmalloc(32)/kfree = 103 cycles
kmalloc(64)/kfree = 103 cycles
kmalloc(128)/kfree = 112 cycles
kmalloc(256)/kfree = 111 cycles
kmalloc(512)/kfree = 111 cycles
kmalloc(1024)/kfree = 111 cycles
kmalloc(2048)/kfree = 121 cycles
kmalloc(4096)/kfree = 650 cycles
kmalloc(8192)/kfree = 1042 cycles
kmalloc(16384)/kfree = 1149 cycles

-- 
Mathieu Desnoyers
Computer Engineering Ph.D. Student, Ecole Polytechnique de Montreal
OpenPGP key fingerprint: 8CD5 52C3 8E3C 4140 715F  BA06 3F25 A8FE 3BAE 9A68

  reply	other threads:[~2007-08-21 23:39 UTC|newest]

Thread overview: 135+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2007-08-20 20:15 [patch 00/23] cmpxchg_local and cmpxchg64_local implementation Mathieu Desnoyers
2007-08-20 20:15 ` [patch 01/23] Fall back on interrupt disable in cmpxchg8b on 80386 and 80486 Mathieu Desnoyers
2007-08-20 20:32   ` Christoph Lameter
2007-08-20 20:41     ` Mathieu Desnoyers
2007-08-20 20:46       ` Christoph Lameter
2007-08-20 21:29         ` Mathieu Desnoyers
2007-08-20 21:49           ` Christoph Lameter
2007-08-20 21:54             ` Mathieu Desnoyers
2007-08-20 22:07               ` Christoph Lameter
2007-08-20 22:29                 ` Mathieu Desnoyers
2007-08-21 17:38                   ` [PATCH] SLUB Use cmpxchg() everywhere Mathieu Desnoyers
2007-08-21 17:38                 ` [PATCH] SLUB use cmpxchg_local Mathieu Desnoyers
2007-08-21 17:44                   ` Mathieu Desnoyers
2007-08-21 21:10                     ` Christoph Lameter
2007-08-21 23:21                     ` Mathieu Desnoyers
2007-08-21 23:35                       ` Christoph Lameter
2007-08-21 23:38                       ` Christoph Lameter
2007-08-21 20:41                   ` Mathieu Desnoyers
2007-08-21 21:36                     ` Christoph Lameter
2007-08-21 21:08                   ` Christoph Lameter
2007-08-21 23:12                     ` Mathieu Desnoyers
2007-08-21 23:17                       ` Christoph Lameter
2007-08-21 23:39                         ` Mathieu Desnoyers [this message]
2007-08-21 23:41                           ` Christoph Lameter
2007-08-21 23:47                             ` Mathieu Desnoyers
2007-08-21 23:51                               ` Christoph Lameter
2007-08-22  0:03                                 ` Mathieu Desnoyers
2007-08-22  0:11                                   ` Christoph Lameter
2007-08-22  0:26                                     ` Mathieu Desnoyers
2007-08-22  0:34                                       ` Christoph Lameter
2007-08-22  1:18                                         ` Mathieu Desnoyers
2007-08-22 15:00                                         ` [PATCH] define have_arch_cmpxchg() Mathieu Desnoyers
2007-08-22 18:50                                           ` Christoph Lameter
2007-08-22 15:02                                         ` [PATCH] SLUB: use have_arch_cmpxchg() Mathieu Desnoyers
2007-08-22 16:24                                           ` Pekka Enberg
2007-08-27 14:56                                             ` Mathieu Desnoyers
2007-08-27 19:43                                               ` Christoph Lameter
2007-08-27 20:25                                                 ` Mathieu Desnoyers
2007-08-22  1:28                                   ` [PATCH] SLUB use cmpxchg_local Andi Kleen
2007-08-22  0:38                                     ` Mathieu Desnoyers
2007-08-22  1:06                                       ` Christoph Lameter
2007-08-22  1:12                                         ` Mathieu Desnoyers
2007-08-22  9:39                                         ` Andi Kleen
2007-08-22 13:45                                         ` Mathieu Desnoyers
2007-08-22 13:46                                           ` Andi Kleen
2007-08-22 18:54                                             ` Christoph Lameter
2007-08-22 19:25                                           ` Christoph Lameter
2007-08-22 20:09                                             ` Mathieu Desnoyers
2007-08-22 20:19                                               ` Christoph Lameter
2007-08-22 20:29                                                 ` Mathieu Desnoyers
2007-08-22 20:33                                                   ` Christoph Lameter
2007-08-22 20:38                                                   ` Christoph Lameter
2007-08-21 23:14                   ` Christoph Lameter
2007-08-21 23:23                     ` Mathieu Desnoyers
2007-08-21 23:50                       ` Mathieu Desnoyers
2007-08-27  6:52                     ` Peter Zijlstra
2007-08-27 19:39                       ` Christoph Lameter
2007-08-27 20:22                         ` Mathieu Desnoyers
2007-08-27 20:26                           ` Christoph Lameter
2007-08-27 20:39                             ` Mathieu Desnoyers
2007-08-27 21:04                               ` Christoph Lameter
2007-08-27 21:10                                 ` Mathieu Desnoyers
2007-08-27 21:23                                   ` Christoph Lameter
2007-08-27 21:38                                     ` Mathieu Desnoyers
2007-08-27 22:12                                       ` Christoph Lameter
2007-08-27 22:27                                         ` Mathieu Desnoyers
2007-08-27 22:29                                           ` Christoph Lameter
2007-08-28  1:26                                           ` Christoph Lameter
2007-08-28  1:26                                             ` Christoph Lameter
2007-08-28 12:07                                             ` Mathieu Desnoyers
2007-08-28 12:07                                               ` Mathieu Desnoyers
2007-08-28 19:42                                               ` Christoph Lameter
2007-08-28 19:42                                                 ` Christoph Lameter
2007-09-04 20:02                                             ` Mathieu Desnoyers
2007-09-04 20:02                                               ` Mathieu Desnoyers
2007-09-04 20:03                                             ` [PATCH] local_t protection (critical section) Mathieu Desnoyers
2007-09-04 20:03                                               ` Mathieu Desnoyers
2007-09-04 20:04                                             ` [PATCH] slub - Use local_t protection Mathieu Desnoyers
2007-09-04 20:04                                               ` Mathieu Desnoyers
2007-09-04 20:45                                               ` Christoph Lameter
2007-09-04 20:45                                                 ` Christoph Lameter
2007-09-05 13:03                                                 ` Mathieu Desnoyers
2007-09-05 13:03                                                   ` Mathieu Desnoyers
2007-09-05 13:04                                                 ` [PATCH] local_t protection (critical section) Mathieu Desnoyers
2007-09-05 13:04                                                   ` Mathieu Desnoyers
2007-09-12 22:33                                                   ` Christoph Lameter
2007-09-12 22:33                                                     ` Christoph Lameter
2007-09-12 23:00                                                     ` Mathieu Desnoyers
2007-09-12 23:00                                                       ` Mathieu Desnoyers
2007-09-05 13:06                                                 ` [PATCH] slub - Use local_t protection Mathieu Desnoyers
2007-09-05 13:06                                                   ` Mathieu Desnoyers
2007-09-12 22:28                                                   ` Christoph Lameter
2007-09-12 22:28                                                     ` Christoph Lameter
2007-08-27 22:15                         ` [PATCH] SLUB use cmpxchg_local Christoph Lameter
2007-08-28  7:12                           ` Peter Zijlstra
2007-08-28 19:36                             ` Christoph Lameter
2007-08-28 19:46                               ` Peter Zijlstra
2007-08-20 20:15 ` [patch 02/23] Add cmpxchg_local to asm-generic for per cpu atomic operations Mathieu Desnoyers
2007-08-20 20:15 ` [patch 03/23] Add cmpxchg_local to arm Mathieu Desnoyers
2007-08-20 20:15 ` [patch 04/23] Add cmpxchg_local to avr32 Mathieu Desnoyers
2007-08-20 20:15 ` [patch 05/23] Add cmpxchg_local to blackfin, replace __cmpxchg by generic cmpxchg Mathieu Desnoyers
2007-08-20 20:15 ` [patch 06/23] Add cmpxchg_local to cris Mathieu Desnoyers
2007-08-20 20:15 ` [patch 07/23] Add cmpxchg_local to frv Mathieu Desnoyers
2007-08-20 20:15 ` [patch 08/23] Add cmpxchg_local to h8300 Mathieu Desnoyers
2007-08-20 20:15 ` [patch 09/23] Add cmpxchg_local, cmpxchg64 and cmpxchg64_local to ia64 Mathieu Desnoyers
2007-08-20 20:15 ` [patch 10/23] New cmpxchg_local (optimized for UP case) for m32r Mathieu Desnoyers
2007-08-21  9:36   ` Hirokazu Takata
2007-08-20 20:15 ` [patch 11/23] Fix m32r __xchg Mathieu Desnoyers
2007-08-21  9:39   ` Hirokazu Takata
2007-08-20 20:15 ` [patch 12/23] local_t m32r use architecture specific cmpxchg_local Mathieu Desnoyers
2007-08-21  9:34   ` Hirokazu Takata
2007-08-21 14:01     ` Mathieu Desnoyers
2007-08-20 20:15 ` [patch 13/23] Add cmpxchg_local to m86k Mathieu Desnoyers
2007-08-20 20:15 ` [patch 14/23] Add cmpxchg_local to m68knommu Mathieu Desnoyers
2007-08-20 20:15 ` [patch 15/23] Add cmpxchg_local to parisc Mathieu Desnoyers
2007-08-20 20:15 ` [parisc-linux] " Mathieu Desnoyers
2007-08-20 20:15 ` [patch 16/23] Add cmpxchg_local to ppc Mathieu Desnoyers
2007-08-20 20:15 ` [patch 17/23] Add cmpxchg_local to s390 Mathieu Desnoyers
2007-08-20 20:15 ` [patch 18/23] Add cmpxchg_local to sh, use generic cmpxchg() instead of cmpxchg_u32 Mathieu Desnoyers
2007-08-20 20:15 ` [patch 19/23] Add cmpxchg_local to sh64 Mathieu Desnoyers
2007-08-20 20:15 ` [patch 20/23] Add cmpxchg_local to sparc, move __cmpxchg to system.h Mathieu Desnoyers
2007-08-20 20:15   ` Mathieu Desnoyers
2007-08-20 20:15 ` [patch 21/23] Add cmpxchg_local to sparc64 Mathieu Desnoyers
2007-08-20 20:15   ` Mathieu Desnoyers
2007-08-20 23:34   ` Julian Calaby
2007-08-20 23:34     ` Julian Calaby
2007-08-20 23:36     ` Christoph Lameter
2007-08-20 23:36       ` Christoph Lameter
2007-08-20 23:42       ` Julian Calaby
2007-08-20 23:42         ` Julian Calaby
2007-08-20 23:43     ` [patch 21/23] Add cmpxchg_local to sparc64 (update) Mathieu Desnoyers
2007-08-20 23:43       ` Mathieu Desnoyers
2007-08-20 20:15 ` [patch 22/23] Add cmpxchg_local to v850 Mathieu Desnoyers
2007-08-20 20:15 ` [patch 23/23] Add cmpxchg_local to xtensa Mathieu Desnoyers
2007-08-20 20:29 ` [patch 00/23] cmpxchg_local and cmpxchg64_local implementation Mathieu Desnoyers

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20070821233938.GD29691@Krystal \
    --to=mathieu.desnoyers@polymtl.ca \
    --cc=akpm@linux-foundation.org \
    --cc=clameter@sgi.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=mingo@redhat.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.