public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
* [PATCH v5 0/8] Reduce cross CPU IPI interference
@ 2012-01-02 10:24 Gilad Ben-Yossef
  2012-01-02 10:24 ` [PATCH v5 1/8] smp: Introduce a generic on_each_cpu_mask function Gilad Ben-Yossef
                   ` (7 more replies)
  0 siblings, 8 replies; 43+ messages in thread
From: Gilad Ben-Yossef @ 2012-01-02 10:24 UTC (permalink / raw)
  To: linux-kernel; +Cc: Gilad Ben-Yossef

We have lots of infrastructure in place to partition a multi-core systems
such that we have a group of CPUs that are dedicated to specific task: 
cgroups, scheduler and interrupt affinity and cpuisol boot parameter. 
Still, kernel code will some time interrupt all CPUs in the system via IPIs 
for various needs. These IPIs are useful and cannot be avoided altogether, 
but in certain cases it is possible to interrupt only specific CPUs that 
have useful work to do and not the entire system.

This patch set, inspired by discussions with Peter Zijlstra and Frederic 
Weisbecker when testing the nohz task patch set, is a first stab at trying 
to explore doing this by locating the places where such global IPI calls 
are being made and turning a global IPI into an IPI for a specific group 
of CPUs.  The purpose of the patch set is to get feedback if this is the 
right way to go for dealing with this issue and indeed, if the issue is 
even worth dealing with at all. Based on the feedback from this patch set 
I plan to offer further patches that address similar issue in other code 
paths.

The patch creates an on_each_cpu_mask infrastructure API (derived from 
existing arch specific versions in Tile and Arm) and service wrappers 
and uses them to turn several  global IPI invocation to per CPU group 
invocations.

This 5th iteration includes the following changes:

- Abstract away the common boilerplate as on_each_cpu_cond wrapper 
  function and make all the places use it.
- Move the page_alloc.c/drain_all_pages to use a static global
  cpumask to avoid adding an allocation in the direct reclaim
  path, based on feedback and suggestion by Mel Gorman and Chris
  Metcalf.
- Add an optional patch to add vmstat counters to per-cpu page
  drain request and the upside to using this patch, based on 
  Mel Gorman idea.
- Provide the same treatment to yet another call site - this time
  the local LRU BH invalidation.

The patch was compiled for arm and boot tested on x86 in UP, SMP, with and without
CONFIG_CPUMASK_OFFSTACK and was further tested by running hackbench on x86 in
SMP mode in a 4 CPUs VM with no obvious regressions.

I also artificially exercised SLUB flush_all via the debug interface and observed
the difference in IPI count across processors with and without the patch - from
an IPI on all processors but one without the patch to a subset (and often no IPI
at all) with the patch.

I further used fault injection framework to force cpumask alloction failures for
CPUMASK_OFFSTACK=y cases and triggering the code using slub sys debug interface,
as well as running ./hackbench 400 for page_alloc, with no obvious falilures.

Gilad Ben-Yossef (8):
  smp: Introduce a generic on_each_cpu_mask function
  arm: Move arm over to generic on_each_cpu_mask
  tile: Move tile to use generic on_each_cpu_mask
  smp: Add func to IPI cpus based on parameter func
  slub: Only IPI CPUs that have per cpu obj to flush
  fs: only send IPI to invalidate LRU BH when needed
  mm: Only IPI CPUs to drain local pages if they exist
  mm: add vmstat counters for tracking PCP drains

 arch/arm/kernel/smp_tlb.c     |   20 ++++-------------
 arch/tile/include/asm/smp.h   |    7 ------
 arch/tile/kernel/smp.c        |   19 ----------------
 fs/buffer.c                   |   15 ++++++++++++-
 include/linux/smp.h           |   32 +++++++++++++++++++++++++++
 include/linux/vm_event_item.h |    1 +
 kernel/smp.c                  |   47 +++++++++++++++++++++++++++++++++++++++++
 mm/page_alloc.c               |   30 +++++++++++++++++++++++++-
 mm/slub.c                     |   10 +++++++-
 mm/vmstat.c                   |    2 +
 10 files changed, 139 insertions(+), 44 deletions(-)


^ permalink raw reply	[flat|nested] 43+ messages in thread

end of thread, other threads:[~2012-01-09 17:25 UTC | newest]

Thread overview: 43+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2012-01-02 10:24 [PATCH v5 0/8] Reduce cross CPU IPI interference Gilad Ben-Yossef
2012-01-02 10:24 ` [PATCH v5 1/8] smp: Introduce a generic on_each_cpu_mask function Gilad Ben-Yossef
2012-01-03  7:51   ` Michal Nazarewicz
2012-01-03  8:12     ` Gilad Ben-Yossef
2012-01-03  8:57       ` Michal Nazarewicz
2012-01-03 22:26   ` Andrew Morton
2012-01-05 13:17     ` Michal Nazarewicz
2012-01-08 16:04     ` Gilad Ben-Yossef
2012-01-02 10:24 ` [PATCH v5 2/8] arm: Move arm over to generic on_each_cpu_mask Gilad Ben-Yossef
2012-01-02 10:24 ` [PATCH v5 3/8] tile: Move tile to use " Gilad Ben-Yossef
2012-01-02 10:24 ` [PATCH v5 4/8] smp: Add func to IPI cpus based on parameter func Gilad Ben-Yossef
2012-01-03 22:34   ` Andrew Morton
2012-01-08 16:09     ` Gilad Ben-Yossef
2012-01-02 10:24 ` [PATCH v5 5/8] slub: Only IPI CPUs that have per cpu obj to flush Gilad Ben-Yossef
2012-01-02 10:24 ` [PATCH v5 6/8] fs: only send IPI to invalidate LRU BH when needed Gilad Ben-Yossef
2012-01-02 10:24 ` [PATCH v5 7/8] mm: Only IPI CPUs to drain local pages if they exist Gilad Ben-Yossef
2012-01-03 17:45   ` KOSAKI Motohiro
2012-01-03 18:58     ` Gilad Ben-Yossef
2012-01-03 22:02       ` KOSAKI Motohiro
2012-01-05 14:20     ` Mel Gorman
2012-01-05 14:40       ` Russell King - ARM Linux
2012-01-05 15:24         ` Peter Zijlstra
2012-01-05 16:17         ` Mel Gorman
2012-01-05 16:35           ` Russell King - ARM Linux
2012-01-05 18:35             ` Paul E. McKenney
2012-01-05 22:21               ` Mel Gorman
2012-01-06  6:06                 ` Srivatsa S. Bhat
2012-01-06 10:46                   ` Mel Gorman
2012-01-06 13:28                 ` Greg KH
2012-01-06 14:09                   ` Mel Gorman
2012-01-05 22:06           ` Andrew Morton
2012-01-05 22:31             ` Mel Gorman
2012-01-05 23:19               ` Andrew Morton
2012-01-09 17:25                 ` Mel Gorman
2012-01-07 16:52           ` Paul E. McKenney
2012-01-07 17:05             ` Paul E. McKenney
2012-01-05 15:54   ` Mel Gorman
2012-01-08 16:01     ` Gilad Ben-Yossef
2012-01-02 10:24 ` [PATCH v5 8/8] mm: add vmstat counters for tracking PCP drains Gilad Ben-Yossef
2012-01-03 17:47   ` KOSAKI Motohiro
2012-01-03 19:00     ` Gilad Ben-Yossef
2012-01-03 22:13       ` KOSAKI Motohiro
2012-01-03 22:37       ` Andrew Morton

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox