public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
* [patch] notifiers: fix blocking_notifier_call_chain() scalability
@ 2007-01-23  9:45 Ingo Molnar
  2007-01-23 11:24 ` Peter Zijlstra
  0 siblings, 1 reply; 3+ messages in thread
From: Ingo Molnar @ 2007-01-23  9:45 UTC (permalink / raw)
  To: linux-kernel; +Cc: Linus Torvalds, Andrew Morton

Subject: [patch] notifiers: fix blocking_notifier_call_chain() scalability
From: Ingo Molnar <mingo@elte.hu>

while lock-profiling the -rt kernel i noticed weird contention during 
mmap-intense workloads, and the tracer showed the following gem, in one 
of our MM hotpaths:

 threaded-2771  1....   65us : sys_munmap (sysenter_do_call)
 threaded-2771  1....   66us : profile_munmap (sys_munmap)
 threaded-2771  1....   66us : blocking_notifier_call_chain (profile_munmap)
 threaded-2771  1....   66us : rt_down_read (blocking_notifier_call_chain)

ouch! a global rw-semaphore taken in one of the most 
performance-sensitive codepaths of the kernel. And i dont even have 
oprofile enabled! All distro kernels have CONFIG_PROFILING enabled, so 
this scalability problem affects the majority of Linux users.

The fix is to enhance blocking_notifier_call_chain() to only take the 
lock if there appears to be work on the call-chain.

With this patch applied i get nicely saturated system, and much higher 
munmap performance, on SMP systems.

And as a bonus this also fixes a similar scalability bottleneck in the 
thread-exit codepath: profile_task_exit() ...

Signed-off-by: Ingo Molnar <mingo@elte.hu>
---
 kernel/sys.c |   15 +++++++++++----
 1 file changed, 11 insertions(+), 4 deletions(-)

Index: linux/kernel/sys.c
===================================================================
--- linux.orig/kernel/sys.c
+++ linux/kernel/sys.c
@@ -325,11 +325,18 @@ EXPORT_SYMBOL_GPL(blocking_notifier_chai
 int blocking_notifier_call_chain(struct blocking_notifier_head *nh,
 		unsigned long val, void *v)
 {
-	int ret;
+	int ret = NOTIFY_DONE;
 
-	down_read(&nh->rwsem);
-	ret = notifier_call_chain(&nh->head, val, v);
-	up_read(&nh->rwsem);
+	/*
+	 * We check the head outside the lock, but if this access is
+	 * racy then it does not matter what the result of the test
+	 * is, we re-check the list after having taken the lock anyway:
+	 */
+	if (rcu_dereference(nh->head)) {
+		down_read(&nh->rwsem);
+		ret = notifier_call_chain(&nh->head, val, v);
+		up_read(&nh->rwsem);
+	}
 	return ret;
 }
 

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: [patch] notifiers: fix blocking_notifier_call_chain() scalability
  2007-01-23  9:45 [patch] notifiers: fix blocking_notifier_call_chain() scalability Ingo Molnar
@ 2007-01-23 11:24 ` Peter Zijlstra
  2007-01-23 15:11   ` Nick Piggin
  0 siblings, 1 reply; 3+ messages in thread
From: Peter Zijlstra @ 2007-01-23 11:24 UTC (permalink / raw)
  To: Ingo Molnar; +Cc: linux-kernel, Linus Torvalds, Andrew Morton

On Tue, 2007-01-23 at 10:45 +0100, Ingo Molnar wrote:
> Subject: [patch] notifiers: fix blocking_notifier_call_chain() scalability
> From: Ingo Molnar <mingo@elte.hu>
> 
> while lock-profiling the -rt kernel i noticed weird contention during 
> mmap-intense workloads, and the tracer showed the following gem, in one 
> of our MM hotpaths:
> 
>  threaded-2771  1....   65us : sys_munmap (sysenter_do_call)
>  threaded-2771  1....   66us : profile_munmap (sys_munmap)
>  threaded-2771  1....   66us : blocking_notifier_call_chain (profile_munmap)
>  threaded-2771  1....   66us : rt_down_read (blocking_notifier_call_chain)
> 
> ouch! a global rw-semaphore taken in one of the most 
> performance-sensitive codepaths of the kernel. And i dont even have 
> oprofile enabled! All distro kernels have CONFIG_PROFILING enabled, so 
> this scalability problem affects the majority of Linux users.
> 
> The fix is to enhance blocking_notifier_call_chain() to only take the 
> lock if there appears to be work on the call-chain.
> 
> With this patch applied i get nicely saturated system, and much higher 
> munmap performance, on SMP systems.
> 
> And as a bonus this also fixes a similar scalability bottleneck in the 
> thread-exit codepath: profile_task_exit() ...
> 
> Signed-off-by: Ingo Molnar <mingo@elte.hu>

Acked-by: Peter Zijlstra <a.p.zijlstra@chello.nl>

> ---
>  kernel/sys.c |   15 +++++++++++----
>  1 file changed, 11 insertions(+), 4 deletions(-)
> 
> Index: linux/kernel/sys.c
> ===================================================================
> --- linux.orig/kernel/sys.c
> +++ linux/kernel/sys.c
> @@ -325,11 +325,18 @@ EXPORT_SYMBOL_GPL(blocking_notifier_chai
>  int blocking_notifier_call_chain(struct blocking_notifier_head *nh,
>  		unsigned long val, void *v)
>  {
> -	int ret;
> +	int ret = NOTIFY_DONE;
>  
> -	down_read(&nh->rwsem);
> -	ret = notifier_call_chain(&nh->head, val, v);
> -	up_read(&nh->rwsem);
> +	/*
> +	 * We check the head outside the lock, but if this access is
> +	 * racy then it does not matter what the result of the test
> +	 * is, we re-check the list after having taken the lock anyway:
> +	 */
> +	if (rcu_dereference(nh->head)) {
> +		down_read(&nh->rwsem);
> +		ret = notifier_call_chain(&nh->head, val, v);
> +		up_read(&nh->rwsem);
> +	}
>  	return ret;
>  }



^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: [patch] notifiers: fix blocking_notifier_call_chain() scalability
  2007-01-23 11:24 ` Peter Zijlstra
@ 2007-01-23 15:11   ` Nick Piggin
  0 siblings, 0 replies; 3+ messages in thread
From: Nick Piggin @ 2007-01-23 15:11 UTC (permalink / raw)
  To: Peter Zijlstra; +Cc: Ingo Molnar, linux-kernel, Linus Torvalds, Andrew Morton

Peter Zijlstra wrote:
> On Tue, 2007-01-23 at 10:45 +0100, Ingo Molnar wrote:

>>The fix is to enhance blocking_notifier_call_chain() to only take the 
>>lock if there appears to be work on the call-chain.
>>
>>With this patch applied i get nicely saturated system, and much higher 
>>munmap performance, on SMP systems.
>>
>>And as a bonus this also fixes a similar scalability bottleneck in the 
>>thread-exit codepath: profile_task_exit() ...
>>
>>Signed-off-by: Ingo Molnar <mingo@elte.hu>
> 
> 
> Acked-by: Peter Zijlstra <a.p.zijlstra@chello.nl>
> 
>>---
>> kernel/sys.c |   15 +++++++++++----
>> 1 file changed, 11 insertions(+), 4 deletions(-)
>>
>>Index: linux/kernel/sys.c
>>===================================================================
>>--- linux.orig/kernel/sys.c
>>+++ linux/kernel/sys.c
>>@@ -325,11 +325,18 @@ EXPORT_SYMBOL_GPL(blocking_notifier_chai
>> int blocking_notifier_call_chain(struct blocking_notifier_head *nh,
>> 		unsigned long val, void *v)
>> {
>>-	int ret;
>>+	int ret = NOTIFY_DONE;
>> 
>>-	down_read(&nh->rwsem);
>>-	ret = notifier_call_chain(&nh->head, val, v);
>>-	up_read(&nh->rwsem);
>>+	/*
>>+	 * We check the head outside the lock, but if this access is
>>+	 * racy then it does not matter what the result of the test
>>+	 * is, we re-check the list after having taken the lock anyway:
>>+	 */

Great idea!

>>+	if (rcu_dereference(nh->head)) {

Except rcu_dereference() is not needed.

>>+		down_read(&nh->rwsem);
>>+		ret = notifier_call_chain(&nh->head, val, v);
>>+		up_read(&nh->rwsem);
>>+	}
>> 	return ret;
>> }

-- 
SUSE Labs, Novell Inc.
Send instant messages to your online friends http://au.messenger.yahoo.com 

^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2007-01-23 15:11 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2007-01-23  9:45 [patch] notifiers: fix blocking_notifier_call_chain() scalability Ingo Molnar
2007-01-23 11:24 ` Peter Zijlstra
2007-01-23 15:11   ` Nick Piggin

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox