* [patch] notifiers: fix blocking_notifier_call_chain() scalability
@ 2007-01-23 9:45 Ingo Molnar
2007-01-23 11:24 ` Peter Zijlstra
0 siblings, 1 reply; 3+ messages in thread
From: Ingo Molnar @ 2007-01-23 9:45 UTC (permalink / raw)
To: linux-kernel; +Cc: Linus Torvalds, Andrew Morton
Subject: [patch] notifiers: fix blocking_notifier_call_chain() scalability
From: Ingo Molnar <mingo@elte.hu>
while lock-profiling the -rt kernel i noticed weird contention during
mmap-intense workloads, and the tracer showed the following gem, in one
of our MM hotpaths:
threaded-2771 1.... 65us : sys_munmap (sysenter_do_call)
threaded-2771 1.... 66us : profile_munmap (sys_munmap)
threaded-2771 1.... 66us : blocking_notifier_call_chain (profile_munmap)
threaded-2771 1.... 66us : rt_down_read (blocking_notifier_call_chain)
ouch! a global rw-semaphore taken in one of the most
performance-sensitive codepaths of the kernel. And i dont even have
oprofile enabled! All distro kernels have CONFIG_PROFILING enabled, so
this scalability problem affects the majority of Linux users.
The fix is to enhance blocking_notifier_call_chain() to only take the
lock if there appears to be work on the call-chain.
With this patch applied i get nicely saturated system, and much higher
munmap performance, on SMP systems.
And as a bonus this also fixes a similar scalability bottleneck in the
thread-exit codepath: profile_task_exit() ...
Signed-off-by: Ingo Molnar <mingo@elte.hu>
---
kernel/sys.c | 15 +++++++++++----
1 file changed, 11 insertions(+), 4 deletions(-)
Index: linux/kernel/sys.c
===================================================================
--- linux.orig/kernel/sys.c
+++ linux/kernel/sys.c
@@ -325,11 +325,18 @@ EXPORT_SYMBOL_GPL(blocking_notifier_chai
int blocking_notifier_call_chain(struct blocking_notifier_head *nh,
unsigned long val, void *v)
{
- int ret;
+ int ret = NOTIFY_DONE;
- down_read(&nh->rwsem);
- ret = notifier_call_chain(&nh->head, val, v);
- up_read(&nh->rwsem);
+ /*
+ * We check the head outside the lock, but if this access is
+ * racy then it does not matter what the result of the test
+ * is, we re-check the list after having taken the lock anyway:
+ */
+ if (rcu_dereference(nh->head)) {
+ down_read(&nh->rwsem);
+ ret = notifier_call_chain(&nh->head, val, v);
+ up_read(&nh->rwsem);
+ }
return ret;
}
^ permalink raw reply [flat|nested] 3+ messages in thread* Re: [patch] notifiers: fix blocking_notifier_call_chain() scalability
2007-01-23 9:45 [patch] notifiers: fix blocking_notifier_call_chain() scalability Ingo Molnar
@ 2007-01-23 11:24 ` Peter Zijlstra
2007-01-23 15:11 ` Nick Piggin
0 siblings, 1 reply; 3+ messages in thread
From: Peter Zijlstra @ 2007-01-23 11:24 UTC (permalink / raw)
To: Ingo Molnar; +Cc: linux-kernel, Linus Torvalds, Andrew Morton
On Tue, 2007-01-23 at 10:45 +0100, Ingo Molnar wrote:
> Subject: [patch] notifiers: fix blocking_notifier_call_chain() scalability
> From: Ingo Molnar <mingo@elte.hu>
>
> while lock-profiling the -rt kernel i noticed weird contention during
> mmap-intense workloads, and the tracer showed the following gem, in one
> of our MM hotpaths:
>
> threaded-2771 1.... 65us : sys_munmap (sysenter_do_call)
> threaded-2771 1.... 66us : profile_munmap (sys_munmap)
> threaded-2771 1.... 66us : blocking_notifier_call_chain (profile_munmap)
> threaded-2771 1.... 66us : rt_down_read (blocking_notifier_call_chain)
>
> ouch! a global rw-semaphore taken in one of the most
> performance-sensitive codepaths of the kernel. And i dont even have
> oprofile enabled! All distro kernels have CONFIG_PROFILING enabled, so
> this scalability problem affects the majority of Linux users.
>
> The fix is to enhance blocking_notifier_call_chain() to only take the
> lock if there appears to be work on the call-chain.
>
> With this patch applied i get nicely saturated system, and much higher
> munmap performance, on SMP systems.
>
> And as a bonus this also fixes a similar scalability bottleneck in the
> thread-exit codepath: profile_task_exit() ...
>
> Signed-off-by: Ingo Molnar <mingo@elte.hu>
Acked-by: Peter Zijlstra <a.p.zijlstra@chello.nl>
> ---
> kernel/sys.c | 15 +++++++++++----
> 1 file changed, 11 insertions(+), 4 deletions(-)
>
> Index: linux/kernel/sys.c
> ===================================================================
> --- linux.orig/kernel/sys.c
> +++ linux/kernel/sys.c
> @@ -325,11 +325,18 @@ EXPORT_SYMBOL_GPL(blocking_notifier_chai
> int blocking_notifier_call_chain(struct blocking_notifier_head *nh,
> unsigned long val, void *v)
> {
> - int ret;
> + int ret = NOTIFY_DONE;
>
> - down_read(&nh->rwsem);
> - ret = notifier_call_chain(&nh->head, val, v);
> - up_read(&nh->rwsem);
> + /*
> + * We check the head outside the lock, but if this access is
> + * racy then it does not matter what the result of the test
> + * is, we re-check the list after having taken the lock anyway:
> + */
> + if (rcu_dereference(nh->head)) {
> + down_read(&nh->rwsem);
> + ret = notifier_call_chain(&nh->head, val, v);
> + up_read(&nh->rwsem);
> + }
> return ret;
> }
^ permalink raw reply [flat|nested] 3+ messages in thread* Re: [patch] notifiers: fix blocking_notifier_call_chain() scalability
2007-01-23 11:24 ` Peter Zijlstra
@ 2007-01-23 15:11 ` Nick Piggin
0 siblings, 0 replies; 3+ messages in thread
From: Nick Piggin @ 2007-01-23 15:11 UTC (permalink / raw)
To: Peter Zijlstra; +Cc: Ingo Molnar, linux-kernel, Linus Torvalds, Andrew Morton
Peter Zijlstra wrote:
> On Tue, 2007-01-23 at 10:45 +0100, Ingo Molnar wrote:
>>The fix is to enhance blocking_notifier_call_chain() to only take the
>>lock if there appears to be work on the call-chain.
>>
>>With this patch applied i get nicely saturated system, and much higher
>>munmap performance, on SMP systems.
>>
>>And as a bonus this also fixes a similar scalability bottleneck in the
>>thread-exit codepath: profile_task_exit() ...
>>
>>Signed-off-by: Ingo Molnar <mingo@elte.hu>
>
>
> Acked-by: Peter Zijlstra <a.p.zijlstra@chello.nl>
>
>>---
>> kernel/sys.c | 15 +++++++++++----
>> 1 file changed, 11 insertions(+), 4 deletions(-)
>>
>>Index: linux/kernel/sys.c
>>===================================================================
>>--- linux.orig/kernel/sys.c
>>+++ linux/kernel/sys.c
>>@@ -325,11 +325,18 @@ EXPORT_SYMBOL_GPL(blocking_notifier_chai
>> int blocking_notifier_call_chain(struct blocking_notifier_head *nh,
>> unsigned long val, void *v)
>> {
>>- int ret;
>>+ int ret = NOTIFY_DONE;
>>
>>- down_read(&nh->rwsem);
>>- ret = notifier_call_chain(&nh->head, val, v);
>>- up_read(&nh->rwsem);
>>+ /*
>>+ * We check the head outside the lock, but if this access is
>>+ * racy then it does not matter what the result of the test
>>+ * is, we re-check the list after having taken the lock anyway:
>>+ */
Great idea!
>>+ if (rcu_dereference(nh->head)) {
Except rcu_dereference() is not needed.
>>+ down_read(&nh->rwsem);
>>+ ret = notifier_call_chain(&nh->head, val, v);
>>+ up_read(&nh->rwsem);
>>+ }
>> return ret;
>> }
--
SUSE Labs, Novell Inc.
Send instant messages to your online friends http://au.messenger.yahoo.com
^ permalink raw reply [flat|nested] 3+ messages in thread
end of thread, other threads:[~2007-01-23 15:11 UTC | newest]
Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2007-01-23 9:45 [patch] notifiers: fix blocking_notifier_call_chain() scalability Ingo Molnar
2007-01-23 11:24 ` Peter Zijlstra
2007-01-23 15:11 ` Nick Piggin
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox