linux-mm.kvack.org archive mirror
 help / color / mirror / Atom feed
* [patch] mm, thp: drop page_table_lock to uncharge memcg pages
@ 2012-04-26 22:57 David Rientjes
  2012-04-26 23:39 ` Andrew Morton
  2012-04-26 23:57 ` Johannes Weiner
  0 siblings, 2 replies; 5+ messages in thread
From: David Rientjes @ 2012-04-26 22:57 UTC (permalink / raw)
  To: Andrew Morton
  Cc: KAMEZAWA Hiroyuki, Andrea Arcangeli, Johannes Weiner, linux-mm

mm->page_table_lock is hotly contested for page fault tests and isn't
necessary to do mem_cgroup_uncharge_page() in do_huge_pmd_wp_page().

Signed-off-by: David Rientjes <rientjes@google.com>
---
 mm/huge_memory.c |    2 ++
 1 file changed, 2 insertions(+)

diff --git a/mm/huge_memory.c b/mm/huge_memory.c
--- a/mm/huge_memory.c
+++ b/mm/huge_memory.c
@@ -968,8 +968,10 @@ int do_huge_pmd_wp_page(struct mm_struct *mm, struct vm_area_struct *vma,
 	spin_lock(&mm->page_table_lock);
 	put_page(page);
 	if (unlikely(!pmd_same(*pmd, orig_pmd))) {
+		spin_unlock(&mm->page_table_lock);
 		mem_cgroup_uncharge_page(new_page);
 		put_page(new_page);
+		goto out;
 	} else {
 		pmd_t entry;
 		VM_BUG_ON(!PageHead(page));

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Fight unfair telecom internet charges in Canada: sign http://stopthemeter.ca/
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: [patch] mm, thp: drop page_table_lock to uncharge memcg pages
  2012-04-26 22:57 [patch] mm, thp: drop page_table_lock to uncharge memcg pages David Rientjes
@ 2012-04-26 23:39 ` Andrew Morton
  2012-04-26 23:44   ` David Rientjes
  2012-04-26 23:57 ` Johannes Weiner
  1 sibling, 1 reply; 5+ messages in thread
From: Andrew Morton @ 2012-04-26 23:39 UTC (permalink / raw)
  To: David Rientjes
  Cc: KAMEZAWA Hiroyuki, Andrea Arcangeli, Johannes Weiner, linux-mm

On Thu, 26 Apr 2012 15:57:30 -0700 (PDT)
David Rientjes <rientjes@google.com> wrote:

> mm->page_table_lock is hotly contested for page fault tests and isn't
> necessary to do mem_cgroup_uncharge_page() in do_huge_pmd_wp_page().
> 
> ...
>
> --- a/mm/huge_memory.c
> +++ b/mm/huge_memory.c
> @@ -968,8 +968,10 @@ int do_huge_pmd_wp_page(struct mm_struct *mm, struct vm_area_struct *vma,
>  	spin_lock(&mm->page_table_lock);
>  	put_page(page);
>  	if (unlikely(!pmd_same(*pmd, orig_pmd))) {
> +		spin_unlock(&mm->page_table_lock);
>  		mem_cgroup_uncharge_page(new_page);
>  		put_page(new_page);
> +		goto out;
>  	} else {
>  		pmd_t entry;
>  		VM_BUG_ON(!PageHead(page));

But this is on the basically-never-happens race path and will surely have no
measurable benefit?

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Fight unfair telecom internet charges in Canada: sign http://stopthemeter.ca/
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: [patch] mm, thp: drop page_table_lock to uncharge memcg pages
  2012-04-26 23:39 ` Andrew Morton
@ 2012-04-26 23:44   ` David Rientjes
  2012-04-27 19:00     ` Andrea Arcangeli
  0 siblings, 1 reply; 5+ messages in thread
From: David Rientjes @ 2012-04-26 23:44 UTC (permalink / raw)
  To: Andrew Morton
  Cc: KAMEZAWA Hiroyuki, Andrea Arcangeli, Johannes Weiner, linux-mm

On Thu, 26 Apr 2012, Andrew Morton wrote:

> > mm->page_table_lock is hotly contested for page fault tests and isn't
> > necessary to do mem_cgroup_uncharge_page() in do_huge_pmd_wp_page().
> > 
> > ...
> >
> > --- a/mm/huge_memory.c
> > +++ b/mm/huge_memory.c
> > @@ -968,8 +968,10 @@ int do_huge_pmd_wp_page(struct mm_struct *mm, struct vm_area_struct *vma,
> >  	spin_lock(&mm->page_table_lock);
> >  	put_page(page);
> >  	if (unlikely(!pmd_same(*pmd, orig_pmd))) {
> > +		spin_unlock(&mm->page_table_lock);
> >  		mem_cgroup_uncharge_page(new_page);
> >  		put_page(new_page);
> > +		goto out;
> >  	} else {
> >  		pmd_t entry;
> >  		VM_BUG_ON(!PageHead(page));
> 
> But this is on the basically-never-happens race path and will surely have no
> measurable benefit?
> 

It happens more often than you may think on page fault tests; how 
representative pft has ever been of actual workloads, especially with thp 
where the benfits of allocating the hugepage usually result in better 
performance in the long-term even for a short-term performance loss, is 
debatable.  However, all other thp code has always dropped 
mm->page_table_lock before calling mem_cgroup_uncharge_page() and this one 
seems to have been missed.  Worth correcting, in my opinion.

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Fight unfair telecom internet charges in Canada: sign http://stopthemeter.ca/
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: [patch] mm, thp: drop page_table_lock to uncharge memcg pages
  2012-04-26 22:57 [patch] mm, thp: drop page_table_lock to uncharge memcg pages David Rientjes
  2012-04-26 23:39 ` Andrew Morton
@ 2012-04-26 23:57 ` Johannes Weiner
  1 sibling, 0 replies; 5+ messages in thread
From: Johannes Weiner @ 2012-04-26 23:57 UTC (permalink / raw)
  To: David Rientjes
  Cc: Andrew Morton, KAMEZAWA Hiroyuki, Andrea Arcangeli, linux-mm

On Thu, Apr 26, 2012 at 03:57:30PM -0700, David Rientjes wrote:
> mm->page_table_lock is hotly contested for page fault tests and isn't
> necessary to do mem_cgroup_uncharge_page() in do_huge_pmd_wp_page().
> 
> Signed-off-by: David Rientjes <rientjes@google.com>

Probably rare, but since it's just two lines and the uncharge path
really does a ridiculous amount of things, I'm happy with moving it
out of the locked section.

Acked-by: Johannes Weiner <hannes@cmpxchg.org>

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Fight unfair telecom internet charges in Canada: sign http://stopthemeter.ca/
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: [patch] mm, thp: drop page_table_lock to uncharge memcg pages
  2012-04-26 23:44   ` David Rientjes
@ 2012-04-27 19:00     ` Andrea Arcangeli
  0 siblings, 0 replies; 5+ messages in thread
From: Andrea Arcangeli @ 2012-04-27 19:00 UTC (permalink / raw)
  To: David Rientjes
  Cc: Andrew Morton, KAMEZAWA Hiroyuki, Johannes Weiner, linux-mm

On Thu, Apr 26, 2012 at 04:44:16PM -0700, David Rientjes wrote:
> On Thu, 26 Apr 2012, Andrew Morton wrote:
> 
> > > mm->page_table_lock is hotly contested for page fault tests and isn't
> > > necessary to do mem_cgroup_uncharge_page() in do_huge_pmd_wp_page().
> > > 
> > > ...
> > >
> > > --- a/mm/huge_memory.c
> > > +++ b/mm/huge_memory.c
> > > @@ -968,8 +968,10 @@ int do_huge_pmd_wp_page(struct mm_struct *mm, struct vm_area_struct *vma,
> > >  	spin_lock(&mm->page_table_lock);
> > >  	put_page(page);
> > >  	if (unlikely(!pmd_same(*pmd, orig_pmd))) {
> > > +		spin_unlock(&mm->page_table_lock);
> > >  		mem_cgroup_uncharge_page(new_page);
> > >  		put_page(new_page);
> > > +		goto out;
> > >  	} else {
> > >  		pmd_t entry;
> > >  		VM_BUG_ON(!PageHead(page));
> > 
> > But this is on the basically-never-happens race path and will surely have no
> > measurable benefit?
> > 

Even if it has no measurable benefit, it's still an ok
microoptimization as it can't slow down anything, it introduces a
slight different jump for the slow path but it shouldn't matter. So it
looks ok to me.

Reviewed-by: Andrea Arcangeli <aarcange@redhat.com>

> It happens more often than you may think on page fault tests; how 
> representative pft has ever been of actual workloads, especially with thp 
> where the benfits of allocating the hugepage usually result in better 
> performance in the long-term even for a short-term performance loss, is 
> debatable.  However, all other thp code has always dropped 
> mm->page_table_lock before calling mem_cgroup_uncharge_page() and this one 
> seems to have been missed.  Worth correcting, in my opinion.

If we take single threaded programs into account too, THP gives a
major boosts to the page faults too, a memset on a uninitialized area
with THP enabled on some CPUs it can run more than twice as fast
depending on the CPU cache sizes. If the access is random and not
sequential cache effects can make it slightly slower though.

I certainly agree the main focus here is not the page fault, but it's
still worth to optimize the page fault of course.

With concurrent threads and THP faults, the increased contention on
the page_table_lock on large-CPU systems could be mitigated with a
per-pmd lock but it would still be as coarse as 1G and it would
complicate the code a bit. If each thread address space is very big
and the threads aren't sharing much memory, it would make their page
faults SMP scale nicely though. Just an idea.

Thanks,
Andrea

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Fight unfair telecom internet charges in Canada: sign http://stopthemeter.ca/
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

^ permalink raw reply	[flat|nested] 5+ messages in thread

end of thread, other threads:[~2012-04-27 19:00 UTC | newest]

Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2012-04-26 22:57 [patch] mm, thp: drop page_table_lock to uncharge memcg pages David Rientjes
2012-04-26 23:39 ` Andrew Morton
2012-04-26 23:44   ` David Rientjes
2012-04-27 19:00     ` Andrea Arcangeli
2012-04-26 23:57 ` Johannes Weiner

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).