* Re: [PATCH] page allocator: fix update NR_FREE_PAGES only as necessary
2010-01-04 5:52 ` [PATCH] page allocator: fix update NR_FREE_PAGES only as necessary KOSAKI Motohiro
@ 2010-01-04 6:03 ` Minchan Kim
2010-01-04 6:11 ` Huang Shijie
` (2 subsequent siblings)
3 siblings, 0 replies; 17+ messages in thread
From: Minchan Kim @ 2010-01-04 6:03 UTC (permalink / raw)
To: KOSAKI Motohiro; +Cc: Minchan Kim, Huang Shijie, akpm, mel, linux-mm
On Mon, 4 Jan 2010 14:52:36 +0900 (JST)
KOSAKI Motohiro <kosaki.motohiro@jp.fujitsu.com> wrote:
> > Hi, Huang.
> >
> > On Mon, 4 Jan 2010 10:22:10 +0800
> > Huang Shijie <shijie8@gmail.com> wrote:
> >
> > > When the `page' returned by __rmqueue() is NULL, the origin code
> > > still adds -(1 << order) to zone's NR_FREE_PAGES item.
> > >
> > > The patch fixes it.
> > >
> > > Signed-off-by: Huang Shijie <shijie8@gmail.com>
> > > ---
> > > mm/page_alloc.c | 10 +++++++---
> > > 1 files changed, 7 insertions(+), 3 deletions(-)
> > >
> > > diff --git a/mm/page_alloc.c b/mm/page_alloc.c
> > > index 4e9f5cc..620921d 100644
> > > --- a/mm/page_alloc.c
> > > +++ b/mm/page_alloc.c
> > > @@ -1222,10 +1222,14 @@ again:
> > > }
> > > spin_lock_irqsave(&zone->lock, flags);
> > > page = __rmqueue(zone, order, migratetype);
> > > - __mod_zone_page_state(zone, NR_FREE_PAGES, -(1 << order));
> > > - spin_unlock(&zone->lock);
> > > - if (!page)
> > > + if (likely(page)) {
> > > + __mod_zone_page_state(zone, NR_FREE_PAGES,
> > > + -(1 << order));
> > > + spin_unlock(&zone->lock);
> > > + } else {
> > > + spin_unlock(&zone->lock);
> > > goto failed;
> > > + }
> > > }
> > >
> > > __count_zone_vm_events(PGALLOC, zone, 1 << order);
> >
> > I think it's not desirable to add new branch in hot-path even though
> > we could avoid that.
> >
> > How about this?
> >
> > diff --git a/mm/page_alloc.c b/mm/page_alloc.c
> > index 4e4b5b3..87976ad 100644
> > --- a/mm/page_alloc.c
> > +++ b/mm/page_alloc.c
> > @@ -1244,6 +1244,9 @@ again:
> > return page;
> >
> > failed:
> > + spin_lock(&zone->lock);
> > + __mod_zone_page_state(zone, NR_FREE_PAGES, 1 << order);
> > + spin_unlock(&zone->lock);
> > local_irq_restore(flags);
> > put_cpu();
> > return NULL;
>
> Why can't we write following? __mod_zone_page_state() only require irq
> disabling, it doesn't need spin lock. I think.
That's true. I missed that :)
>
>
>
> From 72011ff2b0bba6544ae35c6ee52715c8c824a34b Mon Sep 17 00:00:00 2001
> From: KOSAKI Motohiro <kosaki.motohiro@jp.fujitsu.com>
> Date: Mon, 4 Jan 2010 14:38:20 +0900
> Subject: [PATCH] page allocator: fix update NR_FREE_PAGES only as necessary
>
> commit f2260e6b (page allocator: update NR_FREE_PAGES only as necessary)
> made one minor regression.
> if __rmqueue() was failed, NR_FREE_PAGES stat go wrong. this patch fixes
> it.
>
> Signed-off-by: KOSAKI Motohiro <kosaki.motohiro@jp.fujitsu.com>
> Cc: Mel Gorman <mel@csn.ul.ie>
> Cc: Huang Shijie <shijie8@gmail.com>
> Cc: Minchan Kim <minchan.kim@gmail.com>
Reviewed-by : Minchan Kim <minchan.kim@gmail.com>
--
Kind regards,
Minchan Kim
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply [flat|nested] 17+ messages in thread
* Re: [PATCH] page allocator: fix update NR_FREE_PAGES only as necessary
2010-01-04 5:52 ` [PATCH] page allocator: fix update NR_FREE_PAGES only as necessary KOSAKI Motohiro
2010-01-04 6:03 ` Minchan Kim
@ 2010-01-04 6:11 ` Huang Shijie
2010-01-04 6:16 ` KOSAKI Motohiro
2010-01-04 9:58 ` Mel Gorman
2010-01-08 23:02 ` Andrew Morton
3 siblings, 1 reply; 17+ messages in thread
From: Huang Shijie @ 2010-01-04 6:11 UTC (permalink / raw)
To: KOSAKI Motohiro; +Cc: Minchan Kim, akpm, mel, linux-mm
> Why can't we write following? __mod_zone_page_state() only require irq
> disabling, it doesn't need spin lock. I think.
>
>
>
struct per_cpu_pageset {
.................................................
#ifdef CONFIG_SMP
s8 stat_threshold;
s8 vm_stat_diff[NR_VM_ZONE_STAT_ITEMS];
#endif
} ____cacheline_aligned_in_smp;
The field 'stat_threshold' is in the CONFIG_SMP macro, does it not need
the spinlock? I will read the code more carefully.
I saw the macro, so I thought it need the spinlock. :)
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply [flat|nested] 17+ messages in thread
* Re: [PATCH] page allocator: fix update NR_FREE_PAGES only as necessary
2010-01-04 6:11 ` Huang Shijie
@ 2010-01-04 6:16 ` KOSAKI Motohiro
2010-01-04 9:35 ` Huang Shijie
0 siblings, 1 reply; 17+ messages in thread
From: KOSAKI Motohiro @ 2010-01-04 6:16 UTC (permalink / raw)
To: Huang Shijie; +Cc: kosaki.motohiro, Minchan Kim, akpm, mel, linux-mm
>
> > Why can't we write following? __mod_zone_page_state() only require irq
> > disabling, it doesn't need spin lock. I think.
> >
> >
> >
> struct per_cpu_pageset {
> .................................................
> #ifdef CONFIG_SMP
> s8 stat_threshold;
> s8 vm_stat_diff[NR_VM_ZONE_STAT_ITEMS];
> #endif
> } ____cacheline_aligned_in_smp;
>
> The field 'stat_threshold' is in the CONFIG_SMP macro, does it not need
> the spinlock? I will read the code more carefully.
> I saw the macro, so I thought it need the spinlock. :)
Generally, per-cpu data isn't accessed from another cpu. it only need to care
process-context vs irq-context race.
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply [flat|nested] 17+ messages in thread
* Re: [PATCH] page allocator: fix update NR_FREE_PAGES only as necessary
2010-01-04 6:16 ` KOSAKI Motohiro
@ 2010-01-04 9:35 ` Huang Shijie
2010-01-05 7:24 ` KOSAKI Motohiro
0 siblings, 1 reply; 17+ messages in thread
From: Huang Shijie @ 2010-01-04 9:35 UTC (permalink / raw)
To: KOSAKI Motohiro; +Cc: Minchan Kim, akpm, mel, linux-mm
>
>> struct per_cpu_pageset {
>> .................................................
>> #ifdef CONFIG_SMP
>> s8 stat_threshold;
>> s8 vm_stat_diff[NR_VM_ZONE_STAT_ITEMS];
>> #endif
>> } ____cacheline_aligned_in_smp;
>>
>> The field 'stat_threshold' is in the CONFIG_SMP macro, does it not need
>> the spinlock? I will read the code more carefully.
>> I saw the macro, so I thought it need the spinlock. :)
>>
> Generally, per-cpu data isn't accessed from another cpu. it only need to care
> process-context vs irq-context race.
>
>
>
If the __mod_zone_page_state() can be used without caring about the
spinlock, I think there
are several places we can move __mod_zone_page_state() out the guard
area of spinlock to
release the pressure of the zone->lock,such as in rmqueue_bulk().
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply [flat|nested] 17+ messages in thread
* Re: [PATCH] page allocator: fix update NR_FREE_PAGES only as necessary
2010-01-04 9:35 ` Huang Shijie
@ 2010-01-05 7:24 ` KOSAKI Motohiro
0 siblings, 0 replies; 17+ messages in thread
From: KOSAKI Motohiro @ 2010-01-05 7:24 UTC (permalink / raw)
To: Huang Shijie; +Cc: kosaki.motohiro, Minchan Kim, akpm, mel, linux-mm
> >
> >> struct per_cpu_pageset {
> >> .................................................
> >> #ifdef CONFIG_SMP
> >> s8 stat_threshold;
> >> s8 vm_stat_diff[NR_VM_ZONE_STAT_ITEMS];
> >> #endif
> >> } ____cacheline_aligned_in_smp;
> >>
> >> The field 'stat_threshold' is in the CONFIG_SMP macro, does it not need
> >> the spinlock? I will read the code more carefully.
> >> I saw the macro, so I thought it need the spinlock. :)
> >>
> > Generally, per-cpu data isn't accessed from another cpu. it only need to care
> > process-context vs irq-context race.
> >
> If the __mod_zone_page_state() can be used without caring about the
> spinlock, I think there
> are several places we can move __mod_zone_page_state() out the guard
> area of spinlock to
> release the pressure of the zone->lock,such as in rmqueue_bulk().
Welcome to your patch :)
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply [flat|nested] 17+ messages in thread
* Re: [PATCH] page allocator: fix update NR_FREE_PAGES only as necessary
2010-01-04 5:52 ` [PATCH] page allocator: fix update NR_FREE_PAGES only as necessary KOSAKI Motohiro
2010-01-04 6:03 ` Minchan Kim
2010-01-04 6:11 ` Huang Shijie
@ 2010-01-04 9:58 ` Mel Gorman
2010-01-04 17:46 ` Christoph Lameter
2010-01-05 1:08 ` KOSAKI Motohiro
2010-01-08 23:02 ` Andrew Morton
3 siblings, 2 replies; 17+ messages in thread
From: Mel Gorman @ 2010-01-04 9:58 UTC (permalink / raw)
To: KOSAKI Motohiro
Cc: Minchan Kim, Huang Shijie, akpm, linux-mm, Christoph Lameter
On Mon, Jan 04, 2010 at 02:52:36PM +0900, KOSAKI Motohiro wrote:
> > Hi, Huang.
> >
> > On Mon, 4 Jan 2010 10:22:10 +0800
> > Huang Shijie <shijie8@gmail.com> wrote:
> >
> > > When the `page' returned by __rmqueue() is NULL, the origin code
> > > still adds -(1 << order) to zone's NR_FREE_PAGES item.
> > >
> > > The patch fixes it.
> > >
> > > Signed-off-by: Huang Shijie <shijie8@gmail.com>
> > > ---
> > > mm/page_alloc.c | 10 +++++++---
> > > 1 files changed, 7 insertions(+), 3 deletions(-)
> > >
> > > diff --git a/mm/page_alloc.c b/mm/page_alloc.c
> > > index 4e9f5cc..620921d 100644
> > > --- a/mm/page_alloc.c
> > > +++ b/mm/page_alloc.c
> > > @@ -1222,10 +1222,14 @@ again:
> > > }
> > > spin_lock_irqsave(&zone->lock, flags);
> > > page = __rmqueue(zone, order, migratetype);
> > > - __mod_zone_page_state(zone, NR_FREE_PAGES, -(1 << order));
> > > - spin_unlock(&zone->lock);
> > > - if (!page)
> > > + if (likely(page)) {
> > > + __mod_zone_page_state(zone, NR_FREE_PAGES,
> > > + -(1 << order));
> > > + spin_unlock(&zone->lock);
> > > + } else {
> > > + spin_unlock(&zone->lock);
> > > goto failed;
> > > + }
> > > }
> > >
> > > __count_zone_vm_events(PGALLOC, zone, 1 << order);
> >
> > I think it's not desirable to add new branch in hot-path even though
> > we could avoid that.
> >
> > How about this?
> >
> > diff --git a/mm/page_alloc.c b/mm/page_alloc.c
> > index 4e4b5b3..87976ad 100644
> > --- a/mm/page_alloc.c
> > +++ b/mm/page_alloc.c
> > @@ -1244,6 +1244,9 @@ again:
> > return page;
> >
> > failed:
> > + spin_lock(&zone->lock);
> > + __mod_zone_page_state(zone, NR_FREE_PAGES, 1 << order);
> > + spin_unlock(&zone->lock);
> > local_irq_restore(flags);
> > put_cpu();
> > return NULL;
>
> Why can't we write following? __mod_zone_page_state() only require irq
> disabling, it doesn't need spin lock. I think.
>
Adding Christoph to be sure but yes, as this is a per-cpu variable it
should be safe to update with __mod_zone_page_state() as long as
interrupts and preempt are disabled. If true, then this is a neater fix
and is also needed for -stable 2.6.31 and 2.6.32.
Well spotted and thanks.
> From 72011ff2b0bba6544ae35c6ee52715c8c824a34b Mon Sep 17 00:00:00 2001
> From: KOSAKI Motohiro <kosaki.motohiro@jp.fujitsu.com>
> Date: Mon, 4 Jan 2010 14:38:20 +0900
> Subject: [PATCH] page allocator: fix update NR_FREE_PAGES only as necessary
>
> commit f2260e6b (page allocator: update NR_FREE_PAGES only as necessary)
> made one minor regression.
> if __rmqueue() was failed, NR_FREE_PAGES stat go wrong. this patch fixes
> it.
>
> Signed-off-by: KOSAKI Motohiro <kosaki.motohiro@jp.fujitsu.com>
> Cc: Mel Gorman <mel@csn.ul.ie>
> Cc: Huang Shijie <shijie8@gmail.com>
> Cc: Minchan Kim <minchan.kim@gmail.com>
> ---
> mm/page_alloc.c | 2 +-
> 1 files changed, 1 insertions(+), 1 deletions(-)
>
> diff --git a/mm/page_alloc.c b/mm/page_alloc.c
> index 11ae66e..ecf75a1 100644
> --- a/mm/page_alloc.c
> +++ b/mm/page_alloc.c
> @@ -1227,10 +1227,10 @@ again:
> }
> spin_lock_irqsave(&zone->lock, flags);
> page = __rmqueue(zone, order, migratetype);
> - __mod_zone_page_state(zone, NR_FREE_PAGES, -(1 << order));
> spin_unlock(&zone->lock);
> if (!page)
> goto failed;
> + __mod_zone_page_state(zone, NR_FREE_PAGES, -(1 << order));
> }
>
> __count_zone_vm_events(PGALLOC, zone, 1 << order);
> --
> 1.6.5.2
>
>
>
>
--
Mel Gorman
Part-time Phd Student Linux Technology Center
University of Limerick IBM Dublin Software Lab
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply [flat|nested] 17+ messages in thread
* Re: [PATCH] page allocator: fix update NR_FREE_PAGES only as necessary
2010-01-04 9:58 ` Mel Gorman
@ 2010-01-04 17:46 ` Christoph Lameter
2010-01-05 1:08 ` KOSAKI Motohiro
1 sibling, 0 replies; 17+ messages in thread
From: Christoph Lameter @ 2010-01-04 17:46 UTC (permalink / raw)
To: Mel Gorman; +Cc: KOSAKI Motohiro, Minchan Kim, Huang Shijie, akpm, linux-mm
On Mon, 4 Jan 2010, Mel Gorman wrote:
> > Why can't we write following? __mod_zone_page_state() only require irq
> > disabling, it doesn't need spin lock. I think.
Correct.
> > commit f2260e6b (page allocator: update NR_FREE_PAGES only as necessary)
> > made one minor regression.
> > if __rmqueue() was failed, NR_FREE_PAGES stat go wrong. this patch fixes
> > it.
> >
> > Signed-off-by: KOSAKI Motohiro <kosaki.motohiro@jp.fujitsu.com>
> > Cc: Mel Gorman <mel@csn.ul.ie>
> > Cc: Huang Shijie <shijie8@gmail.com>
> > Cc: Minchan Kim <minchan.kim@gmail.com>
> > ---
> > mm/page_alloc.c | 2 +-
> > 1 files changed, 1 insertions(+), 1 deletions(-)
> >
> > diff --git a/mm/page_alloc.c b/mm/page_alloc.c
> > index 11ae66e..ecf75a1 100644
> > --- a/mm/page_alloc.c
> > +++ b/mm/page_alloc.c
> > @@ -1227,10 +1227,10 @@ again:
> > }
> > spin_lock_irqsave(&zone->lock, flags);
> > page = __rmqueue(zone, order, migratetype);
> > - __mod_zone_page_state(zone, NR_FREE_PAGES, -(1 << order));
> > spin_unlock(&zone->lock);
> > if (!page)
> > goto failed;
> > + __mod_zone_page_state(zone, NR_FREE_PAGES, -(1 << order));
> > }
> >
> > __count_zone_vm_events(PGALLOC, zone, 1 << order);
Reviewed-by: Christoph Lameter <cl@linux-foundation.org>
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply [flat|nested] 17+ messages in thread
* Re: [PATCH] page allocator: fix update NR_FREE_PAGES only as necessary
2010-01-04 9:58 ` Mel Gorman
2010-01-04 17:46 ` Christoph Lameter
@ 2010-01-05 1:08 ` KOSAKI Motohiro
1 sibling, 0 replies; 17+ messages in thread
From: KOSAKI Motohiro @ 2010-01-05 1:08 UTC (permalink / raw)
To: Mel Gorman
Cc: kosaki.motohiro, Minchan Kim, Huang Shijie, akpm, linux-mm,
Christoph Lameter
> > > diff --git a/mm/page_alloc.c b/mm/page_alloc.c
> > > index 4e4b5b3..87976ad 100644
> > > --- a/mm/page_alloc.c
> > > +++ b/mm/page_alloc.c
> > > @@ -1244,6 +1244,9 @@ again:
> > > return page;
> > >
> > > failed:
> > > + spin_lock(&zone->lock);
> > > + __mod_zone_page_state(zone, NR_FREE_PAGES, 1 << order);
> > > + spin_unlock(&zone->lock);
> > > local_irq_restore(flags);
> > > put_cpu();
> > > return NULL;
> >
> > Why can't we write following? __mod_zone_page_state() only require irq
> > disabling, it doesn't need spin lock. I think.
> >
>
> Adding Christoph to be sure but yes, as this is a per-cpu variable it
> should be safe to update with __mod_zone_page_state() as long as
> interrupts and preempt are disabled. If true, then this is a neater fix
> and is also needed for -stable 2.6.31 and 2.6.32.
>
> Well spotted and thanks.
Yes, it should be sent to -stable tree. I hope this fix also solve recent mysterious
allocation failure problem ;-)
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply [flat|nested] 17+ messages in thread
* Re: [PATCH] page allocator: fix update NR_FREE_PAGES only as necessary
2010-01-04 5:52 ` [PATCH] page allocator: fix update NR_FREE_PAGES only as necessary KOSAKI Motohiro
` (2 preceding siblings ...)
2010-01-04 9:58 ` Mel Gorman
@ 2010-01-08 23:02 ` Andrew Morton
3 siblings, 0 replies; 17+ messages in thread
From: Andrew Morton @ 2010-01-08 23:02 UTC (permalink / raw)
To: KOSAKI Motohiro; +Cc: Minchan Kim, Huang Shijie, mel, linux-mm
On Mon, 4 Jan 2010 14:52:36 +0900 (JST)
KOSAKI Motohiro <kosaki.motohiro@jp.fujitsu.com> wrote:
> --- a/mm/page_alloc.c
> +++ b/mm/page_alloc.c
> @@ -1227,10 +1227,10 @@ again:
> }
> spin_lock_irqsave(&zone->lock, flags);
> page = __rmqueue(zone, order, migratetype);
> - __mod_zone_page_state(zone, NR_FREE_PAGES, -(1 << order));
> spin_unlock(&zone->lock);
> if (!page)
> goto failed;
> + __mod_zone_page_state(zone, NR_FREE_PAGES, -(1 << order));
> }
>
> __count_zone_vm_events(PGALLOC, zone, 1 << order);
hm, yes, OK, obviously better.
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply [flat|nested] 17+ messages in thread