* [PATCH] improve speed of freeing bootmem
@ 2004-08-06 19:26 Josh Aas
2004-08-06 19:52 ` Andrew Morton
0 siblings, 1 reply; 4+ messages in thread
From: Josh Aas @ 2004-08-06 19:26 UTC (permalink / raw)
To: akpm, linux-kernel, linux-ia64
[-- Attachment #1: Type: text/plain, Size: 1361 bytes --]
Attached is a patch that greatly improves the speed of freeing boot
memory. On ia64 machines with 2GB or more memory (I didn't test with
less, but I can't imagine there being a problem), the speed improvement
is about 75% for the function free_all_bootmem_core. This translates to
savings on the order of 1 minute / TB of memory during boot time. That
number comes from testing on a machine with 512GB, and extrapolating
based on profiling of an unpatched 4TB machine. For 4 and 8 TB machines,
the time spent in this function is about 1 minutes/TB, which is painful
especially given that there is no indication of what is going on put to
the console (this issue to possibly be addressed later).
The basic idea is to free higher order pages instead of going through
every single one. Also, some unnecessary atomic operations are done away
with and replaced with non-atomic equivalents, and prefetching is done
where it helps the most. For a more in-depth discusion of this patch,
please see the linux-ia64 archives (topic is "free bootmem feedback patch").
The patch is originally Tony Luck's, and I added some further
optimizations (non-atomic ops improvements and prefetching).
Signed-off-by: Tony Luck <tony.luck@intel.com>
Signed-off-by: Josh Aas <josha@sgi.com>
--
Josh Aas
Silicon Graphics, Inc. (SGI)
Linux System Software
651-683-3068
[-- Attachment #2: bootmem4.patch --]
[-- Type: text/x-patch, Size: 2397 bytes --]
--- a/mm/bootmem.c 2004-08-05 15:33:39.000000000 -0500
+++ b/mm/bootmem.c 2004-08-06 13:42:33.000000000 -0500
@@ -259,6 +259,7 @@ static unsigned long __init free_all_boo
unsigned long i, count, total = 0;
unsigned long idx;
unsigned long *map;
+ int gofast = 0;
BUG_ON(!bdata->node_bootmem_map);
@@ -267,14 +268,32 @@ static unsigned long __init free_all_boo
page = virt_to_page(phys_to_virt(bdata->node_boot_start));
idx = bdata->node_low_pfn - (bdata->node_boot_start >> PAGE_SHIFT);
map = bdata->node_bootmem_map;
+ if (bdata->node_boot_start == 0 ||
+ ffs(bdata->node_boot_start) - PAGE_SHIFT > ffs(BITS_PER_LONG))
+ gofast = 1;
for (i = 0; i < idx; ) {
unsigned long v = ~map[i / BITS_PER_LONG];
- if (v) {
+ if (gofast && v == ~0UL) {
+ int j;
+
+ count += BITS_PER_LONG;
+ ClearPageReservedNoAtomic(page);
+ set_page_count(page, 1);
+ for (j = 1; j < BITS_PER_LONG; j++) {
+ if (j + 16 < BITS_PER_LONG) {
+ prefetchw(page + j + 16);
+ }
+ ClearPageReservedNoAtomic(page + j);
+ }
+ __free_pages(page, ffs(BITS_PER_LONG)-1);
+ i += BITS_PER_LONG;
+ page += BITS_PER_LONG;
+ } else if (v) {
unsigned long m;
for (m = 1; m && i < idx; m<<=1, page++, i++) {
if (v & m) {
count++;
- ClearPageReserved(page);
+ ClearPageReservedNoAtomic(page);
set_page_count(page, 1);
__free_page(page);
}
@@ -294,7 +313,7 @@ static unsigned long __init free_all_boo
count = 0;
for (i = 0; i < ((bdata->node_low_pfn-(bdata->node_boot_start >> PAGE_SHIFT))/8 + PAGE_SIZE-1)/PAGE_SIZE; i++,page++) {
count++;
- ClearPageReserved(page);
+ ClearPageReservedNoAtomic(page);
set_page_count(page, 1);
__free_page(page);
}
--- a/include/linux/page-flags.h 2004-08-06 13:43:36.000000000 -0500
+++ b/include/linux/page-flags.h 2004-08-06 13:43:27.000000000 -0500
@@ -236,6 +236,7 @@ extern unsigned long __read_page_state(u
#define PageReserved(page) test_bit(PG_reserved, &(page)->flags)
#define SetPageReserved(page) set_bit(PG_reserved, &(page)->flags)
#define ClearPageReserved(page) clear_bit(PG_reserved, &(page)->flags)
+#define ClearPageReservedNoAtomic(page) (page)->flags &= ~(1UL << PG_reserved)
#define SetPagePrivate(page) set_bit(PG_private, &(page)->flags)
#define ClearPagePrivate(page) clear_bit(PG_private, &(page)->flags)
^ permalink raw reply [flat|nested] 4+ messages in thread* Re: [PATCH] improve speed of freeing bootmem
2004-08-06 19:26 [PATCH] improve speed of freeing bootmem Josh Aas
@ 2004-08-06 19:52 ` Andrew Morton
2004-08-06 20:46 ` William Lee Irwin III
2004-08-06 22:02 ` Josh Aas
0 siblings, 2 replies; 4+ messages in thread
From: Andrew Morton @ 2004-08-06 19:52 UTC (permalink / raw)
To: Josh Aas; +Cc: linux-kernel, linux-ia64
Josh Aas <josha@sgi.com> wrote:
>
> Attached is a patch that greatly improves the speed of freeing boot
> memory.
hm, OK. I have a vague feeling that Bill Irwin had patches to fix this up
ages ago.
A few nits:
> --- a/mm/bootmem.c 2004-08-05 15:33:39.000000000 -0500
> +++ b/mm/bootmem.c 2004-08-06 13:42:33.000000000 -0500
> @@ -259,6 +259,7 @@ static unsigned long __init free_all_boo
> unsigned long i, count, total = 0;
> unsigned long idx;
> unsigned long *map;
> + int gofast = 0;
>
> BUG_ON(!bdata->node_bootmem_map);
>
> @@ -267,14 +268,32 @@ static unsigned long __init free_all_boo
> page = virt_to_page(phys_to_virt(bdata->node_boot_start));
> idx = bdata->node_low_pfn - (bdata->node_boot_start >> PAGE_SHIFT);
> map = bdata->node_bootmem_map;
> + if (bdata->node_boot_start == 0 ||
> + ffs(bdata->node_boot_start) - PAGE_SHIFT > ffs(BITS_PER_LONG))
> + gofast = 1;
A comment describing the above reasoning would be nice.
> for (i = 0; i < idx; ) {
> unsigned long v = ~map[i / BITS_PER_LONG];
> - if (v) {
> + if (gofast && v == ~0UL) {
> + int j;
> +
> + count += BITS_PER_LONG;
> + ClearPageReservedNoAtomic(page);
> + set_page_count(page, 1);
> + for (j = 1; j < BITS_PER_LONG; j++) {
> + if (j + 16 < BITS_PER_LONG) {
> + prefetchw(page + j + 16);
> + }
The whitespace/tabbing has gone funny here.
> +#define ClearPageReservedNoAtomic(page) (page)->flags &= ~(1UL << PG_reserved)
The naming convention we used in 2.4 for the nonatomic operation was
__ClearPageReserved(), so can we please stick with that?
And this macro can use __clear_bit() rather than open-coding it.
^ permalink raw reply [flat|nested] 4+ messages in thread* Re: [PATCH] improve speed of freeing bootmem
2004-08-06 19:52 ` Andrew Morton
@ 2004-08-06 20:46 ` William Lee Irwin III
2004-08-06 22:02 ` Josh Aas
1 sibling, 0 replies; 4+ messages in thread
From: William Lee Irwin III @ 2004-08-06 20:46 UTC (permalink / raw)
To: Andrew Morton; +Cc: Josh Aas, linux-kernel, linux-ia64
Josh Aas <josha@sgi.com> wrote:
>> Attached is a patch that greatly improves the speed of freeing boot
>> memory.
On Fri, Aug 06, 2004 at 12:52:16PM -0700, Andrew Morton wrote:
> hm, OK. I have a vague feeling that Bill Irwin had patches to fix this up
> ages ago.
I'm going to do some further work here and bring some patches that
are simplifications of the most widely distributed ones I wrote up to
current, get them tested on a few architectures, and so on. The testing
cycle for this will probably take a couple of weeks, and it's a much
larger change, so I think Josh's solution is a good interim solution.
Merging it also helps in the event that the data structure changes
I'm working on don't work out or end up going through an unusually long
testing cycle.
-- wli
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: [PATCH] improve speed of freeing bootmem
2004-08-06 19:52 ` Andrew Morton
2004-08-06 20:46 ` William Lee Irwin III
@ 2004-08-06 22:02 ` Josh Aas
1 sibling, 0 replies; 4+ messages in thread
From: Josh Aas @ 2004-08-06 22:02 UTC (permalink / raw)
To: Andrew Morton; +Cc: linux-kernel, linux-ia64
[-- Attachment #1: Type: text/plain, Size: 1970 bytes --]
New patch is attached.
Signed-off-by: Tony Luck <tony.luck@intel.com>
Signed-off-by: Josh Aas <josha@sgi.com>
--
Josh Aas
Silicon Graphics, Inc. (SGI)
Linux System Software
651-683-3068
Andrew Morton wrote:
> Josh Aas <josha@sgi.com> wrote:
>
>>Attached is a patch that greatly improves the speed of freeing boot
>>memory.
>
>
> hm, OK. I have a vague feeling that Bill Irwin had patches to fix this up
> ages ago.
>
>
> A few nits:
>
>
>>--- a/mm/bootmem.c 2004-08-05 15:33:39.000000000 -0500
>>+++ b/mm/bootmem.c 2004-08-06 13:42:33.000000000 -0500
>>@@ -259,6 +259,7 @@ static unsigned long __init free_all_boo
>> unsigned long i, count, total = 0;
>> unsigned long idx;
>> unsigned long *map;
>>+ int gofast = 0;
>>
>> BUG_ON(!bdata->node_bootmem_map);
>>
>>@@ -267,14 +268,32 @@ static unsigned long __init free_all_boo
>> page = virt_to_page(phys_to_virt(bdata->node_boot_start));
>> idx = bdata->node_low_pfn - (bdata->node_boot_start >> PAGE_SHIFT);
>> map = bdata->node_bootmem_map;
>>+ if (bdata->node_boot_start == 0 ||
>>+ ffs(bdata->node_boot_start) - PAGE_SHIFT > ffs(BITS_PER_LONG))
>>+ gofast = 1;
>
>
> A comment describing the above reasoning would be nice.
>
>
>> for (i = 0; i < idx; ) {
>> unsigned long v = ~map[i / BITS_PER_LONG];
>>- if (v) {
>>+ if (gofast && v == ~0UL) {
>>+ int j;
>>+
>>+ count += BITS_PER_LONG;
>>+ ClearPageReservedNoAtomic(page);
>>+ set_page_count(page, 1);
>>+ for (j = 1; j < BITS_PER_LONG; j++) {
>>+ if (j + 16 < BITS_PER_LONG) {
>>+ prefetchw(page + j + 16);
>>+ }
>
>
> The whitespace/tabbing has gone funny here.
>
>
>>+#define ClearPageReservedNoAtomic(page) (page)->flags &= ~(1UL << PG_reserved)
>
>
> The naming convention we used in 2.4 for the nonatomic operation was
> __ClearPageReserved(), so can we please stick with that?
>
> And this macro can use __clear_bit() rather than open-coding it.
[-- Attachment #2: bootmem5.patch --]
[-- Type: text/x-patch, Size: 2384 bytes --]
--- mm/bootmem.c.orig 2004-08-05 15:33:39.000000000 -0500
+++ mm/bootmem.c 2004-08-06 16:52:41.000000000 -0500
@@ -259,6 +259,7 @@ static unsigned long __init free_all_boo
unsigned long i, count, total = 0;
unsigned long idx;
unsigned long *map;
+ int gofast = 0;
BUG_ON(!bdata->node_bootmem_map);
@@ -267,14 +268,33 @@ static unsigned long __init free_all_boo
page = virt_to_page(phys_to_virt(bdata->node_boot_start));
idx = bdata->node_low_pfn - (bdata->node_boot_start >> PAGE_SHIFT);
map = bdata->node_bootmem_map;
+ /* Check physaddr is O(LOG2(BITS_PER_LONG)) page aligned */
+ if (bdata->node_boot_start == 0 ||
+ ffs(bdata->node_boot_start) - PAGE_SHIFT > ffs(BITS_PER_LONG))
+ gofast = 1;
for (i = 0; i < idx; ) {
unsigned long v = ~map[i / BITS_PER_LONG];
- if (v) {
+ if (gofast && v == ~0UL) {
+ int j;
+
+ count += BITS_PER_LONG;
+ __ClearPageReserved(page);
+ set_page_count(page, 1);
+ for (j = 1; j < BITS_PER_LONG; j++) {
+ if (j + 16 < BITS_PER_LONG) {
+ prefetchw(page + j + 16);
+ }
+ __ClearPageReserved(page + j);
+ }
+ __free_pages(page, ffs(BITS_PER_LONG)-1);
+ i += BITS_PER_LONG;
+ page += BITS_PER_LONG;
+ } else if (v) {
unsigned long m;
for (m = 1; m && i < idx; m<<=1, page++, i++) {
if (v & m) {
count++;
- ClearPageReserved(page);
+ __ClearPageReserved(page);
set_page_count(page, 1);
__free_page(page);
}
@@ -294,7 +314,7 @@ static unsigned long __init free_all_boo
count = 0;
for (i = 0; i < ((bdata->node_low_pfn-(bdata->node_boot_start >> PAGE_SHIFT))/8 + PAGE_SIZE-1)/PAGE_SIZE; i++,page++) {
count++;
- ClearPageReserved(page);
+ __ClearPageReserved(page);
set_page_count(page, 1);
__free_page(page);
}
--- include/linux/page-flags.h.orig 2004-08-06 13:43:36.000000000 -0500
+++ include/linux/page-flags.h 2004-08-06 15:16:29.000000000 -0500
@@ -236,6 +236,7 @@ extern unsigned long __read_page_state(u
#define PageReserved(page) test_bit(PG_reserved, &(page)->flags)
#define SetPageReserved(page) set_bit(PG_reserved, &(page)->flags)
#define ClearPageReserved(page) clear_bit(PG_reserved, &(page)->flags)
+#define __ClearPageReserved(page) __clear_bit(PG_reserved, &(page)->flags)
#define SetPagePrivate(page) set_bit(PG_private, &(page)->flags)
#define ClearPagePrivate(page) clear_bit(PG_private, &(page)->flags)
^ permalink raw reply [flat|nested] 4+ messages in thread
end of thread, other threads:[~2004-08-06 22:01 UTC | newest]
Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2004-08-06 19:26 [PATCH] improve speed of freeing bootmem Josh Aas
2004-08-06 19:52 ` Andrew Morton
2004-08-06 20:46 ` William Lee Irwin III
2004-08-06 22:02 ` Josh Aas
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox