linux-mm.kvack.org archive mirror
 help / color / mirror / Atom feed
From: Herbert van den Bergh <herbert.van.den.bergh@oracle.com>
To: Mel Gorman <mgorman@suse.de>
Cc: linux-mm@kvack.org
Subject: Re: [BUG] 3.2.2 crash in isolate_migratepages
Date: Mon, 30 Jan 2012 10:16:21 -0800	[thread overview]
Message-ID: <4F26DE75.5050409@oracle.com> (raw)
In-Reply-To: <20120130090923.GD4065@suse.de>

On 1/30/12 1:09 AM, Mel Gorman wrote:
> On Fri, Jan 27, 2012 at 01:43:07PM -0800, Herbert van den Bergh wrote:
>> 3.2.2 panics on a 16GB i686 blade:
>>
>> BUG: unable to handle kernel paging request at 01c00008
>> IP: [<c0522399>] isolate_migratepages+0x119/0x390
>> *pdpt = 000000002f7ce001 *pde = 0000000000000000
>>
>> The crash happens on this line in mm/compaction.c::isolate_migratepages:
>>
>>     328                 page = pfn_to_page(low_pfn);
>>
> This is not line 328 on kernel 3.2.2. Can you double check what version
> you are using?

That's right, I was using 3.1, but reproduced the problem on 3.2.2.  The
source code line numbers are from 3.1.  Sorry for the confusion.
>> This macro finds the struct page pointer for a given pfn.  These struct
>> page pointers are stored in sections of 131072 pages if
>> CONFIG_SPARSEMEM=y.  If an entire section has no memory pages, the page
>> structs are not allocated for this section.  On this particular machine,
>> there is no RAM mapped from 2GB - 4GB:
>>
>> # dmesg|grep usable
>>  BIOS-e820: 0000000000000000 - 000000000009f400 (usable)
>>  BIOS-e820: 0000000000100000 - 000000007fe4e000 (usable)
>>  BIOS-e820: 000000007fe56000 - 000000007fe57000 (usable)
>>  BIOS-e820: 0000000100000000 - 000000047ffff000 (usable)
>>
>> So there are no page structs for the sections between 2GB and 4GB.
>>
>> I believe this check was intended to catch page numbers that point to holes:
>>
>>     323                 if (!pfn_valid_within(low_pfn))
>>     324                         continue;
> Can you try the following patch please?
The following patch fixes the crash on this system.

Thanks,
Herbert.

>
> ---8<---
> mm: compaction: Check pfn_valid when entering a new MAX_ORDER_NR_PAGES block during isolation for migration
>
> When isolating for migration, migration starts at the start of a zone
> which is not necessarily pageblock aligned. Further, it stops isolating
> when COMPACT_CLUSTER_MAX pages are isolated so migrate_pfn is generally
> not aligned.
>
> The problem is that pfn_valid is only called on the first PFN being
> checked. Lets say we have a case like this
>
> H = MAX_ORDER_NR_PAGES boundary
> | = pageblock boundary
> m = cc->migrate_pfn
> f = cc->free_pfn
> o = memory hole
>
> H------|------H------|----m-Hoooooo|ooooooH-f----|------H
>
> The migrate_pfn is just below a memory hole and the free scanner is
> beyond the hole. When isolate_migratepages started, it scans from
> migrate_pfn to migrate_pfn+pageblock_nr_pages which is now in a memory
> hole. It checks pfn_valid() on the first PFN but then scans into the
> hole where there are not necessarily valid struct pages.
>
> This patch ensures that isolate_migratepages calls pfn_valid when
> necessary.
>
> Signed-off-by: Mel Gorman <mgorman@suse.de>
> ---
>  mm/compaction.c |   13 +++++++++++++
>  1 files changed, 13 insertions(+), 0 deletions(-)
>
> diff --git a/mm/compaction.c b/mm/compaction.c
> index 899d956..edc1e26 100644
> --- a/mm/compaction.c
> +++ b/mm/compaction.c
> @@ -313,6 +313,19 @@ static isolate_migrate_t isolate_migratepages(struct zone *zone,
>  		} else if (!locked)
>  			spin_lock_irq(&zone->lru_lock);
>  
> +		/*
> +		 * migrate_pfn does not necessarily start aligned to a
> +		 * pageblock. Ensure that pfn_valid is called when moving
> +		 * into a new MAX_ORDER_NR_PAGES range in case of large
> +		 * memory holes within the zone
> +		 */
> +		if ((low_pfn & (MAX_ORDER_NR_PAGES - 1)) == 0) {
> +			if (!pfn_valid(low_pfn)) {
> +				low_pfn += MAX_ORDER_NR_PAGES - 1;
> +				continue;
> +			}
> +		}
> +
>  		if (!pfn_valid_within(low_pfn))
>  			continue;
>  		nr_scanned++;

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Fight unfair telecom internet charges in Canada: sign http://stopthemeter.ca/
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

  reply	other threads:[~2012-01-30 18:16 UTC|newest]

Thread overview: 4+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2012-01-27 21:43 [BUG] 3.2.2 crash in isolate_migratepages Herbert van den Bergh
2012-01-30  9:09 ` Mel Gorman
2012-01-30 18:16   ` Herbert van den Bergh [this message]
2012-01-30 18:28     ` Michal Nazarewicz

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=4F26DE75.5050409@oracle.com \
    --to=herbert.van.den.bergh@oracle.com \
    --cc=linux-mm@kvack.org \
    --cc=mgorman@suse.de \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).