linux-arch.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Yinghai Lu <yinghai@kernel.org>
To: Johannes Weiner <hannes@cmpxchg.org>
Cc: Ingo Molnar <mingo@elte.hu>, Thomas Gleixner <tglx@linutronix.de>,
	"H. Peter Anvin" <hpa@zytor.com>,
	Andrew Morton <akpm@linux-foundation.org>,
	David Miller <davem@davemloft.net>,
	Benjamin Herrenschmidt <benh@kernel.crashing.org>,
	Linus Torvalds <torvalds@linux-foundation.org>,
	linux-kernel@vger.kernel.org, linux-arch@vger.kernel.org
Subject: Re: [PATCH 02/24] x86: Make sure free_init_pages() free pages in boundary
Date: Fri, 26 Mar 2010 16:45:56 -0700	[thread overview]
Message-ID: <4BAD4734.8020503@kernel.org> (raw)
In-Reply-To: <20100326230615.GB29222@cmpxchg.org>

On 03/26/2010 04:06 PM, Johannes Weiner wrote:
> On Fri, Mar 26, 2010 at 03:21:32PM -0700, Yinghai Lu wrote:
>> diff --git a/arch/x86/kernel/head32.c b/arch/x86/kernel/head32.c
>> index adedeef..fe3d953 100644
>> --- a/arch/x86/kernel/head32.c
>> +++ b/arch/x86/kernel/head32.c
>> @@ -47,6 +47,7 @@ void __init i386_start_kernel(void)
>>  		u64 ramdisk_image = boot_params.hdr.ramdisk_image;
>>  		u64 ramdisk_size  = boot_params.hdr.ramdisk_size;
>>  		u64 ramdisk_end   = ramdisk_image + ramdisk_size;
>> +		ramdisk_end = PFN_UP(ramdisk_end) << PAGE_SHIFT;
> 
> Why not PAGE_ALIGN()?

looks like PFN_UP is more clear.

>> +	size_aligned = ramdisk_end - ramdisk_image;
>> +
>>  	/* We need to move the initrd down into lowmem */
>> -	ramdisk_here = find_e820_area(0, end_of_lowmem, ramdisk_size,
>> +	ramdisk_here = find_e820_area(0, end_of_lowmem, size_aligned,
>>  					 PAGE_SIZE);
>>  
>>  	if (ramdisk_here == -1ULL)
>>  		panic("Cannot find place for new RAMDISK of size %lld\n",
>> -			 ramdisk_size);
>> +			 size_aligned);
>>  
>>  	/* Note: this includes all the lowmem currently occupied by
>>  	   the initrd, we rely on that fact to keep the data intact. */
>> -	reserve_early(ramdisk_here, ramdisk_here + ramdisk_size,
>> +	reserve_early(ramdisk_here, ramdisk_here + size_aligned,
>>  			 "NEW RAMDISK");
>>  	initrd_start = ramdisk_here + PAGE_OFFSET;
>>  	initrd_end   = initrd_start + ramdisk_size;
>>  	printk(KERN_INFO "Allocated new RAMDISK: %08llx - %08llx\n",
>> -			 ramdisk_here, ramdisk_here + ramdisk_size);
>> +			 ramdisk_here, ramdisk_here + size_aligned);
>>  
>>  	q = (char *)initrd_start;
>>  
>>  	/* Copy any lowmem portion of the initrd */
>> -	if (ramdisk_image < end_of_lowmem) {
>> -		clen = end_of_lowmem - ramdisk_image;
>> -		p = (char *)__va(ramdisk_image);
>> +	image = ramdisk_image;
>> +	if (image < end_of_lowmem) {
>> +		clen = end_of_lowmem - image;
>> +		p = (char *)__va(image);
>>  		memcpy(q, p, clen);
>>  		q += clen;
>> -		ramdisk_image += clen;
>> -		ramdisk_size  -= clen;
>> +		image += clen;
>> +		size_aligned  -= clen;
>>  	}
>>  
>>  	/* Copy the highmem portion of the initrd */
>> -	while (ramdisk_size) {
>> -		slop = ramdisk_image & ~PAGE_MASK;
>> -		clen = ramdisk_size;
>> +	while (size_aligned) {
>> +		slop = image & ~PAGE_MASK;
>> +		clen = size_aligned;
>>  		if (clen > MAX_MAP_CHUNK-slop)
>>  			clen = MAX_MAP_CHUNK-slop;
>> -		mapaddr = ramdisk_image & PAGE_MASK;
>> +		mapaddr = image & PAGE_MASK;
>>  		p = early_memremap(mapaddr, clen+slop);
>>  		memcpy(q, p+slop, clen);
>>  		early_iounmap(p, clen+slop);
>>  		q += clen;
>> -		ramdisk_image += clen;
>> -		ramdisk_size  -= clen;
>> +		image += clen;
>> +		size_aligned  -= clen;
> 
> Those appear to be a lot of spurious changes.  We don't need to
> copy the alignment padding as well, so it only matters that the
> arguments to find_e820_area() and reserve_early() are aligned, no?

we need to reserve that whole page, otherwise other user may use the same page.

then we can not use PFN_UP() later to free the whole page.

>> --- a/arch/x86/mm/init.c
>> +++ b/arch/x86/mm/init.c
>> @@ -332,6 +332,16 @@ int devmem_is_allowed(unsigned long pagenr)
>>  void free_init_pages(char *what, unsigned long begin, unsigned long end)
>>  {
>>  	unsigned long addr = begin;
>> +	unsigned long addr_aligned, end_aligned;
>> +
>> +	/* Make sure boundaries are page aligned */
>> +	addr_aligned = PFN_UP(addr) << PAGE_SHIFT;
>> +	end_aligned = PFN_DOWN(end) << PAGE_SHIFT;
>> +
>> +	if (WARN(addr_aligned != addr || end_aligned != end, "free_init_pages: range [%#lx, %#lx] is not aligned\n", addr, end)) {
>> +		addr = addr_aligned;
>> +		end = end_aligned;
>> +	}
> 
> Maybe realign only for when it is not aligned?  So to keep the fixup
> out of line.
> 
> I suppose WARN_ON() is enough as it will print a stack trace which
> should be enough clue of what went wrong.

we need to PFN_DOWN the end, and don't free the partial page.
otherwise could crash the system.

So just print out the trace, and system still can be used. reporter to get dmesg if
he don't have serial console.

> 
>>  	if (addr >= end)
>>  		return;
>> @@ -376,6 +386,18 @@ void free_initmem(void)
>>  #ifdef CONFIG_BLK_DEV_INITRD
>>  void free_initrd_mem(unsigned long start, unsigned long end)
>>  {
>> -	free_init_pages("initrd memory", start, end);
>> +	unsigned long end_aligned;
>> +
>> +	/*
>> +	 * end could be not aligned, and We can not align that,
>> +	 * decompresser could be confused by aligned initrd_end
>> +	 * We already reserve the end partial page before in
>> +	 *   - i386_start_kernel()
>> +	 *   - x86_64_start_kernel()
>> +	 *   - relocate_initrd()
>> +	 * So here we can do PFN_UP() safely to get partial page to be freed
>> +	 */
>> +	end_aligned = PFN_UP(end) << PAGE_SHIFT;
> 
> PAGE_ALIGN()

no PAGE_ALIGN is bad name, it is not clear UP or DOWN.

Thanks

Yinghai

  parent reply	other threads:[~2010-03-26 23:47 UTC|newest]

Thread overview: 72+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2010-03-26 22:21 [PATCH -v8 00/24] use lmb with x86 Yinghai Lu
2010-03-26 22:21 ` Yinghai Lu
2010-03-26 22:21 ` [PATCH 01/24] x86: Make smp_locks end with page alignment Yinghai Lu
2010-03-26 22:21   ` Yinghai Lu
2010-03-26 23:33   ` Johannes Weiner
2010-03-26 22:21 ` [PATCH 02/24] x86: Make sure free_init_pages() free pages in boundary Yinghai Lu
2010-03-26 22:21   ` Yinghai Lu
2010-03-26 23:06   ` Johannes Weiner
2010-03-26 23:06     ` Johannes Weiner
2010-03-26 23:45     ` Yinghai Lu [this message]
2010-03-27  0:07       ` Johannes Weiner
2010-03-27  0:07         ` Johannes Weiner
2010-03-27  0:17         ` Yinghai Lu
2010-03-27  0:17           ` Yinghai Lu
2010-03-27  1:19         ` [PATCH -v3] " Yinghai Lu
2010-03-28  0:03           ` Johannes Weiner
2010-03-28  0:50             ` Yinghai Lu
2010-03-28  1:01               ` Johannes Weiner
2010-03-28  1:58                 ` Yinghai Lu
2010-03-28 23:35                   ` [patch v5] x86: page-alin initrd area size Johannes Weiner
2010-03-29  0:41                     ` Yinghai Lu
2010-03-26 22:21 ` [PATCH 03/24] x86: Do not free zero sized per cpu areas Yinghai Lu
2010-03-26 22:21   ` Yinghai Lu
2010-03-26 23:42   ` Johannes Weiner
2010-03-26 23:42     ` Johannes Weiner
2010-03-26 23:49     ` Yinghai Lu
2010-03-26 23:49       ` Yinghai Lu
2010-03-26 23:54       ` Johannes Weiner
2010-03-26 23:56         ` Yinghai Lu
2010-03-27  1:18   ` [PATCH -v8] " Yinghai Lu
2010-03-26 22:21 ` [PATCH 04/24] lmb: Move lmb.c to mm/ Yinghai Lu
2010-03-26 22:21   ` Yinghai Lu
2010-03-26 22:21 ` [PATCH 05/24] lmb: Seperate region array from lmb_region struct Yinghai Lu
2010-03-26 22:21   ` Yinghai Lu
2010-03-26 22:21 ` [PATCH 06/24] lmb: Add find_lmb_area() Yinghai Lu
2010-03-26 22:21   ` Yinghai Lu
2010-03-26 22:21 ` [PATCH 07/24] lmb: Add reserve_lmb/free_lmb Yinghai Lu
2010-03-26 22:21   ` Yinghai Lu
2010-03-26 22:21 ` [PATCH 08/24] lmb: Add find_lmb_area_size() Yinghai Lu
2010-03-26 22:21   ` Yinghai Lu
2010-03-26 22:21 ` [PATCH 09/24] bootmem, x86: Add weak version of reserve_bootmem_generic Yinghai Lu
2010-03-26 22:21   ` Yinghai Lu
2010-03-26 22:21 ` [PATCH 10/24] lmb: Add lmb_to_bootmem() Yinghai Lu
2010-03-26 22:21   ` Yinghai Lu
2010-03-26 22:21 ` [PATCH 11/24] lmb: Add get_free_all_memory_range() Yinghai Lu
2010-03-26 22:21   ` Yinghai Lu
2010-03-26 22:21 ` [PATCH 12/24] lmb: Add lmb_register_active_regions() and lmb_hole_size() Yinghai Lu
2010-03-26 22:21   ` Yinghai Lu
2010-03-26 22:21 ` [PATCH 13/24] lmb: Prepare to include linux/lmb.h in core file Yinghai Lu
2010-03-26 22:21   ` Yinghai Lu
2010-03-26 22:21 ` [PATCH 14/24] lmb: Add find_memory_core_early() Yinghai Lu
2010-03-26 22:21   ` Yinghai Lu
2010-03-26 22:21 ` [PATCH 15/24] lmb: Add find_lmb_area_node() Yinghai Lu
2010-03-26 22:21   ` Yinghai Lu
2010-03-26 22:21 ` [PATCH 16/24] lmb: Add lmb_free_memory_size() Yinghai Lu
2010-03-26 22:21   ` Yinghai Lu
2010-03-26 22:21 ` [PATCH 17/24] lmb: Add lmb_memory_size() Yinghai Lu
2010-03-26 22:21   ` Yinghai Lu
2010-03-26 22:21 ` [PATCH 18/24] lmb: Add reserve_lmb_overlap_ok() Yinghai Lu
2010-03-26 22:21   ` Yinghai Lu
2010-03-26 22:21 ` [PATCH 19/24] x86: Add sanitize_e820_map() Yinghai Lu
2010-03-26 22:21   ` Yinghai Lu
2010-03-26 22:21 ` [PATCH 20/24] x86: Use lmb to replace early_res Yinghai Lu
2010-03-26 22:21   ` Yinghai Lu
2010-03-26 22:21 ` [PATCH 21/24] x86: Replace e820_/_early string with lmb_ Yinghai Lu
2010-03-26 22:21   ` Yinghai Lu
2010-03-26 22:21 ` [PATCH 22/24] x86: Remove not used early_res code Yinghai Lu
2010-03-26 22:21   ` Yinghai Lu
2010-03-26 22:21 ` [PATCH 23/24] x86, lmb: Use lmb_memory_size()/lmb_free_memory_size() to get correct dma_reserve Yinghai Lu
2010-03-26 22:21   ` Yinghai Lu
2010-03-26 22:21 ` [PATCH 24/24] x86: Align e820 ram range to page Yinghai Lu
2010-03-26 22:21   ` Yinghai Lu

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=4BAD4734.8020503@kernel.org \
    --to=yinghai@kernel.org \
    --cc=akpm@linux-foundation.org \
    --cc=benh@kernel.crashing.org \
    --cc=davem@davemloft.net \
    --cc=hannes@cmpxchg.org \
    --cc=hpa@zytor.com \
    --cc=linux-arch@vger.kernel.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=mingo@elte.hu \
    --cc=tglx@linutronix.de \
    --cc=torvalds@linux-foundation.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).