All of lore.kernel.org
 help / color / mirror / Atom feed
From: Rik van Riel <riel@redhat.com>
To: Johannes Weiner <hannes@cmpxchg.org>
Cc: linux-mm@kvack.org, akpm@linux-foundation.org,
	aarcange@redhat.com, peterz@infradead.org, minchan@gmail.com,
	kosaki.motohiro@gmail.com, andi@firstfloor.org, mel@csn.ul.ie,
	linux-kernel@vger.kernel.org, Rik van Riel <riel@surriel.com>
Subject: Re: [PATCH -mm 2/7] mm: get unmapped area from VMA tree
Date: Thu, 21 Jun 2012 09:17:11 -0400	[thread overview]
Message-ID: <4FE31ED7.4000305@redhat.com> (raw)
In-Reply-To: <20120621090157.GG27816@cmpxchg.org>

On 06/21/2012 05:01 AM, Johannes Weiner wrote:

>> diff --git a/include/linux/mm_types.h b/include/linux/mm_types.h
>> index bf56d66..8ccb4e1 100644
>> --- a/include/linux/mm_types.h
>> +++ b/include/linux/mm_types.h
>> @@ -307,6 +307,7 @@ struct mm_struct {
>>   	unsigned long task_size;		/* size of task vm space */
>>   	unsigned long cached_hole_size; 	/* if non-zero, the largest hole below free_area_cache */
>>   	unsigned long free_area_cache;		/* first hole of size cached_hole_size or larger */
>> +	unsigned long highest_vma;		/* highest vma end address */
>
> It's not clear from the name that this is an end address.  Would
> highest_vm_end be better?

Good idea.  Will fix.

>> +	/* Find the left-most free area of sufficient size. */
>> +	for (addr = 0, rb_node = mm->mm_rb.rb_node; rb_node; ) {
>> +		unsigned long vma_start;
>> +		int found_here = 0;
>> +
>> +		vma = rb_to_vma(rb_node);
>> +
>> +		if (vma->vm_start>  len) {
>
> vmas can abut, and vma->vm_end == vma->vm_next->vm_start.  Should this
> be>=?

We do not want to mmap at address 0.

>> +		/* Go left if it looks promising. */
>> +		if (node_free_hole(rb_node->rb_left)>= len&&
>> +					vma->vm_start - len>= lower_limit) {
>> +			rb_node = rb_node->rb_left;
>> +			continue;
>
> If we already are at a vma whose start has a lower address than the
> overall length, does it make sense to check for a left hole?
> I.e. shouldn't this be inside the if (vma->vm_start>  len) block?

I am trying to preserve the same fragmentation
semantics as the current code, so we do not
get any regressions in that area.

>> +	/*
>> +	 * There is not enough space to the left of any VMA.
>> +	 * Check the far right-hand side of the VMA tree.
>> +	 */
>> +	rb_node = mm->mm_rb.rb_node;
>> +	while (rb_node->rb_right)
>> +		rb_node = rb_node->rb_right;
>> +	vma = rb_to_vma(rb_node);
>> +	addr = vma->vm_end;
>
> Unless I missed something, we only reach here when
> continue_next_right(rb_node) above returned NULL.  And if it does, the
> rb_node it was passed was the right-most node in the tree, so we could
> do something like

We break out of the large while() loop once rb_node
is NULL, due to falling off the end of the tree.

> 	} else if (!addr) {
> 		struct rb_node *rb_right = continue_next_right(rb_node);
> 		if (!rb_right)
> 			break;
> 		rb_node = rb_right;
> 		continue;
> 	}
>
> above and then save the lookup after the loop.

That might work, but I expect the situation to be rare
enough that I would rather pick the more readable option.

> Also, dereferencing mm->mm_rb.rb_node unconditionally after the loop
> assumes that the tree always contains at least one vma.  Is this
> guaranteed for all architectures?

When a process is execve'd, a stack VMA is set up.
This means every process has at least one VMA by the
time we can get to this code.

>> -fail:
>> -	/*
>> -	 * if hint left us with no space for the requested
>> -	 * mapping then try again:
>> -	 *
>> -	 * Note: this is different with the case of bottomup
>> -	 * which does the fully line-search, but we use find_vma
>> -	 * here that causes some holes skipped.
>> -	 */
>> -	if (start_addr != mm->mmap_base) {
>> -		mm->free_area_cache = mm->mmap_base;
>> -		mm->cached_hole_size = 0;
>> -		goto try_again;
>> +		if (!found_here&&  node_free_hole(rb_node->rb_left)>= len) {
>> +			/* Last known hole is to the right of this subtree. */
>
> "to the left"

Thanks, will fix.

-- 
All rights reversed

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

WARNING: multiple messages have this Message-ID (diff)
From: Rik van Riel <riel@redhat.com>
To: Johannes Weiner <hannes@cmpxchg.org>
Cc: linux-mm@kvack.org, akpm@linux-foundation.org,
	aarcange@redhat.com, peterz@infradead.org, minchan@gmail.com,
	kosaki.motohiro@gmail.com, andi@firstfloor.org, mel@csn.ul.ie,
	linux-kernel@vger.kernel.org, Rik van Riel <riel@surriel.com>
Subject: Re: [PATCH -mm 2/7] mm: get unmapped area from VMA tree
Date: Thu, 21 Jun 2012 09:17:11 -0400	[thread overview]
Message-ID: <4FE31ED7.4000305@redhat.com> (raw)
In-Reply-To: <20120621090157.GG27816@cmpxchg.org>

On 06/21/2012 05:01 AM, Johannes Weiner wrote:

>> diff --git a/include/linux/mm_types.h b/include/linux/mm_types.h
>> index bf56d66..8ccb4e1 100644
>> --- a/include/linux/mm_types.h
>> +++ b/include/linux/mm_types.h
>> @@ -307,6 +307,7 @@ struct mm_struct {
>>   	unsigned long task_size;		/* size of task vm space */
>>   	unsigned long cached_hole_size; 	/* if non-zero, the largest hole below free_area_cache */
>>   	unsigned long free_area_cache;		/* first hole of size cached_hole_size or larger */
>> +	unsigned long highest_vma;		/* highest vma end address */
>
> It's not clear from the name that this is an end address.  Would
> highest_vm_end be better?

Good idea.  Will fix.

>> +	/* Find the left-most free area of sufficient size. */
>> +	for (addr = 0, rb_node = mm->mm_rb.rb_node; rb_node; ) {
>> +		unsigned long vma_start;
>> +		int found_here = 0;
>> +
>> +		vma = rb_to_vma(rb_node);
>> +
>> +		if (vma->vm_start>  len) {
>
> vmas can abut, and vma->vm_end == vma->vm_next->vm_start.  Should this
> be>=?

We do not want to mmap at address 0.

>> +		/* Go left if it looks promising. */
>> +		if (node_free_hole(rb_node->rb_left)>= len&&
>> +					vma->vm_start - len>= lower_limit) {
>> +			rb_node = rb_node->rb_left;
>> +			continue;
>
> If we already are at a vma whose start has a lower address than the
> overall length, does it make sense to check for a left hole?
> I.e. shouldn't this be inside the if (vma->vm_start>  len) block?

I am trying to preserve the same fragmentation
semantics as the current code, so we do not
get any regressions in that area.

>> +	/*
>> +	 * There is not enough space to the left of any VMA.
>> +	 * Check the far right-hand side of the VMA tree.
>> +	 */
>> +	rb_node = mm->mm_rb.rb_node;
>> +	while (rb_node->rb_right)
>> +		rb_node = rb_node->rb_right;
>> +	vma = rb_to_vma(rb_node);
>> +	addr = vma->vm_end;
>
> Unless I missed something, we only reach here when
> continue_next_right(rb_node) above returned NULL.  And if it does, the
> rb_node it was passed was the right-most node in the tree, so we could
> do something like

We break out of the large while() loop once rb_node
is NULL, due to falling off the end of the tree.

> 	} else if (!addr) {
> 		struct rb_node *rb_right = continue_next_right(rb_node);
> 		if (!rb_right)
> 			break;
> 		rb_node = rb_right;
> 		continue;
> 	}
>
> above and then save the lookup after the loop.

That might work, but I expect the situation to be rare
enough that I would rather pick the more readable option.

> Also, dereferencing mm->mm_rb.rb_node unconditionally after the loop
> assumes that the tree always contains at least one vma.  Is this
> guaranteed for all architectures?

When a process is execve'd, a stack VMA is set up.
This means every process has at least one VMA by the
time we can get to this code.

>> -fail:
>> -	/*
>> -	 * if hint left us with no space for the requested
>> -	 * mapping then try again:
>> -	 *
>> -	 * Note: this is different with the case of bottomup
>> -	 * which does the fully line-search, but we use find_vma
>> -	 * here that causes some holes skipped.
>> -	 */
>> -	if (start_addr != mm->mmap_base) {
>> -		mm->free_area_cache = mm->mmap_base;
>> -		mm->cached_hole_size = 0;
>> -		goto try_again;
>> +		if (!found_here&&  node_free_hole(rb_node->rb_left)>= len) {
>> +			/* Last known hole is to the right of this subtree. */
>
> "to the left"

Thanks, will fix.

-- 
All rights reversed

  reply	other threads:[~2012-06-21 13:17 UTC|newest]

Thread overview: 58+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2012-06-18 22:05 [PATCH -mm 0/7] mm: scalable and unified arch_get_unmapped_area Rik van Riel
2012-06-18 22:05 ` Rik van Riel
2012-06-18 22:05 ` [PATCH -mm 1/7] mm: track free size between VMAs in VMA rbtree Rik van Riel
2012-06-18 22:05   ` Rik van Riel
2012-06-19 23:25   ` Andrew Morton
2012-06-19 23:25     ` Andrew Morton
2012-06-21 11:01   ` Peter Zijlstra
2012-06-21 11:01     ` Peter Zijlstra
2012-06-21 11:07   ` Peter Zijlstra
2012-06-21 11:07     ` Peter Zijlstra
2012-06-21 14:47   ` Mel Gorman
2012-06-21 14:47     ` Mel Gorman
2012-06-18 22:05 ` [PATCH -mm 2/7] mm: get unmapped area from VMA tree Rik van Riel
2012-06-18 22:05   ` Rik van Riel
2012-06-21  9:01   ` Johannes Weiner
2012-06-21  9:01     ` Johannes Weiner
2012-06-21 13:17     ` Rik van Riel [this message]
2012-06-21 13:17       ` Rik van Riel
2012-06-21 16:50     ` Rik van Riel
2012-06-21 16:50       ` Rik van Riel
2012-06-21 16:16   ` Mel Gorman
2012-06-21 16:16     ` Mel Gorman
2012-06-21 17:27     ` Rik van Riel
2012-06-21 17:27       ` Rik van Riel
2012-06-21 21:06   ` Peter Zijlstra
2012-06-21 21:06     ` Peter Zijlstra
2012-06-18 22:05 ` [PATCH -mm 3/7] Allow each architecture to specify the address range that can be used for this allocation Rik van Riel
2012-06-18 22:05   ` Rik van Riel
2012-06-18 22:05 ` [PATCH -mm 4/7] mm: make page colouring code generic Rik van Riel
2012-06-18 22:05   ` Rik van Riel
2012-06-19 23:27   ` Andrew Morton
2012-06-19 23:27     ` Andrew Morton
2012-06-21 17:52     ` Rik van Riel
2012-06-21 17:52       ` Rik van Riel
2012-06-21 19:22       ` Borislav Petkov
2012-06-21 19:22         ` Borislav Petkov
2012-06-21 11:20   ` Peter Zijlstra
2012-06-21 11:20     ` Peter Zijlstra
2012-06-21 14:30     ` Rik van Riel
2012-06-21 14:30       ` Rik van Riel
2012-06-21 17:40       ` Andrew Morton
2012-06-21 17:40         ` Andrew Morton
2012-06-21 17:45         ` Rik van Riel
2012-06-21 17:45           ` Rik van Riel
2012-06-21 12:37   ` Borislav Petkov
2012-06-21 12:37     ` Borislav Petkov
2012-06-21 13:24     ` Rik van Riel
2012-06-21 13:24       ` Rik van Riel
2012-06-18 22:05 ` [PATCH -mm 5/7] mm: remove x86 arch_get_unmapped_area(_topdown) Rik van Riel
2012-06-18 22:05   ` Rik van Riel
2012-06-18 22:05 ` [PATCH -mm 6/7] remove MIPS arch_get_unmapped_area code Rik van Riel
2012-06-18 22:05   ` Rik van Riel
2012-06-18 22:05 ` [PATCH -mm 7/7] remove ARM arch_get_unmapped_area functions Rik van Riel
2012-06-18 22:05   ` Rik van Riel
2012-06-19 23:20 ` [PATCH -mm 0/7] mm: scalable and unified arch_get_unmapped_area Andrew Morton
2012-06-19 23:20   ` Andrew Morton
2012-06-21 10:18 ` Johannes Weiner
2012-06-21 10:18   ` Johannes Weiner

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=4FE31ED7.4000305@redhat.com \
    --to=riel@redhat.com \
    --cc=aarcange@redhat.com \
    --cc=akpm@linux-foundation.org \
    --cc=andi@firstfloor.org \
    --cc=hannes@cmpxchg.org \
    --cc=kosaki.motohiro@gmail.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linux-mm@kvack.org \
    --cc=mel@csn.ul.ie \
    --cc=minchan@gmail.com \
    --cc=peterz@infradead.org \
    --cc=riel@surriel.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.