From: "H. Peter Anvin" <hpa@zytor.com>
To: Dave Hansen <dave@linux.vnet.ibm.com>
Cc: linux-kernel@vger.kernel.org, linux-mm@kvack.org,
Gleb Natapov <gleb@redhat.com>, Avi Kivity <avi@redhat.com>,
Ingo Molnar <mingo@redhat.com>,
x86@kernel.org, Marcelo Tosatti <mtosatti@redhat.com>
Subject: Re: [RFCv3][PATCH 1/3] create slow_virt_to_phys()
Date: Tue, 15 Jan 2013 15:46:07 -0800 [thread overview]
Message-ID: <50F5EA3F.70002@zytor.com> (raw)
In-Reply-To: <50F5DD45.4060603@linux.vnet.ibm.com>
On 01/15/2013 02:50 PM, Dave Hansen wrote:
>
>> static inline unsigned long page_level_size(int level)
>> {
>> return (PAGE_SIZE/PGDIR_SIZE) << (PGDIR_SHIFT*level);
>> }
>> static inline unsigned long page_level_shift(int level)
>> {
>> return (PAGE_SHIFT-PGDIR_SHIFT) + (PGDIR_SHIFT*level);
>> }
>
> (PAGE_SHIFT-PGDIR_SHIFT) == -27, so this can't possibly work, right?
>
Ah right... sorry, got messed up in my head what that constant is about.
> How about something like this?
>
> /*
> * Note: this only holds true for pagetable levels where PTEs can be
> * present. It would break if you used it on the PGD level where PAE
> * is in use. It basically assumes that the shift between _all_
> * adjacent levels of the pagetables are the same as the lowest-level
> * shift.
> */
This comment is totally misleading. What it refers to is the separation
between various levels of the page hierarchy; in x86 it is always the same.
Perhaps a cleaner way to do this is:
#define PTRS_PER_PTE_SHIFT ilog2(PTRS_PER_PTE)
> #define PG_SHIFT_PER_LEVEL (PMD_SHIFT-PAGE_SHIFT)
>
> static inline unsigned long page_level_shift(int level)
> {
> return PAGE_SHIFT + (level - PG_LEVEL_4K) * PG_SHIFT_PER_LEVEL;
> }
> static inline unsigned long page_level_size(int level)
> {
> return 1 << page_level_shift(level);
> }
>
> The generated code for page_level_size() looks pretty good, despite it
> depending on page_level_shift(), so we might as well leave it defined
> this way for simplicity:
>
Make sure to make that 1UL instead of 1; page_level_shift() should
return int. See below.
> 0000000000400610 <plsize>:
> 400610: 8d 7c bf fb lea -0x5(%rdi,%rdi,4),%edi
> 400614: b8 01 00 00 00 mov $0x1,%eax
> 400619: 8d 4c 3f 0c lea 0xc(%rdi,%rdi,1),%ecx
> 40061d: d3 e0 shl %cl,%eax
> 40061f: c3 retq
We get better code with:
static inline int page_level_shift(int level)
{
return (PAGE_SHIFT - PTRS_PER_PTE_SHIFT) +
level * PTRS_PER_PTE_SHIFT;
}
static inline unsigned long page_level_size(int level)
{
return 1UL << page_level_shift(level);
}
... the resulting code has one lea instead of two:
0000000000000000 <plsize>:
0: 8d 4c ff 03 lea 0x3(%rdi,%rdi,8),%ecx
4: b8 01 00 00 00 mov $0x1,%eax
9: 48 d3 e0 shl %cl,%rax
c: c3 retq
-hpa
--
H. Peter Anvin, Intel Open Source Technology Center
I work for Intel. I don't speak on their behalf.
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
WARNING: multiple messages have this Message-ID (diff)
From: "H. Peter Anvin" <hpa@zytor.com>
To: Dave Hansen <dave@linux.vnet.ibm.com>
Cc: linux-kernel@vger.kernel.org, linux-mm@kvack.org,
Gleb Natapov <gleb@redhat.com>, Avi Kivity <avi@redhat.com>,
Ingo Molnar <mingo@redhat.com>,
x86@kernel.org, Marcelo Tosatti <mtosatti@redhat.com>
Subject: Re: [RFCv3][PATCH 1/3] create slow_virt_to_phys()
Date: Tue, 15 Jan 2013 15:46:07 -0800 [thread overview]
Message-ID: <50F5EA3F.70002@zytor.com> (raw)
In-Reply-To: <50F5DD45.4060603@linux.vnet.ibm.com>
On 01/15/2013 02:50 PM, Dave Hansen wrote:
>
>> static inline unsigned long page_level_size(int level)
>> {
>> return (PAGE_SIZE/PGDIR_SIZE) << (PGDIR_SHIFT*level);
>> }
>> static inline unsigned long page_level_shift(int level)
>> {
>> return (PAGE_SHIFT-PGDIR_SHIFT) + (PGDIR_SHIFT*level);
>> }
>
> (PAGE_SHIFT-PGDIR_SHIFT) == -27, so this can't possibly work, right?
>
Ah right... sorry, got messed up in my head what that constant is about.
> How about something like this?
>
> /*
> * Note: this only holds true for pagetable levels where PTEs can be
> * present. It would break if you used it on the PGD level where PAE
> * is in use. It basically assumes that the shift between _all_
> * adjacent levels of the pagetables are the same as the lowest-level
> * shift.
> */
This comment is totally misleading. What it refers to is the separation
between various levels of the page hierarchy; in x86 it is always the same.
Perhaps a cleaner way to do this is:
#define PTRS_PER_PTE_SHIFT ilog2(PTRS_PER_PTE)
> #define PG_SHIFT_PER_LEVEL (PMD_SHIFT-PAGE_SHIFT)
>
> static inline unsigned long page_level_shift(int level)
> {
> return PAGE_SHIFT + (level - PG_LEVEL_4K) * PG_SHIFT_PER_LEVEL;
> }
> static inline unsigned long page_level_size(int level)
> {
> return 1 << page_level_shift(level);
> }
>
> The generated code for page_level_size() looks pretty good, despite it
> depending on page_level_shift(), so we might as well leave it defined
> this way for simplicity:
>
Make sure to make that 1UL instead of 1; page_level_shift() should
return int. See below.
> 0000000000400610 <plsize>:
> 400610: 8d 7c bf fb lea -0x5(%rdi,%rdi,4),%edi
> 400614: b8 01 00 00 00 mov $0x1,%eax
> 400619: 8d 4c 3f 0c lea 0xc(%rdi,%rdi,1),%ecx
> 40061d: d3 e0 shl %cl,%eax
> 40061f: c3 retq
We get better code with:
static inline int page_level_shift(int level)
{
return (PAGE_SHIFT - PTRS_PER_PTE_SHIFT) +
level * PTRS_PER_PTE_SHIFT;
}
static inline unsigned long page_level_size(int level)
{
return 1UL << page_level_shift(level);
}
... the resulting code has one lea instead of two:
0000000000000000 <plsize>:
0: 8d 4c ff 03 lea 0x3(%rdi,%rdi,8),%ecx
4: b8 01 00 00 00 mov $0x1,%eax
9: 48 d3 e0 shl %cl,%rax
c: c3 retq
-hpa
--
H. Peter Anvin, Intel Open Source Technology Center
I work for Intel. I don't speak on their behalf.
next prev parent reply other threads:[~2013-01-15 23:46 UTC|newest]
Thread overview: 15+ messages / expand[flat|nested] mbox.gz Atom feed top
2013-01-09 18:59 [RFCv3][PATCH 1/3] create slow_virt_to_phys() Dave Hansen
2013-01-09 18:59 ` Dave Hansen
2013-01-09 18:59 ` [RFCv3][PATCH 2/3] fix kvm's use of __pa() on percpu areas Dave Hansen
2013-01-09 18:59 ` Dave Hansen
2013-01-15 18:38 ` Rik van Riel
2013-01-15 18:38 ` Rik van Riel
2013-01-09 18:59 ` [RFCv3][PATCH 3/3] make DEBUG_VIRTUAL work earlier in boot Dave Hansen
2013-01-09 18:59 ` Dave Hansen
2013-01-15 17:04 ` [RFCv3][PATCH 1/3] create slow_virt_to_phys() Rik van Riel
2013-01-15 19:46 ` H. Peter Anvin
2013-01-15 19:46 ` H. Peter Anvin
2013-01-15 22:50 ` Dave Hansen
2013-01-15 22:50 ` Dave Hansen
2013-01-15 23:46 ` H. Peter Anvin [this message]
2013-01-15 23:46 ` H. Peter Anvin
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=50F5EA3F.70002@zytor.com \
--to=hpa@zytor.com \
--cc=avi@redhat.com \
--cc=dave@linux.vnet.ibm.com \
--cc=gleb@redhat.com \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=mingo@redhat.com \
--cc=mtosatti@redhat.com \
--cc=x86@kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.