From: Andrew Morton <akpm@linux-foundation.org>
To: "Kirill A. Shutemov" <kirill.shutemov@linux.intel.com>
Cc: Rusty Russell <rusty@rustcorp.com.au>,
Hugh Dickins <hughd@google.com>,
Madhavan Srinivasan <maddy@linux.vnet.ibm.com>,
linux-kernel@vger.kernel.org, linuxppc-dev@lists.ozlabs.org,
linux-mm@kvack.org, linux-arch@vger.kernel.org, x86@kernel.org,
benh@kernel.crashing.org, paulus@samba.org, riel@redhat.com,
mgorman@suse.de, ak@linux.intel.com, peterz@infradead.org,
mingo@kernel.org, dave.hansen@intel.com
Subject: Re: [PATCH V4 0/2] mm: FAULT_AROUND_ORDER patchset performance data for powerpc
Date: Tue, 20 May 2014 12:59:56 -0700 [thread overview]
Message-ID: <20140520125956.aa61a3bfd84d4d6190740ce2@linux-foundation.org> (raw)
In-Reply-To: <20140520102738.7F096E009B@blue.fi.intel.com>
On Tue, 20 May 2014 13:27:38 +0300 (EEST) "Kirill A. Shutemov" <kirill.shutemov@linux.intel.com> wrote:
> Rusty Russell wrote:
> > "Kirill A. Shutemov" <kirill.shutemov@linux.intel.com> writes:
> > > Andrew Morton wrote:
> > >> On Mon, 19 May 2014 16:23:07 -0700 (PDT) Hugh Dickins <hughd@google.com> wrote:
> > >>
> > >> > Shouldn't FAULT_AROUND_ORDER and fault_around_order be changed to be
> > >> > the order of the fault-around size in bytes, and fault_around_pages()
> > >> > use 1UL << (fault_around_order - PAGE_SHIFT)
> > >>
> > >> Yes. And shame on me for missing it (this time!) at review.
> > >>
> > >> There's still time to fix this. Patches, please.
> > >
> > > Here it is. Made at 3.30 AM, build tested only.
> >
> > Prefer on top of Maddy's patch which makes it always a variable, rather
> > than CONFIG_DEBUG_FS. It's got enough hair as it is.
>
> Something like this?
This appears to be against mainline, not against Madhavan's patch. As
mentioned previously, I'd prefer it that way but confused.
> From: "Kirill A. Shutemov" <kirill.shutemov@linux.intel.com>
> Date: Tue, 20 May 2014 13:02:03 +0300
> Subject: [PATCH] mm: nominate faultaround area in bytes rather then page order
>
> There are evidences that faultaround feature is less relevant on
> architectures with page size bigger then 4k. Which makes sense since
> page fault overhead per byte of mapped area should be less there.
>
> Let's rework the feature to specify faultaround area in bytes instead of
> page order. It's 64 kilobytes for now.
>
> The patch effectively disables faultaround on architectures with
> page size >= 64k (like ppc64).
>
> It's possible that some other size of faultaround area is relevant for a
> platform. We can expose `fault_around_bytes' variable to arch-specific
> code once such platforms will be found.
>
> Signed-off-by: Kirill A. Shutemov <kirill.shutemov@linux.intel.com>
> ---
> mm/memory.c | 62 +++++++++++++++++++++++--------------------------------------
> 1 file changed, 23 insertions(+), 39 deletions(-)
>
> diff --git a/mm/memory.c b/mm/memory.c
> index 037b812a9531..252b319e8cdf 100644
> --- a/mm/memory.c
> +++ b/mm/memory.c
> @@ -3402,63 +3402,47 @@ void do_set_pte(struct vm_area_struct *vma, unsigned long address,
> update_mmu_cache(vma, address, pte);
> }
>
> -#define FAULT_AROUND_ORDER 4
> +static unsigned long fault_around_bytes = 65536;
> +
> +static inline unsigned long fault_around_pages(void)
> +{
> + return rounddown_pow_of_two(fault_around_bytes) / PAGE_SIZE;
> +}
I think we should round up, not down. So if the user asks for 1kb,
they get one page.
So this becomes
return PAGE_ALIGN(fault_around_bytes) / PAGE_SIZE;
> +static inline unsigned long fault_around_mask(void)
> +{
> + return ~(rounddown_pow_of_two(fault_around_bytes) - 1) & PAGE_MASK;
> +}
And this has me a bit stumped. It's not helpful that do_fault_around()
is undocumented. Does it fault in N/2 pages ahead and N/2 pages
behind? Or does it align the address down to the highest multiple of
fault_around_bytes? It appears to be the latter, so the location of
the faultaround window around the fault address is basically random,
depending on what address userspace happened to pick. I don't know why
we did this :(
Or something. Can we please get some code commentary over
do_fault_around() describing this design decision and explaining the
reasoning behind it?
Also, "neast" is not a word.
next prev parent reply other threads:[~2014-05-20 19:59 UTC|newest]
Thread overview: 47+ messages / expand[flat|nested] mbox.gz Atom feed top
2014-05-08 9:28 [PATCH V4 0/2] mm: FAULT_AROUND_ORDER patchset performance data for powerpc Madhavan Srinivasan
2014-05-08 9:28 ` Madhavan Srinivasan
2014-05-08 9:28 ` [PATCH V4 1/2] mm: move FAULT_AROUND_ORDER to arch/ Madhavan Srinivasan
2014-05-08 9:28 ` Madhavan Srinivasan
2014-05-08 9:28 ` [PATCH V4 2/2] powerpc/pseries: init fault_around_order for pseries Madhavan Srinivasan
2014-05-08 9:28 ` Madhavan Srinivasan
2014-05-20 7:28 ` Andrew Morton
2014-05-20 7:28 ` Andrew Morton
2014-05-20 8:03 ` Madhavan Srinivasan
2014-05-20 8:03 ` Madhavan Srinivasan
2014-05-15 8:25 ` [PATCH V4 0/2] mm: FAULT_AROUND_ORDER patchset performance data for powerpc Madhavan Srinivasan
2014-05-15 8:25 ` Madhavan Srinivasan
2014-05-15 17:28 ` Hugh Dickins
2014-05-15 17:28 ` Hugh Dickins
2014-05-19 0:12 ` Rusty Russell
2014-05-19 0:12 ` Rusty Russell
2014-05-19 3:05 ` Madhavan Srinivasan
2014-05-19 3:05 ` Madhavan Srinivasan
2014-05-19 23:23 ` Hugh Dickins
2014-05-19 23:23 ` Hugh Dickins
2014-05-19 23:43 ` Andrew Morton
2014-05-19 23:43 ` Andrew Morton
2014-05-20 0:44 ` Kirill A. Shutemov
2014-05-20 0:44 ` Kirill A. Shutemov
2014-05-20 6:22 ` Rusty Russell
2014-05-20 6:22 ` Rusty Russell
2014-05-20 7:32 ` Andrew Morton
2014-05-20 7:32 ` Andrew Morton
2014-05-20 7:53 ` Madhavan Srinivasan
2014-05-20 10:27 ` Kirill A. Shutemov
2014-05-20 19:59 ` Andrew Morton [this message]
2014-05-21 13:40 ` Kirill A. Shutemov
2014-05-21 13:40 ` Kirill A. Shutemov
2014-05-21 20:34 ` Andrew Morton
2014-05-23 12:28 ` Kirill A. Shutemov
2014-05-23 12:28 ` Kirill A. Shutemov
2014-05-27 6:24 ` Madhavan Srinivasan
2014-05-27 6:24 ` Madhavan Srinivasan
2014-05-27 10:21 ` Kirill A. Shutemov
2014-05-27 10:21 ` Kirill A. Shutemov
2014-05-27 10:44 ` Madhavan Srinivasan
2014-05-20 1:14 ` Rusty Russell
2014-05-20 1:14 ` Rusty Russell
2014-05-20 2:34 ` Hugh Dickins
2014-05-20 2:34 ` Hugh Dickins
2014-05-20 2:06 ` Madhavan Srinivasan
2014-05-20 2:06 ` Madhavan Srinivasan
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20140520125956.aa61a3bfd84d4d6190740ce2@linux-foundation.org \
--to=akpm@linux-foundation.org \
--cc=ak@linux.intel.com \
--cc=benh@kernel.crashing.org \
--cc=dave.hansen@intel.com \
--cc=hughd@google.com \
--cc=kirill.shutemov@linux.intel.com \
--cc=linux-arch@vger.kernel.org \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=linuxppc-dev@lists.ozlabs.org \
--cc=maddy@linux.vnet.ibm.com \
--cc=mgorman@suse.de \
--cc=mingo@kernel.org \
--cc=paulus@samba.org \
--cc=peterz@infradead.org \
--cc=riel@redhat.com \
--cc=rusty@rustcorp.com.au \
--cc=x86@kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).