All of lore.kernel.org
 help / color / mirror / Atom feed
From: Seth Jennings <sjenning@linux.vnet.ibm.com>
To: Dan Magenheimer <dan.magenheimer@oracle.com>
Cc: Peter Zijlstra <a.p.zijlstra@chello.nl>,
	Minchan Kim <minchan@kernel.org>,
	Greg Kroah-Hartman <gregkh@linuxfoundation.org>,
	Nitin Gupta <ngupta@vflare.org>,
	linux-kernel@vger.kernel.org, linux-mm@kvack.org,
	Thomas Gleixner <tglx@linutronix.de>,
	Ingo Molnar <mingo@redhat.com>, Tejun Heo <tj@kernel.org>,
	David Howells <dhowells@redhat.com>,
	x86@kernel.org, Nick Piggin <npiggin@gmail.com>,
	Konrad Rzeszutek Wilk <konrad@darnok.org>
Subject: Re: [PATCH v2 3/3] x86: Support local_flush_tlb_kernel_range
Date: Fri, 15 Jun 2012 11:48:12 -0500	[thread overview]
Message-ID: <4FDB674C.9070304@linux.vnet.ibm.com> (raw)
In-Reply-To: <7e925563-082b-468f-a7d8-829e819eeac0@default>

On 06/15/2012 11:35 AM, Dan Magenheimer wrote:

>> From: Seth Jennings [mailto:sjenning@linux.vnet.ibm.com]
>> Sent: Friday, June 15, 2012 9:13 AM
>> To: Peter Zijlstra
>> Cc: Minchan Kim; Greg Kroah-Hartman; Nitin Gupta; Dan Magenheimer; linux-kernel@vger.kernel.org;
>> linux-mm@kvack.org; Thomas Gleixner; Ingo Molnar; Tejun Heo; David Howells; x86@kernel.org; Nick
>> Piggin
>> Subject: Re: [PATCH v2 3/3] x86: Support local_flush_tlb_kernel_range
>>
>> On 05/17/2012 09:51 AM, Peter Zijlstra wrote:
>>
>>> On Thu, 2012-05-17 at 17:11 +0900, Minchan Kim wrote:
>>>>> +++ b/arch/x86/include/asm/tlbflush.h
>>>>> @@ -172,4 +172,16 @@ static inline void flush_tlb_kernel_range(unsigned long start,
>>>>>       flush_tlb_all();
>>>>>  }
>>>>>
>>>>> +static inline void local_flush_tlb_kernel_range(unsigned long start,
>>>>> +             unsigned long end)
>>>>> +{
>>>>> +     if (cpu_has_invlpg) {
>>>>> +             while (start < end) {
>>>>> +                     __flush_tlb_single(start);
>>>>> +                     start += PAGE_SIZE;
>>>>> +             }
>>>>> +     } else
>>>>> +             local_flush_tlb();
>>>>> +}
>>>
>>> It would be much better if you wait for Alex Shi's patch to mature.
>>> doing the invlpg thing for ranges is not an unconditional win.
>>
>> From what I can tell Alex's patches have stalled.  The last post was v6
>> on 5/17 and there wasn't a single reply to them afaict.
>>
>> According to Alex's investigation of this "tipping point", it seems that
>> a good generic value is 8.  In other words, on most x86 hardware, it is
>> cheaper to flush up to 8 tlb entries one by one rather than doing a
>> complete flush.
>>
>> So we can do something like:
>>
>>      if (cpu_has_invlpg && (end - start)/PAGE_SIZE <= 8) {
>>              while (start < end) {
>>
>> Would this be acceptable?
> 
> Hey Seth, Nitin --
> 
> After more work digging around zsmalloc and zbud, I really think
> this TLB flushing, as well as the "page pair mapping" code can be
> completely eliminated IFF zsmalloc is limited to items PAGE_SIZE or
> less.


To add to what Nitin just sent, without the page mapping, zsmalloc and
the late xvmalloc have the same issue.  Say you have a whole class of
objects that are 3/4 of a page.  Without the mapping, you can't cross
non-contiguous page boundaries and you'll have 25% fragmentation in the
memory pool.  This is the whole point of zsmalloc.

--
Seth

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

WARNING: multiple messages have this Message-ID (diff)
From: Seth Jennings <sjenning@linux.vnet.ibm.com>
To: Dan Magenheimer <dan.magenheimer@oracle.com>
Cc: Peter Zijlstra <a.p.zijlstra@chello.nl>,
	Minchan Kim <minchan@kernel.org>,
	Greg Kroah-Hartman <gregkh@linuxfoundation.org>,
	Nitin Gupta <ngupta@vflare.org>,
	linux-kernel@vger.kernel.org, linux-mm@kvack.org,
	Thomas Gleixner <tglx@linutronix.de>,
	Ingo Molnar <mingo@redhat.com>, Tejun Heo <tj@kernel.org>,
	David Howells <dhowells@redhat.com>,
	x86@kernel.org, Nick Piggin <npiggin@gmail.com>,
	Konrad Rzeszutek Wilk <konrad@darnok.org>
Subject: Re: [PATCH v2 3/3] x86: Support local_flush_tlb_kernel_range
Date: Fri, 15 Jun 2012 11:48:12 -0500	[thread overview]
Message-ID: <4FDB674C.9070304@linux.vnet.ibm.com> (raw)
In-Reply-To: <7e925563-082b-468f-a7d8-829e819eeac0@default>

On 06/15/2012 11:35 AM, Dan Magenheimer wrote:

>> From: Seth Jennings [mailto:sjenning@linux.vnet.ibm.com]
>> Sent: Friday, June 15, 2012 9:13 AM
>> To: Peter Zijlstra
>> Cc: Minchan Kim; Greg Kroah-Hartman; Nitin Gupta; Dan Magenheimer; linux-kernel@vger.kernel.org;
>> linux-mm@kvack.org; Thomas Gleixner; Ingo Molnar; Tejun Heo; David Howells; x86@kernel.org; Nick
>> Piggin
>> Subject: Re: [PATCH v2 3/3] x86: Support local_flush_tlb_kernel_range
>>
>> On 05/17/2012 09:51 AM, Peter Zijlstra wrote:
>>
>>> On Thu, 2012-05-17 at 17:11 +0900, Minchan Kim wrote:
>>>>> +++ b/arch/x86/include/asm/tlbflush.h
>>>>> @@ -172,4 +172,16 @@ static inline void flush_tlb_kernel_range(unsigned long start,
>>>>>       flush_tlb_all();
>>>>>  }
>>>>>
>>>>> +static inline void local_flush_tlb_kernel_range(unsigned long start,
>>>>> +             unsigned long end)
>>>>> +{
>>>>> +     if (cpu_has_invlpg) {
>>>>> +             while (start < end) {
>>>>> +                     __flush_tlb_single(start);
>>>>> +                     start += PAGE_SIZE;
>>>>> +             }
>>>>> +     } else
>>>>> +             local_flush_tlb();
>>>>> +}
>>>
>>> It would be much better if you wait for Alex Shi's patch to mature.
>>> doing the invlpg thing for ranges is not an unconditional win.
>>
>> From what I can tell Alex's patches have stalled.  The last post was v6
>> on 5/17 and there wasn't a single reply to them afaict.
>>
>> According to Alex's investigation of this "tipping point", it seems that
>> a good generic value is 8.  In other words, on most x86 hardware, it is
>> cheaper to flush up to 8 tlb entries one by one rather than doing a
>> complete flush.
>>
>> So we can do something like:
>>
>>      if (cpu_has_invlpg && (end - start)/PAGE_SIZE <= 8) {
>>              while (start < end) {
>>
>> Would this be acceptable?
> 
> Hey Seth, Nitin --
> 
> After more work digging around zsmalloc and zbud, I really think
> this TLB flushing, as well as the "page pair mapping" code can be
> completely eliminated IFF zsmalloc is limited to items PAGE_SIZE or
> less.


To add to what Nitin just sent, without the page mapping, zsmalloc and
the late xvmalloc have the same issue.  Say you have a whole class of
objects that are 3/4 of a page.  Without the mapping, you can't cross
non-contiguous page boundaries and you'll have 25% fragmentation in the
memory pool.  This is the whole point of zsmalloc.

--
Seth


  parent reply	other threads:[~2012-06-15 16:50 UTC|newest]

Thread overview: 68+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2012-05-16  2:05 [PATCH v2 1/3] zsmalloc: support zsmalloc to ARM, MIPS, SUPERH Minchan Kim
2012-05-16  2:05 ` Minchan Kim
2012-05-16  2:05 ` [PATCH v2 2/3] remove dependency with x86 Minchan Kim
2012-05-16  2:05   ` Minchan Kim
2012-05-16 17:11   ` Seth Jennings
2012-05-16 17:11     ` Seth Jennings
2012-05-17  8:06     ` Minchan Kim
2012-05-17  8:06       ` Minchan Kim
2012-05-16  2:05 ` [PATCH v2 3/3] x86: Support local_flush_tlb_kernel_range Minchan Kim
2012-05-16  2:05   ` Minchan Kim
2012-05-17  8:11   ` Minchan Kim
2012-05-17  8:11     ` Minchan Kim
2012-05-17 14:46     ` Greg Kroah-Hartman
2012-05-17 14:46       ` Greg Kroah-Hartman
2012-05-18  8:35       ` Minchan Kim
2012-05-18  8:35         ` Minchan Kim
2012-05-17 14:51     ` Peter Zijlstra
2012-05-17 14:51       ` Peter Zijlstra
2012-05-17 15:08       ` Peter Zijlstra
2012-05-17 15:08         ` Peter Zijlstra
2012-05-19  0:13         ` Alex Shi
2012-05-19  0:13           ` Alex Shi
2012-05-18  8:36       ` Minchan Kim
2012-05-18  8:36         ` Minchan Kim
2012-06-15 15:13       ` Seth Jennings
2012-06-15 15:13         ` Seth Jennings
2012-06-15 16:35         ` Dan Magenheimer
2012-06-15 16:35           ` Dan Magenheimer
2012-06-15 16:45           ` Nitin Gupta
2012-06-15 16:45             ` Nitin Gupta
2012-06-15 17:29             ` Dan Magenheimer
2012-06-15 17:29               ` Dan Magenheimer
2012-06-15 19:07               ` Seth Jennings
2012-06-15 19:07                 ` Seth Jennings
2012-06-15 19:39                 ` Dan Magenheimer
2012-06-15 19:39                   ` Dan Magenheimer
2012-06-15 19:53                   ` Nitin Gupta
2012-06-15 19:53                     ` Nitin Gupta
2012-06-15 20:13                     ` Dan Magenheimer
2012-06-15 20:13                       ` Dan Magenheimer
2012-06-15 21:23                       ` Nitin Gupta
2012-06-15 21:23                         ` Nitin Gupta
2012-06-15 23:26                         ` Seth Jennings
2012-06-15 23:26                           ` Seth Jennings
2012-06-15 16:48           ` Seth Jennings [this message]
2012-06-15 16:48             ` Seth Jennings
2012-05-16  7:28 ` [PATCH v2 1/3] zsmalloc: support zsmalloc to ARM, MIPS, SUPERH Guan Xuetao
2012-05-16  7:28   ` Guan Xuetao
2012-05-17  0:07   ` Minchan Kim
2012-05-17  0:07     ` Minchan Kim
2012-05-17  0:56     ` Guan Xuetao
2012-05-17  0:56       ` Guan Xuetao
2012-05-17  8:04       ` Minchan Kim
2012-05-17  8:04         ` Minchan Kim
2012-05-18  1:45         ` Guan Xuetao
2012-05-18  1:45           ` Guan Xuetao
2012-05-18  8:38           ` Minchan Kim
2012-05-18  8:38             ` Minchan Kim
2012-05-17  8:32 ` Paul Mundt
2012-05-17  8:32   ` Paul Mundt
2012-05-17  9:06   ` Minchan Kim
2012-05-17  9:06     ` Minchan Kim
2012-05-17  9:19     ` Paul Mundt
2012-05-17  9:19       ` Paul Mundt
2012-05-17  9:08   ` Minchan Kim
2012-05-17  9:08     ` Minchan Kim
2012-05-23 20:51 ` Seth Jennings
2012-05-23 20:51   ` Seth Jennings

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=4FDB674C.9070304@linux.vnet.ibm.com \
    --to=sjenning@linux.vnet.ibm.com \
    --cc=a.p.zijlstra@chello.nl \
    --cc=dan.magenheimer@oracle.com \
    --cc=dhowells@redhat.com \
    --cc=gregkh@linuxfoundation.org \
    --cc=konrad@darnok.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linux-mm@kvack.org \
    --cc=minchan@kernel.org \
    --cc=mingo@redhat.com \
    --cc=ngupta@vflare.org \
    --cc=npiggin@gmail.com \
    --cc=tglx@linutronix.de \
    --cc=tj@kernel.org \
    --cc=x86@kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.