From: Ingo Molnar <mingo@elte.hu>
To: Avi Kivity <avi@redhat.com>
Cc: Nick Piggin <npiggin@suse.de>,
Andrea Arcangeli <aarcange@redhat.com>,
Mike Galbraith <efault@gmx.de>,
Jason Garrett-Glaser <darkshikari@gmail.com>,
Linus Torvalds <torvalds@linux-foundation.org>,
Pekka Enberg <penberg@cs.helsinki.fi>,
Andrew Morton <akpm@linux-foundation.org>,
linux-mm@kvack.org, Marcelo Tosatti <mtosatti@redhat.com>,
Adam Litke <agl@us.ibm.com>, Izik Eidus <ieidus@redhat.com>,
Hugh Dickins <hugh.dickins@tiscali.co.uk>,
Rik van Riel <riel@redhat.com>, Mel Gorman <mel@csn.ul.ie>,
Dave Hansen <dave@linux.vnet.ibm.com>,
Benjamin Herrenschmidt <benh@kernel.crashing.org>,
Mike Travis <travis@sgi.com>,
KAMEZAWA Hiroyuki <kamezawa.hiroyu@jp.fujitsu.com>,
Christoph Lameter <cl@linux-foundation.org>,
Chris Wright <chrisw@sous-sol.org>,
bpicco@redhat.com,
KOSAKI Motohiro <kosaki.motohiro@jp.fujitsu.com>,
Balbir Singh <balbir@linux.vnet.ibm.com>,
Arnd Bergmann <arnd@arndb.de>,
"Michael S. Tsirkin" <mst@redhat.com>,
Peter Zijlstra <peterz@infradead.org>,
Johannes Weiner <hannes@cmpxchg.org>,
Daisuke Nishimura <nishimura@mxp.nes.nec.co.jp>
Subject: Re: [PATCH 00 of 41] Transparent Hugepage Support #17
Date: Mon, 12 Apr 2010 10:07:48 +0200 [thread overview]
Message-ID: <20100412080748.GC18485@elte.hu> (raw)
In-Reply-To: <4BC2D0C9.3060201@redhat.com>
* Avi Kivity <avi@redhat.com> wrote:
> On 04/12/2010 10:21 AM, Nick Piggin wrote:
> >>
> >>All data I provided is very real, in addition to building a ton of
> >>packages and running emerge on /usr/portage I've been running all my
> >>real loads. Only problem I only run it for 1 day and half, but the
> >>load I kept it under was significant (surely a lot bigger inode/dentry
> >>load that any hypervisor usage would ever generate).
> >OK, but as a solution for some kind of very specific and highly
> >optimized application already like RDBMS, HPC, hypervisor or JVM,
> >they could just be using hugepages themselves, couldn't they?
> >
> > It seems more interesting as a more general speedup for applications that
> > can't afford such optimizations? (eg. the common case for most people)
>
> The problem with hugetlbfs is that you need to commit upfront to using it,
> and that you need to be the admin. For virtualization, you want to use
> hugepages when there is no memory pressure, but you want to use ksm,
> ballooning, and swapping when there is (and then go back to large pages when
> pressure is relieved, e.g. by live migration).
>
> HPC and databases can probably live with hugetlbfs. JVM is somewhere in the
> middle, they do allocate memory dynamically.
Even for HPC hugetlbfs is often not good enough: if the data is being
constantly acquired and put into a file and if it needs to be in persistent
storage then you dont want to (and cannot) copy it to hugetlbfs (on a poweroff
you would lose the file).
Furthermore there's also the deployment barrier of marginal improvements: not
many apps are willing to change for a +0.1% improvement - or even for a +0.9%
improvement - _especially_ if that improvement also needs admin access and per
distribution hackery. (each distribution tends to have their own slightly
different way of handing filesystems and other permission/configuration
matters)
We've seen that with sendfile() and splice() an it's no different with
hugetlbs either.
hugetlbfs is basically a non-default poor-man's solution for something that
the kernel should be providing transparently. It's a bad hack that is good
enough to prototype that something works, but it has serious deployment,
configuration and usage limitations. Only a kernel hacker detached from
everyday application development and packaging constraints can believe that
it's a high-quality technical solution.
Transparent hugepages eliminates most of the app-visible disadvantages by
shuffling the problems into the kernel [and no doubt causing follow-on
headaches there] and by utilizing the 'power of the default' - and thus
opening up hugetlbs to far more apps. [*]
It's a really simple mechanism.
Thanks,
Ingo
[*] Note, it would be even better if the kernel provided the C library [a'ka
klibc] and if hugetlbs could be utilized via malloc() et al more
transparently by us changing the user-space library in the kernel repo and
deploying it to apps via a new kernel that provides an updated C library.
We dont do that so we are stuck with crappier solutions and slower
propagation of changes.
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
next prev parent reply other threads:[~2010-04-12 8:08 UTC|newest]
Thread overview: 205+ messages / expand[flat|nested] mbox.gz Atom feed top
2010-04-02 0:41 [PATCH 00 of 41] Transparent Hugepage Support #17 Andrea Arcangeli
2010-04-02 0:41 ` [PATCH 01 of 41] define MADV_HUGEPAGE Andrea Arcangeli
2010-04-02 0:41 ` [PATCH 02 of 41] compound_lock Andrea Arcangeli
2010-04-02 0:41 ` [PATCH 03 of 41] alter compound get_page/put_page Andrea Arcangeli
2010-04-02 0:41 ` [PATCH 04 of 41] update futex compound knowledge Andrea Arcangeli
2010-04-02 0:41 ` [PATCH 05 of 41] fix bad_page to show the real reason the page is bad Andrea Arcangeli
2010-04-02 0:41 ` [PATCH 06 of 41] clear compound mapping Andrea Arcangeli
2010-04-02 0:41 ` [PATCH 07 of 41] add native_set_pmd_at Andrea Arcangeli
2010-04-02 0:41 ` [PATCH 08 of 41] add pmd paravirt ops Andrea Arcangeli
2010-04-02 0:41 ` [PATCH 09 of 41] no paravirt version of pmd ops Andrea Arcangeli
2010-04-02 0:41 ` [PATCH 10 of 41] export maybe_mkwrite Andrea Arcangeli
2010-04-02 0:41 ` [PATCH 11 of 41] comment reminder in destroy_compound_page Andrea Arcangeli
2010-04-02 0:41 ` [PATCH 12 of 41] config_transparent_hugepage Andrea Arcangeli
2010-04-02 0:41 ` [PATCH 13 of 41] special pmd_trans_* functions Andrea Arcangeli
2010-04-02 0:41 ` [PATCH 14 of 41] add pmd mangling generic functions Andrea Arcangeli
2010-04-02 0:41 ` [PATCH 15 of 41] add pmd mangling functions to x86 Andrea Arcangeli
2010-04-02 0:41 ` [PATCH 16 of 41] bail out gup_fast on splitting pmd Andrea Arcangeli
2010-04-02 0:41 ` [PATCH 17 of 41] pte alloc trans splitting Andrea Arcangeli
2010-04-02 0:41 ` [PATCH 18 of 41] add pmd mmu_notifier helpers Andrea Arcangeli
2010-04-02 0:41 ` [PATCH 19 of 41] clear page compound Andrea Arcangeli
2010-04-02 0:41 ` [PATCH 20 of 41] add pmd_huge_pte to mm_struct Andrea Arcangeli
2010-04-02 0:41 ` [PATCH 21 of 41] split_huge_page_mm/vma Andrea Arcangeli
2010-04-02 0:41 ` [PATCH 22 of 41] split_huge_page paging Andrea Arcangeli
2010-04-02 0:41 ` [PATCH 23 of 41] clear_copy_huge_page Andrea Arcangeli
2010-04-02 0:41 ` [PATCH 24 of 41] kvm mmu transparent hugepage support Andrea Arcangeli
2010-04-02 0:41 ` [PATCH 25 of 41] _GFP_NO_KSWAPD Andrea Arcangeli
2010-04-02 0:41 ` [PATCH 26 of 41] don't alloc harder for gfp nomemalloc even if nowait Andrea Arcangeli
2010-04-02 0:41 ` [PATCH 27 of 41] transparent hugepage core Andrea Arcangeli
2010-04-02 0:41 ` [PATCH 28 of 41] verify pmd_trans_huge isn't leaking Andrea Arcangeli
2010-04-02 0:41 ` [PATCH 29 of 41] madvise(MADV_HUGEPAGE) Andrea Arcangeli
2010-04-02 0:41 ` [PATCH 30 of 41] pmd_trans_huge migrate bugcheck Andrea Arcangeli
2010-04-02 0:41 ` [PATCH 31 of 41] memcg compound Andrea Arcangeli
2010-04-02 0:41 ` [PATCH 32 of 41] memcg huge memory Andrea Arcangeli
2010-04-02 0:42 ` [PATCH 33 of 41] transparent hugepage vmstat Andrea Arcangeli
2010-04-02 0:42 ` [PATCH 34 of 41] khugepaged Andrea Arcangeli
2010-04-02 0:42 ` [PATCH 35 of 41] skip transhuge pages in ksm for now Andrea Arcangeli
2010-04-02 0:42 ` [PATCH 36 of 41] remove PG_buddy Andrea Arcangeli
2010-04-02 0:42 ` [PATCH 37 of 41] add x86 32bit support Andrea Arcangeli
2010-04-02 0:42 ` [PATCH 38 of 41] mincore transparent hugepage support Andrea Arcangeli
2010-04-02 0:42 ` [PATCH 39 of 41] add pmd_modify Andrea Arcangeli
2010-04-02 0:42 ` [PATCH 40 of 41] mprotect: pass vma down to page table walkers Andrea Arcangeli
2010-04-02 0:42 ` [PATCH 41 of 41] mprotect: transparent huge page support Andrea Arcangeli
2010-04-05 19:09 ` [PATCH 00 of 41] Transparent Hugepage Support #17 Andrew Morton
2010-04-05 19:36 ` Ingo Molnar
2010-04-05 20:26 ` Pekka Enberg
2010-04-05 20:32 ` Linus Torvalds
2010-04-05 20:46 ` Pekka Enberg
2010-04-05 20:58 ` Linus Torvalds
2010-04-05 21:54 ` Ingo Molnar
2010-04-05 23:21 ` Andrea Arcangeli
2010-04-06 0:26 ` Linus Torvalds
2010-04-06 1:08 ` [RFD] " Linus Torvalds
2010-04-06 1:26 ` Andrea Arcangeli
2010-04-06 1:35 ` Linus Torvalds
2010-04-06 1:13 ` Andrea Arcangeli
2010-04-06 1:38 ` Linus Torvalds
2010-04-06 2:23 ` Linus Torvalds
2010-04-06 5:25 ` Nick Piggin
2010-04-06 9:08 ` Ingo Molnar
2010-04-06 9:13 ` Ingo Molnar
2010-04-10 18:47 ` Andrea Arcangeli
2010-04-10 19:02 ` Ingo Molnar
2010-04-10 19:22 ` Avi Kivity
2010-04-10 19:47 ` Ingo Molnar
2010-04-10 20:00 ` Andrea Arcangeli
2010-04-10 20:10 ` Andrea Arcangeli
2010-04-10 20:21 ` Jason Garrett-Glaser
2010-04-10 20:24 ` Avi Kivity
2010-04-10 20:42 ` Avi Kivity
2010-04-10 20:47 ` Andrea Arcangeli
2010-04-10 21:00 ` Avi Kivity
2010-04-10 21:47 ` Andrea Arcangeli
2010-04-11 1:05 ` Andrea Arcangeli
2010-04-11 11:24 ` Ingo Molnar
2010-04-11 11:33 ` Avi Kivity
2010-04-11 12:11 ` Ingo Molnar
2010-04-25 19:27 ` Andrea Arcangeli
2010-04-26 18:01 ` Andrea Arcangeli
2010-04-30 9:55 ` Ingo Molnar
2010-04-30 15:19 ` Andrea Arcangeli
2010-05-02 12:17 ` Ingo Molnar
2010-04-10 20:49 ` Jason Garrett-Glaser
2010-04-10 20:53 ` Avi Kivity
2010-04-10 20:58 ` Jason Garrett-Glaser
2010-04-11 9:29 ` Avi Kivity
2010-04-11 9:37 ` Jason Garrett-Glaser
2010-04-11 9:40 ` Avi Kivity
2010-04-11 10:22 ` Jason Garrett-Glaser
2010-04-11 11:00 ` Ingo Molnar
2010-04-11 11:19 ` Avi Kivity
2010-04-11 11:30 ` Jason Garrett-Glaser
2010-04-11 11:52 ` hugepages will matter more in the future Ingo Molnar
2010-04-11 12:01 ` Avi Kivity
2010-04-11 12:35 ` Ingo Molnar
2010-04-11 15:22 ` Linus Torvalds
2010-04-11 15:43 ` Avi Kivity
2010-04-11 15:52 ` Linus Torvalds
2010-04-11 16:04 ` Avi Kivity
2010-04-12 7:45 ` Ingo Molnar
2010-04-12 8:14 ` Nick Piggin
2010-04-12 8:22 ` Ingo Molnar
2010-04-12 8:34 ` Nick Piggin
2010-04-12 8:47 ` Avi Kivity
2010-04-12 8:45 ` Andrea Arcangeli
2010-04-11 19:35 ` Andrea Arcangeli
2010-04-12 16:20 ` Rik van Riel
2010-04-12 16:40 ` Linus Torvalds
2010-04-12 16:56 ` Linus Torvalds
2010-04-12 17:06 ` Randy Dunlap
2010-04-12 17:36 ` Andrea Arcangeli
2010-04-12 17:46 ` Rik van Riel
2010-04-11 19:40 ` Andrea Arcangeli
2010-04-12 15:41 ` Linus Torvalds
2010-04-12 11:22 ` Arjan van de Ven
2010-04-12 11:29 ` Avi Kivity
2010-04-17 15:12 ` Arjan van de Ven
2010-04-17 18:18 ` Avi Kivity
2010-04-17 19:05 ` Arjan van de Ven
2010-04-17 19:05 ` Avi Kivity
2010-04-17 19:18 ` Arjan van de Ven
2010-04-17 19:20 ` Avi Kivity
2010-04-12 13:30 ` Andrea Arcangeli
2010-04-12 13:33 ` Avi Kivity
2010-04-12 13:39 ` Andrea Arcangeli
2010-04-12 13:53 ` Avi Kivity
2010-04-13 11:38 ` Ingo Molnar
2010-04-13 13:17 ` Andrea Arcangeli
2010-04-11 10:46 ` [PATCH 00 of 41] Transparent Hugepage Support #17 Ingo Molnar
2010-04-11 10:49 ` Ingo Molnar
2010-04-11 11:30 ` Avi Kivity
2010-04-11 12:08 ` Ingo Molnar
2010-04-11 12:24 ` Avi Kivity
2010-04-11 12:46 ` Ingo Molnar
2010-04-12 6:09 ` Nick Piggin
2010-04-12 6:18 ` Pekka Enberg
2010-04-12 6:48 ` Nick Piggin
2010-04-12 14:29 ` Christoph Lameter
2010-04-12 16:06 ` Nick Piggin
2010-04-12 6:36 ` Avi Kivity
2010-04-12 6:55 ` Ingo Molnar
2010-04-12 7:15 ` Nick Piggin
2010-04-12 7:45 ` Avi Kivity
2010-04-12 8:28 ` Nick Piggin
2010-04-12 9:01 ` Andrea Arcangeli
2010-04-12 9:03 ` Avi Kivity
2010-04-12 9:26 ` Nick Piggin
2010-04-12 9:39 ` Andrea Arcangeli
2010-04-12 10:02 ` Avi Kivity
2010-04-12 10:08 ` Andrea Arcangeli
2010-04-12 10:10 ` Avi Kivity
2010-04-12 10:23 ` Andrea Arcangeli
2010-04-12 10:37 ` Nick Piggin
2010-04-12 10:59 ` Avi Kivity
2010-04-12 12:23 ` Avi Kivity
2010-04-12 13:25 ` Andrea Arcangeli
2010-04-13 0:38 ` Andrew Morton
2010-04-13 6:18 ` Neil Brown
2010-04-13 13:31 ` Andrea Arcangeli
2010-04-13 13:40 ` Mel Gorman
2010-04-13 13:44 ` Andrea Arcangeli
2010-04-13 13:55 ` Mel Gorman
2010-04-13 14:03 ` Andrea Arcangeli
2010-04-12 7:51 ` Ingo Molnar
2010-04-12 7:18 ` Andrea Arcangeli
2010-04-12 6:49 ` Ingo Molnar
2010-04-12 7:35 ` Andrea Arcangeli
2010-04-12 7:08 ` Andrea Arcangeli
2010-04-12 7:21 ` Nick Piggin
2010-04-12 7:50 ` Avi Kivity
2010-04-12 8:07 ` Ingo Molnar [this message]
2010-04-12 8:21 ` Andrea Arcangeli
2010-04-12 10:27 ` Mel Gorman
2010-04-12 8:18 ` Andrea Arcangeli
2010-04-12 8:06 ` Andrea Arcangeli
2010-04-12 10:44 ` Mel Gorman
2010-04-12 11:12 ` Avi Kivity
2010-04-12 13:17 ` Andrea Arcangeli
2010-04-12 14:24 ` Christoph Lameter
2010-04-12 14:49 ` Avi Kivity
2010-04-06 9:55 ` Avi Kivity
2010-04-06 9:57 ` Avi Kivity
2010-04-06 11:55 ` Avi Kivity
2010-04-06 13:10 ` Nick Piggin
2010-04-06 13:22 ` Avi Kivity
2010-04-06 13:45 ` Nick Piggin
2010-04-06 13:57 ` Avi Kivity
2010-04-06 16:50 ` Andrea Arcangeli
2010-04-06 17:31 ` Avi Kivity
2010-04-06 18:00 ` Christoph Lameter
2010-04-06 18:04 ` Avi Kivity
2010-04-06 18:47 ` Avi Kivity
2010-04-06 14:44 ` Rik van Riel
2010-04-06 16:43 ` Andrea Arcangeli
2010-04-06 9:30 ` Mel Gorman
2010-04-06 10:32 ` Theodore Tso
2010-04-06 11:16 ` Mel Gorman
2010-04-06 13:13 ` Theodore Tso
2010-04-06 14:55 ` Mel Gorman
2010-04-06 16:46 ` Andrea Arcangeli
2010-04-05 21:01 ` Chris Mason
2010-04-05 21:18 ` Avi Kivity
2010-04-05 21:33 ` Linus Torvalds
2010-04-05 22:33 ` Chris Mason
2010-04-06 8:30 ` Mel Gorman
2010-04-06 11:35 ` Chris Mason
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20100412080748.GC18485@elte.hu \
--to=mingo@elte.hu \
--cc=aarcange@redhat.com \
--cc=agl@us.ibm.com \
--cc=akpm@linux-foundation.org \
--cc=arnd@arndb.de \
--cc=avi@redhat.com \
--cc=balbir@linux.vnet.ibm.com \
--cc=benh@kernel.crashing.org \
--cc=bpicco@redhat.com \
--cc=chrisw@sous-sol.org \
--cc=cl@linux-foundation.org \
--cc=darkshikari@gmail.com \
--cc=dave@linux.vnet.ibm.com \
--cc=efault@gmx.de \
--cc=hannes@cmpxchg.org \
--cc=hugh.dickins@tiscali.co.uk \
--cc=ieidus@redhat.com \
--cc=kamezawa.hiroyu@jp.fujitsu.com \
--cc=kosaki.motohiro@jp.fujitsu.com \
--cc=linux-mm@kvack.org \
--cc=mel@csn.ul.ie \
--cc=mst@redhat.com \
--cc=mtosatti@redhat.com \
--cc=nishimura@mxp.nes.nec.co.jp \
--cc=npiggin@suse.de \
--cc=penberg@cs.helsinki.fi \
--cc=peterz@infradead.org \
--cc=riel@redhat.com \
--cc=torvalds@linux-foundation.org \
--cc=travis@sgi.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.