From: Mike Snitzer <snitzer@redhat.com>
To: David Rientjes <rientjes@google.com>
Cc: Mikulas Patocka <mpatocka@redhat.com>,
Edward Thornber <thornber@redhat.com>,
linux-kernel@vger.kernel.org, linux-mm@kvack.org,
dm-devel@redhat.com, Vivek Goyal <vgoyal@redhat.com>,
Andrew Morton <akpm@linux-foundation.org>,
Linus Torvalds <torvalds@linux-foundation.org>,
"Alasdair G. Kergon" <agk@redhat.com>
Subject: Re: [PATCH 2/7] mm: introduce kvmalloc and kvmalloc_node
Date: Tue, 14 Jul 2015 17:19:18 -0400 [thread overview]
Message-ID: <20150714211918.GC7915@redhat.com> (raw)
In-Reply-To: <alpine.DEB.2.10.1507141401170.16182@chino.kir.corp.google.com>
On Tue, Jul 14 2015 at 5:13pm -0400,
David Rientjes <rientjes@google.com> wrote:
> On Thu, 9 Jul 2015, Mikulas Patocka wrote:
>
> > > > Index: linux-4.2-rc1/mm/util.c
> > > > ===================================================================
> > > > --- linux-4.2-rc1.orig/mm/util.c 2015-07-07 15:58:11.000000000 +0200
> > > > +++ linux-4.2-rc1/mm/util.c 2015-07-08 19:22:26.000000000 +0200
> > > > @@ -316,6 +316,61 @@ unsigned long vm_mmap(struct file *file,
> > > > }
> > > > EXPORT_SYMBOL(vm_mmap);
> > > >
> > > > +void *kvmalloc_node(size_t size, gfp_t gfp, int node)
> > > > +{
> > > > + void *p;
> > > > + unsigned uninitialized_var(noio_flag);
> > > > +
> > > > + /* vmalloc doesn't support no-wait allocations */
> > > > + WARN_ON_ONCE(!(gfp & __GFP_WAIT));
> > > > +
> > > > + if (likely(size <= KMALLOC_MAX_SIZE)) {
> > > > + /*
> > > > + * Use __GFP_NORETRY so that we don't loop waiting for the
> > > > + * allocation - we don't have to loop here, if the memory
> > > > + * is too fragmented, we fallback to vmalloc.
> > >
> > > I'm not sure about this decision. The direct reclaim retry code is the
> > > normal default behaviour and becomes more important with larger allocation
> > > attempts. So why turn it off, and make it more likely that we return
> > > vmalloc memory?
> >
> > It can avoid triggering the OOM killer in case of fragmented memory.
> >
> > This is general question - if the code can handle allocation failure
> > gracefully, what gfp flags should it use? Maybe add some flag
> > __GFP_MAYFAIL instead of __GFP_NORETRY that changes the behavior in
> > desired way?
> >
>
> There's a misunderstanding in regards to the comment: __GFP_NORETRY
> doesn't turn direct reclaim or compaction off, it is still attempted and
> with the same priority as any other allocation. This only stops the page
> allocator from calling the oom killer, which will free memory or panic the
> system, and looping when memory is available.
>
> In regards to the proposal in general, I think it's unnecessary because we
> are still left behind with other users who open code their call to
> vmalloc. I was interested in commit 058504edd026 ("fs/seq_file: fallback
> to vmalloc allocation") since it solved an issue with high memory
> fragmentation. Note how it falls back to vmalloc(): _without_ this
> __GFP_NORETRY. That's because we only want to fallback when high-order
> allocations fail and the page allocator doesn't implicitly loop due to the
> order. ext4_kvmalloc(), ext4_kzmalloc() does the same.
>
> The differences in implementations between those that do kmalloc() and
> fallback to vmalloc() are different enough that I don't think we need this
> addition.
Wouldn't mm benefit from acknowledging the pattern people are
open-coding and switching existing code over to official methods for
accomplishing the same?
It is always easier to shoehorn utility functions locally within a
subsystem (be it ext4, dm, etc) but once enough do something in a
similar but different way it really should get elevated.
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
WARNING: multiple messages have this Message-ID (diff)
From: Mike Snitzer <snitzer@redhat.com>
To: David Rientjes <rientjes@google.com>
Cc: Mikulas Patocka <mpatocka@redhat.com>,
Edward Thornber <thornber@redhat.com>,
linux-kernel@vger.kernel.org, linux-mm@kvack.org,
dm-devel@redhat.com, Vivek Goyal <vgoyal@redhat.com>,
Andrew Morton <akpm@linux-foundation.org>,
Linus Torvalds <torvalds@linux-foundation.org>,
"Alasdair G. Kergon" <agk@redhat.com>
Subject: Re: [PATCH 2/7] mm: introduce kvmalloc and kvmalloc_node
Date: Tue, 14 Jul 2015 17:19:18 -0400 [thread overview]
Message-ID: <20150714211918.GC7915@redhat.com> (raw)
In-Reply-To: <alpine.DEB.2.10.1507141401170.16182@chino.kir.corp.google.com>
On Tue, Jul 14 2015 at 5:13pm -0400,
David Rientjes <rientjes@google.com> wrote:
> On Thu, 9 Jul 2015, Mikulas Patocka wrote:
>
> > > > Index: linux-4.2-rc1/mm/util.c
> > > > ===================================================================
> > > > --- linux-4.2-rc1.orig/mm/util.c 2015-07-07 15:58:11.000000000 +0200
> > > > +++ linux-4.2-rc1/mm/util.c 2015-07-08 19:22:26.000000000 +0200
> > > > @@ -316,6 +316,61 @@ unsigned long vm_mmap(struct file *file,
> > > > }
> > > > EXPORT_SYMBOL(vm_mmap);
> > > >
> > > > +void *kvmalloc_node(size_t size, gfp_t gfp, int node)
> > > > +{
> > > > + void *p;
> > > > + unsigned uninitialized_var(noio_flag);
> > > > +
> > > > + /* vmalloc doesn't support no-wait allocations */
> > > > + WARN_ON_ONCE(!(gfp & __GFP_WAIT));
> > > > +
> > > > + if (likely(size <= KMALLOC_MAX_SIZE)) {
> > > > + /*
> > > > + * Use __GFP_NORETRY so that we don't loop waiting for the
> > > > + * allocation - we don't have to loop here, if the memory
> > > > + * is too fragmented, we fallback to vmalloc.
> > >
> > > I'm not sure about this decision. The direct reclaim retry code is the
> > > normal default behaviour and becomes more important with larger allocation
> > > attempts. So why turn it off, and make it more likely that we return
> > > vmalloc memory?
> >
> > It can avoid triggering the OOM killer in case of fragmented memory.
> >
> > This is general question - if the code can handle allocation failure
> > gracefully, what gfp flags should it use? Maybe add some flag
> > __GFP_MAYFAIL instead of __GFP_NORETRY that changes the behavior in
> > desired way?
> >
>
> There's a misunderstanding in regards to the comment: __GFP_NORETRY
> doesn't turn direct reclaim or compaction off, it is still attempted and
> with the same priority as any other allocation. This only stops the page
> allocator from calling the oom killer, which will free memory or panic the
> system, and looping when memory is available.
>
> In regards to the proposal in general, I think it's unnecessary because we
> are still left behind with other users who open code their call to
> vmalloc. I was interested in commit 058504edd026 ("fs/seq_file: fallback
> to vmalloc allocation") since it solved an issue with high memory
> fragmentation. Note how it falls back to vmalloc(): _without_ this
> __GFP_NORETRY. That's because we only want to fallback when high-order
> allocations fail and the page allocator doesn't implicitly loop due to the
> order. ext4_kvmalloc(), ext4_kzmalloc() does the same.
>
> The differences in implementations between those that do kmalloc() and
> fallback to vmalloc() are different enough that I don't think we need this
> addition.
Wouldn't mm benefit from acknowledging the pattern people are
open-coding and switching existing code over to official methods for
accomplishing the same?
It is always easier to shoehorn utility functions locally within a
subsystem (be it ext4, dm, etc) but once enough do something in a
similar but different way it really should get elevated.
next prev parent reply other threads:[~2015-07-14 21:19 UTC|newest]
Thread overview: 40+ messages / expand[flat|nested] mbox.gz Atom feed top
2015-07-07 15:08 [PATCH 0/7] mm: reliable memory allocation with kvmalloc Mikulas Patocka
2015-07-07 15:08 ` Mikulas Patocka
2015-07-07 15:09 ` [PATCH 1/7] mm/vmalloc: export __vmalloc_node_flags Mikulas Patocka
2015-07-07 15:09 ` Mikulas Patocka
2015-07-07 15:10 ` [PATCH 2/7] mm: introduce kvmalloc and kvmalloc_node Mikulas Patocka
2015-07-07 15:10 ` Mikulas Patocka
2015-07-07 21:41 ` Andrew Morton
2015-07-07 21:41 ` Andrew Morton
2015-07-08 7:34 ` [dm-devel] " Zdenek Kabelac
2015-07-08 7:34 ` Zdenek Kabelac
2015-07-08 23:03 ` Mikulas Patocka
2015-07-08 23:03 ` Mikulas Patocka
2015-07-08 23:18 ` Andrew Morton
2015-07-08 23:18 ` Andrew Morton
2015-07-09 14:45 ` Mikulas Patocka
2015-07-09 14:45 ` Mikulas Patocka
2015-07-14 21:13 ` David Rientjes
2015-07-14 21:13 ` David Rientjes
2015-07-14 21:19 ` Mike Snitzer [this message]
2015-07-14 21:19 ` Mike Snitzer
2015-07-14 21:24 ` David Rientjes
2015-07-14 21:24 ` David Rientjes
2015-07-14 21:54 ` Dave Chinner
2015-07-14 21:54 ` Dave Chinner
2015-07-14 22:45 ` David Rientjes
2015-07-14 22:45 ` David Rientjes
2015-07-15 0:25 ` Dave Chinner
2015-07-15 0:25 ` Dave Chinner
2015-07-14 21:24 ` Andrew Morton
2015-07-14 21:24 ` Andrew Morton
2015-07-07 15:10 ` [PATCH 3/7] dm-ioctl: join flags DM_PARAMS_KMALLOC and DM_PARAMS_VMALLOC Mikulas Patocka
2015-07-07 15:10 ` Mikulas Patocka
2015-07-07 15:11 ` [PATCH 4/7] dm: use kvmalloc Mikulas Patocka
2015-07-07 15:11 ` Mikulas Patocka
2015-07-07 15:11 ` [PATCH 5/7] dm-thin: " Mikulas Patocka
2015-07-07 15:11 ` Mikulas Patocka
2015-07-07 15:12 ` [PATCH 6/7] dm-stats: use kvmalloc_node Mikulas Patocka
2015-07-07 15:12 ` Mikulas Patocka
2015-07-07 15:13 ` [PATCH 7/7] dm: make dm_vcalloc use kvmalloc Mikulas Patocka
2015-07-07 15:13 ` Mikulas Patocka
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20150714211918.GC7915@redhat.com \
--to=snitzer@redhat.com \
--cc=agk@redhat.com \
--cc=akpm@linux-foundation.org \
--cc=dm-devel@redhat.com \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=mpatocka@redhat.com \
--cc=rientjes@google.com \
--cc=thornber@redhat.com \
--cc=torvalds@linux-foundation.org \
--cc=vgoyal@redhat.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.