From: Matthew Rushton <mvrushton@gmail.com>
To: Matt Wilson <msw@linux.com>, Tim Deegan <tim@xen.org>
Cc: Keir Fraser <keir@xen.org>, Jan Beulich <jbeulich@suse.com>,
Andrew Cooper <andrew.cooper3@citrix.com>,
Matt Wilson <msw@amazon.com>,
xen-devel@lists.xenproject.org
Subject: Re: [RFC PATCH] page_alloc: use first half of higher order chunks when halving
Date: Tue, 25 Mar 2014 13:09:13 -0700 [thread overview]
Message-ID: <5331E269.9090708@gmail.com> (raw)
In-Reply-To: <20140325132740.GB11708@u109add4315675089e695.ant.amazon.com>
On 03/25/14 06:27, Matt Wilson wrote:
> On Tue, Mar 25, 2014 at 01:19:22PM +0100, Tim Deegan wrote:
>> At 13:22 +0200 on 25 Mar (1395750124), Matt Wilson wrote:
>>> From: Matt Rushton <mrushton@amazon.com>
>>>
>>> This patch makes the Xen heap allocator use the first half of higher
>>> order chunks instead of the second half when breaking them down for
>>> smaller order allocations.
>>>
>>> Linux currently remaps the memory overlapping PCI space one page at a
>>> time. Before this change this resulted in the mfns being allocated in
>>> reverse order and led to discontiguous dom0 memory. This forced dom0
>>> to use bounce buffers for doing DMA and resulted in poor performance.
>> This seems like something better fixed on the dom0 side, by asking
>> explicitly for contiguous memory in cases where it makes a difference.
>> On the Xen side, this change seems harmless, but we might like to keep
>> the explicitly reversed allocation on debug builds, to flush out
>> guests that rely on their memory being contiguous.
> Yes, I think that retaining the reverse allocation on debug builds is
> fine. I'd like Konrad's take on if it's better or possible to fix this
> on the Linux side.
I considered fixing it in Linux but this was a more straight forward
change with no downside as far as I can tell. I see no reason in not
fixing it in both places but this at least behaves more reasonably for
one potential use case. I'm also interested in other opinions.
>>> This change more gracefully handles the dom0 use case and returns
>>> contiguous memory for subsequent allocations.
>>>
>>> Cc: xen-devel@lists.xenproject.org
>>> Cc: Keir Fraser <keir@xen.org>
>>> Cc: Jan Beulich <jbeulich@suse.com>
>>> Cc: Konrad Rzeszutek Wilk <konrad.wilk@oracle.com>
>>> Cc: Tim Deegan <tim@xen.org>
>>> Cc: Andrew Cooper <andrew.cooper3@citrix.com>
>>> Signed-off-by: Matt Rushton <mrushton@amazon.com>
>>> Signed-off-by: Matt Wilson <msw@amazon.com>
>>> ---
>>> xen/common/page_alloc.c | 5 +++--
>>> 1 file changed, 3 insertions(+), 2 deletions(-)
>>>
>>> diff --git a/xen/common/page_alloc.c b/xen/common/page_alloc.c
>>> index 601319c..27e7f18 100644
>>> --- a/xen/common/page_alloc.c
>>> +++ b/xen/common/page_alloc.c
>>> @@ -677,9 +677,10 @@ static struct page_info *alloc_heap_pages(
>>> /* We may have to halve the chunk a number of times. */
>>> while ( j != order )
>>> {
>>> - PFN_ORDER(pg) = --j;
>>> + struct page_info *pg2;
>>> + pg2 = pg + (1 << --j);
>>> + PFN_ORDER(pg) = j;
>>> page_list_add_tail(pg, &heap(node, zone, j));
>>> - pg += 1 << j;
>> AFAICT this uses the low half (pg) for the allocation _and_ puts it on
>> the freelist, and just leaks the high half (pg2). Am I missing something?
> Argh, oops. this is totally my fault (not Matt R.'s). I ported the
> patch out of our development tree incorrectly. The code should have
> read:
>
> while ( j != order )
> {
> struct page_info *pg2;
>
> pg2 = pg + (1 << --j);
> PFN_ORDER(pg2) = j;
> page_list_add_tail(pg2, &heap(node, zone, j));
> }
>
> Apologies to Matt for my mangling of his patch (which also already had
> the correct blank line per Andy's comment).
>
> --msw
No worries I was about to correct you:)
next prev parent reply other threads:[~2014-03-25 20:09 UTC|newest]
Thread overview: 55+ messages / expand[flat|nested] mbox.gz Atom feed top
2014-03-25 11:22 [RFC PATCH] page_alloc: use first half of higher order chunks when halving Matt Wilson
2014-03-25 11:44 ` Andrew Cooper
2014-03-25 13:20 ` Matt Wilson
2014-03-25 20:18 ` Matthew Rushton
2014-03-25 12:19 ` Tim Deegan
2014-03-25 13:27 ` Matt Wilson
2014-03-25 20:09 ` Matthew Rushton [this message]
2014-03-26 9:55 ` Tim Deegan
2014-03-26 10:17 ` Matt Wilson
2014-03-26 10:44 ` David Vrabel
2014-03-26 10:48 ` Matt Wilson
2014-03-26 11:13 ` Ian Campbell
2014-03-26 11:41 ` Matt Wilson
2014-03-26 11:45 ` Andrew Cooper
2014-03-26 11:50 ` Matt Wilson
2014-03-26 12:43 ` David Vrabel
2014-03-26 12:48 ` Matt Wilson
2014-03-26 15:08 ` Konrad Rzeszutek Wilk
2014-03-26 15:15 ` Matt Wilson
2014-03-26 15:59 ` Matthew Rushton
2014-03-26 16:36 ` Konrad Rzeszutek Wilk
2014-03-26 17:47 ` Matthew Rushton
2014-03-26 17:56 ` Konrad Rzeszutek Wilk
2014-03-26 22:15 ` Matthew Rushton
2014-03-28 17:02 ` Konrad Rzeszutek Wilk
2014-03-28 22:06 ` Matthew Rushton
2014-03-31 14:15 ` Konrad Rzeszutek Wilk
2014-04-01 3:25 ` Matthew Rushton
2014-04-01 10:48 ` Konrad Rzeszutek Wilk
2014-04-01 12:22 ` Tim Deegan
2014-04-02 0:17 ` Matthew Rushton
2014-04-02 7:52 ` Jan Beulich
2014-04-02 10:06 ` Ian Campbell
2014-04-02 10:15 ` Jan Beulich
2014-04-02 10:20 ` Ian Campbell
2014-04-09 22:21 ` Matthew Rushton
2014-04-10 6:14 ` Jan Beulich
2014-04-11 20:20 ` Matthew Rushton
2014-04-11 17:05 ` Konrad Rzeszutek Wilk
2014-04-11 20:28 ` Matthew Rushton
2014-04-12 1:34 ` Konrad Rzeszutek Wilk
2014-04-13 21:32 ` Tim Deegan
2014-04-14 8:51 ` Jan Beulich
2014-04-14 14:40 ` Konrad Rzeszutek Wilk
2014-04-14 15:34 ` Jan Beulich
2014-04-16 14:15 ` Konrad Rzeszutek Wilk
2014-04-17 1:34 ` Matthew Rushton
2014-05-07 23:16 ` Matthew Rushton
2014-05-08 18:05 ` Konrad Rzeszutek Wilk
2014-05-14 15:06 ` Konrad Rzeszutek Wilk
2014-05-20 19:26 ` Matthew Rushton
2014-05-23 19:00 ` Konrad Rzeszutek Wilk
2014-06-04 22:25 ` Matthew Rushton
2014-06-05 9:32 ` David Vrabel
2014-03-26 16:34 ` Konrad Rzeszutek Wilk
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=5331E269.9090708@gmail.com \
--to=mvrushton@gmail.com \
--cc=andrew.cooper3@citrix.com \
--cc=jbeulich@suse.com \
--cc=keir@xen.org \
--cc=msw@amazon.com \
--cc=msw@linux.com \
--cc=tim@xen.org \
--cc=xen-devel@lists.xenproject.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.