From: Mukesh Rathor <mukesh.rathor@oracle.com>
To: Tim Deegan <tim@xen.org>
Cc: xen-devel <xen-devel@lists.xenproject.org>,
Jan Beulich <JBeulich@suse.com>
Subject: Re: [RFC PATCH] PVH: cleanup of p2m upon p2m destroy
Date: Wed, 18 Dec 2013 18:01:55 -0800 [thread overview]
Message-ID: <20131218180155.1dc09717@mantra.us.oracle.com> (raw)
In-Reply-To: <20131218165152.GO24792@deinos.phlegethon.org>
On Wed, 18 Dec 2013 17:51:52 +0100
Tim Deegan <tim@xen.org> wrote:
> At 11:09 +0100 on 18 Dec (1387361398), Tim Deegan wrote:
> > > An alternative might be to just create a link list then and walk
> > > it. In general, foreign mappings should be very small, so the
> > > overhead of 16 bytes per page for the link list might not be too
> > > bad. I will code it if there is no disagreement from any
> > > maintainer... everyone has different ideas :)...
> >
> > I think it would be best to walk the p2m trie (i.e. bounded by
> > amount of RAM, rather than max GFN) and do it preemptably. I'll
> > look into something like that for the mem_sharing loop today, and
> > foreign mapping code can reuse it.
>
> What I've ended up with is making p2m_change_entry_type_global()
> preemptible (which is a bigger task but will be needed as domains get
> bigger). Do you think that using that function to switch all mappings
> from p2m_foreign to p2m_invalid, appropriately late in the teardown,
> will be good enough for what you need?
No, not quite, because I need to know which mfns are foreign and
do put_page on them.
By changing things around a bit for change ept type, I came up with
following:
diff --git a/xen/arch/x86/mm/p2m-ept.c b/xen/arch/x86/mm/p2m-ept.c
index 0ba2365..c996aac 100644
--- a/xen/arch/x86/mm/p2m-ept.c
+++ b/xen/arch/x86/mm/p2m-ept.c
@@ -838,6 +838,80 @@ void setup_ept_dump(void)
register_keyhandler('D', &ept_p2m_table);
}
+typedef int (ept_walk_entry_callback_f)(ept_entry_t *, unsigned long);
+
+static int ept_change_entry_type(ept_entry_t *entry, unsigned long data)
+{
+ p2m_type_t new = (p2m_type_t)data;
+
+ entry->sa_p2mt = new;
+ ept_p2m_type_to_flags(entry, new, entry->access);
+ return 1;
+}
+
+static int ept_put_foreign_mfn(ept_entry_t *entry, unsigned long data)
+{
+ put_page(mfn_to_page(entry->mfn));
+ return 0;
+}
+
+static void ept_walk_entry_callback_recurse(
+ mfn_t ept_page_mfn, int ept_page_level, p2m_type_t p2mt,
+ ept_walk_entry_callback_f *fp, unsigned long data)
+{
+ ept_entry_t e, *epte = map_domain_page(mfn_x(ept_page_mfn));
+
+ for ( int i = 0; i < EPT_PAGETABLE_ENTRIES; i++ )
+ {
+ if ( !is_epte_valid(epte + i) )
+ continue;
+
+ if ( (ept_page_level > 0) && !is_epte_superpage(epte + i) )
+ ept_walk_entry_callback_recurse(_mfn(epte[i].mfn),
+ ept_page_level - 1, p2mt, fp, data);
+ else
+ {
+ e = atomic_read_ept_entry(&epte[i]);
+
+ if ( e.sa_p2mt == p2mt && fp(&e, data) )
+ atomic_write_ept_entry(&epte[i], &e);
+ }
+ }
+
+ unmap_domain_page(epte);
+}
+
+static void ept_walk_entry_callback(struct p2m_domain *p2m, p2m_type_t p2mt,
+ ept_walk_entry_callback_f *fp,
+ unsigned long data, int do_flush)
+{
+ struct ept_data *ept = &p2m->ept;
+
+ if ( ept_get_asr(ept) == 0 )
+ return;
+
+ ept_walk_entry_callback_recurse(_mfn(ept_get_asr(ept)), ept_get_wl(ept),
+ p2mt, fp, data);
+ if ( do_flush )
+ ept_sync_domain(p2m);
+}
+
+int ept_release_foreign_pages(struct p2m_domain *p2m)
+{
+ ept_walk_entry_callback(p2m, p2m_map_foreign, ept_put_foreign_mfn, 0, 0);
+ return 0;
+}
+
+int ept_change_entry_type_global_mine(struct p2m_domain *p2m,
+ p2m_type_t ot, p2m_type_t nt)
+{
+ BUG_ON(p2m_is_grant(ot) || p2m_is_grant(nt));
+ BUG_ON(ot != nt && (ot == p2m_mmio_direct || nt == p2m_mmio_direct));
+
+ ept_walk_entry_callback(p2m, ot, ept_change_entry_type, nt, 1);
+ return 0;
+}
+
/*
* Local variables:
* mode: C
I can further make this pre-emptible by returning some "handle"
that the caller must stash somewhere, say, p2m_domain, so that at
ept level, synchronization doesn't need to be concerned about.
what do you think?
thanks
mukesh
next prev parent reply other threads:[~2013-12-19 2:02 UTC|newest]
Thread overview: 18+ messages / expand[flat|nested] mbox.gz Atom feed top
2013-12-17 1:47 [RFC PATCH] PVH: cleanup of p2m upon p2m destroy Mukesh Rathor
2013-12-17 8:42 ` Jan Beulich
2013-12-17 10:19 ` Tim Deegan
2013-12-18 2:44 ` Mukesh Rathor
2013-12-18 10:03 ` Jan Beulich
2013-12-18 11:32 ` Dietmar Hahn
2013-12-18 10:09 ` Tim Deegan
2013-12-18 16:51 ` Tim Deegan
2013-12-19 2:01 ` Mukesh Rathor [this message]
2013-12-19 10:50 ` Tim Deegan
2013-12-20 2:00 ` Mukesh Rathor
2013-12-20 9:22 ` Tim Deegan
2014-02-01 2:38 ` Mukesh Rathor
2014-02-03 10:12 ` Tim Deegan
2013-12-20 13:58 ` George Dunlap
2013-12-20 14:29 ` Tim Deegan
2013-12-18 1:01 ` Mukesh Rathor
2013-12-18 8:12 ` Jan Beulich
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20131218180155.1dc09717@mantra.us.oracle.com \
--to=mukesh.rathor@oracle.com \
--cc=JBeulich@suse.com \
--cc=tim@xen.org \
--cc=xen-devel@lists.xenproject.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).