* mapping problems in xenpaging
@ 2011-09-29 14:55 zhen shi
2011-09-29 17:02 ` Olaf Hering
0 siblings, 1 reply; 8+ messages in thread
From: zhen shi @ 2011-09-29 14:55 UTC (permalink / raw)
To: Olaf Hering; +Cc: xen-devel
[-- Attachment #1.1: Type: text/plain, Size: 1403 bytes --]
Hi,Olaf,
When we analyze and test xenpaging,we found there are some problems between
mapping and xenpaging.
1) When mapping firstly, then do xenpaging,and the code paths have resolved
the problems.It's OK.
2) The problems exists if we do address mapping firstly then go to
xenpaging,and our confusions are as followings:
a) If the domU's memory is directly mapped to dom0,such as the hypercall
from pv driver,then it will build a related page-table in dom0,which will
not change p2m-type.
and then do the xenpaging to page out the domU's memory pages whose
gfn address have been already mapped to dom0;So it will cause some problems
when dom0
accesses these pages.Because these pages are paged-out,and dom0 cannot
tell the p2mt before access the pages.
b)The another situation is that if xen has mapped the domU's page, and get
the mfn according to pfn_to_mfn.But then the page's p2mt is changed by
others, so when xen
accesses the page ,it will cause problems such as BSOD or reboot.Because
the operations of getting mfn and accessing the page are not
atomic.and the situation exists
in many code paths .
According to the above-mentioned points,do you have any suggestions about
what to do to avoid these situations.We have thought these two problems,but
currently have no
good method to resolve.
I am looking forward to hearing from you. Thank you very much! :)
[-- Attachment #1.2: Type: text/html, Size: 1652 bytes --]
[-- Attachment #2: Type: text/plain, Size: 138 bytes --]
_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xensource.com
http://lists.xensource.com/xen-devel
^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: mapping problems in xenpaging
2011-09-29 14:55 mapping problems in xenpaging zhen shi
@ 2011-09-29 17:02 ` Olaf Hering
2011-09-30 21:02 ` Adin Scannell
2011-10-01 3:52 ` zhen shi
0 siblings, 2 replies; 8+ messages in thread
From: Olaf Hering @ 2011-09-29 17:02 UTC (permalink / raw)
To: zhen shi; +Cc: xen-devel
On Thu, Sep 29, zhen shi wrote:
> Hi,Olaf,
>
> When we analyze and test xenpaging,we found there are some problems between
> mapping and xenpaging.
> 1) When mapping firstly, then do xenpaging,and the code paths have resolved
> the problems.It's OK.
> 2) The problems exists if we do address mapping firstly then go to
> xenpaging,and our confusions are as followings:
> a) If the domU's memory is directly mapped to dom0,such as the hypercall
> from pv driver,then it will build a related page-table in dom0,which will not
> change p2m-type.
> and then do the xenpaging to page out the domU's memory pages whose gfn
> address have been already mapped to dom0;So it will cause some problems when
> dom0
> accesses these pages.Because these pages are paged-out,and dom0 cannot
> tell the p2mt before access the pages.
I'm not entirely sure what you do. xenpaging runs in dom0 and is able to
map paged-out pages. It uses that to trigger a page-in, see
tools/xenpaging/pagein.c in xen-unstable.hg
> b)The another situation is that if xen has mapped the domU's page, and get
> the mfn according to pfn_to_mfn.But then the page's p2mt is changed by others,
> so when xen
> accesses the page ,it will cause problems such as BSOD or reboot.Because
> the operations of getting mfn and accessing the page are not
> atomic.and the situation exists
> in many code paths .
Can you be more specific what you mean? Xen doesnt seem to have a
pfn_to_mfn function, only the tools have some helper macros of that name.
Olaf
^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: Re: mapping problems in xenpaging
2011-09-29 17:02 ` Olaf Hering
@ 2011-09-30 21:02 ` Adin Scannell
2011-09-30 22:19 ` Tim Deegan
2011-10-03 14:56 ` Olaf Hering
2011-10-01 3:52 ` zhen shi
1 sibling, 2 replies; 8+ messages in thread
From: Adin Scannell @ 2011-09-30 21:02 UTC (permalink / raw)
To: Olaf Hering; +Cc: zhen shi, xen-devel
>> When we analyze and test xenpaging,we found there are some problems between
>> mapping and xenpaging.
>> 1) When mapping firstly, then do xenpaging,and the code paths have resolved
>> the problems.It's OK.
>> 2) The problems exists if we do address mapping firstly then go to
>> xenpaging,and our confusions are as followings:
>> a) If the domU's memory is directly mapped to dom0,such as the hypercall
>> from pv driver,then it will build a related page-table in dom0,which will not
>> change p2m-type.
>> and then do the xenpaging to page out the domU's memory pages whose gfn
>> address have been already mapped to dom0;So it will cause some problems when
>> dom0
>> accesses these pages.Because these pages are paged-out,and dom0 cannot
>> tell the p2mt before access the pages.
>
> I'm not entirely sure what you do. xenpaging runs in dom0 and is able to
> map paged-out pages. It uses that to trigger a page-in, see
> tools/xenpaging/pagein.c in xen-unstable.hg
Here's my take...
Xenpaging doesn't allow selection of pages that have been mapped by
other domains (as in p2m.c):
669 int p2m_mem_paging_nominate(struct domain *d, unsigned long gfn)
....
693 /* Check page count and type */
694 page = mfn_to_page(mfn);
695 if ( (page->count_info & (PGC_count_mask | PGC_allocated)) !=
696 (1 | PGC_allocated) )
697 goto out;
*However*, I think that the problem Zhen is describing still exists:
1) xenpaging nominates a page, it is successful.
2) dom0 maps the same page (a process other than xenpaging, which will
also map it).
3) xenpaging evicts the page, successfully.
I've experienced a few nasty crashes, and I think this could account
for a couple (but certainly not all)... I think that the solution may
be to repeat the refcount check in paging_evict, and roll back the
nomination gracefully if the race is detected. Thoughts?
>> b)The another situation is that if xen has mapped the domU's page, and get
>> the mfn according to pfn_to_mfn.But then the page's p2mt is changed by others,
>> so when xen
>> accesses the page ,it will cause problems such as BSOD or reboot.Because
>> the operations of getting mfn and accessing the page are not
>> atomic.and the situation exists
>> in many code paths .
I believe I have recreated this problem a few times, resulting in
various crashes... unfortunately, there is somewhat of an implicit
assumption throughout the code that when you grab an mfn via
gfn_to_mfn, that mfn won't disappear underneath you (for example, see
vmx_load_pdptrs). Really, you want something like gfn_to_mfn_getpage,
where the underlying page has its refcount bumped so that it won't be
nominated/evicted while you map and use the page, then you must put it
back when you're done. I hope to look into helping fix some of these
paging bugs soon.
Cheers,
-Adin
^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: Re: mapping problems in xenpaging
2011-09-30 21:02 ` Adin Scannell
@ 2011-09-30 22:19 ` Tim Deegan
2011-10-03 14:56 ` Olaf Hering
1 sibling, 0 replies; 8+ messages in thread
From: Tim Deegan @ 2011-09-30 22:19 UTC (permalink / raw)
To: Adin Scannell; +Cc: Olaf Hering, xen-devel, zhen shi
At 17:02 -0400 on 30 Sep (1317402151), Adin Scannell wrote:
> I believe I have recreated this problem a few times, resulting in
> various crashes... unfortunately, there is somewhat of an implicit
> assumption throughout the code that when you grab an mfn via
> gfn_to_mfn, that mfn won't disappear underneath you (for example, see
> vmx_load_pdptrs). Really, you want something like gfn_to_mfn_getpage,
> where the underlying page has its refcount bumped so that it won't be
> nominated/evicted while you map and use the page, then you must put it
> back when you're done.
Quite right - there are a lot of places that assume that a p2m mapping
won't change underfoot, and the right answer will involve reference
counting of some kind - either better integration with the underlying
refcount/typecount system or some reference count in the p2m.
The tricky part will be what to do when a p2m update can't be made
because of one of those refcounts.
Tim.
^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: mapping problems in xenpaging
2011-09-29 17:02 ` Olaf Hering
2011-09-30 21:02 ` Adin Scannell
@ 2011-10-01 3:52 ` zhen shi
1 sibling, 0 replies; 8+ messages in thread
From: zhen shi @ 2011-10-01 3:52 UTC (permalink / raw)
To: Olaf Hering; +Cc: xen-devel
[-- Attachment #1.1: Type: text/plain, Size: 2683 bytes --]
Olaf,maybe I didn't described the problems clearly.I will give an example.
a) In xen_vga_vram_map() of vga.c from tools/ioemu-qemu-xen/hw, it uses
xc_map_foreign_pages() to map a page's gfn address to dom0. If then the page
is paged out and changed to zero page in xenpaging, and dom0 access the page
such as using the mapped address, it will make mistakes.Am I right?
In brief,I mean there may be some conflicts in xc_map_foreign_pages from
other functions and xenpaging feature when they access the same page.
b) In create_grant_pte_mapping() of mm.c from /xen/arch/x86, it
uses gmfn_to_mfn() to get mfn, and then executes map_domain_page(mfn). At
the same time, the page is paged_out and the mfn is changed to INVALID_MFN.
So that in create_grant_pte_mapping () when it goes to mfn_to_page(mfn), it
will make a mistake.Because xen didn't judge the mfn and thought the mfn was
original.
I mean there may be some conflicts of operations after getting the mfn in
xen but the page is paged_out in the meantime.
2011/9/30 Olaf Hering <olaf@aepfle.de>
> On Thu, Sep 29, zhen shi wrote:
>
> > Hi,Olaf,
> >
> > When we analyze and test xenpaging,we found there are some
> problems between
> > mapping and xenpaging.
> > 1) When mapping firstly, then do xenpaging,and the code paths have
> resolved
> > the problems.It's OK.
> > 2) The problems exists if we do address mapping firstly then go to
> > xenpaging,and our confusions are as followings:
> > a) If the domU's memory is directly mapped to dom0,such as the
> hypercall
> > from pv driver,then it will build a related page-table in dom0,which will
> not
> > change p2m-type.
> > and then do the xenpaging to page out the domU's memory pages whose
> gfn
> > address have been already mapped to dom0;So it will cause some problems
> when
> > dom0
> > accesses these pages.Because these pages are paged-out,and dom0
> cannot
> > tell the p2mt before access the pages.
>
> I'm not entirely sure what you do. xenpaging runs in dom0 and is able to
> map paged-out pages. It uses that to trigger a page-in, see
> tools/xenpaging/pagein.c in xen-unstable.hg
>
> > b)The another situation is that if xen has mapped the domU's page, and
> get
> > the mfn according to pfn_to_mfn.But then the page's p2mt is changed by
> others,
> > so when xen
> > accesses the page ,it will cause problems such as BSOD or
> reboot.Because
> > the operations of getting mfn and accessing the page are not
> > atomic.and the situation exists
> > in many code paths .
>
> Can you be more specific what you mean? Xen doesnt seem to have a
> pfn_to_mfn function, only the tools have some helper macros of that name.
>
>
> Olaf
>
[-- Attachment #1.2: Type: text/html, Size: 3284 bytes --]
[-- Attachment #2: Type: text/plain, Size: 138 bytes --]
_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xensource.com
http://lists.xensource.com/xen-devel
^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: Re: mapping problems in xenpaging
2011-09-30 21:02 ` Adin Scannell
2011-09-30 22:19 ` Tim Deegan
@ 2011-10-03 14:56 ` Olaf Hering
2011-10-06 11:10 ` Tim Deegan
1 sibling, 1 reply; 8+ messages in thread
From: Olaf Hering @ 2011-10-03 14:56 UTC (permalink / raw)
To: Adin Scannell; +Cc: zhen shi, xen-devel
On Fri, Sep 30, Adin Scannell wrote:
> >> When we analyze and test xenpaging,we found there are some problems between
> >> mapping and xenpaging.
> >> 1) When mapping firstly, then do xenpaging,and the code paths have resolved
> >> the problems.It's OK.
> >> 2) The problems exists if we do address mapping firstly then go to
> >> xenpaging,and our confusions are as followings:
> >> a) If the domU's memory is directly mapped to dom0,such as the hypercall
> >> from pv driver,then it will build a related page-table in dom0,which will not
> >> change p2m-type.
> >> and then do the xenpaging to page out the domU's memory pages whose gfn
> >> address have been already mapped to dom0;So it will cause some problems when
> >> dom0
> >> accesses these pages.Because these pages are paged-out,and dom0 cannot
> >> tell the p2mt before access the pages.
> >
> > I'm not entirely sure what you do. xenpaging runs in dom0 and is able to
> > map paged-out pages. It uses that to trigger a page-in, see
> > tools/xenpaging/pagein.c in xen-unstable.hg
>
> Here's my take...
>
> Xenpaging doesn't allow selection of pages that have been mapped by
> other domains (as in p2m.c):
>
> 669 int p2m_mem_paging_nominate(struct domain *d, unsigned long gfn)
> ....
> 693 /* Check page count and type */
> 694 page = mfn_to_page(mfn);
> 695 if ( (page->count_info & (PGC_count_mask | PGC_allocated)) !=
> 696 (1 | PGC_allocated) )
> 697 goto out;
>
> *However*, I think that the problem Zhen is describing still exists:
> 1) xenpaging nominates a page, it is successful.
> 2) dom0 maps the same page (a process other than xenpaging, which will
> also map it).
> 3) xenpaging evicts the page, successfully.
>
> I've experienced a few nasty crashes, and I think this could account
> for a couple (but certainly not all)... I think that the solution may
> be to repeat the refcount check in paging_evict, and roll back the
> nomination gracefully if the race is detected. Thoughts?
Are there really code paths that touch a mfn without going through the
p2m functions? If so I will copy the check and update xenpaging.
Olaf
^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: Re: mapping problems in xenpaging
2011-10-03 14:56 ` Olaf Hering
@ 2011-10-06 11:10 ` Tim Deegan
2011-10-09 16:40 ` zhen shi
0 siblings, 1 reply; 8+ messages in thread
From: Tim Deegan @ 2011-10-06 11:10 UTC (permalink / raw)
To: Olaf Hering; +Cc: zhen shi, xen-devel, Adin Scannell
At 16:56 +0200 on 03 Oct (1317660976), Olaf Hering wrote:
> On Fri, Sep 30, Adin Scannell wrote:
>
> > >> When we analyze and test xenpaging,we found there are some problems between
> > >> mapping and xenpaging.
> > >> 1) When mapping firstly, then do xenpaging,and the code paths have resolved
> > >> the problems.It's OK.
> > >> 2) The problems exists if we do address mapping firstly then go to
> > >> xenpaging,and our confusions are as followings:
> > >> a) If the domU's memory is directly mapped to dom0,such as the hypercall
> > >> from pv driver,then it will build a related page-table in dom0,which will not
> > >> change p2m-type.
> > >> and then do the xenpaging to page out the domU's memory pages whose gfn
> > >> address have been already mapped to dom0;So it will cause some problems when
> > >> dom0
> > >> accesses these pages.Because these pages are paged-out,and dom0 cannot
> > >> tell the p2mt before access the pages.
> > >
> > > I'm not entirely sure what you do. xenpaging runs in dom0 and is able to
> > > map paged-out pages. It uses that to trigger a page-in, see
> > > tools/xenpaging/pagein.c in xen-unstable.hg
> >
> > Here's my take...
> >
> > Xenpaging doesn't allow selection of pages that have been mapped by
> > other domains (as in p2m.c):
> >
> > 669 int p2m_mem_paging_nominate(struct domain *d, unsigned long gfn)
> > ....
> > 693 /* Check page count and type */
> > 694 page = mfn_to_page(mfn);
> > 695 if ( (page->count_info & (PGC_count_mask | PGC_allocated)) !=
> > 696 (1 | PGC_allocated) )
> > 697 goto out;
> >
> > *However*, I think that the problem Zhen is describing still exists:
> > 1) xenpaging nominates a page, it is successful.
> > 2) dom0 maps the same page (a process other than xenpaging, which will
> > also map it).
> > 3) xenpaging evicts the page, successfully.
> >
> > I've experienced a few nasty crashes, and I think this could account
> > for a couple (but certainly not all)... I think that the solution may
> > be to repeat the refcount check in paging_evict, and roll back the
> > nomination gracefully if the race is detected. Thoughts?
>
> Are there really code paths that touch a mfn without going through the
> p2m functions? If so I will copy the check and update xenpaging.
No, but there are race conditions where CPU A could to the p2m lookup,
then CPU B nominates the page and changes its p2m entry, then CPU A
completes the mapping. In the extreme case, detecting this in the
eviction code is also subject to the same race; some sort of atomic
lookup-and-get-reference operation seems like a better fix.
Tim.
^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: Re: mapping problems in xenpaging
2011-10-06 11:10 ` Tim Deegan
@ 2011-10-09 16:40 ` zhen shi
0 siblings, 0 replies; 8+ messages in thread
From: zhen shi @ 2011-10-09 16:40 UTC (permalink / raw)
To: Tim Deegan, Olaf Hering, Adin Scannell; +Cc: xen-devel
[-- Attachment #1.1: Type: text/plain, Size: 3158 bytes --]
2011/10/6 Tim Deegan <tim@xen.org>
> At 16:56 +0200 on 03 Oct (1317660976), Olaf Hering wrote:
> > On Fri, Sep 30, Adin Scannell wrote:
> >
> > > >> When we analyze and test xenpaging,we found there are some
> problems between
> > > >> mapping and xenpaging.
> > > >> 1) When mapping firstly, then do xenpaging,and the code paths have
> resolved
> > > >> the problems.It's OK.
> > > >> 2) The problems exists if we do address mapping firstly then go to
> > > >> xenpaging,and our confusions are as followings:
> > > >> a) If the domU's memory is directly mapped to dom0,such as the
> hypercall
> > > >> from pv driver,then it will build a related page-table in dom0,which
> will not
> > > >> change p2m-type.
> > > >> and then do the xenpaging to page out the domU's memory pages
> whose gfn
> > > >> address have been already mapped to dom0;So it will cause some
> problems when
> > > >> dom0
> > > >> accesses these pages.Because these pages are paged-out,and
> dom0 cannot
> > > >> tell the p2mt before access the pages.
> > > >
> > > > I'm not entirely sure what you do. xenpaging runs in dom0 and is able
> to
> > > > map paged-out pages. It uses that to trigger a page-in, see
> > > > tools/xenpaging/pagein.c in xen-unstable.hg
> > >
> > > Here's my take...
> > >
> > > Xenpaging doesn't allow selection of pages that have been mapped by
> > > other domains (as in p2m.c):
> > >
> > > 669 int p2m_mem_paging_nominate(struct domain *d, unsigned long gfn)
> > > ....
> > > 693 /* Check page count and type */
> > > 694 page = mfn_to_page(mfn);
> > > 695 if ( (page->count_info & (PGC_count_mask | PGC_allocated)) !=
> > > 696 (1 | PGC_allocated) )
> > > 697 goto out;
>
> I wonder if pages have been mapped by other domains,then the
page->count_info will be added.I have analyzed xc_map_foreign_pages()
function,and have not figured out how to add the page->count_info
by xc_map_foreign_pages().and the page->count_info decreases in munmap().
> > > *However*, I think that the problem Zhen is describing still exists:
> > > 1) xenpaging nominates a page, it is successful.
> > > 2) dom0 maps the same page (a process other than xenpaging, which will
> > > also map it).
> > > 3) xenpaging evicts the page, successfully.
> > >
> > > I've experienced a few nasty crashes, and I think this could account
> > > for a couple (but certainly not all)... I think that the solution may
> > > be to repeat the refcount check in paging_evict, and roll back the
> > > nomination gracefully if the race is detected. Thoughts?
>
> > Are there really code paths that touch a mfn without going through the
> > p2m functions? If so I will copy the check and update xenpaging.
>
> >No, but there are race conditions where CPU A could to the p2m lookup,
> >then CPU B nominates the page and changes its p2m entry, then CPU A
> >completes the mapping. In the extreme case, detecting this in the
> >eviction code is also subject to the same race; some sort of atomic
> >lookup-and-get-reference operation seems like a better fix.
>
Tim , Olaf and Adin, do you have any good ideas about the above
situation.
[-- Attachment #1.2: Type: text/html, Size: 4322 bytes --]
[-- Attachment #2: Type: text/plain, Size: 138 bytes --]
_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xensource.com
http://lists.xensource.com/xen-devel
^ permalink raw reply [flat|nested] 8+ messages in thread
end of thread, other threads:[~2011-10-09 16:40 UTC | newest]
Thread overview: 8+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2011-09-29 14:55 mapping problems in xenpaging zhen shi
2011-09-29 17:02 ` Olaf Hering
2011-09-30 21:02 ` Adin Scannell
2011-09-30 22:19 ` Tim Deegan
2011-10-03 14:56 ` Olaf Hering
2011-10-06 11:10 ` Tim Deegan
2011-10-09 16:40 ` zhen shi
2011-10-01 3:52 ` zhen shi
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.