* get_page() vs __split_huge_page_refcount()
@ 2011-03-25 5:00 Michel Lespinasse
2011-03-25 16:48 ` Andrea Arcangeli
0 siblings, 1 reply; 2+ messages in thread
From: Michel Lespinasse @ 2011-03-25 5:00 UTC (permalink / raw)
To: Andrea Arcangeli, linux-mm
Hi,
I am getting up to speed with mainline THP code and was wondering
what's going on with reference counts within
__split_huge_page_refcount():
for (i = 1; i < HPAGE_PMD_NR; i++) {
struct page *page_tail = page + i;
/* tail_page->_count cannot change */
atomic_sub(atomic_read(&page_tail->_count), &page->_count);
BUG_ON(page_count(page) <= 0);
...
A look at get_page() gave a partial answer. First, the page refcount
is incremented, then, if this was a tail page, the head page is looked
up and its refcount is incremented too. __split_huge_page_refcount()
preserves the refcount of tail pages but substracts it from the head
page, as it'll be an independent page after the split. However this
comment lead to more head scratching:
/*
* This is safe only because
* __split_huge_page_refcount can't run under
* get_page().
*/
As I can see, follow_page() with a FOLL_GET flag is careful when it
encounters huge pages. It tests the _PAGE_SPLITTING bit in the pmd
(under protection of page_table_lock) to avoid racing with
__split_huge_page_refcount(). Then, it can safely call get_page() and
not worry about both refcounts updates being visible at once.
My question is this: After someone obtains a page reference using
get_user_pages(), what prevents them from getting additional
references with get_page() ? I always thought it was legal to
duplicate references that way, but now I don't see how it'd be safe
doing so on anon pages with THP enabled.
--
Michel "Walken" Lespinasse
A program is never fully debugged until the last user dies.
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Fight unfair telecom internet charges in Canada: sign http://stopthemeter.ca/
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply [flat|nested] 2+ messages in thread
* Re: get_page() vs __split_huge_page_refcount()
2011-03-25 5:00 get_page() vs __split_huge_page_refcount() Michel Lespinasse
@ 2011-03-25 16:48 ` Andrea Arcangeli
0 siblings, 0 replies; 2+ messages in thread
From: Andrea Arcangeli @ 2011-03-25 16:48 UTC (permalink / raw)
To: Michel Lespinasse; +Cc: linux-mm
Hi Michel,
On Thu, Mar 24, 2011 at 10:00:16PM -0700, Michel Lespinasse wrote:
> My question is this: After someone obtains a page reference using
> get_user_pages(), what prevents them from getting additional
> references with get_page() ? I always thought it was legal to
> duplicate references that way, but now I don't see how it'd be safe
> doing so on anon pages with THP enabled.
It's not legal anymore as you noticed, but I'm not aware of anything
doing that. I don't see an useful case where a driver could need to
take one extra refcount after GUP returned. The normal API is
GUP/put_page. We could make it legal again by taking the compound_lock
after a PageCompound check though. I hope it's not needed though. It's
unavoidable in put_page because put_page will run out of order with
regard to __split_huge_page_refcount. But serializing get_page in GUP
against __split_huge_page_refcount is automatic through the
pmd_trans_splitting bit and needed for all page table walkers anyway.
Maybe it's good idea to add a comment to transhuge.txt about that? I
don't think I added it.
Grepping for get_page in drivers doesn't show too many, they mostly
run through the vm_ops->fault handler. Most important I can't see how
possibly it could be useful to run a get_page after
get_user_pages(FOLL_GET) returns.
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Fight unfair telecom internet charges in Canada: sign http://stopthemeter.ca/
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply [flat|nested] 2+ messages in thread
end of thread, other threads:[~2011-03-25 16:48 UTC | newest]
Thread overview: 2+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2011-03-25 5:00 get_page() vs __split_huge_page_refcount() Michel Lespinasse
2011-03-25 16:48 ` Andrea Arcangeli
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).