From: "Kirill A. Shutemov" <kirill@shutemov.name>
To: Vladimir Davydov <vdavydov@virtuozzo.com>
Cc: "Kirill A. Shutemov" <kirill.shutemov@linux.intel.com>,
Andrew Morton <akpm@linux-foundation.org>,
Andrea Arcangeli <aarcange@redhat.com>,
Hugh Dickins <hughd@google.com>,
Naoya Horiguchi <n-horiguchi@ah.jp.nec.com>,
Sasha Levin <sasha.levin@oracle.com>,
Minchan Kim <minchan@kernel.org>,
linux-kernel@vger.kernel.org, linux-mm@kvack.org
Subject: Re: [PATCH 4/4] mm: prepare page_referenced() and page_idle to new THP refcounting
Date: Thu, 5 Nov 2015 19:27:51 +0200 [thread overview]
Message-ID: <20151105172750.GA20014@node.shutemov.name> (raw)
In-Reply-To: <20151105160324.GF29259@esperanza>
On Thu, Nov 05, 2015 at 07:03:24PM +0300, Vladimir Davydov wrote:
> On Tue, Nov 03, 2015 at 05:26:15PM +0200, Kirill A. Shutemov wrote:
> ...
> > @@ -812,60 +812,104 @@ static int page_referenced_one(struct page *page, struct vm_area_struct *vma,
> > spinlock_t *ptl;
> > int referenced = 0;
> > struct page_referenced_arg *pra = arg;
> > + pgd_t *pgd;
> > + pud_t *pud;
> > + pmd_t *pmd;
> > + pte_t *pte;
> >
> > - if (unlikely(PageTransHuge(page))) {
> > - pmd_t *pmd;
> > -
> > - /*
> > - * rmap might return false positives; we must filter
> > - * these out using page_check_address_pmd().
> > - */
> > - pmd = page_check_address_pmd(page, mm, address, &ptl);
> > - if (!pmd)
> > + if (unlikely(PageHuge(page))) {
> > + /* when pud is not present, pte will be NULL */
> > + pte = huge_pte_offset(mm, address);
> > + if (!pte)
> > return SWAP_AGAIN;
> >
> > - if (vma->vm_flags & VM_LOCKED) {
> > + ptl = huge_pte_lockptr(page_hstate(page), mm, pte);
> > + goto check_pte;
> > + }
> > +
> > + pgd = pgd_offset(mm, address);
> > + if (!pgd_present(*pgd))
> > + return SWAP_AGAIN;
> > + pud = pud_offset(pgd, address);
> > + if (!pud_present(*pud))
> > + return SWAP_AGAIN;
> > + pmd = pmd_offset(pud, address);
> > +
> > + if (pmd_trans_huge(*pmd)) {
> > + int ret = SWAP_AGAIN;
> > +
> > + ptl = pmd_lock(mm, pmd);
> > + if (!pmd_present(*pmd))
> > + goto unlock_pmd;
> > + if (unlikely(!pmd_trans_huge(*pmd))) {
> > spin_unlock(ptl);
> > + goto map_pte;
> > + }
> > +
> > + if (pmd_page(*pmd) != page)
> > + goto unlock_pmd;
> > +
> > + if (vma->vm_flags & VM_LOCKED) {
> > pra->vm_flags |= VM_LOCKED;
> > - return SWAP_FAIL; /* To break the loop */
> > + ret = SWAP_FAIL; /* To break the loop */
> > + goto unlock_pmd;
> > }
> >
> > if (pmdp_clear_flush_young_notify(vma, address, pmd))
> > referenced++;
> > -
> > spin_unlock(ptl);
> > + goto found;
> > +unlock_pmd:
> > + spin_unlock(ptl);
> > + return ret;
> > } else {
> > - pte_t *pte;
> > -
> > - /*
> > - * rmap might return false positives; we must filter
> > - * these out using page_check_address().
> > - */
> > - pte = page_check_address(page, mm, address, &ptl, 0);
> > - if (!pte)
> > + pmd_t pmde = *pmd;
> > + barrier();
>
> This is supposed to be
>
> pmd_t pmde = READ_ONCE(*pmd);
>
> Right?
See e37c69827063. If I read this correctly, barrier() is less overhead for
some archs.
>
> I don't understand why we need a barrier here. Why can't we just do
>
> } else if (!pmd_present(*pmd))
> reutnr SWAP_AGAIN;
>
> ?
See f72e7dcdd252 too.
> > + if (!pmd_present(pmde) || pmd_trans_huge(pmde))
> > return SWAP_AGAIN;
> > + }
--
Kirill A. Shutemov
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
next prev parent reply other threads:[~2015-11-05 17:27 UTC|newest]
Thread overview: 20+ messages / expand[flat|nested] mbox.gz Atom feed top
2015-11-03 15:26 [PATCH 0/4] Bugfixes for THP refcounting Kirill A. Shutemov
2015-11-03 15:26 ` [PATCH 1/4] mm: do not crash on PageDoubleMap() for non-head pages Kirill A. Shutemov
2015-11-03 15:26 ` [PATCH 2/4] mm: duplicate rmap reference for hugetlb pages as compound Kirill A. Shutemov
2015-11-03 15:26 ` [PATCH 3/4] thp: fix split vs. unmap race Kirill A. Shutemov
2015-11-03 15:26 ` [PATCH 4/4] mm: prepare page_referenced() and page_idle to new THP refcounting Kirill A. Shutemov
2015-11-05 9:10 ` Vladimir Davydov
2015-11-05 9:24 ` Kirill A. Shutemov
2015-11-05 12:07 ` Vladimir Davydov
2015-11-05 12:36 ` Kirill A. Shutemov
2015-11-05 12:53 ` Vladimir Davydov
2015-11-05 12:58 ` Kirill A. Shutemov
2015-11-05 16:31 ` Vladimir Davydov
2015-11-06 14:37 ` [PATCH] mm: add page_check_address_transhuge helper Vladimir Davydov
2015-11-06 15:24 ` Kirill A. Shutemov
2015-11-05 16:03 ` [PATCH 4/4] mm: prepare page_referenced() and page_idle to new THP refcounting Vladimir Davydov
2015-11-05 17:27 ` Kirill A. Shutemov [this message]
2015-11-06 0:32 ` Andrew Morton
2015-11-06 10:29 ` Kirill A. Shutemov
2015-11-06 22:39 ` Andrew Morton
2015-11-08 23:40 ` Kirill A. Shutemov
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20151105172750.GA20014@node.shutemov.name \
--to=kirill@shutemov.name \
--cc=aarcange@redhat.com \
--cc=akpm@linux-foundation.org \
--cc=hughd@google.com \
--cc=kirill.shutemov@linux.intel.com \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=minchan@kernel.org \
--cc=n-horiguchi@ah.jp.nec.com \
--cc=sasha.levin@oracle.com \
--cc=vdavydov@virtuozzo.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).