All of lore.kernel.org
 help / color / mirror / Atom feed
From: marc.zyngier@arm.com (Marc Zyngier)
To: linux-arm-kernel@lists.infradead.org
Subject: [PATCH] arm64: mm: fix pmd_write CoW brokenness
Date: Wed, 28 May 2014 18:49:45 +0100	[thread overview]
Message-ID: <538621B9.9090503@arm.com> (raw)
In-Reply-To: <1401296836-16837-1-git-send-email-will.deacon@arm.com>

Hi Will,

On 28/05/14 18:07, Will Deacon wrote:
> Commit 9c7e535fcc17 ("arm64: mm: Route pmd thp functions through pte
> equivalents") changed the pmd manipulator and accessor functions to
> convert the target pmd to a pte, process it with the pte functions, then
> convert it back. Along the way, we gained support for PTE_WRITE, however
> this is completely ignored by set_pmd_at, and so we fail to set the
> PMD_SECT_RDONLY for PMDs, resulting in all sorts of lovely failures (like
> CoW not working).
> 
> Partially reverting the offending commit (by making use of
> PMD_SECT_RDONLY explicitly for pmd_{write,wrprotect,mkwrite} functions)
> leads to further issues because pmd_write can then return potentially
> incorrect values for page table entries marked as RDONLY, leading to
> BUG_ON(pmd_write(entry)) tripping under some THP workloads.
> 
> This patch fixes the issue by routing set_pmd_at through set_pte_at,
> which correctly takes the PTE_WRITE flag into account. Given that
> THP mappings are always anonymous, the additional cache-flushing code
> in __sync_icache_dcache won't impose any significant overhead as the
> flush will be skipped.
> 
> Cc: Steve Capper <steve.capper@arm.com>
> Cc: Catalin Marinas <catalin.marinas@arm.com>
> Signed-off-by: Will Deacon <will.deacon@arm.com>
> ---
> 
> Whilst this is a fairly scary change at this point in the cycle, I can't get
> through an LTP run without it. Furthermore, spurious CoW failures for tasks
> that happen to get transparent hugepages are a pretty significant regression.
> 
>  arch/arm64/include/asm/pgtable.h | 2 +-
>  1 file changed, 1 insertion(+), 1 deletion(-)
> 
> diff --git a/arch/arm64/include/asm/pgtable.h b/arch/arm64/include/asm/pgtable.h
> index 90c811f05a2e..20785f9da95c 100644
> --- a/arch/arm64/include/asm/pgtable.h
> +++ b/arch/arm64/include/asm/pgtable.h
> @@ -266,7 +266,7 @@ static inline pmd_t pte_pmd(pte_t pte)
>  
>  #define pmd_page(pmd)           pfn_to_page(__phys_to_pfn(pmd_val(pmd) & PHYS_MASK))
>  
> -#define set_pmd_at(mm, addr, pmdp, pmd)	set_pmd(pmdp, pmd)
> +#define set_pmd_at(mm, addr, pmdp, pmd)	set_pte_at(mm, addr, pmdp, pmd_pte(pmd))
>  
>  static inline int has_transparent_hugepage(void)
>  {
> 

I managed to reliably reproduce the failure on my favourite arm64 board
running a KVM guest with THP enabled. Adding this patch to the guest
kernel made it behave correctly.

So FWIW:
Tested-by: Marc Zyngier <marc.zyngier@arm.com>

	M.
-- 
Jazz is not dead. It just smells funny...

      reply	other threads:[~2014-05-28 17:49 UTC|newest]

Thread overview: 2+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2014-05-28 17:07 [PATCH] arm64: mm: fix pmd_write CoW brokenness Will Deacon
2014-05-28 17:49 ` Marc Zyngier [this message]

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=538621B9.9090503@arm.com \
    --to=marc.zyngier@arm.com \
    --cc=linux-arm-kernel@lists.infradead.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.