* [PATCH] mm: softdirty: respect VM_SOFTDIRTY in PTE holes
@ 2014-07-31 22:43 ` Peter Feiner
0 siblings, 0 replies; 6+ messages in thread
From: Peter Feiner @ 2014-07-31 22:43 UTC (permalink / raw)
To: linux-mm
Cc: linux-kernel, Peter Feiner, Cyrill Gorcunov, Pavel Emelyanov,
Hugh Dickins, Naoya Horiguchi, Andrew Morton
After a VMA is created with the VM_SOFTDIRTY flag set,
/proc/pid/pagemap should report that the VMA's virtual pages are
soft-dirty until VM_SOFTDIRTY is cleared (i.e., by the next write of
"4" to /proc/pid/clear_refs). However, pagemap ignores the
VM_SOFTDIRTY flag for virtual addresses that fall in PTE holes (i.e.,
virtual addresses that don't have a PMD, PUD, or PGD allocated yet).
To observe this bug, use mmap to create a VMA large enough such that
there's a good chance that the VMA will occupy an unused PMD, then
test the soft-dirty bit on its pages. In practice, I found that a VMA
that covered a PMD's worth of address space was big enough.
This patch adds the necessary VMA lookup to the PTE hole callback in
/proc/pid/pagemap's page walk and sets soft-dirty according to the
VMAs' VM_SOFTDIRTY flag.
Signed-off-by: Peter Feiner <pfeiner@google.com>
---
fs/proc/task_mmu.c | 27 +++++++++++++++++++++------
1 file changed, 21 insertions(+), 6 deletions(-)
diff --git a/fs/proc/task_mmu.c b/fs/proc/task_mmu.c
index cfa63ee..dfc791c 100644
--- a/fs/proc/task_mmu.c
+++ b/fs/proc/task_mmu.c
@@ -925,15 +925,30 @@ static int pagemap_pte_hole(unsigned long start, unsigned long end,
struct mm_walk *walk)
{
struct pagemapread *pm = walk->private;
- unsigned long addr;
+ unsigned long addr = start;
int err = 0;
- pagemap_entry_t pme = make_pme(PM_NOT_PRESENT(pm->v2));
- for (addr = start; addr < end; addr += PAGE_SIZE) {
- err = add_to_pagemap(addr, &pme, pm);
- if (err)
- break;
+ while (addr < end) {
+ struct vm_area_struct *vma = find_vma(walk->mm, addr);
+ pagemap_entry_t pme = make_pme(PM_NOT_PRESENT(pm->v2));
+ unsigned long vm_end;
+
+ if (!vma) {
+ vm_end = end;
+ } else {
+ vm_end = min(end, vma->vm_end);
+ if (vma->vm_flags & VM_SOFTDIRTY)
+ pme.pme |= PM_STATUS2(pm->v2, __PM_SOFT_DIRTY);
+ }
+
+ for (; addr < vm_end; addr += PAGE_SIZE) {
+ err = add_to_pagemap(addr, &pme, pm);
+ if (err)
+ goto out;
+ }
}
+
+out:
return err;
}
--
2.0.0.526.g5318336
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply related [flat|nested] 6+ messages in thread
* [PATCH] mm: softdirty: respect VM_SOFTDIRTY in PTE holes
@ 2014-07-31 22:43 ` Peter Feiner
0 siblings, 0 replies; 6+ messages in thread
From: Peter Feiner @ 2014-07-31 22:43 UTC (permalink / raw)
To: linux-mm
Cc: linux-kernel, Peter Feiner, Cyrill Gorcunov, Pavel Emelyanov,
Hugh Dickins, Naoya Horiguchi, Andrew Morton
After a VMA is created with the VM_SOFTDIRTY flag set,
/proc/pid/pagemap should report that the VMA's virtual pages are
soft-dirty until VM_SOFTDIRTY is cleared (i.e., by the next write of
"4" to /proc/pid/clear_refs). However, pagemap ignores the
VM_SOFTDIRTY flag for virtual addresses that fall in PTE holes (i.e.,
virtual addresses that don't have a PMD, PUD, or PGD allocated yet).
To observe this bug, use mmap to create a VMA large enough such that
there's a good chance that the VMA will occupy an unused PMD, then
test the soft-dirty bit on its pages. In practice, I found that a VMA
that covered a PMD's worth of address space was big enough.
This patch adds the necessary VMA lookup to the PTE hole callback in
/proc/pid/pagemap's page walk and sets soft-dirty according to the
VMAs' VM_SOFTDIRTY flag.
Signed-off-by: Peter Feiner <pfeiner@google.com>
---
fs/proc/task_mmu.c | 27 +++++++++++++++++++++------
1 file changed, 21 insertions(+), 6 deletions(-)
diff --git a/fs/proc/task_mmu.c b/fs/proc/task_mmu.c
index cfa63ee..dfc791c 100644
--- a/fs/proc/task_mmu.c
+++ b/fs/proc/task_mmu.c
@@ -925,15 +925,30 @@ static int pagemap_pte_hole(unsigned long start, unsigned long end,
struct mm_walk *walk)
{
struct pagemapread *pm = walk->private;
- unsigned long addr;
+ unsigned long addr = start;
int err = 0;
- pagemap_entry_t pme = make_pme(PM_NOT_PRESENT(pm->v2));
- for (addr = start; addr < end; addr += PAGE_SIZE) {
- err = add_to_pagemap(addr, &pme, pm);
- if (err)
- break;
+ while (addr < end) {
+ struct vm_area_struct *vma = find_vma(walk->mm, addr);
+ pagemap_entry_t pme = make_pme(PM_NOT_PRESENT(pm->v2));
+ unsigned long vm_end;
+
+ if (!vma) {
+ vm_end = end;
+ } else {
+ vm_end = min(end, vma->vm_end);
+ if (vma->vm_flags & VM_SOFTDIRTY)
+ pme.pme |= PM_STATUS2(pm->v2, __PM_SOFT_DIRTY);
+ }
+
+ for (; addr < vm_end; addr += PAGE_SIZE) {
+ err = add_to_pagemap(addr, &pme, pm);
+ if (err)
+ goto out;
+ }
}
+
+out:
return err;
}
--
2.0.0.526.g5318336
^ permalink raw reply related [flat|nested] 6+ messages in thread
* Re: [PATCH] mm: softdirty: respect VM_SOFTDIRTY in PTE holes
2014-07-31 22:43 ` Peter Feiner
@ 2014-08-01 7:01 ` Cyrill Gorcunov
-1 siblings, 0 replies; 6+ messages in thread
From: Cyrill Gorcunov @ 2014-08-01 7:01 UTC (permalink / raw)
To: Peter Feiner
Cc: linux-mm, linux-kernel, Pavel Emelyanov, Hugh Dickins,
Naoya Horiguchi, Andrew Morton
On Thu, Jul 31, 2014 at 06:43:25PM -0400, Peter Feiner wrote:
> After a VMA is created with the VM_SOFTDIRTY flag set,
> /proc/pid/pagemap should report that the VMA's virtual pages are
> soft-dirty until VM_SOFTDIRTY is cleared (i.e., by the next write of
> "4" to /proc/pid/clear_refs). However, pagemap ignores the
> VM_SOFTDIRTY flag for virtual addresses that fall in PTE holes (i.e.,
> virtual addresses that don't have a PMD, PUD, or PGD allocated yet).
>
> To observe this bug, use mmap to create a VMA large enough such that
> there's a good chance that the VMA will occupy an unused PMD, then
> test the soft-dirty bit on its pages. In practice, I found that a VMA
> that covered a PMD's worth of address space was big enough.
>
> This patch adds the necessary VMA lookup to the PTE hole callback in
> /proc/pid/pagemap's page walk and sets soft-dirty according to the
> VMAs' VM_SOFTDIRTY flag.
>
> Signed-off-by: Peter Feiner <pfeiner@google.com>
Acked-by: Cyrill Gorcunov <gorcunov@openvz.org>
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: [PATCH] mm: softdirty: respect VM_SOFTDIRTY in PTE holes
@ 2014-08-01 7:01 ` Cyrill Gorcunov
0 siblings, 0 replies; 6+ messages in thread
From: Cyrill Gorcunov @ 2014-08-01 7:01 UTC (permalink / raw)
To: Peter Feiner
Cc: linux-mm, linux-kernel, Pavel Emelyanov, Hugh Dickins,
Naoya Horiguchi, Andrew Morton
On Thu, Jul 31, 2014 at 06:43:25PM -0400, Peter Feiner wrote:
> After a VMA is created with the VM_SOFTDIRTY flag set,
> /proc/pid/pagemap should report that the VMA's virtual pages are
> soft-dirty until VM_SOFTDIRTY is cleared (i.e., by the next write of
> "4" to /proc/pid/clear_refs). However, pagemap ignores the
> VM_SOFTDIRTY flag for virtual addresses that fall in PTE holes (i.e.,
> virtual addresses that don't have a PMD, PUD, or PGD allocated yet).
>
> To observe this bug, use mmap to create a VMA large enough such that
> there's a good chance that the VMA will occupy an unused PMD, then
> test the soft-dirty bit on its pages. In practice, I found that a VMA
> that covered a PMD's worth of address space was big enough.
>
> This patch adds the necessary VMA lookup to the PTE hole callback in
> /proc/pid/pagemap's page walk and sets soft-dirty according to the
> VMAs' VM_SOFTDIRTY flag.
>
> Signed-off-by: Peter Feiner <pfeiner@google.com>
Acked-by: Cyrill Gorcunov <gorcunov@openvz.org>
^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: [PATCH] mm: softdirty: respect VM_SOFTDIRTY in PTE holes
2014-07-31 22:43 ` Peter Feiner
@ 2014-08-01 19:53 ` Naoya Horiguchi
-1 siblings, 0 replies; 6+ messages in thread
From: Naoya Horiguchi @ 2014-08-01 19:53 UTC (permalink / raw)
To: Peter Feiner
Cc: linux-mm, linux-kernel, Cyrill Gorcunov, Pavel Emelyanov,
Hugh Dickins, Andrew Morton
On Thu, Jul 31, 2014 at 06:43:25PM -0400, Peter Feiner wrote:
> After a VMA is created with the VM_SOFTDIRTY flag set,
> /proc/pid/pagemap should report that the VMA's virtual pages are
> soft-dirty until VM_SOFTDIRTY is cleared (i.e., by the next write of
> "4" to /proc/pid/clear_refs). However, pagemap ignores the
> VM_SOFTDIRTY flag for virtual addresses that fall in PTE holes (i.e.,
> virtual addresses that don't have a PMD, PUD, or PGD allocated yet).
>
> To observe this bug, use mmap to create a VMA large enough such that
> there's a good chance that the VMA will occupy an unused PMD, then
> test the soft-dirty bit on its pages. In practice, I found that a VMA
> that covered a PMD's worth of address space was big enough.
>
> This patch adds the necessary VMA lookup to the PTE hole callback in
> /proc/pid/pagemap's page walk and sets soft-dirty according to the
> VMAs' VM_SOFTDIRTY flag.
>
> Signed-off-by: Peter Feiner <pfeiner@google.com>
It's unfortunate that we have to do this kind of vma boundary calculation
inside pagemap_pte_hole, which comes from poor vma handling in mm/pagewalk.c.
Recently I'm trying to solve this (I posted ver.6 patchset today) and if
that's merged, your problem should be implicitly fixed.
But anyway if Andrew decided to merge your patch in first, it's OK for me.
Acked-by: Naoya Horiguchi <n-horiguchi@ah.jp.nec.com>
Thanks,
Naoya Horiguchi
> ---
> fs/proc/task_mmu.c | 27 +++++++++++++++++++++------
> 1 file changed, 21 insertions(+), 6 deletions(-)
>
> diff --git a/fs/proc/task_mmu.c b/fs/proc/task_mmu.c
> index cfa63ee..dfc791c 100644
> --- a/fs/proc/task_mmu.c
> +++ b/fs/proc/task_mmu.c
> @@ -925,15 +925,30 @@ static int pagemap_pte_hole(unsigned long start, unsigned long end,
> struct mm_walk *walk)
> {
> struct pagemapread *pm = walk->private;
> - unsigned long addr;
> + unsigned long addr = start;
> int err = 0;
> - pagemap_entry_t pme = make_pme(PM_NOT_PRESENT(pm->v2));
>
> - for (addr = start; addr < end; addr += PAGE_SIZE) {
> - err = add_to_pagemap(addr, &pme, pm);
> - if (err)
> - break;
> + while (addr < end) {
> + struct vm_area_struct *vma = find_vma(walk->mm, addr);
> + pagemap_entry_t pme = make_pme(PM_NOT_PRESENT(pm->v2));
> + unsigned long vm_end;
> +
> + if (!vma) {
> + vm_end = end;
> + } else {
> + vm_end = min(end, vma->vm_end);
> + if (vma->vm_flags & VM_SOFTDIRTY)
> + pme.pme |= PM_STATUS2(pm->v2, __PM_SOFT_DIRTY);
> + }
> +
> + for (; addr < vm_end; addr += PAGE_SIZE) {
> + err = add_to_pagemap(addr, &pme, pm);
> + if (err)
> + goto out;
> + }
> }
> +
> +out:
> return err;
> }
>
> --
> 2.0.0.526.g5318336
>
> --
> To unsubscribe, send a message with 'unsubscribe linux-mm' in
> the body to majordomo@kvack.org. For more info on Linux MM,
> see: http://www.linux-mm.org/ .
> Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
>
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: [PATCH] mm: softdirty: respect VM_SOFTDIRTY in PTE holes
@ 2014-08-01 19:53 ` Naoya Horiguchi
0 siblings, 0 replies; 6+ messages in thread
From: Naoya Horiguchi @ 2014-08-01 19:53 UTC (permalink / raw)
To: Peter Feiner
Cc: linux-mm, linux-kernel, Cyrill Gorcunov, Pavel Emelyanov,
Hugh Dickins, Andrew Morton
On Thu, Jul 31, 2014 at 06:43:25PM -0400, Peter Feiner wrote:
> After a VMA is created with the VM_SOFTDIRTY flag set,
> /proc/pid/pagemap should report that the VMA's virtual pages are
> soft-dirty until VM_SOFTDIRTY is cleared (i.e., by the next write of
> "4" to /proc/pid/clear_refs). However, pagemap ignores the
> VM_SOFTDIRTY flag for virtual addresses that fall in PTE holes (i.e.,
> virtual addresses that don't have a PMD, PUD, or PGD allocated yet).
>
> To observe this bug, use mmap to create a VMA large enough such that
> there's a good chance that the VMA will occupy an unused PMD, then
> test the soft-dirty bit on its pages. In practice, I found that a VMA
> that covered a PMD's worth of address space was big enough.
>
> This patch adds the necessary VMA lookup to the PTE hole callback in
> /proc/pid/pagemap's page walk and sets soft-dirty according to the
> VMAs' VM_SOFTDIRTY flag.
>
> Signed-off-by: Peter Feiner <pfeiner@google.com>
It's unfortunate that we have to do this kind of vma boundary calculation
inside pagemap_pte_hole, which comes from poor vma handling in mm/pagewalk.c.
Recently I'm trying to solve this (I posted ver.6 patchset today) and if
that's merged, your problem should be implicitly fixed.
But anyway if Andrew decided to merge your patch in first, it's OK for me.
Acked-by: Naoya Horiguchi <n-horiguchi@ah.jp.nec.com>
Thanks,
Naoya Horiguchi
> ---
> fs/proc/task_mmu.c | 27 +++++++++++++++++++++------
> 1 file changed, 21 insertions(+), 6 deletions(-)
>
> diff --git a/fs/proc/task_mmu.c b/fs/proc/task_mmu.c
> index cfa63ee..dfc791c 100644
> --- a/fs/proc/task_mmu.c
> +++ b/fs/proc/task_mmu.c
> @@ -925,15 +925,30 @@ static int pagemap_pte_hole(unsigned long start, unsigned long end,
> struct mm_walk *walk)
> {
> struct pagemapread *pm = walk->private;
> - unsigned long addr;
> + unsigned long addr = start;
> int err = 0;
> - pagemap_entry_t pme = make_pme(PM_NOT_PRESENT(pm->v2));
>
> - for (addr = start; addr < end; addr += PAGE_SIZE) {
> - err = add_to_pagemap(addr, &pme, pm);
> - if (err)
> - break;
> + while (addr < end) {
> + struct vm_area_struct *vma = find_vma(walk->mm, addr);
> + pagemap_entry_t pme = make_pme(PM_NOT_PRESENT(pm->v2));
> + unsigned long vm_end;
> +
> + if (!vma) {
> + vm_end = end;
> + } else {
> + vm_end = min(end, vma->vm_end);
> + if (vma->vm_flags & VM_SOFTDIRTY)
> + pme.pme |= PM_STATUS2(pm->v2, __PM_SOFT_DIRTY);
> + }
> +
> + for (; addr < vm_end; addr += PAGE_SIZE) {
> + err = add_to_pagemap(addr, &pme, pm);
> + if (err)
> + goto out;
> + }
> }
> +
> +out:
> return err;
> }
>
> --
> 2.0.0.526.g5318336
>
> --
> To unsubscribe, send a message with 'unsubscribe linux-mm' in
> the body to majordomo@kvack.org. For more info on Linux MM,
> see: http://www.linux-mm.org/ .
> Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
>
^ permalink raw reply [flat|nested] 6+ messages in thread
end of thread, other threads:[~2014-08-01 20:39 UTC | newest]
Thread overview: 6+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2014-07-31 22:43 [PATCH] mm: softdirty: respect VM_SOFTDIRTY in PTE holes Peter Feiner
2014-07-31 22:43 ` Peter Feiner
2014-08-01 7:01 ` Cyrill Gorcunov
2014-08-01 7:01 ` Cyrill Gorcunov
2014-08-01 19:53 ` Naoya Horiguchi
2014-08-01 19:53 ` Naoya Horiguchi
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.