* [BUG] BUG: unable to handle kernel paging request at fffba000
@ 2011-01-19 12:40 Ilya Dryomov
2011-01-19 20:01 ` Andrea Arcangeli
2011-01-19 22:19 ` Andrea Arcangeli
0 siblings, 2 replies; 4+ messages in thread
From: Ilya Dryomov @ 2011-01-19 12:40 UTC (permalink / raw)
To: linux-mm; +Cc: Andrea Arcangeli, Rik van Riel, linux-kernel, idryomov
Hello,
I just built a fresh 38-rc1 kernel with transparent huge page support
built-in (TRANSPARENT_HUGEPAGE=y) and it failed to boot with the
following bug. However after the reboot everything went fine. It turns
out it only happens when fsck checks one or more filesystems before they
are mounted.
It's easily reproducable it with touch /forcefsck and reboot on one of
my 32-bit machines. Haven't tried it on others yet.
Thanks,
Ilya
Checking file systems...fsck from util-linux-ng 2.17.2
/dev/mapper/vg_zmb-lv_home: 235/2992416 files (0.4% non-contiguous),
2461505/11968512 blocks
/dev/mapper/vg_zmb-lv_tmp: 13/62464 files (0.0% non-contiguous), 8334/249856
blocks
/dev/mapper/vg_zmb-lv_usr: 24821/187680 files (0.2% non-contiguous),
152556/749568 blocks
/dev/mapper/vg_zmb-lv_var: 2871/375360 files (1.1% non-contiguous),
222844/1499136 blocks
[ 13.716535] BUG: unable to handle kernel paging request at fffba000
[ 13.717402] IP: [<c1149f3d>] khugepaged+0x9dd/0xd00
[ 13.717402] *pde = 017da067 *pte = 00000000
[ 13.717402] Oops: 0000 [#1] PREEMPT SMP
[ 13.717402] last sysfs file: /sys/devices/virtual/net/lo/operstate
[ 13.717402] Modules linked in:
[ 13.717402]
[ 13.717402] Pid: 582, comm: khugepaged Not tainted 2.6.38-rc1-testbox2 #7
EP35-DS3/EP35-DS3
[ 13.717402] EIP: 0060:[<c1149f3d>] EFLAGS: 00010287 CPU: 0
[ 13.717402] EIP is at khugepaged+0x9dd/0xd00
[ 13.717402] EAX: 00000000 EBX: f307ef68 ECX: fffba000 EDX: fffbb000
[ 13.717402] ESI: f77eb4c0 EDI: f77d5000 EBP: f4731f9c ESP: f4731f1c
[ 13.885304] DS: 007b ES: 007b FS: 00d8 GS: 0000 SS: 0068
[ 13.885304] Process khugepaged (pid: 582, ti=f4730000 task=f51a1f80
task.ti=f4730000)
[ 13.885304] Stack:
[ 13.885304] 00000000 f51a1f80 b7000000 f51a1f80 df826067 00000001 fffbb000
00000292
[ 13.885304] f307ef68 efe8bb70 f307ef68 f77d5000 f3874570 fffba000 00002000
f307ef68
[ 13.885304] f3874534 00000000 00000004 f3874500 f51a1f80 f77d5000 f07a7000
00000001
[ 13.885304] Call Trace:
[ 13.885304] [<c10948c0>] ? autoremove_wake_function+0x0/0x40
[ 13.885304] [<c1149560>] ? khugepaged+0x0/0xd00
[ 13.885304] [<c1094474>] kthread+0x74/0x80
[ 13.885304] [<c1094400>] ? kthread+0x0/0x80
[ 13.885304] [<c103977a>] kernel_thread_helper+0x6/0x10
[ 13.885304] Code: 1d 00 89 d8 e8 15 75 f1 ff 8b 7d bc 8b 07 ff 80 a0 01 00 00
83 45 ac 20 83 45 b4 04 8b 55 98 39 55 b4 0f 83 35 02 00 00 8b 4d b4 <8b> 19 85
db 74 bb c1 eb 0c c1 e3 05 03 1d 00 20 d8 c1 89 d8 e8
[ 13.885304] EIP: [<c1149f3d>] khugepaged+0x9dd/0xd00 SS:ESP 0068:f4731f1c
[ 13.885304] CR2: 00000000fffba000
[ 13.885304] ---[ end trace 7890962500b65912 ]---
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Fight unfair telecom policy in Canada: sign http://dissolvethecrtc.ca/
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: [BUG] BUG: unable to handle kernel paging request at fffba000
2011-01-19 12:40 [BUG] BUG: unable to handle kernel paging request at fffba000 Ilya Dryomov
@ 2011-01-19 20:01 ` Andrea Arcangeli
2011-01-19 22:19 ` Andrea Arcangeli
1 sibling, 0 replies; 4+ messages in thread
From: Andrea Arcangeli @ 2011-01-19 20:01 UTC (permalink / raw)
To: Ilya Dryomov; +Cc: linux-mm, Rik van Riel, linux-kernel
Hello,
thanks for the report!
On Wed, Jan 19, 2011 at 02:40:47PM +0200, Ilya Dryomov wrote:
> Hello,
>
> I just built a fresh 38-rc1 kernel with transparent huge page support
> built-in (TRANSPARENT_HUGEPAGE=y) and it failed to boot with the
> following bug. However after the reboot everything went fine. It turns
> out it only happens when fsck checks one or more filesystems before they
> are mounted.
>
> It's easily reproducable it with touch /forcefsck and reboot on one of
> my 32-bit machines. Haven't tried it on others yet.
Could you send me the vmlinux (or bzImage)? I can't see where it crash
otherwise.
Most certainly it's 32bit bug only.
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Fight unfair telecom policy in Canada: sign http://dissolvethecrtc.ca/
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: [BUG] BUG: unable to handle kernel paging request at fffba000
2011-01-19 12:40 [BUG] BUG: unable to handle kernel paging request at fffba000 Ilya Dryomov
2011-01-19 20:01 ` Andrea Arcangeli
@ 2011-01-19 22:19 ` Andrea Arcangeli
2011-01-19 22:49 ` Ilya Dryomov
1 sibling, 1 reply; 4+ messages in thread
From: Andrea Arcangeli @ 2011-01-19 22:19 UTC (permalink / raw)
To: Ilya Dryomov; +Cc: linux-mm, Rik van Riel, linux-kernel
Hello Ilya,
thanks for sending me the gdb info too.
can you test this fix? Thanks a lot! (it only affected x86 32bit
builds with highpte enabled)
====
Subject: fix pte_unmap in khugepaged for highpte x86_32
From: Andrea Arcangeli <aarcange@redhat.com>
__collapse_huge_page_copy is still dereferencing the pte passed as parameter so
we must pte_unmap after __collapse_huge_page_copy returns, not before.
Signed-off-by: Andrea Arcangeli <aarcange@redhat.com>
---
diff --git a/mm/huge_memory.c b/mm/huge_memory.c
index 004c9c2..c4f634b 100644
--- a/mm/huge_memory.c
+++ b/mm/huge_memory.c
@@ -1837,9 +1837,9 @@ static void collapse_huge_page(struct mm_struct *mm,
spin_lock(ptl);
isolated = __collapse_huge_page_isolate(vma, address, pte);
spin_unlock(ptl);
- pte_unmap(pte);
if (unlikely(!isolated)) {
+ pte_unmap(pte);
spin_lock(&mm->page_table_lock);
BUG_ON(!pmd_none(*pmd));
set_pmd_at(mm, address, pmd, _pmd);
@@ -1856,6 +1856,7 @@ static void collapse_huge_page(struct mm_struct *mm,
anon_vma_unlock(vma->anon_vma);
__collapse_huge_page_copy(pte, new_page, vma, address, ptl);
+ pte_unmap(pte);
__SetPageUptodate(new_page);
pgtable = pmd_pgtable(_pmd);
VM_BUG_ON(page_count(pgtable) != 1);
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Fight unfair telecom policy in Canada: sign http://dissolvethecrtc.ca/
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply related [flat|nested] 4+ messages in thread
* Re: [BUG] BUG: unable to handle kernel paging request at fffba000
2011-01-19 22:19 ` Andrea Arcangeli
@ 2011-01-19 22:49 ` Ilya Dryomov
0 siblings, 0 replies; 4+ messages in thread
From: Ilya Dryomov @ 2011-01-19 22:49 UTC (permalink / raw)
To: Andrea Arcangeli; +Cc: linux-mm, Rik van Riel, linux-kernel, idryomov
On Wed, Jan 19, 2011 at 11:19:09PM +0100, Andrea Arcangeli wrote:
> Hello Ilya,
>
> thanks for sending me the gdb info too.
>
> can you test this fix? Thanks a lot! (it only affected x86 32bit
> builds with highpte enabled)
>
> ====
> Subject: fix pte_unmap in khugepaged for highpte x86_32
>
> From: Andrea Arcangeli <aarcange@redhat.com>
>
> __collapse_huge_page_copy is still dereferencing the pte passed as parameter so
> we must pte_unmap after __collapse_huge_page_copy returns, not before.
>
> Signed-off-by: Andrea Arcangeli <aarcange@redhat.com>
It fixes the above problem for me. Thanks a lot Andrea.
> ---
>
> diff --git a/mm/huge_memory.c b/mm/huge_memory.c
> index 004c9c2..c4f634b 100644
> --- a/mm/huge_memory.c
> +++ b/mm/huge_memory.c
> @@ -1837,9 +1837,9 @@ static void collapse_huge_page(struct mm_struct *mm,
> spin_lock(ptl);
> isolated = __collapse_huge_page_isolate(vma, address, pte);
> spin_unlock(ptl);
> - pte_unmap(pte);
>
> if (unlikely(!isolated)) {
> + pte_unmap(pte);
> spin_lock(&mm->page_table_lock);
> BUG_ON(!pmd_none(*pmd));
> set_pmd_at(mm, address, pmd, _pmd);
> @@ -1856,6 +1856,7 @@ static void collapse_huge_page(struct mm_struct *mm,
> anon_vma_unlock(vma->anon_vma);
>
> __collapse_huge_page_copy(pte, new_page, vma, address, ptl);
> + pte_unmap(pte);
> __SetPageUptodate(new_page);
> pgtable = pmd_pgtable(_pmd);
> VM_BUG_ON(page_count(pgtable) != 1);
>
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Fight unfair telecom policy in Canada: sign http://dissolvethecrtc.ca/
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply [flat|nested] 4+ messages in thread
end of thread, other threads:[~2011-01-19 22:50 UTC | newest]
Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2011-01-19 12:40 [BUG] BUG: unable to handle kernel paging request at fffba000 Ilya Dryomov
2011-01-19 20:01 ` Andrea Arcangeli
2011-01-19 22:19 ` Andrea Arcangeli
2011-01-19 22:49 ` Ilya Dryomov
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).