All of lore.kernel.org
 help / color / mirror / Atom feed
From: wzab <wzab@ise.pw.edu.pl>
To: linux-kernel@vger.kernel.org
Cc: wzab@ise.pw.edu.pl
Subject: kmemcheck error and panic when booting 2.6.39.1 , however acpi=off allows to boot
Date: Sat, 11 Jun 2011 17:23:07 +0200	[thread overview]
Message-ID: <4DF3885B.2080900@ise.pw.edu.pl> (raw)
In-Reply-To: <4DEBE579.1070508@ise.pw.edu.pl>

Hi,

Today I've tried to investigate more thoroughly why one of my machine 
doesn't boot with 2.6.39.1
I've performed four reboots with different parameters, recording serial 
console output to files:

crash6.txt - booting with HT in BIOS on, parameters: kmemleak=on
crash7.txt - booting with HT in BIOS on, parameters: slub_debug kmemleak=on
crash8.txt - booting with HT in BIOS off, parameters: slub_debug kmemleak=on
crash9.txt - booting with HT in BIOS on, parameters: slub_debug 
kmemleak=on acpi=off

In all cases there was a problem detected in kmemcheck:

crash6.txt:
[   69.689737] ------------[ cut here ]------------
[   69.690605] WARNING: at arch/x86/mm/kmemcheck/kmemcheck.c:634 
kmemcheck_fault+0xa5/0xc0()
[   69.690605] Hardware name:
[   69.690605] Modules linked in:
[   69.690605] Pid: 1, comm: swapper Not tainted 2.6.39.1 #3
[   69.690605] Call Trace:
[   69.690605]  [<c013de0d>] warn_slowpath_common+0x6d/0xa0
[   69.690605]  [<c0128ce5>] ? kmemcheck_fault+0xa5/0xc0
[   69.690605]  [<c0128ce5>] ? kmemcheck_fault+0xa5/0xc0
[   69.690605]  [<c013de5d>] warn_slowpath_null+0x1d/0x20
[   69.690605]  [<c0128ce5>] kmemcheck_fault+0xa5/0xc0
[   69.690605]  [<c0124480>] do_page_fault+0x270/0x440
[   69.690605]  [<c0128cad>] ? kmemcheck_fault+0x6d/0xc0
[   69.690605]  [<c0128e2e>] ? kmemcheck_pte_lookup+0xe/0x40
[   69.690605]  [<c0124210>] ? vmalloc_sync_all+0x100/0x100
[   69.690605]  [<c04663f3>] error_code+0x5f/0x64
[   69.690605]  [<c012007b>] ? io_apic_set_pci_routing+0x4b/0x60
[   69.690605]  [<c0124210>] ? vmalloc_sync_all+0x100/0x100
[   69.690605]  [<c0111dbf>] ? p4_pmu_handle_irq+0x7f/0x1b0
[   69.690605]  [<c0128e2e>] ? kmemcheck_pte_lookup+0xe/0x40
[   69.690605]  [<c0128469>] ? kmemcheck_save_addr+0x19/0x40
[   69.690605]  [<c012880b>] ? kmemcheck_show_addr+0xb/0x20
[   69.690605]  [<c012896e>] ? kmemcheck_show_all+0x2e/0x40
[   69.690605]  [<c010f098>] perf_event_nmi_handler+0x28/0xa0
[   69.690605]  [<c015f1c5>] notifier_call_chain+0x75/0xe0
[   69.690605]  [<c015f700>] __atomic_notifier_call_chain+0x60/0x90
[   69.690605]  [<c015f6a0>] ? register_reboot_notifier+0x20/0x20
[   69.690605]  [<c015f74a>] atomic_notifier_call_chain+0x1a/0x20
[   69.690605]  [<c015f86d>] notify_die+0x2d/0x30
[   69.690605]  [<c0102d72>] default_do_nmi+0x32/0x280
[   69.690605]  [<c010369f>] do_nmi+0x7f/0x90
[   69.690605]  [<c04664a5>] nmi_stack_correct+0x28/0x2d
[   69.690605]  [<c010378c>] ? do_debug+0xc/0x190
[   69.690605]  [<c0466446>] debug_stack_correct+0x2e/0x34
[   69.690605]  [<c028f6ef>] ? prio_tree_insert+0xdf/0x190
[   69.690605]  [<c01c68e0>] create_object+0x140/0x230
[   69.690605]  [<c04543a7>] kmemleak_alloc+0x27/0x50
[   69.690605]  [<c01c3b79>] kmem_cache_alloc+0xc9/0x110
[   69.690605]  [<c02ac306>] dma_debug_init+0x96/0x140
[   69.690605]  [<c061f75d>] ? start_kernel+0x322/0x322
[   69.690605]  [<c061f75d>] ? start_kernel+0x322/0x322
[   69.690605]  [<c0624d50>] pci_iommu_init+0x13/0x48
[   69.690605]  [<c01011f0>] do_one_initcall+0x30/0x170
[   69.690605]  [<c061f75d>] ? start_kernel+0x322/0x322
[   69.690605]  [<c0624d3d>] ? iommu_setup+0x1fd/0x1fd
[   69.690605]  [<c061f75d>] ? start_kernel+0x322/0x322
[   69.690605]  [<c061f7f8>] kernel_init+0x9b/0x12f
[   69.690605]  [<c0466bba>] kernel_thread_helper+0x6/0xd
[   69.690605] ---[ end trace 93d72a36b9146f22 ]---

crash7.txt:
[   69.687496] ------------[ cut here ]------------
[   69.690015] WARNING: at arch/x86/mm/kmemcheck/kmemcheck.c:634 
kmemcheck_fault+0xa5/0xc0()
[   69.690015] Hardware name:
[   69.690015] Modules linked in:
[   69.690015] Pid: 1, comm: swapper Not tainted 2.6.39.1 #3
[   69.690015] Call Trace:
[   69.690015]  [<c013de0d>] warn_slowpath_common+0x6d/0xa0
[   69.690015]  [<c0128ce5>] ? kmemcheck_fault+0xa5/0xc0
[   69.690015]  [<c0128ce5>] ? kmemcheck_fault+0xa5/0xc0
[   69.690015]  [<c013de5d>] warn_slowpath_null+0x1d/0x20
[   69.690015]  [<c0128ce5>] kmemcheck_fault+0xa5/0xc0
[   69.690015]  [<c0124480>] do_page_fault+0x270/0x440
[   69.690015]  [<c0128e2e>] ? kmemcheck_pte_lookup+0xe/0x40
[   69.690015]  [<c0128469>] ? kmemcheck_save_addr+0x19/0x40
[   69.690015]  [<c0128e2e>] ? kmemcheck_pte_lookup+0xe/0x40
[   69.690015]  [<c0128469>] ? kmemcheck_save_addr+0x19/0x40
[   69.690015]  [<c0128713>] ? kmemcheck_read_strict+0x33/0x80
[   69.690015]  [<c0124210>] ? vmalloc_sync_all+0x100/0x100
[   69.690015]  [<c04663f3>] error_code+0x5f/0x64
[   69.690015]  [<c0124210>] ? vmalloc_sync_all+0x100/0x100
[   69.690015]  [<c0111dbf>] ? p4_pmu_handle_irq+0x7f/0x1b0
[   69.690015]  [<c0128e2e>] ? kmemcheck_pte_lookup+0xe/0x40
[   69.690015]  [<c0128e2e>] ? kmemcheck_pte_lookup+0xe/0x40
[   69.690015]  [<c010f098>] perf_event_nmi_handler+0x28/0xa0
[   69.690015]  [<c015f1c5>] notifier_call_chain+0x75/0xe0
[   69.690015]  [<c015f700>] __atomic_notifier_call_chain+0x60/0x90
[   69.690015]  [<c015f6a0>] ? register_reboot_notifier+0x20/0x20
[   69.690015]  [<c015f74a>] atomic_notifier_call_chain+0x1a/0x20
[   69.690015]  [<c015f86d>] notify_die+0x2d/0x30
[   69.690015]  [<c0102d72>] default_do_nmi+0x32/0x280
[   69.690015]  [<c010369f>] do_nmi+0x7f/0x90
[   69.690015]  [<c04664a5>] nmi_stack_correct+0x28/0x2d
[   69.690015]  [<c012007b>] ? io_apic_set_pci_routing+0x4b/0x60
[   69.690015]  [<c016ce46>] ? trace_hardirqs_off_caller+0xa6/0xf0
[   69.690015]  [<c0295b64>] trace_hardirqs_off_thunk+0xc/0x18
[   69.690015]  [<c0465de6>] ? ret_from_exception+0x6/0x6
[   69.690015]  [<c01c007b>] ? unuse_pte+0xfb/0x120
[   69.690015]  [<c01c67f2>] ? create_object+0x52/0x230
[   69.690015]  [<c04543a7>] kmemleak_alloc+0x27/0x50
[   69.690015]  [<c01c3b79>] kmem_cache_alloc+0xc9/0x110
[   69.690015]  [<c02ac306>] dma_debug_init+0x96/0x140
[   69.690015]  [<c061f75d>] ? start_kernel+0x322/0x322
[   69.690015]  [<c061f75d>] ? start_kernel+0x322/0x322
[   69.690015]  [<c0624d50>] pci_iommu_init+0x13/0x48
[   69.690015]  [<c01011f0>] do_one_initcall+0x30/0x170
[   69.690015]  [<c061f75d>] ? start_kernel+0x322/0x322
[   69.690015]  [<c0624d3d>] ? iommu_setup+0x1fd/0x1fd
[   69.690015]  [<c061f75d>] ? start_kernel+0x322/0x322
[   69.690015]  [<c061f7f8>] kernel_init+0x9b/0x12f
[   69.690015]  [<c0466bba>] kernel_thread_helper+0x6/0xd
[   69.690015] ---[ end trace 93d72a36b9146f22 ]---

crash8.txt:
[   69.663411] ------------[ cut here ]------------
[   69.663411] WARNING: at arch/x86/mm/kmemcheck/kmemcheck.c:634 
kmemcheck_fault+0xa5/0xc0()
[   69.663411] Hardware name:
[   69.663411] Modules linked in:
[   69.663411] Pid: 1, comm: swapper Not tainted 2.6.39.1 #3
[   69.663411] Call Trace:
[   69.663411]  [<c013de0d>] warn_slowpath_common+0x6d/0xa0
[   69.663411]  [<c0128ce5>] ? kmemcheck_fault+0xa5/0xc0
[   69.663411]  [<c0128ce5>] ? kmemcheck_fault+0xa5/0xc0
[   69.663411]  [<c013de5d>] warn_slowpath_null+0x1d/0x20
[   69.663411]  [<c0128ce5>] kmemcheck_fault+0xa5/0xc0
[   69.663411]  [<c0124480>] do_page_fault+0x270/0x440
[   69.663411]  [<c0128e2e>] ? kmemcheck_pte_lookup+0xe/0x40
[   69.663411]  [<c01093f6>] ? native_sched_clock+0x26/0x90
[   69.663411]  [<c015ffc3>] ? sched_clock_local+0xd3/0x1c0
[   69.663411]  [<c0124210>] ? vmalloc_sync_all+0x100/0x100
[   69.663411]  [<c04663f3>] error_code+0x5f/0x64
[   69.663411]  [<c0124210>] ? vmalloc_sync_all+0x100/0x100
[   69.663411]  [<c0111dbf>] ? p4_pmu_handle_irq+0x7f/0x1b0
[   69.663411]  [<c012896e>] ? kmemcheck_show_all+0x2e/0x40
[   69.663411]  [<c0128469>] ? kmemcheck_save_addr+0x19/0x40
[   69.663411]  [<c0128713>] ? kmemcheck_read_strict+0x33/0x80
[   69.663411]  [<c010f098>] perf_event_nmi_handler+0x28/0xa0
[   69.663411]  [<c015f1c5>] notifier_call_chain+0x75/0xe0
[   69.663411]  [<c012896e>] ? kmemcheck_show_all+0x2e/0x40
[   69.663411]  [<c015f700>] __atomic_notifier_call_chain+0x60/0x90
[   69.663411]  [<c015f6a0>] ? register_reboot_notifier+0x20/0x20
[   69.663411]  [<c015f74a>] atomic_notifier_call_chain+0x1a/0x20
[   69.663411]  [<c015f86d>] notify_die+0x2d/0x30
[   69.663411]  [<c0102d72>] default_do_nmi+0x32/0x280
[   69.663411]  [<c010369f>] do_nmi+0x7f/0x90
[   69.663411]  [<c04664a5>] nmi_stack_correct+0x28/0x2d
[   69.663411]  [<c046638c>] ? spurious_interrupt_bug+0xc/0xc
[   69.663411]  [<c028f42c>] ? prio_tree_replace+0x4c/0x60
[   69.663411]  [<c028f739>] prio_tree_insert+0x129/0x190
[   69.663411]  [<c01c68e0>] create_object+0x140/0x230
[   69.663411]  [<c04543a7>] kmemleak_alloc+0x27/0x50
[   69.663411]  [<c01c3b79>] kmem_cache_alloc+0xc9/0x110
[   69.663411]  [<c02ac306>] dma_debug_init+0x96/0x140
[   69.663411]  [<c061f75d>] ? start_kernel+0x322/0x322
[   69.663411]  [<c061f75d>] ? start_kernel+0x322/0x322
[   69.663411]  [<c0624d50>] pci_iommu_init+0x13/0x48
[   69.663411]  [<c01011f0>] do_one_initcall+0x30/0x170
[   69.663411]  [<c061f75d>] ? start_kernel+0x322/0x322
[   69.663411]  [<c0624d3d>] ? iommu_setup+0x1fd/0x1fd
[   69.663411]  [<c061f75d>] ? start_kernel+0x322/0x322
[   69.663411]  [<c061f7f8>] kernel_init+0x9b/0x12f
[   69.663411]  [<c0466bba>] kernel_thread_helper+0x6/0xd
[   69.663411] ---[ end trace 93d72a36b9146f22 ]---

crash9.txt:
[   61.373342] ------------[ cut here ]------------
[   61.373348] WARNING: at arch/x86/mm/kmemcheck/kmemcheck.c:634 
kmemcheck_fault+0xa5/0xc0()
[   61.373348] Hardware name:
[   61.373348] Modules linked in:
[   61.373348] Pid: 1, comm: swapper Not tainted 2.6.39.1 #3
[   61.373348] Call Trace:
[   61.373348]  [<c013de0d>] warn_slowpath_common+0x6d/0xa0
[   61.373348]  [<c0128ce5>] ? kmemcheck_fault+0xa5/0xc0
[   61.373348]  [<c0128ce5>] ? kmemcheck_fault+0xa5/0xc0
[   61.373348]  [<c013de5d>] warn_slowpath_null+0x1d/0x20
[   61.373348]  [<c0128ce5>] kmemcheck_fault+0xa5/0xc0
[   61.373348]  [<c0124480>] do_page_fault+0x270/0x440
[   61.373348]  [<c0128713>] ? kmemcheck_read_strict+0x33/0x80
[   61.373348]  [<c0128469>] ? kmemcheck_save_addr+0x19/0x40
[   61.373348]  [<c0128713>] ? kmemcheck_read_strict+0x33/0x80
[   61.373348]  [<c0128e2e>] ? kmemcheck_pte_lookup+0xe/0x40
[   61.373348]  [<c0124210>] ? vmalloc_sync_all+0x100/0x100
[   61.373348]  [<c04663f3>] error_code+0x5f/0x64
[   61.373348]  [<c0124210>] ? vmalloc_sync_all+0x100/0x100
[   61.373348]  [<c0111dbf>] ? p4_pmu_handle_irq+0x7f/0x1b0
[   61.373348]  [<c0128713>] ? kmemcheck_read_strict+0x33/0x80
[   61.373348]  [<c0128e2e>] ? kmemcheck_pte_lookup+0xe/0x40
[   61.373348]  [<c0128e2e>] ? kmemcheck_pte_lookup+0xe/0x40
[   61.373348]  [<c010f098>] perf_event_nmi_handler+0x28/0xa0
[   61.373348]  [<c015f1c5>] notifier_call_chain+0x75/0xe0
[   61.373348]  [<c015f700>] __atomic_notifier_call_chain+0x60/0x90
[   61.373348]  [<c015f6a0>] ? register_reboot_notifier+0x20/0x20
[   61.373348]  [<c015f74a>] atomic_notifier_call_chain+0x1a/0x20
[   61.373348]  [<c015f86d>] notify_die+0x2d/0x30
[   61.373348]  [<c0102d72>] default_do_nmi+0x32/0x280
[   61.373348]  [<c0128cad>] ? kmemcheck_fault+0x6d/0xc0
[   61.373348]  [<c010369f>] do_nmi+0x7f/0x90
[   61.373348]  [<c04664a5>] nmi_stack_correct+0x28/0x2d
[   61.373348]  [<c012007b>] ? io_apic_set_pci_routing+0x4b/0x60
[   61.373348]  [<c01093d0>] ? time_cpufreq_notifier+0x140/0x140
[   61.373348]  [<c015ffc3>] ? sched_clock_local+0xd3/0x1c0
[   61.373348]  [<c012880b>] ? kmemcheck_show_addr+0xb/0x20
[   61.373348]  [<c012896e>] ? kmemcheck_show_all+0x2e/0x40
[   61.373348]  [<c0128cad>] ? kmemcheck_fault+0x6d/0xc0
[   61.373348]  [<c0160259>] sched_clock_cpu+0xf9/0x190
[   61.373348]  [<c0128e2e>] ? kmemcheck_pte_lookup+0xe/0x40
[   61.373348]  [<c0160309>] local_clock+0x19/0x60
[   61.373348]  [<c016d33d>] lock_release_holdtime+0x2d/0x160
[   61.373348]  [<c0128d1a>] ? kmemcheck_trap+0x1a/0x30
[   61.373348]  [<c01728dc>] lock_release_nested+0x8c/0x110
[   61.373348]  [<c02a60da>] ? debug_object_deactivate+0x8a/0xf0
[   61.373348]  [<c01729ab>] __lock_release+0x4b/0xe0
[   61.373348]  [<c0172a89>] lock_release+0x49/0x70
[   61.373348]  [<c02a60da>] ? debug_object_deactivate+0x8a/0xf0
[   61.373348]  [<c0465cb9>] _raw_spin_unlock_irqrestore+0x19/0x70
[   61.373348]  [<c02a60da>] debug_object_deactivate+0x8a/0xf0
[   61.373348]  [<c015d4ed>] __run_hrtimer.clone.22+0x2d/0x120
[   61.373348]  [<c0465116>] ? _raw_spin_lock+0x66/0x70
[   61.373348]  [<c015dffd>] hrtimer_interrupt+0x17d/0x260
[   61.373348]  [<c012898b>] ? kmemcheck_hide_addr+0xb/0x20
[   61.373348]  [<c011c370>] smp_apic_timer_interrupt+0x50/0x90
[   61.373348]  [<c0295b64>] ? trace_hardirqs_off_thunk+0xc/0x18
[   61.373348]  [<c04661b7>] apic_timer_interrupt+0x2f/0x34
[   61.373348]  [<c029007b>] ? radix_tree_callback+0x4b/0x60
[   61.373348]  [<c0225eda>] ? sysfs_find_dirent+0x2a/0x50
[   61.373348]  [<c0226077>] __sysfs_add_one+0x27/0x90
[   61.373348]  [<c02260f8>] sysfs_add_one+0x18/0xb0
[   61.373348]  [<c022695a>] sysfs_do_create_link+0xea/0x1f0
[   61.373348]  [<c0226a72>] sysfs_create_link+0x12/0x20
[   61.373348]  [<c0341a44>] device_add+0x154/0x350
[   61.373348]  [<c0341c52>] device_register+0x12/0x20
[   61.373348]  [<c0341d01>] device_create_vargs+0xa1/0xc0
[   61.373348]  [<c0341d48>] device_create+0x28/0x30
[   61.373348]  [<c030386f>] tty_register_device+0x7f/0x100
[   61.373348]  [<c0290035>] ? radix_tree_callback+0x5/0x60
[   61.373348]  [<c0465ec8>] ? restore_all+0xf/0xf
[   61.373348]  [<c028da00>] ? kobject_cleanup+0x100/0x110
[   61.373348]  [<c0303d73>] tty_register_driver+0xf3/0x240
[   61.373348]  [<c0640a11>] legacy_pty_init+0x159/0x188
[   61.373348]  [<c061f75d>] ? start_kernel+0x322/0x322
[   61.373348]  [<c0640c6c>] pty_init+0x8/0x11
[   61.373348]  [<c01011f0>] do_one_initcall+0x30/0x170
[   61.373348]  [<c061f75d>] ? start_kernel+0x322/0x322
[   61.373348]  [<c0640c64>] ? unix98_pty_init+0x224/0x224
[   61.373348]  [<c061f75d>] ? start_kernel+0x322/0x322
[   61.373348]  [<c061f7f8>] kernel_init+0x9b/0x12f
[   61.373348]  [<c0466bba>] kernel_thread_helper+0x6/0xd
[   61.373348] ---[ end trace 93d72a36b9146f22 ]---

All above errors look very similar, however further operation of the kernel
depends on boot parameters.
Only with "acpi=off" the system started completely, and I was able to log in
into gdm, and later switch it off in normal way (crash8.txt).
With other parameters kernel panicked during the boot.

I attach the crashes.tar.z2 file containing logs (crash?.txt) and 
configuration
of my kernel (config). Hardware details of my machine were already provided
in previous messages in this thread.
-- 
Regards,
Wojtek


  reply	other threads:[~2011-06-11 15:23 UTC|newest]

Thread overview: 10+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2011-06-04 20:04 Random kernel panics when booting 2.6.39.1 wzabolot@elektron.elka.pw.edu.pl
2011-06-04 20:07 ` wzabolot@elektron.elka.pw.edu.pl
2011-06-04 21:41 ` Machine panicking with 2.6.39.1 boots correctly with acpi=off wzabolot@elektron.elka.pw.edu.pl
2011-06-04 22:01   ` acpi=off switches off also HT support, maybe it is something SMP related? wzabolot@elektron.elka.pw.edu.pl
2011-06-05  9:40     ` Kernel panic with 2.6.39.1 - switching off HT in BIOS helps wzabolot@elektron.elka.pw.edu.pl
2011-06-05 20:22       ` Kernel panic on HT machine - full logs with debug from a few boots (some successful) wzab
2011-06-11 15:23         ` wzab [this message]
2011-06-11 15:28           ` Forgotten attachment Re: kmemcheck error and panic when booting 2.6.39.1 , however acpi=off allows to boot wzab
2011-06-11 16:15           ` wzab
2011-07-06 22:13             ` Tech

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=4DF3885B.2080900@ise.pw.edu.pl \
    --to=wzab@ise.pw.edu.pl \
    --cc=linux-kernel@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.