From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1755540Ab0CDBqx (ORCPT ); Wed, 3 Mar 2010 20:46:53 -0500 Received: from mail-fx0-f219.google.com ([209.85.220.219]:44168 "EHLO mail-fx0-f219.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1755360Ab0CDBqq convert rfc822-to-8bit (ORCPT ); Wed, 3 Mar 2010 20:46:46 -0500 DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=date:from:to:cc:subject:message-id:references:mime-version :content-type:content-disposition:content-transfer-encoding :in-reply-to:user-agent; b=SZuIv8SWp0z9W1Te6XdhMfRwu8CrettjZD5KXFPcgTXEl6zAh/ODKrnl+XeM5W9cDy eikPEQW45REg33bphSGkoekyvCb0ooZ9vPslt2AzUonsapMGpYOzAZMo+MLEE3jIEo8V FdjNlQ7jXOvU+W+QmerfbCDzTY9w4LNtjHmxo= Date: Thu, 4 Mar 2010 02:46:43 +0100 From: Frederic Weisbecker To: =?iso-8859-1?Q?Am=E9rico?= Wang Cc: LKML , Steven Rostedt , Ingo Molnar Subject: Re: 2.6.33: ftrace triggers soft lockup Message-ID: <20100304014641.GH5194@nowhere> References: <2375c9f91003022204p5bdab1fdj3b3500998575fc28@mail.gmail.com> MIME-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Disposition: inline Content-Transfer-Encoding: 8BIT In-Reply-To: <2375c9f91003022204p5bdab1fdj3b3500998575fc28@mail.gmail.com> User-Agent: Mutt/1.5.18 (2008-05-17) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, Mar 03, 2010 at 02:04:23PM +0800, Américo Wang wrote: > Mar 3 13:34:37 dhcp-66-70-5 kernel: BUG: soft lockup - CPU#3 stuck for 61s! [kstop/3:4876] > Mar 3 13:34:37 dhcp-66-70-5 kernel: Modules linked in: ipt_REJECT xt_tcpudp x_tables bridge stp deflate zlib_deflate ctr cast5 crypto_null ccm serpent blowfish twofish twofish_common ecb xcbc cbc md5 sha256_generic sha512_generic des_generic aes_x86_64 aes_generic ah6 ah4 esp6 esp4 xfrm4_tunnel tunnel4 xfrm4_mode_tunnel xfrm4_mode_transport xfrm6_mode_transport xfrm6_mode_beet xfrm6_mode_tunnel ipcomp ipcomp6 xfrm_ipcomp xfrm6_tunnel tunnel6 af_key autofs4 hidp rfcomm l2cap crc16 bluetooth lockd sunrpc ib_iser rdma_cm ib_cm iw_cm ib_sa ib_mad ib_addr iscsi_tcp bnx2i cnic uio ipv6 cxgb3i iw_cxgb3 ib_core cxgb3 mdio libiscsi_tcp libiscsi scsi_transport_iscsi cpufreq_ondemand powernow_k8 freq_table dm_multipath scsi_dh video output sbs sbshc power_meter hwmon battery acpi_memhotplug ac parport_pc lp parport sg dcdbas sr_mod cdrom serio_raw rtc_cmos rtc_core rtc_lib tpm_tis tpm tpm_bios button tg3 libphy shpchp i2c_nforce2 i2c_core pcspkr dm_snapshot dm_zero dm_mirror dm_region_hash dm_log dm_mod sata_nv libata sd_mod s > Mar 3 13:34:37 dhcp-66-70-5 kernel: si_mod ext3 jbd uhci_hcd ohci_hcd ehci_hcd [last unloaded: ip_tables] > Mar 3 13:34:37 dhcp-66-70-5 kernel: irq event stamp: 51397 > Mar 3 13:34:37 dhcp-66-70-5 kernel: hardirqs last enabled at (51396): [] restore_args+0x0/0x30 > Mar 3 13:34:37 dhcp-66-70-5 kernel: hardirqs last disabled at (51397): [] save_args+0x67/0x70 > Mar 3 13:34:37 dhcp-66-70-5 kernel: softirqs last enabled at (6536): [] return_to_handler+0x0/0x2f > Mar 3 13:34:37 dhcp-66-70-5 kernel: softirqs last disabled at (6567): [] return_to_handler+0x0/0x2f > Mar 3 13:34:37 dhcp-66-70-5 kernel: CPU 3 > Mar 3 13:34:37 dhcp-66-70-5 kernel: Modules linked in: ipt_REJECT xt_tcpudp x_tables bridge stp deflate zlib_deflate ctr cast5 crypto_null ccm serpent blowfish twofish twofish_common ecb xcbc cbc md5 sha256_generic sha512_generic des_generic aes_x86_64 aes_generic ah6 ah4 esp6 esp4 xfrm4_tunnel tunnel4 xfrm4_mode_tunnel xfrm4_mode_transport xfrm6_mode_transport xfrm6_mode_beet xfrm6_mode_tunnel ipcomp ipcomp6 xfrm_ipcomp xfrm6_tunnel tunnel6 af_key autofs4 hidp rfcomm l2cap crc16 bluetooth lockd sunrpc ib_iser rdma_cm ib_cm iw_cm ib_sa ib_mad ib_addr iscsi_tcp bnx2i cnic uio ipv6 cxgb3i iw_cxgb3 ib_core cxgb3 mdio libiscsi_tcp libiscsi scsi_transport_iscsi cpufreq_ondemand powernow_k8 freq_table dm_multipath scsi_dh video output sbs sbshc power_meter hwmon battery acpi_memhotplug ac parport_pc lp parport sg dcdbas sr_mod cdrom serio_raw rtc_cmos rtc_core rtc_lib tpm_tis tpm tpm_bios button tg3 libphy shpchp i2c_nforce2 i2c_core pcspkr dm_snapshot dm_zero dm_mirror dm_region_hash dm_log dm_mod sata_nv libata sd_mod s > Mar 3 13:34:37 dhcp-66-70-5 kernel: si_mod ext3 jbd uhci_hcd ohci_hcd ehci_hcd [last unloaded: ip_tables] > Mar 3 13:34:37 dhcp-66-70-5 kernel: > Mar 3 13:34:38 dhcp-66-70-5 kernel: Pid: 4876, comm: kstop/3 Not tainted 2.6.33 #31 0J001K/PowerEdge T105 > Mar 3 13:34:38 dhcp-66-70-5 kernel: RIP: 0010:[] [] native_sched_clock+0x2a/0x69 > Mar 3 13:34:38 dhcp-66-70-5 kernel: RSP: 0018:ffff88002fe03da0 EFLAGS: 00000246 > Mar 3 13:34:38 dhcp-66-70-5 kernel: RAX: 00000000b24e3e78 RBX: ffff88002fe03da0 RCX: ffff88002fe03ee0 > Mar 3 13:34:39 dhcp-66-70-5 kernel: RDX: 0000000000000053 RSI: ffffffff810dfde6 RDI: ffffffff810e138d > Mar 3 13:34:41 dhcp-66-70-5 kernel: RBP: ffffffff81003881 R08: 0000000000000110 R09: ffffffff81003896 > Mar 3 13:34:41 dhcp-66-70-5 kernel: R10: ffff88021697dfd8 R11: 0000000000000000 R12: ffff88002fe03d20 > Mar 3 13:34:41 dhcp-66-70-5 kernel: R13: ffffffff810e138d R14: ffff8802287a6680 R15: ffffffff81003881 > Mar 3 13:34:41 dhcp-66-70-5 kernel: FS: 00007fd9db23a720(0000) GS:ffff88002fe00000(0000) knlGS:0000000000000000 > Mar 3 13:34:41 dhcp-66-70-5 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b > Mar 3 13:34:41 dhcp-66-70-5 kernel: CR2: 00000000008e42c0 CR3: 0000000219d90000 CR4: 00000000000006e0 > Mar 3 13:34:41 dhcp-66-70-5 kernel: DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 > Mar 3 13:34:41 dhcp-66-70-5 kernel: DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400 > Mar 3 13:34:41 dhcp-66-70-5 kernel: Process kstop/3 (pid: 4876, threadinfo ffff88021697c000, task ffff8802287a6680) > Mar 3 13:34:41 dhcp-66-70-5 kernel: Stack: > Mar 3 13:34:41 dhcp-66-70-5 kernel: ffff88002fe03db0 ffffffff810e7407 ffff88002fe03df0 ffffffff810ff331 > Mar 3 13:34:41 dhcp-66-70-5 kernel: <0> ffff88002ffceec0 ffff88002fe03e98 ffffffff810e138d ffff8802287a6680 > Mar 3 13:34:41 dhcp-66-70-5 kernel: <0> ffff88002fe03e00 ffffffff810dfde6 ffff88002fe03e40 ffffffff8102cd97 > Mar 3 13:34:41 dhcp-66-70-5 kernel: Call Trace: > Mar 3 13:34:41 dhcp-66-70-5 kernel: > Mar 3 13:34:41 dhcp-66-70-5 kernel: [] ? trace_clock_local+0x2c/0x43 > Mar 3 13:34:41 dhcp-66-70-5 kernel: [] ? ftrace_push_return_trace+0xac/0x17a > Mar 3 13:34:41 dhcp-66-70-5 kernel: [] ? __rcu_process_callbacks+0x1d4/0x4a9 > Mar 3 13:34:41 dhcp-66-70-5 kernel: [] ? cpu_needs_another_gp+0x4/0x46 > Mar 3 13:34:41 dhcp-66-70-5 kernel: [] ? prepare_ftrace_return+0xda/0x138 > Mar 3 13:34:41 dhcp-66-70-5 kernel: [] ? ftrace_graph_caller+0x4a/0x71 > Mar 3 13:34:41 dhcp-66-70-5 kernel: [] ? return_to_handler+0x15/0x2f > Mar 3 13:34:41 dhcp-66-70-5 kernel: [] ? cpu_needs_another_gp+0x9/0x46 > Mar 3 13:34:41 dhcp-66-70-5 kernel: [] ? return_to_handler+0x0/0x2f > Mar 3 13:34:41 dhcp-66-70-5 kernel: [] print_context_stack+0xb6/0x144 > Mar 3 13:34:41 dhcp-66-70-5 kernel: [] ? return_to_handler+0x0/0x2f > Mar 3 13:34:41 dhcp-66-70-5 kernel: [] dump_trace+0x320/0x3bf > Mar 3 13:34:41 dhcp-66-70-5 kernel: [] ? return_to_handler+0x0/0x2f > Mar 3 13:34:41 dhcp-66-70-5 kernel: [] show_trace_log_lvl+0x5f/0x72 > Mar 3 13:34:41 dhcp-66-70-5 kernel: [] ? worker_thread+0x0/0x46c > Mar 3 13:34:41 dhcp-66-70-5 kernel: [] ? exit_signals+0x7/0x253 > Mar 3 13:34:41 dhcp-66-70-5 kernel: [] ? return_to_handler+0x0/0x2f > Mar 3 13:34:41 dhcp-66-70-5 kernel: [] show_stack_log_lvl+0x185/0x19b > Mar 3 13:34:41 dhcp-66-70-5 kernel: [] ? do_softirq+0x8c/0x181 > Mar 3 13:34:41 dhcp-66-70-5 kernel: [] ? return_to_handler+0x0/0x2f > Mar 3 13:34:41 dhcp-66-70-5 kernel: [] show_registers+0x102/0x3ac > Mar 3 13:34:41 dhcp-66-70-5 kernel: [] ? return_to_handler+0x0/0x2f > Mar 3 13:34:41 dhcp-66-70-5 kernel: [] show_regs+0x1d/0x53 > Mar 3 13:34:41 dhcp-66-70-5 kernel: [] ? return_to_handler+0x0/0x2f > Mar 3 13:34:41 dhcp-66-70-5 kernel: [] softlockup_tick+0x268/0x2d2 > Mar 3 13:34:41 dhcp-66-70-5 kernel: > Mar 3 13:34:41 dhcp-66-70-5 kernel: [] ? exit_signals+0x7/0x253 > Mar 3 13:34:41 dhcp-66-70-5 kernel: [] ? prepare_ftrace_return+0x2a/0x138 > Mar 3 13:34:41 dhcp-66-70-5 kernel: [] ? do_exit+0x160/0xad0 > Mar 3 13:34:41 dhcp-66-70-5 kernel: [] ? worker_thread+0x0/0x46c > Mar 3 13:34:41 dhcp-66-70-5 kernel: [] ? worker_thread+0x0/0x46c > Mar 3 13:34:41 dhcp-66-70-5 kernel: [] ? ftrace_graph_caller+0x4a/0x71 > Mar 3 13:34:41 dhcp-66-70-5 kernel: [] ? exit_signals+0xc/0x253 > Mar 3 13:34:41 dhcp-66-70-5 kernel: [] ? do_exit+0x16f/0xad0 > Mar 3 13:34:41 dhcp-66-70-5 kernel: [] ? autoremove_wake_function+0x0/0x52 > Mar 3 13:34:41 dhcp-66-70-5 kernel: [] ? worker_thread+0x0/0x46c > Mar 3 13:34:41 dhcp-66-70-5 kernel: [] ? return_to_handler+0x0/0x2f > Mar 3 13:34:41 dhcp-66-70-5 kernel: [] run_local_timers+0x32/0x3b > Mar 3 13:34:41 dhcp-66-70-5 kernel: [] ? return_to_handler+0x0/0x2f > Mar 3 13:34:41 dhcp-66-70-5 kernel: [] update_process_times+0x51/0xa5 > Mar 3 13:34:41 dhcp-66-70-5 kernel: [] ? kernel_thread_helper+0x4/0x10 > Mar 3 13:34:41 dhcp-66-70-5 kernel: [] ? restore_args+0x0/0x30 > Mar 3 13:34:41 dhcp-66-70-5 kernel: [] ? kthread+0x0/0xec > Mar 3 13:34:41 dhcp-66-70-5 kernel: [] ? kernel_thread_helper+0x0/0x10 So it is stuck in stop machine. I wonder where exactly. I see some do_exit at the top but I wonder how much they are reliable. Anyway, as Steve said, we really need a full config to reproduce it. Thanks.