From mboxrd@z Thu Jan 1 00:00:00 1970 From: Carsten Emde Subject: [OSADL QA 3.18.9-rt5 #1] Date: Wed, 08 Apr 2015 00:52:56 +0200 Message-ID: <55245FC8.9090509@osadl.org> Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 7bit Cc: Linux RT Users To: Sebastian Andrzej Siewior Return-path: Received: from toro.web-alm.net ([62.245.132.31]:49411 "EHLO toro.web-alm.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1753843AbbDGXBK (ORCPT ); Tue, 7 Apr 2015 19:01:10 -0400 Sender: linux-rt-users-owner@vger.kernel.org List-ID: Hi Sebastian, an Intel Bay Trail board (Intel(R) Celeron(R) CPU J1900 @ 1.99GHz) at the OSADL QA Farm rack #b/slot #6 (https://www.osadl.org/?id=1894) stops working every 12 to 36 hours. The only way to get the board back to work is to power cycle it. Such crashes did not happen with any of the previously tested 3.12-rt kernels. About eight crashes have been observed so far - the kernel message obtained at the serial console (see below) was similar in all cases. Thanks, Carsten. ------------[ cut here ]------------ WARNING: CPU: 3 PID: 16574 at kernel/watchdog.c:298 watchdog_overflow_callback+0x10f/0x16c() Watchdog detected hard LOCKUP on cpu 3 Modules linked in: rpcsec_gss_krb5 nfsv4 eeprom nfs cpufreq_stats fscache bnep bluetooth ip6t_REJECT nf_reject_ipv6 nf_conntrack_ipv6 nf_defrag_ipv6 ip6table_filter ip6_tables cfg80211 rfkill it87 hwmon_vid pl2303 usbserial cdc_acm r8169 mii iTCO_wdt iTCO_vendor_support ppdev coretemp kvm_intel kvm crc32c_intel snd_hda_codec_hdmi ghash_clmulni_intel cryptd microcode snd_hda_codec_realtek snd_hda_codec_generic serio_raw snd_hda_intel snd_hda_controller pcspkr snd_hda_codec snd_hwdep lpc_ich i2c_i801 snd_seq mfd_core snd_seq_device snd_pcm snd_timer snd xhci_pci shpchp soundcore xhci_hcd parport_pc parport nfsd auth_rpcgss oid_registry exportfs nfs_acl lockd grace sunrpc i915 i2c_algo_bit drm_kms_helper drm i2c_core video ipv6 autofs4 [last unloaded: hwlat_detector] CPU: 3 PID: 16574 Comm: cyclictest Not tainted 3.18.9-rt5 #30 Hardware name: Gigabyte Technology Co., Ltd. To be filled by O.E.M./J1900N-D3V, BIOS F2 03/06/2014 0000000000000009 ffff88013fd85ba8 ffffffff814f89a4 00000000000003f8 ffff88013fd85bf8 ffff88013fd85be8 ffffffff8103a27b 0000000000000000 ffffffff810b6e65 0000000000000003 0000000000000000 ffff88013fd85d38 Call Trace: [] dump_stack+0x4f/0x9e [] warn_slowpath_common+0x81/0x9b [] ? watchdog_overflow_callback+0x10f/0x16c [] warn_slowpath_fmt+0x46/0x48 [] watchdog_overflow_callback+0x10f/0x16c [] __perf_event_overflow+0x15a/0x1e8 [] ? x86_perf_event_set_period+0xfa/0x10c [] perf_event_overflow+0x14/0x16 [] intel_pmu_handle_irq+0x2bc/0x341 [] perf_event_nmi_handler+0x25/0x3e [] nmi_handle+0x72/0x134 [] ? cpumask_clear_cpu.constprop.4+0x11/0x11 [] ? _raw_spin_unlock_irqrestore+0xe/0x4d [] default_do_nmi+0x78/0x14e [] do_nmi+0x63/0xa4 [] end_repeat_nmi+0x1e/0x2e [] ? _raw_spin_unlock_irqrestore+0xe/0x4d [] ? _raw_spin_unlock_irqrestore+0xe/0x4d [] ? _raw_spin_unlock_irqrestore+0xe/0x4d <> [] hrtimer_try_to_cancel+0x55/0x5f [] hrtimer_cancel+0x16/0x28 [] tick_nohz_restart+0x17/0x72 [] __tick_nohz_full_check+0x8e/0x93 [] nohz_full_kick_work_func+0xe/0x10 [] irq_work_run_list+0x39/0x57 [] ? tick_sched_do_timer+0x45/0x45 [] irq_work_tick+0x60/0x67 [] update_process_times+0x57/0x67 [] tick_sched_handle+0x4a/0x59 [] tick_sched_timer+0x3b/0x64 [] __run_hrtimer+0x7a/0x149 [] hrtimer_interrupt+0x1cc/0x2c5 [] local_apic_timer_interrupt+0x54/0x58 [] smp_apic_timer_interrupt+0x31/0x43 [] apic_timer_interrupt+0x6a/0x70 [] ? context_tracking_user_exit+0xa0/0xcd [] syscall_trace_leave+0xf9/0x134 [] int_check_syscall_exit_work+0x34/0x3d ---[ end trace 0000000000000002 ]---