From mboxrd@z Thu Jan 1 00:00:00 1970 From: Joerg Roedel Subject: Kernel crash with sky2 Date: Mon, 17 May 2010 20:52:28 +0200 Message-ID: <20100517185228.GG9007@amd.com> Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Cc: netdev@vger.kernel.org To: Stephen Hemminger Return-path: Received: from mail-va3.bigfish.com ([216.32.180.111]:46171 "EHLO mail10-va3-R.bigfish.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1754752Ab0EQTWz (ORCPT ); Mon, 17 May 2010 15:22:55 -0400 Content-Disposition: inline Sender: netdev-owner@vger.kernel.org List-ID: Hi Stephen, I experience the following crash with 2.6.34 in the sky2 code on my laptop when I plug off the lan-cable and then plug-off the power cable and switching to battery. It does not happen with acpi=off. I havn't tested earlier kernels but I can do that if necessary. I did some initial research and found that the driver assumes that port[1] is available when the status bits for it are set on the device. Please let me know if you need any additional information or want me to test anything. The crash message is: [ 107.010134] sky2 0000:02:00.0: PCI hardware error (0xffff) [ 107.015614] sky2 0000:02:00.0: PCI Express error (0xffffffff) [ 107.021355] sky2 0000:02:00.0: eth0: ram data read parity error [ 107.027249] sky2 0000:02:00.0: eth0: ram data write parity error [ 107.033253] sky2 0000:02:00.0: eth0: MAC parity error [ 107.038283] sky2 0000:02:00.0: eth0: RX parity error [ 107.043259] sky2 0000:02:00.0: eth0: TCP segmentation error [ 107.048823] BUG: unable to handle kernel NULL pointer dereference at 0000000000000438 [ 107.053238] IP: [] sky2_hw_error+0x153/0x310 [sky2] [ 107.053238] PGD 139600067 PUD 139643067 PMD 0 [ 107.053238] Oops: 0000 [#1] SMP [ 107.053238] last sysfs file: /sys/devices/system/cpu/cpu0/cpufreq/scaling_max_freq [ 107.053238] CPU 1 [ 107.053238] Modules linked in: snd_hda_codec_atihdmi snd_hda_codec_idt snd_hda_intel rfcomm snd_pcm_oss snd_hda_2 [ 107.053238] [ 107.053238] Pid: 7, comm: ksoftirqd/1 Not tainted 2.6.34-default #1 307E/HP ProBook 6545b [ 107.053238] RIP: 0010:[] [] sky2_hw_error+0x153/0x310 [sky2] [ 107.053238] RSP: 0018:ffff880001e83d78 EFLAGS: 00010202 [ 107.053238] RAX: 0000000000000001 RBX: 0000000000ffffff RCX: 00000000000001f4 [ 107.053238] RDX: 000000000000000a RSI: 0000000000000202 RDI: ffffffff81a5dc80 [ 107.053238] RBP: ffff880001e83db8 R08: 00000000ffffffff R09: 0000000000000000 [ 107.053238] R10: 0000000000000000 R11: 0000000000000001 R12: ffff88012862da00 [ 107.053238] R13: 0000000000000001 R14: 0000000000000001 R15: 0000000000000000 [ 107.053238] FS: 00007ff07987b800(0000) GS:ffff880001e80000(0000) knlGS:0000000000000000 [ 107.053238] CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b [ 107.053238] CR2: 0000000000000438 CR3: 0000000139641000 CR4: 00000000000006e0 [ 107.053238] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 [ 107.053238] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400 [ 107.053238] Process ksoftirqd/1 (pid: 7, threadinfo ffff88013badc000, task ffff88013bad1700) [ 107.053238] Stack: [ 107.053238] ffff88013b488b00 ffff8801284f7000 00000a8800000ad7 00000000ffffffff [ 107.053238] <0> ffff88012862da00 00000000ffffffff ffff88013b4d1000 ffff88013b488b00 [ 107.053238] <0> ffff880001e83ed8 ffffffffa000720f 0000000000000082 0000000000000000 [ 107.053238] Call Trace: [ 107.053238] [ 107.053238] [] sky2_poll+0xeef/0x1020 [sky2] [ 107.053238] [] ? lapic_timer_broadcast+0x1b/0x20 [ 107.053238] [] ? __queue_work+0x3f/0x50 [ 107.053238] [] ? delayed_work_timer_fn+0x39/0x50 [ 107.053238] [] net_rx_action+0xed/0x1f0 [ 107.053238] [] __do_softirq+0xb0/0x1d0 [ 107.053238] [] call_softirq+0x1c/0x30 [ 107.053238] [ 107.053238] [] ? do_softirq+0x55/0x90 [ 107.053238] [] run_ksoftirqd+0x80/0x130 [ 107.053238] [] ? run_ksoftirqd+0x0/0x130 [ 107.053238] [] kthread+0x96/0xa0 [ 107.053238] [] kernel_thread_helper+0x4/0x10 [ 107.053238] [] ? kthread+0x0/0xa0 [ 107.053238] [] ? kernel_thread_helper+0x0/0x10 [ 107.053238] Code: e8 d3 a7 43 e1 85 c0 0f 85 f5 00 00 00 44 89 f0 ba 00 02 00 00 c1 e0 06 0d a0 01 00 00 89 c0 4 [ 107.053238] RIP [] sky2_hw_error+0x153/0x310 [sky2] [ 107.053238] RSP [ 107.053238] CR2: 0000000000000438 [ 107.392268] ---[ end trace 8a4d942e73cd8681 ]--- [ 107.396866] Kernel panic - not syncing: Fatal exception in interrupt [ 107.403214] Pid: 7, comm: ksoftirqd/1 Tainted: G D 2.6.34-default #1 [ 107.410230] Call Trace: [ 107.412695] [] panic+0x7d/0xf7 [ 107.418004] [] oops_end+0xe2/0xf0 [ 107.422970] [] no_context+0xfb/0x260 [ 107.428174] [] __bad_area_nosemaphore+0x10d/0x1c0 [ 107.434523] [] bad_area_nosemaphore+0x13/0x20 [ 107.440513] [] do_page_fault+0x26f/0x330 [ 107.446084] [] page_fault+0x1f/0x30 [ 107.451202] [] ? sky2_hw_error+0x153/0x310 [sky2] [ 107.457554] [] ? sky2_hw_error+0x36/0x310 [sky2] [ 107.463811] [] sky2_poll+0xeef/0x1020 [sky2] [ 107.469706] [] ? lapic_timer_broadcast+0x1b/0x20 [ 107.475980] [] ? __queue_work+0x3f/0x50 [ 107.481457] [] ? delayed_work_timer_fn+0x39/0x50 [ 107.487698] [] net_rx_action+0xed/0x1f0 [ 107.493183] [] __do_softirq+0xb0/0x1d0 [ 107.498558] [] call_softirq+0x1c/0x30 [ 107.503868] [] ? do_softirq+0x55/0x90 [ 107.509788] [] run_ksoftirqd+0x80/0x130 [ 107.515275] [] ? run_ksoftirqd+0x0/0x130 [ 107.520823] [] kthread+0x96/0xa0 [ 107.525702] [] kernel_thread_helper+0x4/0x10 [ 107.531612] [] ? kthread+0x0/0xa0 [ 107.536563] [] ? kernel_thread_helper+0x0/0x10 [ 107.542657] [drm:drm_fb_helper_panic] *ERROR* panic occurred, switching back to text console [ 107.551054] BUG: scheduling while atomic: ksoftirqd/1/7/0x10000100 [ 107.552642] Modules linked in: snd_hda_codec_atihdmi snd_hda_codec_idt snd_hda_intel rfcomm snd_pcm_oss snd_hda_2 [ 107.552642] Pid: 7, comm: ksoftirqd/1 Tainted: G D 2.6.34-default #1 [ 107.552642] Call Trace: [ 107.552642] [] __schedule_bug+0x61/0x70 [ 107.552642] [] schedule+0x6cc/0x800 [ 107.552642] [] __cond_resched+0x2a/0x40 [ 107.552642] [] _cond_resched+0x30/0x40 [ 107.552642] [] __kmalloc+0xc1/0x190 [ 107.552642] [] ? T.687+0x13/0x20 [drm_kms_helper] [ 107.552642] [] T.687+0x13/0x20 [drm_kms_helper] [ 107.552642] [] drm_crtc_helper_set_config+0xe7/0x880 [drm_kms_helper] [ 107.552642] [] drm_fb_helper_force_kernel_mode+0x74/0xa0 [drm_kms_helper] [ 107.552642] [] drm_fb_helper_panic+0x23/0x30 [drm_kms_helper] [ 107.552642] [] notifier_call_chain+0x56/0x80 [ 107.552642] [] atomic_notifier_call_chain+0x1a/0x20 [ 107.552642] [] panic+0xa9/0xf7 [ 107.552642] [] oops_end+0xe2/0xf0 [ 107.552642] [] no_context+0xfb/0x260 [ 107.552642] [] __bad_area_nosemaphore+0x10d/0x1c0 [ 107.552642] [] bad_area_nosemaphore+0x13/0x20 [ 107.552642] [] do_page_fault+0x26f/0x330 [ 107.552642] [] page_fault+0x1f/0x30 [ 107.552642] [] ? sky2_hw_error+0x153/0x310 [sky2] [ 107.552642] [] ? sky2_hw_error+0x36/0x310 [sky2] [ 107.552642] [] sky2_poll+0xeef/0x1020 [sky2] [ 107.552642] [] ? lapic_timer_broadcast+0x1b/0x20 [ 107.552642] [] ? __queue_work+0x3f/0x50 [ 107.552642] [] ? delayed_work_timer_fn+0x39/0x50 [ 107.552642] [] net_rx_action+0xed/0x1f0 [ 107.552642] [] __do_softirq+0xb0/0x1d0 [ 107.552642] [] call_softirq+0x1c/0x30 [ 107.552642] [] ? do_softirq+0x55/0x90 [ 107.552642] [] run_ksoftirqd+0x80/0x130 [ 107.552642] [] ? run_ksoftirqd+0x0/0x130 [ 107.552642] [] kthread+0x96/0xa0 [ 107.552642] [] kernel_thread_helper+0x4/0x10 [ 107.552642] [] ? kthread+0x0/0xa0 [ 107.552642] [] ? kernel_thread_helper+0x0/0x10 lspci -vvv -n of the device: 02:00.0 0200: 11ab:436c (rev 10) Subsystem: 103c:3080 Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+ Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- SERR- Capabilities: [5c] Message Signalled Interrupts: Mask- 64bit+ Queue=0/0 Enable+ Address: 00000000fee0100c Data: 4189 Capabilities: [c0] Express (v2) Legacy Endpoint, MSI 00 DevCap: MaxPayload 128 bytes, PhantFunc 0, Latency L0s unlimited, L1 unlimited ExtTag- AttnBtn- AttnInd- PwrInd- RBE+ FLReset- DevCtl: Report errors: Correctable- Non-Fatal- Fatal- Unsupported- RlxdOrd- ExtTag- PhantFunc- AuxPwr- NoSnoop- MaxPayload 128 bytes, MaxReadReq 512 bytes DevSta: CorrErr+ UncorrErr- FatalErr- UnsuppReq+ AuxPwr+ TransPend- LnkCap: Port #1, Speed 2.5GT/s, Width x1, ASPM L0s L1, Latency L0 <256ns, L1 unlimited ClockPM+ Suprise- LLActRep- BwNot- LnkCtl: ASPM L0s L1 Enabled; RCB 128 bytes Disabled- Retrain- CommClk+ ExtSynch- ClockPM+ AutWidDis- BWInt- AutBWInt- LnkSta: Speed 2.5GT/s, Width x1, TrErr- Train- SlotClk+ DLActive- BWMgmt- ABWMgmt- Capabilities: [100] Advanced Error Reporting Capabilities: [130] Device Serial Number 70-5a-b6-ff-ff-97-a6-80 Kernel driver in use: sky2 Kernel modules: sky2 Thanks, Joerg