From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-2.5 required=3.0 tests=HEADER_FROM_DIFFERENT_DOMAINS, MAILING_LIST_MULTI,SPF_PASS,URIBL_BLOCKED,USER_AGENT_MUTT autolearn=ham autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id 3D0AEC43441 for ; Wed, 21 Nov 2018 11:54:55 +0000 (UTC) Received: from vger.kernel.org (vger.kernel.org [209.132.180.67]) by mail.kernel.org (Postfix) with ESMTP id 08948214DE for ; Wed, 21 Nov 2018 11:54:54 +0000 (UTC) DMARC-Filter: OpenDMARC Filter v1.3.2 mail.kernel.org 08948214DE Authentication-Results: mail.kernel.org; dmarc=none (p=none dis=none) header.from=ucw.cz Authentication-Results: mail.kernel.org; spf=none smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1729757AbeKUW26 (ORCPT ); Wed, 21 Nov 2018 17:28:58 -0500 Received: from atrey.karlin.mff.cuni.cz ([195.113.26.193]:33619 "EHLO atrey.karlin.mff.cuni.cz" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1729391AbeKUW26 (ORCPT ); Wed, 21 Nov 2018 17:28:58 -0500 Received: by atrey.karlin.mff.cuni.cz (Postfix, from userid 512) id 60C4F80906; Wed, 21 Nov 2018 12:54:47 +0100 (CET) Date: Wed, 21 Nov 2018 12:54:49 +0100 From: Pavel Machek To: Joonas Lahtinen Cc: bp@alien8.de, hpa@zytor.com, kernel list , mingo@redhat.com, tglx@linutronix.de, x86@kernel.org, jani.nikula@linux.intel.com, rodrigo.vivi@intel.com, intel-gfx@lists.freedesktop.org, chris@chris-wilson.co.uk Subject: Re: v4.20-rc1: list_del corruption on thinkpad x220 Message-ID: <20181121115449.GA32455@amd> References: <20181108175803.GA10785@amd> <154279919462.20217.14259089584802660420@jlahtine-desk.ger.corp.intel.com> MIME-Version: 1.0 Content-Type: multipart/signed; micalg=pgp-sha1; protocol="application/pgp-signature"; boundary="zYM0uCDKw75PZbzx" Content-Disposition: inline In-Reply-To: <154279919462.20217.14259089584802660420@jlahtine-desk.ger.corp.intel.com> User-Agent: Mutt/1.5.23 (2014-03-12) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org --zYM0uCDKw75PZbzx Content-Type: text/plain; charset=us-ascii Content-Disposition: inline Content-Transfer-Encoding: quoted-printable Hi! > > My machine locked hard (thinkpad x220). After reboot, I found this in > > syslog: > >=20 > > Sounds like memory corruption..? Does not sound like easy to debug. >=20 > Were you doing something GPU intense when you experienced the hard hang? >=20 > And if so, have you been able to hit the issue more than once? At this > point it doesn't look like anything we've hit previously, so would be > great to have some more insight into how we could reproduce. I seen another crash since that, but I don't think it counts at "easily reproducible". I may have been running flightgear at that point. That's fairly GPU intensi= ve. > There's one similar for nouveau in Bugzilla, but it seems like a genuine > memory corruption (1 bit flipped): >=20 > https://bugs.freedesktop.org/show_bug.cgi?id=3D84880 >=20 > Any extra information would be of use :) >=20 > Regards, Joonas >=20 > PS. Could you open a bug to Bugzilla, it'll help to collect the > information in one consolidated place: >=20 > https://01.org/linuxgraphics/documentation/how-report-bugs I prefer email... certainly for bugs that can't be reproduced. Best regards, Pavel > > > > ...otoh, it still looks like an addres, so maybe it is "just" race = in > > GPU drivers? > >=20 > > Any ideas? > > Pavel > >=20 > > Nov 8 18:35:01 duo CRON[28511]: (root) CMD (command -v debian-sa1 > > > /dev/null && debian-sa > > 1 1 1) > > Nov 8 18:42:57 duo kernel: list_del corruption. prev->next should be > > ffff8801742b8178, but > > was ffffc9000192fec8 > > Nov 8 18:42:57 duo kernel: ------------[ cut here ]------------ > > Nov 8 18:42:57 duo kernel: kernel BUG at > > /data/fast/l/k/lib/list_debug.c:53! > > Nov 8 18:42:57 duo kernel: invalid opcode: 0000 [#1] SMP PTI > > Nov 8 18:42:57 duo kernel: CPU: 2 PID: 1082 Comm: i915/signal:1 Not > > tainted 4.20.0-rc1+ #3 > > Nov 8 18:42:57 duo kernel: Hardware name: LENOVO 42872WU/42872WU, > > BIOS 8DET74WW (1.44 ) 03 > > /13/2018 > > Nov 8 18:42:57 duo kernel: RIP: > > 0010:__list_del_entry_valid+0x8e/0x90 > > Nov 8 18:42:57 duo kernel: Code: 66 88 d1 ff 0f 0b 48 89 fe 31 c0 48 > > c7 c7 90 74 5e 85 e8 > > 53 88 d1 ff 0f 0b 48 89 fe 31 c0 48 c7 c7 c8 74 5e 85 e8 40 88 d1 ff > > <0f> 0b 55 48 89 d0 48 > > 8b 52 08 48 89 e5 48 39 f2 75 19 48 8b 32 48 > > Nov 8 18:42:57 duo kernel: RSP: 0000:ffffc9000196be78 EFLAGS: > > 00210086 > > Nov 8 18:42:57 duo kernel: RAX: 0000000000000054 RBX: > > ffff8801742b8178 RCX: 00000000000000 > > 00 > > Nov 8 18:42:57 duo kernel: RDX: 0000000000000000 RSI: > > ffff88019e2a53d8 RDI: ffff88019e2a53 > > d8 > > Nov 8 18:42:57 duo kernel: RBP: ffffc9000196be78 R08: > > ffff880196e2cd10 R09: 00000000000000 > > 00 > > Nov 8 18:42:57 duo kernel: R10: 00000000e7684eb9 R11: > > 3863656632393101 R12: ffffc9000196be > > c8 > > Nov 8 18:42:57 duo kernel: R13: ffff88019707e000 R14: > > ffff8801742b8080 R15: ffffc9000192fd > > d0 > > Nov 8 18:42:57 duo kernel: FS: 0000000000000000(0000) > > GS:ffff88019e280000(0000) knlGS:000 > > 0000000000000 > > Nov 8 18:42:57 duo kernel: CS: 0010 DS: 0000 ES: 0000 CR0: > > 0000000080050033 > > Nov 8 18:42:57 duo kernel: CR2: 00000000ed2bf000 CR3: > > 000000000581e001 CR4: 00000000000606a0 > > Nov 8 18:42:57 duo kernel: Call Trace: > > Nov 8 18:42:57 duo kernel: intel_breadcrumbs_signaler+0x162/0x330 > > Nov 8 18:42:57 duo kernel: kthread+0x116/0x150 > > Nov 8 18:42:57 duo kernel: ? intel_engine_wakeup+0x40/0x40 > > Nov 8 18:42:57 duo kernel: ? kthread_park+0x90/0x90 > > Nov 8 18:42:57 duo kernel: ret_from_fork+0x35/0x40 > > Nov 8 18:42:57 duo kernel: Modules linked in: > > Nov 8 18:42:57 duo kernel: ---[ end trace 2f8da183a56f80f6 ]--- > > Nov 8 18:42:57 duo kernel: RIP: > > 0010:__list_del_entry_valid+0x8e/0x90 > > Nov 8 18:42:57 duo kernel: Code: 66 88 d1 ff 0f 0b 48 89 fe 31 c0 > > 48 c7 c7 90 74 5e 85 e8 53 88 d1 ff 0f 0b 48 89 fe 31 c0 48 c7 c7 c8 > > 74 5e 85 e8 40 88 d1 ff <0f> 0b 55 48 89 d0 48 8b 52 08 48 89 e5 48 > > 39 f2 75 19 48 8b 32 48 > > Nov 8 18:42:57 duo kernel: RSP: 0000:ffffc9000196be78 EFLAGS: > > 00210086 > > Nov 8 18:42:57 duo kernel: RAX: 0000000000000054 RBX: > > ffff8801742b8178 RCX: 0000000000000000 > > Nov 8 18:42:57 duo kernel: RDX: 0000000000000000 RSI: > > ffff88019e2a53d8 RDI: ffff88019e2a53d8 > > Nov 8 18:42:57 duo kernel: RBP: ffffc9000196be78 R08: > > ffff880196e2cd10 R09: 0000000000000000 > > Nov 8 18:42:57 duo kernel: R10: 00000000e7684eb9 R11: > > 3863656632393101 R12: ffffc9000196bec8 > > Nov 8 18:42:57 duo kernel: R13: ffff88019707e000 R14: > > ffff8801742b8080 R15: ffffc9000192fdd0 > > Nov 8 18:42:57 duo kernel: FS: 0000000000000000(0000) > > GS:ffff88019e280000(0000) knlGS:0000000000000000 > > Nov 8 18:42:57 duo kernel: CS: 0010 DS: 0000 ES: 0000 CR0: > > 0000000080050033 > > Nov 8 18:42:57 duo kernel: CR2: 00000000ed2bf000 CR3: > > 000000000581e001 CR4: 00000000000606a0 > >=20 > > --=20 > > (english) http://www.livejournal.com/~pavelmachek > > (cesky, pictures) http://atrey.karlin.mff.cuni.cz/~pavel/picture/horses= /blog.html --=20 (english) http://www.livejournal.com/~pavelmachek (cesky, pictures) http://atrey.karlin.mff.cuni.cz/~pavel/picture/horses/blo= g.html --zYM0uCDKw75PZbzx Content-Type: application/pgp-signature; name="signature.asc" Content-Description: Digital signature -----BEGIN PGP SIGNATURE----- Version: GnuPG v1 iEYEARECAAYFAlv1R4kACgkQMOfwapXb+vIdZQCeI8QzZpkgOaaaEO0FIL1IpHC0 facAoJtWbLhs6EtAqPeI8xngYTG7z2nz =yOuu -----END PGP SIGNATURE----- --zYM0uCDKw75PZbzx--