From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by smtp.lore.kernel.org (Postfix) with ESMTP id 9BE36C61DA4 for ; Thu, 16 Mar 2023 00:15:25 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S230088AbjCPAPX (ORCPT ); Wed, 15 Mar 2023 20:15:23 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:41124 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229459AbjCPAPW (ORCPT ); Wed, 15 Mar 2023 20:15:22 -0400 Received: from smtp-fw-33001.amazon.com (smtp-fw-33001.amazon.com [207.171.190.10]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id E585141B6B; Wed, 15 Mar 2023 17:15:19 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=amazon.com; i=@amazon.com; q=dns/txt; s=amazon201209; t=1678925721; x=1710461721; h=from:to:cc:subject:date:message-id:in-reply-to: references:mime-version:content-transfer-encoding; bh=2oySJvO6i9+28hK9e7TgkInDJy9QN6sfEmwT+3wRJvY=; b=mrkk76qxG5+S5l9AWTivjbEzLnW80fWep8hYCfmIbuWhRrjenln74qIQ 0IezG/NYMtrVPV1semvrG9Msy6PsAECjkuX4ZOqoH63Cqrr8PPhd5fIWX JGT3lmQxPWOABF0FwB7V0ypyBW3meno308ykiSItH3TbIhVlvZzzGwV+6 A=; X-IronPort-AV: E=Sophos;i="5.98,262,1673913600"; d="scan'208";a="269902607" Received: from iad12-co-svc-p1-lb1-vlan3.amazon.com (HELO email-inbound-relay-pdx-2b-m6i4x-cadc3fbd.us-west-2.amazon.com) ([10.43.8.6]) by smtp-border-fw-33001.sea14.amazon.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 16 Mar 2023 00:15:13 +0000 Received: from EX19MTAUWC001.ant.amazon.com (pdx1-ws-svc-p6-lb9-vlan2.pdx.amazon.com [10.236.137.194]) by email-inbound-relay-pdx-2b-m6i4x-cadc3fbd.us-west-2.amazon.com (Postfix) with ESMTPS id B878CA06B9; Thu, 16 Mar 2023 00:15:10 +0000 (UTC) Received: from EX19D004ANA001.ant.amazon.com (10.37.240.138) by EX19MTAUWC001.ant.amazon.com (10.250.64.174) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.2.1118.25; Thu, 16 Mar 2023 00:14:50 +0000 Received: from 88665a182662.ant.amazon.com (10.106.101.20) by EX19D004ANA001.ant.amazon.com (10.37.240.138) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.2.1118.24; Thu, 16 Mar 2023 00:14:36 +0000 From: Kuniyuki Iwashima To: CC: , , , , , , , , , , Subject: Re: KERNEL BUG LIKELY: Kernel Panic! MKISS related Date: Wed, 15 Mar 2023 17:14:28 -0700 Message-ID: <20230316001428.54359-1-kuniyu@amazon.com> X-Mailer: git-send-email 2.30.2 In-Reply-To: References: MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Content-Type: text/plain X-Originating-IP: [10.106.101.20] X-ClientProxiedBy: EX19D038UWC001.ant.amazon.com (10.13.139.213) To EX19D004ANA001.ant.amazon.com (10.37.240.138) Precedence: bulk List-ID: X-Mailing-List: netdev@vger.kernel.org Hi, From: Chris Maness Date: Wed, 15 Mar 2023 09:54:03 -0700 > I noticed this on two systems except I did not get the kernel dump > readout as X was running. I thought it may have had to do with > VirtualBox, but then I am getting similar behavior on a real install. > I was lucky that I did not have X running and have nice output that I > OCR'd from a photo of the screen. > > How to reproduce this: > > Setup 3 pseudo tty's for loopback to FBB BBS > kissnetd -p 3 & > kissattach /dev/pts/1 > kissparms -c 1 -p lb0 > kissattach /dev/pts/2 > kissparms -c 1 -p lb1 > fbb configured with a real radio on /dev/ttyS0 for port 1 > and the lb0 AX.25 kernel interface for port 2. > > I leave the third pseudo tty for connecting JNOS to FBB over this > "loopback" net. > > If I try to pass bulls (or if I remember correctly even connect from > JNOS) it locks cold. > > Last night (where this screen shot kernel panic came from) was an > actual radio port where I was pushing bulls to a newly installed > instance of FBB on Arch Linux with the latest kernel 6.2.5. I have > not had issues in slackware (and I was also running the latest stable > kernel as I am too lazy to keep patching the LTS kernel, but I have > not played as much there. I think I may install slack on this box too > and see if I can get it to dump like this. If you want the photo, > just email me directly as it looks like the OCR is not that good, but > I do not know how photos will go over here. > > Here is the screen dump: > > 11 fm KOolIP: 1 to NoTJ-1 ctI DISC: 1100216.0098271 BUG: kernel MULL > > [106216.0109741 #PF supervisor read access in kernel mode > > 1106216.0121001 APE: error codetox0000) > > not present page > > [106216.0132301 PCD 0 P4D 0 > > [106216.0143551 Dops: 0000 [#11 PREEMPT SMP PTI > > [106216.0151781 CPU: 0 PID: 39178 Comm: xibbd Tainted: G > > DE > > 6.2.5-arch1-1 #1 Fel?0e9497e04500 > > [106216.0166331 Hardware name: Dell Inc. Optifiex 790/ONKUGV, BIOS A17 > 03/14/2013 > > [106216.0177931 RIP: 0010 :ax25_addr_ax25dev+0x44/0xb0 [ax251 > > [106216.0189621 Code: c1 53 ed 9 b3 61 P9 48 86 14 40 16 01 00 48 85 > db 74 41 41 bc 01 00 00 00 eb 08 48 Bb 16 > > 0 03 00 00 eB do fb M Tr 85 co 75 e1 48 Ba bb 90 00 00=C2=B0 > > [106216.0214491 RSP: 0018: Frffa52b0107bdeB EFLAGS: 00010286 > > [106216.022714] RAX: 0000000000000000 RBX: FIff9457c84126c0 RCX: > 0000000000000000 > > [106216.023976] RDX: 0000000000000001 RSI: ffffffffc1541100 RDI: > ffffa5260107be68 > > [106216.025229] RBP: ffFfa52b0107be68 ROB: 0000000000000009 R09: > 0000000000000000 > > [106216.0264951 R10: 0000000000000000 R11: 0000000000000000 R12: > 0000000000000001 > > [106216.0277661 R13: 0000000000000000 R14: 0000000000000000 R15: > 0000000000000000 > > [106216.029015] FS: > > 00007/7421a26740 (0000) GS:ffff9458a9c00000 (0000) knIGS: 0000000000000000 > > [106216.0302591 CS: > > 0010 DS: 0000 ES: 0000 CRO: 0000000080050033 > > [106216.031589] CR2: 0000000000000340 CR3: 000000008c804005 CR4: > 0000000000060650 > > [106216.0330441 Call Trace: > > [106216.0337581 > > [106216.0344601 ax25_bind+0x1e2/0x210 [ax25 > 0149579197c9004716ce47844d0cb0c56b9a4c841 It seems to be a new null-deref ? I think at least this patch is not related to the issue and same for other patches from Hyunwoom. All of these commits did not touch ax25. 611792920925 ("netrom: Fix use-after-free caused by accept on already connected socket") f2b0b5210f67 ("net/x25: Fix to not accept on connected socket") 14caefcf9837 ("net/rose: Fix to not accept on connected socket") Thanks, Kuniyuki > > [106216.0351811 > > _sys_bind+Oxe8/OxfO > > [106216.0358931 > > _x64_sus_bind+0x18/0=C3=9720 > > [106216.0365951 > > [106216.037296] > > do_suscal1_64+0x5f/0=C3=9790 > > ? syscall_exit_to_user_mode+0x1b/0=C3=9740 > > [106216.038000] > > ? do_syscal1_64+0x6b/0x90 [106216.0386931 > entry_SYSCALL_64_after_huframe+0x72/0xdc > > [106216.039390] RIP: 0033:0x717421b3791b > > [106216.0400931 Code: c3 66 0f 1 44 00 00 48 86 15 51 e4 0c 00 f7 d8 > 64 89 02 b8 ff fr if ff eb be of 11 44 00 00. 13 of 1e fa b8 31 00 00 > 00 of 05 <48> 34 01 1 > > 0 ff ff 73 01 c3 48 86 0d 25 e4 0c 00 f7 d8 64 89 01 48 > > [106216.041578] RSP: 0026:00007ffd15dab688 EFLAGS: 00000206 ORIG RAX: > 0000000000000031 > > [106216.0423321 RAX: fffffffffffffrda RBX: 00007ffd15dabac2 RCX: 000077421b= > 37916 > > [106216.043092] RDX: 0000000000000048 RSI: 00007ffd15dab700 RDI: > 0000000000000009 > > [106216.043853] RBP: 0000563ef2403900 ROB: 0000000000000004 R09: > 00000000ffffffff > > [106216.0446231 R10: 00007rd15dab6c0 R11: 0000000000000206 R12: 00000000000= > 00009 > > [106216.045395] R13: 0000000000000048 R14: 0000563ef2403c00 R15: > 0000000000000010 > > [106216.0461671 > > > > [106216.0469201 Modules linked in: mkiss ax25 crc16 cp210x tun rperdma > rdma_cm iw_cm ib_cm ib_core ufat fat intel_rap _msr intel_rapl common > =C3=9786_pkg_temp_therma I intel powerc lamp coretemp kum_ intel kum > (rqbupass cret10dif pcimul cre32 pcimul polyual cimulni poluual > generic grizimul ghash cimulni intel crypta sha512 53s e3 snd Ida > _codec_hdmi snd hda_codec_-realtek snd _hda_codec generic rapl ledtrig > _audio mousedev snd hda_intel intel _cstate at24 snd_intel_dspefg > snd_intel_sdu acp i snd_hda_codec snd_hda_core snd_hudep snd_pem > mei_pxp 12c_1801 intel_uncore mei_hdep snd_timer iTCO_wt snd dedbas > intel_pc_bxt iTCO_uendor_support mei wat cf gBO211 rikill soundcore > pespr 12c _smblis mei me mel lpe_ich mac hid e1000e nfsd auth epcyss > nfs acl lockd grace sunrpe uboxnetfit (OF) Uboxmetadp(OF) uboxdru(OF=C2=BB > dm _mod loop fuse bpf_preload ip_tables x tables btrfs blakeZb > _generic xor raid_pq liberc32c usbhid cre32c _generic 1915 drm_buddy > intel_gtt crc32c_intel drm_di splay_helper sr_mod cdrom cec ttm video > uni > > [106216.0526781 CRZ: 0000000000000340 > > [106216.053515] ---[ end trace 0000000000000000 1- > > [106216.0543461 RIP: 0010 :ax25_ addr_ax25deu+0x44/Oxb0 Lax25] > > [106216.055173] Code: c1 53 8 9 63 61 9 48 86 1d 40 16 01 00 48 85 ab > 74 4f 41 bc 01 00 00 00 eb 0B 48 8b 16 48 85 b 74 31 48 86 43 08 48. > 89 ef <48> 86 60 > > 0 03 00 00 e8 do fb fr ff 85 cO 75 e1 48 8d bb 90 00 00 > > [106216.056921] RSP: 0018:ffffa52b0107bded EFLAGS: 00010286 > > 1106216.0578041 RAX: 0000000000000000 RBX: fFff9457c84126c0 RCX: > 0000000000000000 > > [106216.058688] RDX: 0000000000000001 RSI: ffffffffc1541100 RDI: ffffa52010= > 7be68 > > [106216.059577] RBP: ffffa52b0107be68 ROB: 0000000000000009 R09: > 0000000000000000 > > [106216.060473] R10: 0000000000000000 R11: 0000000000000000 R12: > 0000000000000001 > > [106216.0613741 R13: 0000000000000000 R14: 0000000000000000 R15: > 0000000000000000 > > [106216.0622811 FS* > > 0000757421a26740 (0000) GS:ffFf9458a9c00000(0000) knIGS:0000000000000000 > > [106216.063200] C8: > > 0010 DS: 0000 ES: 0000 CRO: 0000000080050033 > > [106216.0641291 CR2: 0000000000000340 CR3: 000000008c804005 CR4: 0000000000= > 06060 > > [106216.0650721 Kernel panic - not syncing: Fatal exception in interrupt > > [106216.0662201 Kernel Offset: 0x38e00000 from Oxffffffff81000000 > (relocation range: Oxfffffff80000000-oxfrrrrrrrrrrrrr) > > [106216.0672191 > > ---[ end Kernel panic > > - not syncing: Fatal exception in interrupt ]-. > > Regards, > Chris KQ6UP