From mboxrd@z Thu Jan 1 00:00:00 1970 Received: from mail-qt1-f174.google.com (mail-qt1-f174.google.com [209.85.160.174]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id 7D21C31062D for ; Tue, 13 Jan 2026 21:15:52 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=209.85.160.174 ARC-Seal:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1768338954; cv=none; b=bLHovt6YEAEhn1wikPy40IprXb03aXLSCinbKfJNzu4W3V9qAzydM6dWaK2whkmbaS3p9/+KKE6qZ//kCpJv6h9TrNVZuqJyboWTMh1Ha+Dx3bnd15l9lTwiS5Xj4RO5HJuc9y26neVef+wBfMrkbBbBWCQ+78ocReRYTKASeZ4= ARC-Message-Signature:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1768338954; c=relaxed/simple; bh=lSiT+5f78IAghqqameTuBM0ImkHpqh4OyDDkmFOfoGM=; h=Date:From:To:Cc:Subject:Message-ID:References:MIME-Version: Content-Type:Content-Disposition:In-Reply-To; b=rV9BwkfjQGW7RRLN0jVtU8yzhuxBeYbzQUkOXC2ESmBxOhCJbtZVUhTfPWZhouDlC2RHzLVj0TT0FMjWeAUVUTeLksxb/r9TXBGnkaX7wAzlZaHC+a17c1lUt46n3sMOiZDY1lBo6iHqhZBCl8vxaxgAI4wQrF03IH/+cwgryE0= ARC-Authentication-Results:i=1; smtp.subspace.kernel.org; dmarc=none (p=none dis=none) header.from=ziepe.ca; spf=pass smtp.mailfrom=ziepe.ca; dkim=pass (2048-bit key) header.d=ziepe.ca header.i=@ziepe.ca header.b=VBKD4V4V; arc=none smtp.client-ip=209.85.160.174 Authentication-Results: smtp.subspace.kernel.org; dmarc=none (p=none dis=none) header.from=ziepe.ca Authentication-Results: smtp.subspace.kernel.org; spf=pass smtp.mailfrom=ziepe.ca Authentication-Results: smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=ziepe.ca header.i=@ziepe.ca header.b="VBKD4V4V" Received: by mail-qt1-f174.google.com with SMTP id d75a77b69052e-50146fcf927so1952021cf.0 for ; Tue, 13 Jan 2026 13:15:52 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=ziepe.ca; s=google; t=1768338951; x=1768943751; darn=vger.kernel.org; h=in-reply-to:content-disposition:mime-version:references:message-id :subject:cc:to:from:date:from:to:cc:subject:date:message-id:reply-to; bh=8S7FI6sNOOKEC+zXX9NqKGAvEksfTGrqIkrYgB7KQ6g=; b=VBKD4V4V3IUsGc39ZQAuz2lUAIyDs4KH3zvLjxY/gSnR8Ss7arqwupeQdEjosZ2/un SwQidMYDt09X7I/4+xVcWppAbrQmswgVXIQ8GbnYHNeXVAlcZz7v6GWy28Mt+d1mi6GG ZJkWLdQYqBaaj7y7OrPWBAAxwdbQLCh7hKx7givQ08q0o8mRCqFQOJvPF1/LwOMCQ49a kfX1kTteTiD2C5Oymg1tVez3uwJW4qs7YuPQ71ZItFh5QNxH1+tIcG27NgUCId88v3mD A2+HcoO2o4f7AsOeB+HefNGlKXj3Pf/FwyszOtZ5N1TqqoA+qEEoB0GfdFTdrb3wcqVv BbBQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1768338951; x=1768943751; h=in-reply-to:content-disposition:mime-version:references:message-id :subject:cc:to:from:date:x-gm-gg:x-gm-message-state:from:to:cc :subject:date:message-id:reply-to; bh=8S7FI6sNOOKEC+zXX9NqKGAvEksfTGrqIkrYgB7KQ6g=; b=ccHc85h9t8m6rg6+J8sCSXE5iNGGnMo6c7LU0r0qLA6RYaE3vihftfVyf4EKF4zlL/ i/xmBo6IpHcPWFQZVL2bKry+OxYmHVQe+ayj81tTjwUk7JSX60gUuoms2C1IdC0KDOnT Joux5SfyRIwU7CYn90tmzJFDm+/FrBQaPsgEKTd6hehoacycqaFFVeaOIuEhqNJ4AltH MdzN1RjYTcTLCdYv/Urpx1sCORqJZ2q6Eb9n4BDETiD5udVvnZU/Tw4IQLdxAvQgVFmS HqZG8jY7cjJhq7JQ68Sm8xwzDEoDH4sLv2Gg6c2DSrUY5gNVBeE5grENRO4YTZ33u/i1 Mt2w== X-Forwarded-Encrypted: i=1; AJvYcCV/TeAsN3EG0gfJjUYfNvfo0OitR6PN/Kd2mffPqtyq8nUQRAmx0UOAHj73NeFyGerhrrl23liW3Ow=@vger.kernel.org X-Gm-Message-State: AOJu0YwBx0xugsmNCIO2ApVK8X8hGHMXYEBL7Cu4Hcse1qmk0xjDJms3 57lr1plSQks/6AxZ3CpWM8TmsbNFdrkYZ99UkZphKVXqjnyIO8nT1oTj/KkszoKjREI= X-Gm-Gg: AY/fxX6j3EggnkmtLdv6jKlQ53fHo7vbiDhpnyQJ2991l8u6aLrkpaewqQPp4zfc8+T /At9yPIFELNQZwk5fIS/GfbCMcd9E6guo6OvYl+oZ3H14W8363oxn/TFUk9x20+JeH/jk1JM+Jc V8DGK9t9qz3jAZvu8GKFGho0cAi/yeBpPwEAmzgtkf9j1PhWVhxz4wzPoUUqPSVLN5krxni/iMF M4x0RhDZXfkmlcmRFPBrUkhVyuUnZIsgIiiuFvQtvJ2P7kCpC5et+WRgFYJM4LOCLGcRWXRr08G Dm1KRvcq7TaQ5U/0E3zoa2bHbJeX633CtO/kDAByNUB76UAKqFLP3vC+poSm75tfoTpecHnkcH2 UvQyDxj0/wy8Ku/EbeWvZ+BfPOfCW/meShG4xr46y3UmmlvHMdEufnTlUSHwwSwMFqkIbGZhfEM fms3CBFqllpe723dxOFIH0O5wqqKxsEcHwp765beI9NdnNoStE1MjnomwMiRGhmvaaAFk= X-Received: by 2002:a05:622a:19aa:b0:4f1:de87:ad90 with SMTP id d75a77b69052e-5013972ec7emr56793941cf.4.1768338951296; Tue, 13 Jan 2026 13:15:51 -0800 (PST) Received: from ziepe.ca (hlfxns017vw-142-162-112-119.dhcp-dynamic.fibreop.ns.bellaliant.net. [142.162.112.119]) by smtp.gmail.com with ESMTPSA id 6a1803df08f44-890772682besm164113626d6.50.2026.01.13.13.15.50 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Tue, 13 Jan 2026 13:15:50 -0800 (PST) Received: from jgg by wakko with local (Exim 4.97) (envelope-from ) id 1vfljw-000000040dN-3h50; Tue, 13 Jan 2026 17:15:48 -0400 Date: Tue, 13 Jan 2026 17:15:48 -0400 From: Jason Gunthorpe To: Thomas Gleixner Cc: Bert Karwatzki , linux-kernel@vger.kernel.org, linux-next@vger.kernel.org, Mario Limonciello , Sebastian Andrzej Siewior , Clark Williams , Steven Rostedt , Christian =?utf-8?B?S8O2bmln?= , regressions@lists.linux.dev, linux-pci@vger.kernel.org, linux-acpi@vger.kernel.org, "Rafael J . Wysocki" , acpica-devel@lists.linux.dev, Robert Moore , Saket Dumbre , Bjorn Helgaas , Clemens Ladisch , Jinchao Wang , Yury Norov , Anna Schumaker , Baoquan He , "Darrick J. Wong" , Dave Young , Doug Anderson , "Guilherme G. Piccoli" , Helge Deller , Ingo Molnar , Joanthan Cameron , Joel Granados , John Ogness , Kees Cook , Li Huafei , "Luck, Tony" , Luo Gengkun , Max Kellermann , Nam Cao , oushixiong , Petr Mladek , Qianqiang Liu , Sergey Senozhatsky , Sohil Mehta , Tejun Heo , Thomas Zimemrmann , Thorsten Blum , Ville Syrjala , Vivek Goyal , Yunhui Cui , Andrew Morton , W_Armin@gmx.de Subject: Re: NMI stack overflow during resume of PCIe bridge with CONFIG_HARDLOCKUP_DETECTOR=y Message-ID: <20260113211548.GV745888@ziepe.ca> References: <20260113094129.3357-1-spasswolf@web.de> <87h5spk01t.ffs@tglx> <87v7h5ia3d.ffs@tglx> Precedence: bulk X-Mailing-List: linux-pci@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <87v7h5ia3d.ffs@tglx> On Tue, Jan 13, 2026 at 08:30:46PM +0100, Thomas Gleixner wrote: > So gradually your machine just stalls on outstanding MMIO transactions > w/o further notice... The NMI is just a red herring. CPUs usualy have timeouts for these things and they return 0xFF back for the timed out read. Beyond that "it depends" if any other RAS indications are raised. > You need to figure out why that MMIO access to that device's > configuration space stalls as anything else is just subsequent > damage. Given this is a resume it seems likely the PCI routing inside the bridge chip has been messed up somehow during the suspend/resume. Possibily due to errata in the bridge, there are many weird bridge errata :\ Jason