From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-0.8 required=3.0 tests=HEADER_FROM_DIFFERENT_DOMAINS, MAILING_LIST_MULTI,SPF_HELO_NONE,SPF_PASS autolearn=no autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id A0619C54FD0 for ; Mon, 27 Apr 2020 15:15:58 +0000 (UTC) Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by mail.kernel.org (Postfix) with ESMTP id 816CE2054F for ; Mon, 27 Apr 2020 15:15:58 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1728282AbgD0PP5 (ORCPT ); Mon, 27 Apr 2020 11:15:57 -0400 Received: from mx2.suse.de ([195.135.220.15]:46072 "EHLO mx2.suse.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1728148AbgD0PP5 (ORCPT ); Mon, 27 Apr 2020 11:15:57 -0400 X-Virus-Scanned: by amavisd-new at test-mx.suse.de Received: from relay2.suse.de (unknown [195.135.220.254]) by mx2.suse.de (Postfix) with ESMTP id 38B2AABCF; Mon, 27 Apr 2020 15:15:55 +0000 (UTC) Date: Mon, 27 Apr 2020 17:15:55 +0200 Message-ID: From: Takashi Iwai To: "Deucher, Alexander" Cc: Nicholas Johnson , "linux-kernel@vger.kernel.org" , "amd-gfx@lists.freedesktop.org" , Takashi Iwai , "alsa-devel@alsa-project.org" , Lukas Wunner , "Koenig, Christian" , "Zhou, David(ChunMing)" Subject: Re: [PATCH 0/1] Fiji GPU audio register timeout when in BACO state In-Reply-To: References: User-Agent: Wanderlust/2.15.9 (Almost Unreal) SEMI/1.14.6 (Maruoka) FLIM/1.14.9 (=?UTF-8?B?R29qxY0=?=) APEL/10.8 Emacs/25.3 (x86_64-suse-linux-gnu) MULE/6.0 (HANACHIRUSATO) MIME-Version: 1.0 (generated by SEMI 1.14.6 - "Maruoka") Content-Type: text/plain; charset=US-ASCII Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon, 27 Apr 2020 16:22:21 +0200, Deucher, Alexander wrote: > > [AMD Public Use] > > > -----Original Message----- > > From: Nicholas Johnson > > Sent: Sunday, April 26, 2020 12:02 PM > > To: linux-kernel@vger.kernel.org > > Cc: Deucher, Alexander ; Koenig, Christian > > ; Zhou, David(ChunMing) > > ; Nicholas Johnson > opensource@outlook.com.au> > > Subject: [PATCH 0/1] Fiji GPU audio register timeout when in BACO state > > > > Hi all, > > > > Since Linux v5.7-rc1 / commit 4fdda2e66de0 ("drm/amdgpu/runpm: enable > > runpm on baco capable VI+ asics"), my AMD R9 Nano has been using runpm / > > BACO. You can tell visually when it sleeps, because the fan on the graphics > > card is switched off to save power. It did not spin down the fan in v5.6.x. > > > > This is great (I love it), except that when it is sleeping, the PCIe audio function > > of the GPU has issues if anything tries to access it. You get dmesg errors such > > as these: > > > > snd_hda_intel 0000:08:00.1: spurious response 0x0:0x0, last cmd=0x170500 > > snd_hda_intel 0000:08:00.1: azx_get_response timeout, switching to polling > > mode: last cmd=0x001f0500 snd_hda_intel 0000:08:00.1: No response from > > codec, disabling MSI: last cmd=0x001f0500 snd_hda_intel 0000:08:00.1: No > > response from codec, resetting bus: last cmd=0x001f0500 > > snd_hda_codec_hdmi hdaudioC1D0: Unable to sync register 0x2f0d00. -11 > > > > The above is with the Fiji XT GPU at 0000:08:00.0 in a Thunderbolt enclosure > > (not that Thunderbolt should affect it, but I feel I should mention it just in > > case). I dropped a lot of duplicate dmesg lines, as some of them repeated a > > lot of times before the driver gave up. > > > > I offer this patch to disable runpm for Fiji while a fix is found, if you decide > > that is the best approach. Regardless, I will gladly test any patches you come > > up with instead and confirm that the above issue has been fixed. > > > > I cannot tell if any other GPUs are affected. The only other cards to which I > > have access are a couple of AMD R9 280X (Tahiti XT), which use radeon driver > > instead of amdgpu driver. > > Adding a few more people. Do you know what is accessing the audio? The audio should have a dependency on the GPU device. The GPU won't enter runtime pm until the audio has entered runtime pm and vice versa on resume. Please attach a copy of your dmesg output and lspci output. Also, please retest with the fresh 5.7-rc3. There was a known regression regarding HD-audio PM in 5.7-rc1/rc2, and it's been fixed there (commit 8d6762af302d). thanks, Takashi