AMD-GFX Archive on lore.kernel.org
 help / color / mirror / Atom feed
From: "Christian König" <christian.koenig@amd.com>
To: Alexey Klimov <alexey.klimov@linaro.org>,
	Alex Deucher <alexdeucher@gmail.com>
Cc: Fugang Duan <fugang.duan@cixtech.com>,
	"alexander.deucher@amd.com" <alexander.deucher@amd.com>,
	"frank.min@amd.com" <frank.min@amd.com>,
	"amd-gfx@lists.freedesktop.org" <amd-gfx@lists.freedesktop.org>,
	"stable@vger.kernel.org" <stable@vger.kernel.org>,
	"david.belanger@amd.com" <david.belanger@amd.com>,
	Peter Chen <peter.chen@cixtech.com>,
	cix-kernel-upstream <cix-kernel-upstream@cixtech.com>,
	"linux-arm-kernel@lists.infradead.org"
	<linux-arm-kernel@lists.infradead.org>
Subject: Re: 回复: [REGRESSION] amdgpu: async system error exception from hdp_v5_0_flush_hdp()
Date: Wed, 23 Apr 2025 16:32:48 +0200	[thread overview]
Message-ID: <e19f13a0-90e0-4e24-b547-607496d8983d@amd.com> (raw)
In-Reply-To: <D9DAIUZXIWH3.1L7CV6GEX4C9M@linaro.org>

On 4/22/25 17:59, Alexey Klimov wrote:
> On Tue Apr 22, 2025 at 2:00 PM BST, Alex Deucher wrote:
>> On Mon, Apr 21, 2025 at 10:21 PM Alexey Klimov <alexey.klimov@linaro.org> wrote:
>>>
>>> On Thu Apr 17, 2025 at 2:08 PM BST, Alex Deucher wrote:
>>>> On Wed, Apr 16, 2025 at 8:43 PM Fugang Duan <fugang.duan@cixtech.com> wrote:
>>>>>
>>>>> 发件人: Alex Deucher <alexdeucher@gmail.com> 发送时间: 2025年4月16日 22:49
>>>>>> 收件人: Alexey Klimov <alexey.klimov@linaro.org>
>>>>>> On Wed, Apr 16, 2025 at 9:48 AM Alexey Klimov <alexey.klimov@linaro.org> wrote:
>>>>>>>
>>>>>>> On Wed Apr 16, 2025 at 4:12 AM BST, Fugang Duan wrote:
>>>>>>>> 发件人: Alexey Klimov <alexey.klimov@linaro.org> 发送时间: 2025年4月16
>>>>>> 日 2:28
>>>>>>>>> #regzbot introduced: v6.12..v6.13
>>>>>>>>> The only change related to hdp_v5_0_flush_hdp() was
>>>>>>>>> cf424020e040 drm/amdgpu/hdp5.0: do a posting read when flushing HDP
>>>>>>>>>
>>>>>>>>> Reverting that commit ^^ did help and resolved that problem. Before
> 
> [..]
> 
>>>> OK.  that patch won't change anything then.  Can you try this patch instead?
>>>
>>> Config I am using is basically defconfig wrt memory parameters, yeah, i use 4k.
>>>
>>> So I tested that patch, thank you, and some other different configurations --
>>> nothing helped. Exactly the same behaviour with the same backtrace.
>>
>> Did you test the first (4k check) or the second (don't remap on ARM) patch?
> 
> The second one. I think you mentioned that first one won't help for 4k pages.
> 
> 
>>> So it seems that it is firmware problem after all?
>>
>> There is no GPU firmware involved in this operation.  It's just a
>> posted write.  E.g., we write to a register to flush the HDP write
>> queue and then read the register back to make sure the write posted.
>> If the second patch didn't help, then perhaps there is some issue with
>> MMIO access on your platform?
> 
> I didn't mean GPU firmware at all. I only had uefi/EL3 firmwares in mind.
> 
> Completely out of the blue, based on nothing, do you think that
> adding delay/some mem barrier between write and read might help?

That would still be quite some platform bug.

> I wonder if host data path code should be executed during common desktop
> usage as a common user then why it doesn't break later.

Maybe it's some kind of write/read re-ordering issue.

 But yeah, I also think this is this motherboard problem. Thank you.

You should probably ping some ARM guys to figure out what the fault code actually means.

Regards,
Christian.

> 
> Thanks,
> Alexey
> 


  reply	other threads:[~2025-04-23 14:33 UTC|newest]

Thread overview: 20+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2025-04-15 18:28 [REGRESSION] amdgpu: async system error exception from hdp_v5_0_flush_hdp() Alexey Klimov
2025-04-16  3:12 ` 回复: " Fugang Duan
2025-04-16 11:25   ` Alexey Klimov
2025-04-16 14:49     ` Alex Deucher
2025-04-17  0:42       ` 回复: " Fugang Duan
2025-04-17 13:08         ` Alex Deucher
2025-04-18  0:30           ` 回复: " Fugang Duan
2025-04-18  1:10             ` Alex Deucher
2025-04-22  2:20           ` Alexey Klimov
2025-04-22 13:00             ` Alex Deucher
2025-04-22 15:59               ` Alexey Klimov
2025-04-23 14:32                 ` Christian König [this message]
2025-04-24 15:44                 ` Alex Deucher
2025-04-27  1:01                   ` Alexey Klimov
2025-04-30 16:55                     ` Alex Deucher
2025-05-11 23:24                       ` Alexey Klimov
2025-05-12 14:46                         ` Alex Deucher
2025-04-16 11:44 ` Christian König
2025-04-22  2:49   ` Alexey Klimov
2025-04-24 11:41     ` Peter Chen

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=e19f13a0-90e0-4e24-b547-607496d8983d@amd.com \
    --to=christian.koenig@amd.com \
    --cc=alexander.deucher@amd.com \
    --cc=alexdeucher@gmail.com \
    --cc=alexey.klimov@linaro.org \
    --cc=amd-gfx@lists.freedesktop.org \
    --cc=cix-kernel-upstream@cixtech.com \
    --cc=david.belanger@amd.com \
    --cc=frank.min@amd.com \
    --cc=fugang.duan@cixtech.com \
    --cc=linux-arm-kernel@lists.infradead.org \
    --cc=peter.chen@cixtech.com \
    --cc=stable@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox