public inbox for linux-media@vger.kernel.org
 help / color / mirror / Atom feed
From: Mario Limonciello <mario.limonciello@amd.com>
To: Sultan Alsawaf <sultan@kerneltoast.com>, "Du, Bin" <bin.du@amd.com>
Cc: mchehab@kernel.org, hverkuil@xs4all.nl,
	laurent.pinchart+renesas@ideasonboard.com,
	bryan.odonoghue@linaro.org, sakari.ailus@linux.intel.com,
	prabhakar.mahadev-lad.rj@bp.renesas.com,
	linux-media@vger.kernel.org, linux-kernel@vger.kernel.org,
	pratap.nirujogi@amd.com, benjamin.chan@amd.com, king.li@amd.com,
	gjorgji.rosikopulos@amd.com, Phil.Jawich@amd.com,
	Dominic.Antony@amd.com, richard.gong@amd.com, anson.tsao@amd.com
Subject: Re: [PATCH v5 0/7] Add AMD ISP4 driver
Date: Fri, 21 Nov 2025 08:32:34 -0600	[thread overview]
Message-ID: <546a8d8e-514b-4f6b-bf46-1ec0c9974e7e@amd.com> (raw)
In-Reply-To: <aSAg1MUVZtDlCC96@sultan-box>



On 11/21/2025 2:20 AM, Sultan Alsawaf wrote:
> On Wed, Nov 19, 2025 at 06:14:17PM +0800, Du, Bin wrote:
>>
>>
>> On 11/18/2025 3:35 PM, Sultan Alsawaf wrote:
>>> On Wed, Nov 12, 2025 at 06:21:33PM +0800, Du, Bin wrote:
>>>>
>>>>
>>>> On 11/12/2025 3:38 PM, Sultan Alsawaf wrote:
>>>>> On Tue, Nov 11, 2025 at 11:06:41PM -0800, Sultan Alsawaf wrote:
>>>>>> On Wed, Nov 12, 2025 at 02:32:51PM +0800, Du, Bin wrote:
>>>>>>> Thanks Sultan for your information.
>>>>>>>
>>>>>>> On 11/12/2025 9:21 AM, Sultan Alsawaf wrote:
>>>>>>>> On Tue, Nov 11, 2025 at 03:33:42PM -0800, Sultan Alsawaf wrote:
>>>>>>>>> On Tue, Nov 11, 2025 at 05:58:10PM +0800, Du, Bin wrote:
>>>>>>>>>>
>>>>>>>>>> On 11/11/2025 5:05 PM, Sultan Alsawaf wrote:
>>>>>>>>>>
>>>>>>>>>>> On Mon, Nov 10, 2025 at 05:46:28PM +0800, Du, Bin wrote:
>>>>>>>>>>>> Thank you, Sultan, for your time, big effort, and constant support.
>>>>>>>>>>>> Apologies for my delayed reply for being occupied a little with other
>>>>>>>>>>>> matters.
>>>>>>>>>>>>
>>>>>>>>>>>> On 11/10/2025 4:33 PM, Sultan Alsawaf wrote:
>>>>>>>>>>>>> Hi Bin,
>>>>>>>>>>>>>
>>>>>>>>>>>>> On Wed, Nov 05, 2025 at 01:25:58AM -0800, Sultan Alsawaf wrote:
>>>>>>>>>>>>>> Hi Bin,
>>>>>>>>>>>>>>
>>>>>>>>>>>>>> To expedite review, I've attached a patch containing a bunch of fixes I've made
>>>>>>>>>>>>>> on top of v5. Most of my changes should be self-explanatory; feel free to ask
>>>>>>>>>>>>>> further about specific changes if you have any questions.
>>>>>>>>>>>>>>
>>>>>>>>>>>>>> I haven't had a chance to review all of the v4 -> v5 changes yet, but I figured
>>>>>>>>>>>>>> I should send what I've got so far.
>>>>>>>>>>>>>>
>>>>>>>>>>>>>> FYI, there is a regression in isp4if_dequeue_buffer() where the bufq lock isn't
>>>>>>>>>>>>>> protecting the list_del() anymore. I also checked the compiler output when using
>>>>>>>>>>>>>> guard() versus scoped_guard() in that function and there is no difference:
>>>>>>>>>>>>>>
>>>>>>>>>>>>>>         sha1sum:
>>>>>>>>>>>>>>         5652a0306da22ea741d80a9e03a787d0f71758a8  isp4_interface.o // guard()
>>>>>>>>>>>>>>         5652a0306da22ea741d80a9e03a787d0f71758a8  isp4_interface.o // scoped_guard()
>>>>>>>>>>>>>>
>>>>>>>>>>>>>> So guard() should be used there again, which I've done in my patch.
>>>>>>>>>>>>>>
>>>>>>>>>>>>>> I also have a few questions:
>>>>>>>>>>>>>>
>>>>>>>>>>>>>> 1. Does ISP FW provide a register interface to disable the IRQ? If so, it is
>>>>>>>>>>>>>>          faster to use that than using disable_irq_nosync() to disable the IRQ from
>>>>>>>>>>>>>>          the CPU's side.
>>>>>>>>>>>>>>
>>>>>>>>>>>>>> 2. When the IRQ is re-enabled in isp4sd_fw_resp_func(), is there anything
>>>>>>>>>>>>>>          beforehand to mask all pending interrupts from the ISP so that there isn't a
>>>>>>>>>>>>>>          bunch of stale interrupts firing as soon the IRQ is re-enabled?
>>>>>>>>>>>>>>
>>>>>>>>>>>>>> 3. Is there any risk of deadlock due to the stream kthread racing with the ISP
>>>>>>>>>>>>>>          when the ISP posts a new response _after_ the kthread determines there are no
>>>>>>>>>>>>>>          more new responses but _before_ the enable_irq() in isp4sd_fw_resp_func()?
>>>>>>>>>>>>>>
>>>>>>>>>>>>>> 4. Why are some lines much longer than before? It seems inconsistent that now
>>>>>>>>>>>>>>          there is a mix of several lines wrapped to 80 cols and many lines going
>>>>>>>>>>>>>>          beyond 80 cols. And there are multiple places where code is wrapped before
>>>>>>>>>>>>>>          reaching 80 cols even with lots of room left, specifically for cases where it
>>>>>>>>>>>>>>          wouldn't hurt readability to put more characters onto each line.
>>>>>>>>>>>>>
>>>>>>>>>>>>> I've attached a new, complete patch of changes to apply on top of v5. You may
>>>>>>>>>>>>> ignore the incomplete patch from my previous email and use the new one instead.
>>>>>>>>>>>>>
>>>>>>>>>>>>> I made many changes and also answered questions 1-3 myself.
>>>>>>>>>>>>>
>>>>>>>>>>>>> Please apply this on top of v5 and let me know if you have any questions.
>>>>>>>>>>>>>
>>>>>>>>>>>>
>>>>>>>>>>>> Sure, will review, apply and test your patch accordingly. Your contribution
>>>>>>>>>>>> is greatly appreciated, will let you know if there is any question or
>>>>>>>>>>>> problem.
>>>>>>>>>>>>
>>>>>>>>>>>>> BTW, I noticed a strange regression in v5 even without any of my changes: every
>>>>>>>>>>>>> time you use cheese after using it one time, the video will freeze after 30-60
>>>>>>>>>>>>> seconds with this message printed to dmesg:
>>>>>>>>>>>>>         [ 2032.716559] amd_isp_capture amd_isp_capture: -><- fail respid unknown respid(0x30002)
>>>>>>>>>>>>>
>>>>>>>>>>>>> And the video never unfreezes. I haven't found the cause for the regression yet;
>>>>>>>>>>>>> can you try to reproduce it?
>>>>>>>>>>>>>
>>>>>>>>>>>>
>>>>>>>>>>>> Really weird, we don't see this issue either in dev or QA test. Is it 100%
>>>>>>>>>>>> reproducible and any other fail or err in the log?
>>>>>>>>>>>
>>>>>>>>>>> Yes, it's 100% reproducible. There's no other message in dmesg, only that one.
>>>>>>>>>>>
>>>>>>>>>>> Sometimes there is a stop stream error when I close cheese after it froze:
>>>>>>>>>>>
>>>>>>>>>>>        [  656.540307] amd_isp_capture amd_isp_capture: fail to disable stream
>>>>>>>>>>>        [  657.046633] amd_isp_capture amd_isp_capture: fail to stop steam
>>>>>>>>>>>        [  657.047224] amd_isp_capture amd_isp_capture: disabling streaming failed (-110)
>>>>>>>>>>>
>>>>>>>>>>> When I revert to v4 I cannot reproduce it at all. It seems to be something in
>>>>>>>>>>> v4 -> v5 that is not fixed by any of my changes.
>>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>> Hi Sultan, could you please try following modifications on top of v5 to see
>>>>>>>>>> if it helps?
>>>>>>>>>>
>>>>>>>>>> diff --git a/drivers/media/platform/amd/isp4/isp4_fw_cmd_resp.h
>>>>>>>>>> b/drivers/media/platform/amd/isp4/isp4_fw_cmd_resp.h
>>>>>>>>>> index 39c2265121f9..d571b3873edb 100644
>>>>>>>>>> --- a/drivers/media/platform/amd/isp4/isp4_fw_cmd_resp.h
>>>>>>>>>> +++ b/drivers/media/platform/amd/isp4/isp4_fw_cmd_resp.h
>>>>>>>>>> @@ -97,7 +97,7 @@
>>>>>>>>>>
>>>>>>>>>> #define ADDR_SPACE_TYPE_GPU_VA          4
>>>>>>>>>>
>>>>>>>>>> -#define FW_MEMORY_POOL_SIZE             (200 * 1024 * 1024)
>>>>>>>>>> +#define FW_MEMORY_POOL_SIZE             (100 * 1024 * 1024)
>>>>>>>>>>
>>>>>>>>>> /*
>>>>>>>>>>       * standard ISP mipicsi=>isp
>>>>>>>>>> diff --git a/drivers/media/platform/amd/isp4/isp4_subdev.c
>>>>>>>>>> b/drivers/media/platform/amd/isp4/isp4_subdev.c
>>>>>>>>>> index 248d10076ae8..acbc80aa709e 100644
>>>>>>>>>> --- a/drivers/media/platform/amd/isp4/isp4_subdev.c
>>>>>>>>>> +++ b/drivers/media/platform/amd/isp4/isp4_subdev.c
>>>>>>>>>> @@ -697,7 +697,7 @@ static int isp4sd_stop_resp_proc_threads(struct
>>>>>>>>>> isp4_subdev *isp_subdev)
>>>>>>>>>>             return 0;
>>>>>>>>>> }
>>>>>>>>>>
>>>>>>>>>> -static int isp4sd_pwroff_and_deinit(struct isp4_subdev *isp_subdev)
>>>>>>>>>> +static int isp4sd_pwroff_and_deinit(struct isp4_subdev *isp_subdev, bool
>>>>>>>>>> irq_enabled)
>>>>>>>>>> {
>>>>>>>>>>             struct isp4sd_sensor_info *sensor_info = &isp_subdev->sensor_info;
>>>>>>>>>>             unsigned int perf_state = ISP4SD_PERFORMANCE_STATE_LOW;
>>>>>>>>>> @@ -716,8 +716,9 @@ static int isp4sd_pwroff_and_deinit(struct isp4_subdev
>>>>>>>>>> *isp_subdev)
>>>>>>>>>>                     return 0;
>>>>>>>>>>             }
>>>>>>>>>>
>>>>>>>>>> -       for (int i = 0; i < ISP4SD_MAX_FW_RESP_STREAM_NUM; i++)
>>>>>>>>>> -               disable_irq(isp_subdev->irq[i]);
>>>>>>>>>> +       if (irq_enabled)
>>>>>>>>>> +               for (int i = 0; i < ISP4SD_MAX_FW_RESP_STREAM_NUM; i++)
>>>>>>>>>> +                       disable_irq(isp_subdev->irq[i]);
>>>>>>>>>>
>>>>>>>>>>             isp4sd_stop_resp_proc_threads(isp_subdev);
>>>>>>>>>>             dev_dbg(dev, "isp_subdev stop resp proc streads suc");
>>>>>>>>>> @@ -813,7 +814,7 @@ static int isp4sd_pwron_and_init(struct isp4_subdev
>>>>>>>>>> *isp_subdev)
>>>>>>>>>>
>>>>>>>>>>             return 0;
>>>>>>>>>> err_unlock_and_close:
>>>>>>>>>> -       isp4sd_pwroff_and_deinit(isp_subdev);
>>>>>>>>>> +       isp4sd_pwroff_and_deinit(isp_subdev, false);
>>>>>>>>>>             return -EINVAL;
>>>>>>>>>> }
>>>>>>>>>>
>>>>>>>>>> @@ -985,7 +986,7 @@ static int isp4sd_set_power(struct v4l2_subdev *sd, int
>>>>>>>>>> on)
>>>>>>>>>>             if (on)
>>>>>>>>>>                     return isp4sd_pwron_and_init(isp_subdev);
>>>>>>>>>>             else
>>>>>>>>>> -               return isp4sd_pwroff_and_deinit(isp_subdev);
>>>>>>>>>> +               return isp4sd_pwroff_and_deinit(isp_subdev, true);
>>>>>>>>>> }
>>>>>>>>>>
>>>>>>>>>> static const struct v4l2_subdev_core_ops isp4sd_core_ops = {
>>>>>>>>>
>>>>>>>>> No difference sadly; same symptoms as before. FYI, your email client broke the
>>>>>>>>> patch formatting so I couldn't apply it; it hard wrapped some lines to 80 cols,
>>>>>>>>> replaced tabs with spaces, and removed leading spaces on each context line, so I
>>>>>>>>> had to apply it manually. To confirm I applied it correctly, here is my diff:
>>>>>>>>>
>>>>>>>>> diff --git a/drivers/media/platform/amd/isp4/isp4_fw_cmd_resp.h b/drivers/media/platform/amd/isp4/isp4_fw_cmd_resp.h
>>>>>>>>> index 39c2265121f9..d571b3873edb 100644
>>>>>>>>> --- a/drivers/media/platform/amd/isp4/isp4_fw_cmd_resp.h
>>>>>>>>> +++ b/drivers/media/platform/amd/isp4/isp4_fw_cmd_resp.h
>>>>>>>>> @@ -97,7 +97,7 @@
>>>>>>>>>      #define ADDR_SPACE_TYPE_GPU_VA          4
>>>>>>>>> -#define FW_MEMORY_POOL_SIZE             (200 * 1024 * 1024)
>>>>>>>>> +#define FW_MEMORY_POOL_SIZE             (100 * 1024 * 1024)
>>>>>>>>>      /*
>>>>>>>>>       * standard ISP mipicsi=>isp
>>>>>>>>> diff --git a/drivers/media/platform/amd/isp4/isp4_subdev.c b/drivers/media/platform/amd/isp4/isp4_subdev.c
>>>>>>>>> index 4bd2ebf0f694..500ef0af8a41 100644
>>>>>>>>> --- a/drivers/media/platform/amd/isp4/isp4_subdev.c
>>>>>>>>> +++ b/drivers/media/platform/amd/isp4/isp4_subdev.c
>>>>>>>>> @@ -669,7 +669,7 @@ static int isp4sd_stop_resp_proc_threads(struct isp4_subdev *isp_subdev)
>>>>>>>>>      	return 0;
>>>>>>>>>      }
>>>>>>>>> -static int isp4sd_pwroff_and_deinit(struct isp4_subdev *isp_subdev)
>>>>>>>>> +static int isp4sd_pwroff_and_deinit(struct isp4_subdev *isp_subdev, bool irq_enabled)
>>>>>>>>>      {
>>>>>>>>>      	struct isp4sd_sensor_info *sensor_info = &isp_subdev->sensor_info;
>>>>>>>>>      	unsigned int perf_state = ISP4SD_PERFORMANCE_STATE_LOW;
>>>>>>>>> @@ -688,8 +688,9 @@ static int isp4sd_pwroff_and_deinit(struct isp4_subdev *isp_subdev)
>>>>>>>>>      		return 0;
>>>>>>>>>      	}
>>>>>>>>> -	for (int i = 0; i < ISP4SD_MAX_FW_RESP_STREAM_NUM; i++)
>>>>>>>>> -		disable_irq(isp_subdev->irq[i]);
>>>>>>>>> +	if (irq_enabled)
>>>>>>>>> +		for (int i = 0; i < ISP4SD_MAX_FW_RESP_STREAM_NUM; i++)
>>>>>>>>> +			disable_irq(isp_subdev->irq[i]);
>>>>>>>>>      	isp4sd_stop_resp_proc_threads(isp_subdev);
>>>>>>>>>      	dev_dbg(dev, "isp_subdev stop resp proc streads suc");
>>>>>>>>> @@ -785,7 +786,7 @@ static int isp4sd_pwron_and_init(struct isp4_subdev *isp_subdev)
>>>>>>>>>      	return 0;
>>>>>>>>>      err_unlock_and_close:
>>>>>>>>> -	isp4sd_pwroff_and_deinit(isp_subdev);
>>>>>>>>> +	isp4sd_pwroff_and_deinit(isp_subdev, false);
>>>>>>>>>      	return -EINVAL;
>>>>>>>>>      }
>>>>>>>>> @@ -957,7 +958,7 @@ static int isp4sd_set_power(struct v4l2_subdev *sd, int on)
>>>>>>>>>      	if (on)
>>>>>>>>>      		return isp4sd_pwron_and_init(isp_subdev);
>>>>>>>>>      	else
>>>>>>>>> -		return isp4sd_pwroff_and_deinit(isp_subdev);
>>>>>>>>> +		return isp4sd_pwroff_and_deinit(isp_subdev, true);
>>>>>>>>>      }
>>>>>>>>>      static const struct v4l2_subdev_core_ops isp4sd_core_ops = {
>>>>>>>>>
>>>>>>>>>> On the other hand, please also add the patch in amdgpu which sets *bo to
>>>>>>>>>> NULL in isp_kernel_buffer_alloc() which you mentioned in another thread.
>>>>>>>>>
>>>>>>>>> Yep, I have been doing all v5 testing with that patch applied.
>>>>>>>>>
>>>>>>>>> BTW, I have verified the IRQ changes are not the cause for the regression; I
>>>>>>>>> tested with IRQs kept enabled all the time and the issue still occurs.
>>>>>>>>>
>>>>>>>>> I did observe that ISP stops sending interrupts when the video stream freezes.
>>>>>>>>> And, if I replicate the bug enough times, it seems to permanently break FW until
>>>>>>>>> a full machine reboot. Reloading amd_capture with the v4 driver doesn't recover
>>>>>>>>> the ISP when this happens.
>>>>>>>>>
>>>>>>>>> As an improvement to the driver, can we do a hard reset of ISP on driver probe?
>>>>>>>>> I am assuming hardware doesn't need too long to settle after hard reset, maybe
>>>>>>>>> a few hundred milliseconds? This would ensure ISP FW is always in a working
>>>>>>>>> state when the driver is loaded.
>>>>>>>>>
>>>>>>>
>>>>>>> Actually, each time the camera is activated, the ISP driver powers on the
>>>>>>> ISP and initiates its firmware from the beginning; when the camera is
>>>>>>> closed, the ISP is powered off..
>>>>>>
>>>>>> Hmm, well I am able to put the ISP in a completely unusable state that doesn't
>>>>>> recover when I unload and reload amd_capture. Or maybe it is the sensor that is
>>>>>> stuck in a broken state?
>>>>>
>>>>> Here is the log output when I try to start cheese after unloading and reloading
>>>>> amd_capture, where the ISP is in the broken state that requires rebooting the
>>>>> laptop (annotated with notes of what I saw/did at each point in time):
>>>>>
>>>>> ==> opened cheese
>>>>> ==> cheese froze after a few seconds
>>>>> ==> closed cheese
>>>>>      [   34.570823] amd_isp_capture amd_isp_capture: fail to disable stream
>>>>>      [   35.077503] amd_isp_capture amd_isp_capture: fail to stop steam
>>>>>      [   35.077525] amd_isp_capture amd_isp_capture: disabling streaming failed (-110)
>>>>> ==> unloaded amd_capture
>>>>> ==> loaded amd_capture
>>>>> ==> opened cheese
>>>>>      [  308.039721] amd_isp_capture amd_isp_capture: ISP CCPU FW boot failed
>>>>>      [  308.043961] amd_isp_capture amd_isp_capture: fail to start isp_subdev interface
>>>>>      [  308.044188] amd_isp_capture amd_isp_capture: invalid mem_info
>>>>>      [  308.044194] amd_isp_capture amd_isp_capture: invalid mem_info
>>>>>      [  308.044196] amd_isp_capture amd_isp_capture: invalid mem_info
>>>>>      [  308.044197] amd_isp_capture amd_isp_capture: invalid mem_info
>>>>>      [  308.044198] amd_isp_capture amd_isp_capture: invalid mem_info
>>>>>      [  308.044198] amd_isp_capture amd_isp_capture: invalid mem_info
>>>>>      [  308.044199] amd_isp_capture amd_isp_capture: invalid mem_info
>>>>>      [  308.044200] amd_isp_capture amd_isp_capture: invalid mem_info
>>>>>      [  308.044200] amd_isp_capture amd_isp_capture: invalid mem_info
>>>>>      [  308.044201] amd_isp_capture amd_isp_capture: invalid mem_info
>>>>>      [  308.044202] amd_isp_capture amd_isp_capture: invalid mem_info
>>>>>      [  308.065226] amd_isp_capture amd_isp_capture: power up isp fail -22
>>>>>      [  308.174814] amd_isp_capture amd_isp_capture: ISP CCPU FW boot failed
>>>>>      [  308.177807] amd_isp_capture amd_isp_capture: fail to start isp_subdev interface
>>>>>      [  308.178036] amd_isp_capture amd_isp_capture: invalid mem_info
>>>>>      [  308.178043] amd_isp_capture amd_isp_capture: invalid mem_info
>>>>>      [  308.178044] amd_isp_capture amd_isp_capture: invalid mem_info
>>>>>      [  308.178045] amd_isp_capture amd_isp_capture: invalid mem_info
>>>>>      [  308.178046] amd_isp_capture amd_isp_capture: invalid mem_info
>>>>>      [  308.178047] amd_isp_capture amd_isp_capture: invalid mem_info
>>>>>      [  308.178048] amd_isp_capture amd_isp_capture: invalid mem_info
>>>>>      [  308.178048] amd_isp_capture amd_isp_capture: invalid mem_info
>>>>>      [  308.178049] amd_isp_capture amd_isp_capture: invalid mem_info
>>>>>      [  308.178050] amd_isp_capture amd_isp_capture: invalid mem_info
>>>>>      [  308.178050] amd_isp_capture amd_isp_capture: invalid mem_info
>>>>>      [  308.198776] amd_isp_capture amd_isp_capture: power up isp fail -22
>>>>>      [  308.306835] amd_isp_capture amd_isp_capture: ISP CCPU FW boot failed
>>>>>      [  308.311533] amd_isp_capture amd_isp_capture: fail to start isp_subdev interface
>>>>>      [  308.311723] amd_isp_capture amd_isp_capture: invalid mem_info
>>>>>      [  308.311723] amd_isp_capture amd_isp_capture: invalid mem_info
>>>>>      [  308.311724] amd_isp_capture amd_isp_capture: invalid mem_info
>>>>>      [  308.311725] amd_isp_capture amd_isp_capture: invalid mem_info
>>>>>      [  308.311725] amd_isp_capture amd_isp_capture: invalid mem_info
>>>>>      [  308.311726] amd_isp_capture amd_isp_capture: invalid mem_info
>>>>>      [  308.311726] amd_isp_capture amd_isp_capture: invalid mem_info
>>>>>      [  308.311726] amd_isp_capture amd_isp_capture: invalid mem_info
>>>>>      [  308.311727] amd_isp_capture amd_isp_capture: invalid mem_info
>>>>>      [  308.311727] amd_isp_capture amd_isp_capture: invalid mem_info
>>>>>      [  308.311727] amd_isp_capture amd_isp_capture: invalid mem_info
>>>>>      [  308.335281] amd_isp_capture amd_isp_capture: power up isp fail -22
>>>>>
>>>>
>>>> Thanks Sultan for the detailed info, I agree with you, tend to believe it
>>>> may related to the sensor, I will follow up with the FW team to investigate
>>>> further.
>>>>
>>>>>>>>> Thanks,
>>>>>>>>> Sultan
>>>>>>>>
>>>>>>>> A small update: I reproduced the issue on v4, but it took several more cycles of
>>>>>>>> closing/opening cheese and waiting 30s compared to v5.
>>>>>>>>
>>>>>>>> Right now my best guess is that this is a timing issue with respect to FW that
>>>>>>>> was exposed by the v5 changes. v5 introduced slight changes in code timing, like
>>>>>>>> with the mutex locks getting replaced by spin locks.
>>>>>>>>
>>>>>>>> I'll try to insert mdelays to see if I can expose the issue that way on v4.
>>>>>>>>
>>>>>>>
>>>>>>> Could you kindly provide the FW used?
>>>>>>
>>>>>>      commit a89515d3ff79f12099f7a51b0f4932b881b7720e
>>>>>>      Author: Pratap Nirujogi <pratap.nirujogi@amd.com>
>>>>>>      Date:   Thu Aug 21 15:40:45 2025 -0400
>>>>>>
>>>>>>          amdgpu: Update ISP FW for isp v4.1.1
>>>>>>          From internal git commit:
>>>>>>          24557b7326e539183b3bc44cf8e1496c74d383d6
>>>>>>          Signed-off-by: Pratap Nirujogi <pratap.nirujogi@amd.com>
>>>>>>
>>>>>> Sultan
>>>>>
>>>>> Sultan
>>>>
>>>> -- 
>>>> Regards,
>>>> Bin
>>>>
>>>
>>> Thanks, Bin. I looked deeper at the code and didn't find any reason the issue
>>> could be due to the driver. Also, the problem happens outside of cheese for me
>>> (like in Chromium with Zoom meetings), so v5 of the driver is pretty much
>>> unusable for me until this issue is fixed. :(
>>>
>>
>> Oh, really sad to hear that :(, there must be some difference between our
>> platforms because we still can't reproduce the issue you mentioned, to help
>> on this, would you share more info like your Ubuntu version, Kernel
>> version/source, ISP driver version, BIOS version, .config used to build the
>> kernel, FW: commit a89515d3ff79f12099f7a51b0f4932b881b7720e.
>> Just wondering, if possible, could you provide the kernel image either so we
>> can directly test on it. Also, the HW is not broken, right?
> 
> I figured out why you cannot reproduce the issue. You need to pass amd_iommu=off
> on the kernel command line to trigger the issue.
> 
> The reason I am using amd_iommu=off is because this laptop otherwise doesn't
> ever wake from suspend under Linux once it reaches the S0i3 state. The keyboard,
> power button, and lid do not respond to wake up the laptop from suspend. This
> happens 100% of the time once S0i3 is reached, and occurs on the OEM Ubuntu
> image from HP as well. The only fix I have found is to pass amd_iommu=off, which
> other owners of this laptop also found fixes the issue.

You're the first report I've heard of this.

Are you using storage encryption or a storage password of any kind 
(software or hardware) by chance?

If you are can you please describe it?

Also can you generate a report using amd-s2idle?  I don't think it's 
going to flag anything but I would like to confirm.

> 
>>
>>> I've attached a v2 of my big patch, which includes more improvements and another
>>> ringbuffer bug fix. Please check the ringbuffer logic on the FW side to make
>>> sure FW doesn't have the same bug, where it may let rd_ptr == wr_ptr upon
>>> filling the ringbuffer, even though rd_ptr == wr_ptr is supposed to indicate the
>>> ringbuffer is empty.
>>>
>>
>> Thank you for your huge contribution. We'll review, verify, and merge as
>> much as possible into v6 and hope this marks the final significant update.
>> We typically avoid large changes to reduce regression risks and minimize
>> additional debugging, testing, and schedule impact.
> 
> I understand. Don't worry, that is indeed the final significant update. Please
> let me know which changes you don't merge in, since it could be an important
> change. And thank you for considering my changes! :)
> 
> And please be sure to check the ringbuffer code in FW to see if it has the same
> bug I found in the driver.
> 
>>> Also, I have a suggestion for a FW change to improve IRQ handling in the driver:
>>>
>>> 1. Change ISP_SYS_INT0_ACK behavior so that it also clears the acked interrupts
>>>      from ISP_SYS_INT0_EN.
>>>
>>
>> Based on my understanding, this pertains to the hardware design and cannot
>> be modified.
>>
>>> 2. Change ISP_SYS_INT0_EN behavior so that it only enables the interrupts in the
>>>      provided mask, so RMW of ISP_SYS_INT0_EN in the driver won't be necessary to
>>>      re-enable interrupts for a stream.
>>>
>>
>> I'm not sure I understand your point. Are you saying that ISP_SYS_INT0_EN
>> only considers '1' in the mask and ignores '0'? If that's the case, I'm
>> curious about how to disable an interrupt. Also, this is also determined by
>> the hardware design.
> 
> No worries, it's not a big deal. Just a small optimization I thought of.
> 
> My idea was:
> 
> 1. Current behavior of ISP_SYS_INT0_ACK:
>       isp4hw_wreg(mmio, ISP_SYS_INT0_ACK, intr_ack) results in:
> 
>           regmap[ISP_SYS_INT0_STATUS] &= ~intr_ack;
> 
>     Proposed behavior of ISP_SYS_INT0_ACK:
>       isp4hw_wreg(mmio, ISP_SYS_INT0_ACK, intr_ack) results in:
> 
>           regmap[ISP_SYS_INT0_STATUS] &= ~intr_ack;
>           regmap[ISP_SYS_INT0_EN] &= ~intr_ack;
> 
> 2. Current behavior of ISP_SYS_INT0_EN:
>       isp4hw_wreg(mmio, ISP_SYS_INT0_EN, intr_en) results in:
> 
>           regmap[ISP_SYS_INT0_EN] = intr_en;
> 
>     Proposed behavior of ISP_SYS_INT0_EN:
>       isp4hw_wreg(mmio, ISP_SYS_INT0_EN, intr_en) results in:
> 
>           regmap[ISP_SYS_INT0_EN] |= intr_en;
> 
> 
> And to disable an interrupt with this design, you can write to ISP_SYS_INT0_ACK.
> 
> Sultan


  reply	other threads:[~2025-11-21 14:33 UTC|newest]

Thread overview: 39+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2025-10-24  9:06 [PATCH v5 0/7] Add AMD ISP4 driver Bin Du
2025-10-24  9:06 ` [PATCH v5 1/7] media: platform: amd: Introduce amd isp4 capture driver Bin Du
2025-10-24 13:57   ` Krzysztof Kozlowski
2025-10-28  8:30     ` Du, Bin
2025-10-28  8:41       ` Krzysztof Kozlowski
2025-10-28  9:00         ` Du, Bin
2025-10-28  9:06           ` Krzysztof Kozlowski
2025-10-28  9:10             ` Du, Bin
2025-11-03  6:17               ` Du, Bin
2025-10-24  9:06 ` [PATCH v5 2/7] media: platform: amd: low level support for isp4 firmware Bin Du
2025-10-24  9:06 ` [PATCH v5 3/7] media: platform: amd: Add isp4 fw and hw interface Bin Du
2025-10-24  9:06 ` [PATCH v5 4/7] media: platform: amd: isp4 subdev and firmware loading handling added Bin Du
2025-10-24  9:06 ` [PATCH v5 5/7] media: platform: amd: isp4 video node and buffers " Bin Du
2025-10-24  9:06 ` [PATCH v5 6/7] media: platform: amd: isp4 debug fs logging and more descriptive errors Bin Du
2025-10-24  9:06 ` [PATCH v5 7/7] Documentation: add documentation of AMD isp 4 driver Bin Du
2025-11-05  9:25 ` [PATCH v5 0/7] Add AMD ISP4 driver Sultan Alsawaf
2025-11-10  8:33   ` Sultan Alsawaf
2025-11-10  9:46     ` Du, Bin
2025-11-11  9:05       ` Sultan Alsawaf
2025-11-11  9:58         ` Du, Bin
2025-11-11 23:33           ` Sultan Alsawaf
2025-11-12  1:21             ` Sultan Alsawaf
2025-11-12  6:32               ` Du, Bin
2025-11-12  7:06                 ` Sultan Alsawaf
2025-11-12  7:38                   ` Sultan Alsawaf
2025-11-12 10:21                     ` Du, Bin
2025-11-18  7:35                       ` Sultan Alsawaf
2025-11-19 10:14                         ` Du, Bin
2025-11-21  8:20                           ` Sultan Alsawaf
2025-11-21 14:32                             ` Mario Limonciello [this message]
2025-11-21 15:39                               ` Sultan Alsawaf
2025-11-21 15:46                                 ` Mario Limonciello
2025-11-21 16:31                                   ` Sultan Alsawaf
2025-11-21 17:52                                     ` Mario Limonciello
2025-11-22  2:55                             ` Sultan Alsawaf
2025-11-22  3:19                               ` Sultan Alsawaf
2025-11-25  7:55                                 ` Sultan Alsawaf
2025-11-27  6:16                                   ` Du, Bin
2025-11-27  6:40                                     ` Sultan Alsawaf

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=546a8d8e-514b-4f6b-bf46-1ec0c9974e7e@amd.com \
    --to=mario.limonciello@amd.com \
    --cc=Dominic.Antony@amd.com \
    --cc=Phil.Jawich@amd.com \
    --cc=anson.tsao@amd.com \
    --cc=benjamin.chan@amd.com \
    --cc=bin.du@amd.com \
    --cc=bryan.odonoghue@linaro.org \
    --cc=gjorgji.rosikopulos@amd.com \
    --cc=hverkuil@xs4all.nl \
    --cc=king.li@amd.com \
    --cc=laurent.pinchart+renesas@ideasonboard.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linux-media@vger.kernel.org \
    --cc=mchehab@kernel.org \
    --cc=prabhakar.mahadev-lad.rj@bp.renesas.com \
    --cc=pratap.nirujogi@amd.com \
    --cc=richard.gong@amd.com \
    --cc=sakari.ailus@linux.intel.com \
    --cc=sultan@kerneltoast.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox