All of lore.kernel.org
 help / color / mirror / Atom feed
From: Pekka Paalanen <pekka.paalanen@collabora.com>
To: Alex Hung <alex.hung@amd.com>
Cc: <dri-devel@lists.freedesktop.org>,
	<amd-gfx@lists.freedesktop.org>,
	<wayland-devel@lists.freedesktop.org>, <harry.wentland@amd.com>,
	<leo.liu@amd.com>, <ville.syrjala@linux.intel.com>,
	<contact@emersion.fr>, <mwen@igalia.com>, <jadahl@redhat.com>,
	<sebastian.wick@redhat.com>, <shashank.sharma@amd.com>,
	<agoins@nvidia.com>, <joshua@froggi.es>, <mdaenzer@redhat.com>,
	<aleixpol@kde.org>, <xaver.hugl@gmail.com>,
	<victoria@system76.com>, <daniel@ffwll.ch>,
	<uma.shankar@intel.com>, <quic_naseer@quicinc.com>,
	<quic_cbraga@quicinc.com>, <quic_abhinavk@quicinc.com>,
	<marcan@marcan.st>, <Liviu.Dudau@arm.com>,
	<sashamcintosh@google.com>, <chaitanya.kumar.borah@intel.com>,
	<louis.chauvet@bootlin.com>, <mcanal@igalia.com>,
	<nfraprado@collabora.com>, Daniel Stone <daniels@collabora.com>
Subject: Re: [PATCH V11 17/47] drm/vkms: Use s32 for internal color pipeline precision
Date: Tue, 30 Sep 2025 10:07:40 +0300	[thread overview]
Message-ID: <20250930100740.40b718bb@eldfell> (raw)
In-Reply-To: <20250815035047.3319284-18-alex.hung@amd.com>

[-- Attachment #1: Type: text/plain, Size: 4901 bytes --]

On Thu, 14 Aug 2025 21:50:06 -0600
Alex Hung <alex.hung@amd.com> wrote:

> From: Harry Wentland <harry.wentland@amd.com>
> 
> Certain operations require us to preserve values below 0.0 and
> above 1.0 (0x0 and 0xffff respectively in 16 bpc unorm). One
> such operation is a BT709 encoding operation followed by its
> decoding operation, or the reverse.
> 
> We'll use s32 values as intermediate in and outputs of our
> color operations, for the operations where it matters.
> 
> For now this won't apply to LUT operations. We might want to
> update those to work on s32 as well, but it's unclear how
> that should work for unorm LUT definitions. We'll revisit
> that once we add LUT + CTM tests.
> 
> In order to allow for this we'll also invert the nesting of our
> colorop processing loops. We now use the pixel iteration loop
> on the outside and the colorop iteration on the inside.

Hi Alex,

is this an out-dated paragraph in the commit message?

I don't see the patch inverting the nesting of loops.

That statement worried me, because changing the loop structures has
tanked the performance before.


Thanks,
pq

> 
> Reviewed-by: Louis Chauvet <louis.chauvet@bootlin.com>
> Signed-off-by: Alex Hung <alex.hung@amd.com>
> Signed-off-by: Harry Wentland <harry.wentland@amd.com>
> Reviewed-by: Daniel Stone <daniels@collabora.com>
> ---
> v7:
>  - Fix checkpatch warnings
>   - Add a commit messages
>   - Fix code styles by adding and removing spaces (new lines, tabs and so on)
> 
> v6:
>  - use clamp_val instead of manual clamping (Louis Chauvet)
> 
> v4:
>  - Clarify that we're packing 16-bit UNORM into s32, not
>    converting values to a different representation (Pekka)
> 
> v3:
>  - Use new colorop->next pointer
> 
>  drivers/gpu/drm/vkms/vkms_composer.c | 27 +++++++++++++++++++++++++--
>  drivers/gpu/drm/vkms/vkms_drv.h      |  4 ++++
>  2 files changed, 29 insertions(+), 2 deletions(-)
> 
> diff --git a/drivers/gpu/drm/vkms/vkms_composer.c b/drivers/gpu/drm/vkms/vkms_composer.c
> index 0f3fcd6a5925..6630dccd68a4 100644
> --- a/drivers/gpu/drm/vkms/vkms_composer.c
> +++ b/drivers/gpu/drm/vkms/vkms_composer.c
> @@ -128,7 +128,7 @@ static void apply_lut(const struct vkms_crtc_state *crtc_state, struct line_buff
>  	}
>  }
>  
> -static void apply_colorop(struct pixel_argb_u16 *pixel, struct drm_colorop *colorop)
> +static void apply_colorop(struct pixel_argb_s32 *pixel, struct drm_colorop *colorop)
>  {
>  	struct drm_colorop_state *colorop_state = colorop->state;
>  	struct drm_device *dev = colorop->dev;
> @@ -157,9 +157,26 @@ static void apply_colorop(struct pixel_argb_u16 *pixel, struct drm_colorop *colo
>  static void pre_blend_color_transform(const struct vkms_plane_state *plane_state,
>  				      struct line_buffer *output_buffer)
>  {
> +	struct pixel_argb_s32 pixel;
> +
>  	for (size_t x = 0; x < output_buffer->n_pixels; x++) {
>  		struct drm_colorop *colorop = plane_state->base.base.color_pipeline;
>  
> +		/*
> +		 * Some operations, such as applying a BT709 encoding matrix,
> +		 * followed by a decoding matrix, require that we preserve
> +		 * values above 1.0 and below 0.0 until the end of the pipeline.
> +		 *
> +		 * Pack the 16-bit UNORM values into s32 to give us head-room to
> +		 * avoid clipping until we're at the end of the pipeline. Clip
> +		 * intentionally at the end of the pipeline before packing
> +		 * UNORM values back into u16.
> +		 */
> +		pixel.a = output_buffer->pixels[x].a;
> +		pixel.r = output_buffer->pixels[x].r;
> +		pixel.g = output_buffer->pixels[x].g;
> +		pixel.b = output_buffer->pixels[x].b;
> +
>  		while (colorop) {
>  			struct drm_colorop_state *colorop_state;
>  
> @@ -169,10 +186,16 @@ static void pre_blend_color_transform(const struct vkms_plane_state *plane_state
>  				return;
>  
>  			if (!colorop_state->bypass)
> -				apply_colorop(&output_buffer->pixels[x], colorop);
> +				apply_colorop(&pixel, colorop);
>  
>  			colorop = colorop->next;
>  		}
> +
> +		/* clamp values */
> +		output_buffer->pixels[x].a = clamp_val(pixel.a, 0, 0xffff);
> +		output_buffer->pixels[x].r = clamp_val(pixel.r, 0, 0xffff);
> +		output_buffer->pixels[x].g = clamp_val(pixel.g, 0, 0xffff);
> +		output_buffer->pixels[x].b = clamp_val(pixel.b, 0, 0xffff);
>  	}
>  }
>  
> diff --git a/drivers/gpu/drm/vkms/vkms_drv.h b/drivers/gpu/drm/vkms/vkms_drv.h
> index 30941714cd0f..55440ec6db52 100644
> --- a/drivers/gpu/drm/vkms/vkms_drv.h
> +++ b/drivers/gpu/drm/vkms/vkms_drv.h
> @@ -45,6 +45,10 @@ struct vkms_frame_info {
>  	unsigned int rotation;
>  };
>  
> +struct pixel_argb_s32 {
> +	s32 a, r, g, b;
> +};
> +
>  /**
>   * struct pixel_argb_u16 - Internal representation of a pixel color.
>   * @a: Alpha component value, stored in 16 bits, without padding, using


[-- Attachment #2: OpenPGP digital signature --]
[-- Type: application/pgp-signature, Size: 833 bytes --]

  reply	other threads:[~2025-10-03  7:03 UTC|newest]

Thread overview: 94+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2025-08-15  3:49 [PATCH V11 00/47] Color Pipeline API w/ VKMS Alex Hung
2025-08-15  3:49 ` [PATCH V11 01/47] drm: Add helper for conversion from signed-magnitude Alex Hung
2025-08-15  3:49 ` [PATCH V11 02/47] drm/vkms: Add kunit tests for VKMS LUT handling Alex Hung
2025-08-15 18:48   ` kernel test robot
2025-08-15 21:26   ` kernel test robot
2025-09-19 12:43   ` Louis Chauvet
2025-08-15  3:49 ` [PATCH V11 03/47] drm/doc/rfc: Describe why prescriptive color pipeline is needed Alex Hung
2025-08-15  3:49 ` [PATCH V11 04/47] drm/colorop: Introduce new drm_colorop mode object Alex Hung
2025-08-15  3:49 ` [PATCH V11 05/47] drm/colorop: Add TYPE property Alex Hung
2025-08-15  3:49 ` [PATCH V11 06/47] drm/colorop: Add 1D Curve subtype Alex Hung
2025-08-19 15:11   ` Sebastian Wick
2025-08-21 12:23     ` Xaver Hugl
2025-08-21 17:54       ` Alex Hung
2025-08-26  9:03         ` Pekka Paalanen
2025-09-16 23:01           ` Alex Hung
2025-09-18  8:40             ` Pekka Paalanen
2025-09-23  3:16               ` Alex Hung
2025-09-23  7:59                 ` Pekka Paalanen
2025-09-23 14:03                   ` Alex Hung
2025-09-23 16:16                   ` Alex Hung
2025-09-23 17:41                     ` Alex Hung
2025-09-25  8:11                       ` Pekka Paalanen
2025-09-25 18:22                         ` Harry Wentland
2025-09-27 18:35                           ` Shengyu Qu
2025-08-15  3:49 ` [PATCH V11 07/47] drm/colorop: Add BYPASS property Alex Hung
2025-08-19 15:15   ` Sebastian Wick
2025-08-20 17:57     ` Alex Hung
2025-08-15  3:49 ` [PATCH V11 08/47] drm/colorop: Add NEXT property Alex Hung
2025-08-15  3:49 ` [PATCH V11 09/47] drm/colorop: Add atomic state print for drm_colorop Alex Hung
2025-08-15  3:49 ` [PATCH V11 10/47] drm/plane: Add COLOR PIPELINE property Alex Hung
2025-08-15  3:50 ` [PATCH V11 11/47] drm/colorop: Introduce DRM_CLIENT_CAP_PLANE_COLOR_PIPELINE Alex Hung
2025-09-15 18:43   ` Nícolas F. R. A. Prado
2025-09-17  2:23     ` Alex Hung
2025-08-15  3:50 ` [PATCH V11 12/47] Documentation/gpu: document drm_colorop Alex Hung
2025-08-15  3:50 ` [PATCH V11 13/47] drm/colorop: Add destroy functions for color pipeline Alex Hung
2025-09-05 17:12   ` Louis Chauvet
2025-09-17  2:01     ` Alex Hung
2025-09-17 15:31       ` Nícolas F. R. A. Prado
2025-09-18  0:50         ` Alex Hung
2025-08-15  3:50 ` [PATCH V11 14/47] drm/vkms: Add enumerated 1D curve colorop Alex Hung
2025-09-05 17:12   ` Louis Chauvet
2025-09-17  1:54     ` Alex Hung
2025-09-17 14:47       ` Nícolas F. R. A. Prado
2025-09-18  0:45         ` Alex Hung
2025-09-19 12:49           ` Louis Chauvet
2025-09-20  3:31             ` Alex Hung
2025-09-22  8:27               ` Louis Chauvet
2025-08-15  3:50 ` [PATCH V11 15/47] drm/vkms: Add kunit tests for linear and sRGB LUTs Alex Hung
2025-08-15 20:34   ` kernel test robot
2025-08-15  3:50 ` [PATCH V11 16/47] drm/colorop: Add 3x4 CTM type Alex Hung
2025-08-15  3:50 ` [PATCH V11 17/47] drm/vkms: Use s32 for internal color pipeline precision Alex Hung
2025-09-30  7:07   ` Pekka Paalanen [this message]
2025-09-30 13:58     ` Harry Wentland
2025-08-15  3:50 ` [PATCH V11 18/47] drm/vkms: add 3x4 matrix in color pipeline Alex Hung
2025-08-15  3:50 ` [PATCH V11 19/47] drm/tests: Add a few tests around drm_fixed.h Alex Hung
2025-08-15  3:50 ` [PATCH V11 20/47] drm/vkms: Add tests for CTM handling Alex Hung
2025-08-15  3:50 ` [PATCH V11 21/47] drm/colorop: pass plane_color_pipeline client cap to atomic check Alex Hung
2025-08-15  3:50 ` [PATCH V11 22/47] drm/colorop: define a new macro for_each_new_colorop_in_state Alex Hung
2025-08-15  3:50 ` [PATCH V11 23/47] drm/amd/display: Ignore deprecated props when plane_color_pipeline set Alex Hung
2025-08-15  3:50 ` [PATCH V11 24/47] drm/amd/display: Add bypass COLOR PIPELINE Alex Hung
2025-08-15  3:50 ` [PATCH V11 25/47] drm/amd/display: Skip color pipeline initialization for cursor plane Alex Hung
2025-08-15  3:50 ` [PATCH V11 26/47] drm/amd/display: Add support for sRGB EOTF in DEGAM block Alex Hung
2025-08-15  3:50 ` [PATCH V11 27/47] drm/amd/display: Add support for sRGB Inverse EOTF in SHAPER block Alex Hung
2025-08-15  3:50 ` [PATCH V11 28/47] drm/amd/display: Add support for sRGB EOTF in BLND block Alex Hung
2025-08-16  2:21   ` kernel test robot
2025-08-15  3:50 ` [PATCH V11 29/47] drm/colorop: Add PQ 125 EOTF and its inverse Alex Hung
2025-08-15  3:50 ` [PATCH V11 30/47] drm/amd/display: Enable support for PQ 125 EOTF and Inverse Alex Hung
2025-08-15  3:50 ` [PATCH V11 31/47] drm/colorop: add BT2020/BT709 OETF and Inverse OETF Alex Hung
2025-08-15 17:54   ` Qu Shengyu
2025-08-15 19:26     ` Alex Hung
2025-08-16  2:45       ` Shengyu Qu
2025-08-16  3:28         ` Alex Hung
2025-09-18  9:26           ` Pekka Paalanen
2025-08-15  3:50 ` [PATCH V11 32/47] drm/amd/display: Add support for BT.709 and BT.2020 TFs Alex Hung
2025-08-15  3:50 ` [PATCH V11 33/47] drm: Add Enhanced LUT precision structure Alex Hung
2025-08-15  3:50 ` [PATCH V11 34/47] drm: Add helper to extract lut from struct drm_color_lut32 Alex Hung
2025-08-15  3:50 ` [PATCH V11 35/47] drm/colorop: Add 1D Curve Custom LUT type Alex Hung
2025-08-19 15:31   ` Sebastian Wick
2025-08-20 18:16     ` Alex Hung
2025-08-20 19:40       ` Sebastian Wick
2025-08-21 12:18         ` Xaver Hugl
2025-08-15  3:50 ` [PATCH V11 36/47] drm/amd/display: add shaper and blend colorops for 1D Curve Custom LUT Alex Hung
2025-08-15  3:50 ` [PATCH V11 37/47] drm/amd/display: add 3x4 matrix colorop Alex Hung
2025-08-15  3:50 ` [PATCH V11 38/47] drm/colorop: Add multiplier type Alex Hung
2025-08-15  3:50 ` [PATCH V11 39/47] drm/amd/display: add multiplier colorop Alex Hung
2025-08-15  3:50 ` [PATCH V11 40/47] drm/amd/display: Swap matrix and multiplier Alex Hung
2025-08-15  3:50 ` [PATCH V11 41/47] drm/colorop: Define LUT_1D interpolation Alex Hung
2025-08-15  3:50 ` [PATCH V11 42/47] drm/colorop: allow non-bypass colorops Alex Hung
2025-08-15  3:50 ` [PATCH V11 43/47] drm/colorop: Add 3D LUT support to color pipeline Alex Hung
2025-08-15  3:50 ` [PATCH V11 44/47] drm/amd/display: add 3D LUT colorop Alex Hung
2025-08-15  3:50 ` [PATCH V11 45/47] drm/amd/display: Add AMD color pipeline doc Alex Hung
2025-08-15  3:50 ` [PATCH V11 46/47] drm/amd/display: Ensure 3D LUT for color pipeline Alex Hung
2025-08-15  3:50 ` [PATCH V11 47/47] drm/amd/display: Disable CRTC degamma when color pipeline is enabled Alex Hung
2025-08-20 19:43 ` [PATCH V11 00/47] Color Pipeline API w/ VKMS Sebastian Wick

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20250930100740.40b718bb@eldfell \
    --to=pekka.paalanen@collabora.com \
    --cc=Liviu.Dudau@arm.com \
    --cc=agoins@nvidia.com \
    --cc=aleixpol@kde.org \
    --cc=alex.hung@amd.com \
    --cc=amd-gfx@lists.freedesktop.org \
    --cc=chaitanya.kumar.borah@intel.com \
    --cc=contact@emersion.fr \
    --cc=daniel@ffwll.ch \
    --cc=daniels@collabora.com \
    --cc=dri-devel@lists.freedesktop.org \
    --cc=harry.wentland@amd.com \
    --cc=jadahl@redhat.com \
    --cc=joshua@froggi.es \
    --cc=leo.liu@amd.com \
    --cc=louis.chauvet@bootlin.com \
    --cc=marcan@marcan.st \
    --cc=mcanal@igalia.com \
    --cc=mdaenzer@redhat.com \
    --cc=mwen@igalia.com \
    --cc=nfraprado@collabora.com \
    --cc=quic_abhinavk@quicinc.com \
    --cc=quic_cbraga@quicinc.com \
    --cc=quic_naseer@quicinc.com \
    --cc=sashamcintosh@google.com \
    --cc=sebastian.wick@redhat.com \
    --cc=shashank.sharma@amd.com \
    --cc=uma.shankar@intel.com \
    --cc=victoria@system76.com \
    --cc=ville.syrjala@linux.intel.com \
    --cc=wayland-devel@lists.freedesktop.org \
    --cc=xaver.hugl@gmail.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.