From: Eric Anholt <eric@anholt.net>
To: Boris Brezillon <boris.brezillon@bootlin.com>
Cc: dri-devel@lists.freedesktop.org
Subject: Re: [PATCH v2 3/3] drm/vc4: Add a load tracker to prevent HVS underflow errors
Date: Thu, 08 Nov 2018 08:26:49 -0800 [thread overview]
Message-ID: <8736sbo7qu.fsf@anholt.net> (raw)
In-Reply-To: <20181105123648.4fe6d838@bbrezillon>
[-- Attachment #1.1: Type: text/plain, Size: 4109 bytes --]
Boris Brezillon <boris.brezillon@bootlin.com> writes:
> Hi Eric,
>
> On Tue, 30 Oct 2018 16:12:55 -0700
> Eric Anholt <eric@anholt.net> wrote:
>> > +static void vc4_load_tracker_destroy_state(struct drm_private_obj *obj,
>> > + struct drm_private_state *state)
>> > +{
>> > + struct vc4_load_tracker_state *load_state;
>> > +
>> > + load_state = to_vc4_load_tracker_state(state);
>> > + kfree(load_state);
>> > +}
>>
>> Optional: just kfree(state) for simplicity.
>
> Hm, not sure that's a good idea. kfree(state) works as long as
> drm_private_state is the first field in vc4_load_tracker_state, but it
> sounds a bit fragile.
>
> I can do
>
> kfree(to_vc4_load_tracker_state(state));
>
> if you prefer.
I said optional for that reason :) Just keep it as is.
>> > +static void vc4_plane_calc_load(struct drm_plane_state *state)
>> > +{
>> > + unsigned int hvs_load_shift, vrefresh, i;
>> > + struct drm_framebuffer *fb = state->fb;
>> > + struct vc4_plane_state *vc4_state;
>> > + struct drm_crtc_state *crtc_state;
>> > + unsigned int vscale_factor;
>> > +
>> > + vc4_state = to_vc4_plane_state(state);
>> > + crtc_state = drm_atomic_get_existing_crtc_state(state->state,
>> > + state->crtc);
>> > + vrefresh = drm_mode_vrefresh(&crtc_state->adjusted_mode);
>> > +
>> > + /* The HVS is able to process 2 pixels/cycle when scaling the source,
>> > + * 4 pixels/cycle otherwise.
>> > + * Alpha blending step seems to be pipelined and it's always operating
>> > + * at 4 pixels/cycle, so the limiting aspect here seems to be the
>> > + * scaler block.
>> > + * HVS load is expressed in clk-cycles/sec (AKA Hz).
>> > + */
>> > + if (vc4_state->x_scaling[0] != VC4_SCALING_NONE ||
>> > + vc4_state->x_scaling[1] != VC4_SCALING_NONE ||
>> > + vc4_state->y_scaling[0] != VC4_SCALING_NONE ||
>> > + vc4_state->y_scaling[1] != VC4_SCALING_NONE)
>> > + hvs_load_shift = 1;
>> > + else
>> > + hvs_load_shift = 2;
>> > +
>> > + vc4_state->membus_load = 0;
>> > + vc4_state->hvs_load = 0;
>> > + for (i = 0; i < fb->format->num_planes; i++) {
>> > + unsigned long pixels_load;
>>
>> I'm scared any time I see longs. Do you want 32 or 64 bits here?
>
> I just assumed a 32bit unsigned var would be enough, so unsigned long
> seemed just fine. I can use u32 or u64 if you prefer.
Yes, please. See also Maxime's recent trouble with a 64-bit kernel.
>> > + /* Even if the bandwidth/plane required for a single frame is
>> > + *
>> > + * vc4_state->src_w[i] * vc4_state->src_h[i] * cpp * vrefresh
>> > + *
>> > + * when downscaling, we have to read more pixels per line in
>> > + * the time frame reserved for a single line, so the bandwidth
>> > + * demand can be punctually higher. To account for that, we
>> > + * calculate the down-scaling factor and multiply the plane
>> > + * load by this number. We're likely over-estimating the read
>> > + * demand, but that's better than under-estimating it.
>> > + */
>> > + vscale_factor = DIV_ROUND_UP(vc4_state->src_h[i],
>> > + vc4_state->crtc_h);
>> > + pixels_load = vc4_state->src_w[i] * vc4_state->src_h[i] *
>> > + vscale_factor;
>>
>> If we're upscaling (common for video, right?), aren't we under-counting
>> the cost? You need to scale/colorspace-convert crtc_w * crtc_h at 2
>> pixels per cycle.
>
> That's not entirely clear to me. I'm not sure what the scaler does when
> upscaling. Are the same pixels read several times from the memory? If
> that's the case, then the membus load should indeed be based on the
> crtc_w,h.
I'm going to punt on this question because that would be a *lot* of
verilog tracing to figure out for me (and I'm not sure I'd even trust
what I came up with).
> Also, when the spec says the HVS can process 4pixels/cycles, is it 4
> input pixels or 4 output pixels per cycle?
Well, it's 4 pixels per cycle when not scaling, so both :)
I think the scaling pipeline is doing two output pixels per cycle.
Nothing else would make sense to me.
[-- Attachment #1.2: signature.asc --]
[-- Type: application/pgp-signature, Size: 832 bytes --]
[-- Attachment #2: Type: text/plain, Size: 160 bytes --]
_______________________________________________
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel
next prev parent reply other threads:[~2018-11-08 16:26 UTC|newest]
Thread overview: 18+ messages / expand[flat|nested] mbox.gz Atom feed top
2018-10-25 12:45 [PATCH v2 0/3] drm/vc4: Add a load tracker Boris Brezillon
2018-10-25 12:45 ` [PATCH v2 1/3] drm/atomic: Add a generic infrastructure to track underrun errors Boris Brezillon
2018-10-26 10:33 ` Daniel Vetter
2018-10-26 12:36 ` Boris Brezillon
2018-10-26 13:36 ` Daniel Vetter
2018-10-26 14:13 ` Boris Brezillon
2018-10-26 14:23 ` Daniel Vetter
2018-10-25 12:45 ` [PATCH v2 2/3] drm/vc4: Report " Boris Brezillon
2018-10-25 12:45 ` [PATCH v2 3/3] drm/vc4: Add a load tracker to prevent HVS underflow errors Boris Brezillon
2018-10-30 23:12 ` Eric Anholt
2018-11-05 11:36 ` Boris Brezillon
2018-11-08 16:26 ` Eric Anholt [this message]
2018-11-08 16:50 ` Boris Brezillon
2018-11-08 17:53 ` Eric Anholt
2018-11-06 13:07 ` Boris Brezillon
2018-11-28 9:16 ` [PATCH v2 0/3] drm/vc4: Add a load tracker Paul Kocialkowski
2018-11-28 9:29 ` Boris Brezillon
2018-11-28 13:32 ` Paul Kocialkowski
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=8736sbo7qu.fsf@anholt.net \
--to=eric@anholt.net \
--cc=boris.brezillon@bootlin.com \
--cc=dri-devel@lists.freedesktop.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.