From: Lionel Landwerlin <lionel.g.landwerlin@intel.com>
To: Tvrtko Ursulin <tvrtko.ursulin@linux.intel.com>,
Chris Wilson <chris@chris-wilson.co.uk>,
Intel-gfx@lists.freedesktop.org,
Tvrtko Ursulin <tursulin@ursulin.net>
Subject: Re: [CI 2/2] drm/i915/perf: reuse intel_lrc ctx regs macro
Date: Wed, 15 Aug 2018 11:50:55 +0100 [thread overview]
Message-ID: <1f21f8a3-a84d-16fd-b70b-8fdbf2670ae2@intel.com> (raw)
In-Reply-To: <b16b0452-daf9-fcd6-14a3-e892d8b02c4e@linux.intel.com>
On 15/08/18 09:49, Tvrtko Ursulin wrote:
>
> On 14/08/2018 19:57, Chris Wilson wrote:
>> Quoting Tvrtko Ursulin (2018-08-14 19:49:46)
>>>
>>> On 13/08/2018 10:16, Chris Wilson wrote:
>>>> Quoting Tvrtko Ursulin (2018-08-13 10:11:44)
>>>>>
>>>>> On 13/08/2018 09:16, Chris Wilson wrote:
>>>>>> Quoting Tvrtko Ursulin (2018-08-13 09:02:18)
>>>>>>> From: Lionel Landwerlin <lionel.g.landwerlin@intel.com>
>>>>>>>
>>>>>>> Abstract the context image access a bit.
>>>>>>>
>>>>>>> Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>
>>>>>>> Reviewed-by: Tvrtko Ursulin <tvrtko.ursulin@intel.com>
>>>>>>> ---
>>>>>>> drivers/gpu/drm/i915/i915_perf.c | 34
>>>>>>> +++++++++++++++-----------------
>>>>>>> 1 file changed, 16 insertions(+), 18 deletions(-)
>>>>>>>
>>>>>>> diff --git a/drivers/gpu/drm/i915/i915_perf.c
>>>>>>> b/drivers/gpu/drm/i915/i915_perf.c
>>>>>>> index 49597cf31707..ccb20230df2c 100644
>>>>>>> --- a/drivers/gpu/drm/i915/i915_perf.c
>>>>>>> +++ b/drivers/gpu/drm/i915/i915_perf.c
>>>>>>> @@ -210,6 +210,7 @@
>>>>>>> #include "i915_oa_cflgt3.h"
>>>>>>> #include "i915_oa_cnl.h"
>>>>>>> #include "i915_oa_icl.h"
>>>>>>> +#include "intel_lrc_reg.h"
>>>>>>> /* HW requires this to be a power of two, between 128k
>>>>>>> and 16M, though driver
>>>>>>> * is currently generally designed assuming the largest 16M
>>>>>>> size is used such
>>>>>>> @@ -1636,27 +1637,25 @@ static void
>>>>>>> gen8_update_reg_state_unlocked(struct i915_gem_context *ctx,
>>>>>>> u32 ctx_oactxctrl =
>>>>>>> dev_priv->perf.oa.ctx_oactxctrl_offset;
>>>>>>> u32 ctx_flexeu0 = dev_priv->perf.oa.ctx_flexeu0_offset;
>>>>>>> /* The MMIO offsets for Flex EU registers aren't
>>>>>>> contiguous */
>>>>>>> - u32 flex_mmio[] = {
>>>>>>> - i915_mmio_reg_offset(EU_PERF_CNTL0),
>>>>>>> - i915_mmio_reg_offset(EU_PERF_CNTL1),
>>>>>>> - i915_mmio_reg_offset(EU_PERF_CNTL2),
>>>>>>> - i915_mmio_reg_offset(EU_PERF_CNTL3),
>>>>>>> - i915_mmio_reg_offset(EU_PERF_CNTL4),
>>>>>>> - i915_mmio_reg_offset(EU_PERF_CNTL5),
>>>>>>> - i915_mmio_reg_offset(EU_PERF_CNTL6),
>>>>>>> + i915_reg_t flex_regs[] = {
>>>>>>> + EU_PERF_CNTL0,
>>>>>>> + EU_PERF_CNTL1,
>>>>>>> + EU_PERF_CNTL2,
>>>>>>> + EU_PERF_CNTL3,
>>>>>>> + EU_PERF_CNTL4,
>>>>>>> + EU_PERF_CNTL5,
>>>>>>> + EU_PERF_CNTL6,
>>>>>>> };
>>>>>>> int i;
>>>>>>> - reg_state[ctx_oactxctrl] =
>>>>>>> i915_mmio_reg_offset(GEN8_OACTXCONTROL);
>>>>>>> - reg_state[ctx_oactxctrl+1] =
>>>>>>> (dev_priv->perf.oa.period_exponent <<
>>>>>>> - GEN8_OA_TIMER_PERIOD_SHIFT) |
>>>>>>> - (dev_priv->perf.oa.periodic ?
>>>>>>> - GEN8_OA_TIMER_ENABLE : 0) |
>>>>>>> - GEN8_OA_COUNTER_RESUME;
>>>>>>> + CTX_REG(reg_state, ctx_oactxctrl, GEN8_OACTXCONTROL,
>>>>>>> + (dev_priv->perf.oa.period_exponent <<
>>>>>>> GEN8_OA_TIMER_PERIOD_SHIFT) |
>>>>>>> + (dev_priv->perf.oa.periodic ?
>>>>>>> GEN8_OA_TIMER_ENABLE : 0) |
>>>>>>> + GEN8_OA_COUNTER_RESUME);
>>>>>>
>>>>>> I'll be honest but, I don't think it's CTX_REG() that helps
>>>>>> improve the
>>>>>> readability here.
>>>>>>
>>>>>> The really odd part is that this sticks itself into a bare part
>>>>>> of the
>>>>>> register state not surrounded by any LRI and after a BB_END. This
>>>>>> routine can only work for established contexts, it should not
>>>>>> work for
>>>>>> execlists_init_reg_state.
>>>>>
>>>>> Unless I am missing something change is completely mechanical, so any
>>>>> question marks you have are already there, right? What do you
>>>>> suggest is
>>>>> the action here?
>>>>
>>>> Sure, the only thing I question of this patch itself is whether
>>>> CTX_REG() is simply too much horrible obfuscating magic.
>>>
>>> Turn a blind eye if the perceived badness factor is below some
>>> threshold? Following patch depends on this one, so if I have to drop
>>> this one, then I have to rework the next one etc.. well, it's not the
>>> worst problem, so yeah, whatever. Make a call and let me know.
>>
>> The patch was fine, just worrying about the surrounding code.
>
> I misunderstood. So only about ctx_oactxctrl_offset and
> ctx_flexeu0_offset from i915_perf.c? Maybe that is some OA magic, I
> have not idea. CC-ing Lionel in case he can shed some light on it.
Those are the offsets at which the hardware will store the
OA_CTXCTRL/FLEX_EU registers values as documented.
I can see that's it's a bit odd not to have the MI_LRI written before we
do the first restore.
I'm 99% sure I've verified in practice that application started after
i915/perf is opened have the right values programmed.
Not completely sure that the IGT tests cover that case though.
So maybe there is a problem with the first restore...
What's the value set into most register that aren't explicitly
programmed in intel_lrc.c ?
-
Lionel
>
> Regards,
>
> Tvrtko
>
_______________________________________________
Intel-gfx mailing list
Intel-gfx@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/intel-gfx
next prev parent reply other threads:[~2018-08-15 10:51 UTC|newest]
Thread overview: 15+ messages / expand[flat|nested] mbox.gz Atom feed top
2018-08-13 8:02 [CI 0/2] Reviewed perf cleanups Tvrtko Ursulin
2018-08-13 8:02 ` [CI 1/2] drm/i915/perf: simplify configure all context function Tvrtko Ursulin
2018-08-13 8:02 ` [CI 2/2] drm/i915/perf: reuse intel_lrc ctx regs macro Tvrtko Ursulin
2018-08-13 8:16 ` Chris Wilson
2018-08-13 9:11 ` Tvrtko Ursulin
2018-08-13 9:16 ` Chris Wilson
2018-08-14 18:49 ` Tvrtko Ursulin
2018-08-14 18:57 ` Chris Wilson
2018-08-15 8:49 ` Tvrtko Ursulin
2018-08-15 10:50 ` Lionel Landwerlin [this message]
2018-08-15 10:56 ` Chris Wilson
2018-08-15 11:15 ` Lionel Landwerlin
2018-08-13 8:33 ` ✓ Fi.CI.BAT: success for Reviewed perf cleanups Patchwork
2018-08-13 10:00 ` ✓ Fi.CI.IGT: " Patchwork
2018-08-31 15:27 ` Chris Wilson
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=1f21f8a3-a84d-16fd-b70b-8fdbf2670ae2@intel.com \
--to=lionel.g.landwerlin@intel.com \
--cc=Intel-gfx@lists.freedesktop.org \
--cc=chris@chris-wilson.co.uk \
--cc=tursulin@ursulin.net \
--cc=tvrtko.ursulin@linux.intel.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).