From: Tvrtko Ursulin <tvrtko.ursulin@linux.intel.com>
To: Umesh Nerlige Ramappa <umesh.nerlige.ramappa@intel.com>
Cc: intel-gfx@lists.freedesktop.org
Subject: Re: [Intel-gfx] [PATCH 5/6] drm/i915/pmu: Prepare for multi-tile non-engine counters
Date: Mon, 15 May 2023 11:10:56 +0100 [thread overview]
Message-ID: <45f66bdd-7a67-d6a2-e1cb-111c78c0a70f@linux.intel.com> (raw)
In-Reply-To: <ZF6oV4FT73yWtWD7@orsosgc001.jf.intel.com>
On 12/05/2023 21:57, Umesh Nerlige Ramappa wrote:
> On Fri, May 12, 2023 at 11:56:18AM +0100, Tvrtko Ursulin wrote:
>>
>> On 12/05/2023 02:08, Dixit, Ashutosh wrote:
>>> On Fri, 05 May 2023 17:58:15 -0700, Umesh Nerlige Ramappa wrote:
>>>>
>>>> From: Tvrtko Ursulin <tvrtko.ursulin@intel.com>
>>>>
>>>> Reserve some bits in the counter config namespace which will carry the
>>>> tile id and prepare the code to handle this.
>>>>
>>>> No per tile counters have been added yet.
>>>>
>>>> v2:
>>>> - Fix checkpatch issues
>>>> - Use 4 bits for gt id in non-engine counters. Drop FIXME.
>>>> - Set MAX GTs to 4. Drop FIXME.
>>>>
>>>> Signed-off-by: Tvrtko Ursulin <tvrtko.ursulin@intel.com>
>>>> Signed-off-by: Umesh Nerlige Ramappa <umesh.nerlige.ramappa@intel.com>
8<
>>>> +static u64 frequency_enabled_mask(void)
>>>> +{
>>>> + unsigned int i;
>>>> + u64 mask = 0;
>>>> +
>>>> + for (i = 0; i < I915_PMU_MAX_GTS; i++)
>>>> + mask |= config_mask(__I915_PMU_ACTUAL_FREQUENCY(i)) |
>>>> + config_mask(__I915_PMU_REQUESTED_FREQUENCY(i));
>>>> +
>>>> + return mask;
>>>> +}
>>>> +
>>>> static bool pmu_needs_timer(struct i915_pmu *pmu, bool gpu_active)
>>>> {
>>>> struct drm_i915_private *i915 = container_of(pmu, typeof(*i915),
>>>> pmu);
>>>> @@ -120,9 +147,7 @@ static bool pmu_needs_timer(struct i915_pmu
>>>> *pmu, bool gpu_active)
>>>> * Mask out all the ones which do not need the timer, or in
>>>> * other words keep all the ones that could need the timer.
>>>> */
>>>> - enable &= config_mask(I915_PMU_ACTUAL_FREQUENCY) |
>>>> - config_mask(I915_PMU_REQUESTED_FREQUENCY) |
>>>> - ENGINE_SAMPLE_MASK;
>>>> + enable &= frequency_enabled_mask() | ENGINE_SAMPLE_MASK;
>>>
>>> u32 enable & u64 frequency_enabled_mask
>>>
>>> ugly but ok I guess? Or change enable to u64?
>
> making pmu->enable u64 as well as other places where it is assigned to
> local variables.
>
>>
>> Hmm.. yes very ugly. Could have been an accident which happened to
>> work because there is a single timer (not per tile).
>
> Happened to work because the frequency mask does not spill over to the
> upper 32 bits (even for multi tile).
>
> --------------------- START_SECTION ----------------
>>
>> Similar issue in frequency_sampling_enabled too. Gt_id argument to it
>> seems pointless.
>
> Not sure why it's pointless. We need the gt_id to determine the right
> mask for that specific gt. If it's not enabled, then we just return
> without pm_get and async put (like you mention later).
> And this piece of code is called within for_each_gt.
I think I got a little confused cross referencing the code and patches
last week and did not mentally see all the changes.
Because the hunk in other_bit() is correctly adding support for per gt bits.
The layout of pmu->enable ends up like this:
bits 0 - 2: engine events
bits 3 - 5: gt0 other events
bits 6 - 8: gt1 other events
bits 9 - 11: gt2 other events
bits 12 - 14: gt3 other events
>> So I now think whole frequency_enabled_mask() is just pointless and
>> should be removed. And then pmu_needs_time code can stay as is.
>> Possibly add a config_mask_32 helper which ensures no bits in upper 32
>> bits are returned.
>>
>> That is if we are happy for the frequency_sampling_enabled returning
>> true for all gts, regardless of which ones actually have frequency
>> sampling enabled.
>>
>> Or if we want to implement it as I probably have intended, we will
>> need to add some gt bits into pmu->enable. Maybe reserve top four same
>> as with config counters.
>
> Nope. What you have here works just fine. pmu->enable should not include
> any gt id info. gt_id[63:60] is only a concept for pmu config sent by
> user. config_mask and pmu->enable are i915 internal bookkeeping (bit
> masks) just to track what events need to be sampled. The 'other' bit
> masks are a function of gt_id because we use gt_id to calculate a
> contiguous numerical value for these 'other' events. That's all. Once
> the numerical value is calculated, there is no need for gt_id because
> config_mask is BIT_ULL(numerical_value). Since the numerical values
> never exceeded 31 (even for multi-gts), everything worked even with 32
> bit pmu->enable.
Yep.
So question then is why make pmu->enable u64?
Instead frequency_enabled_mask() should be made u32 since the bitwise or
composition of config_masks() is guaranteed to fit.
At most it can have an internal u64 for the mask, assert upper_32_bits
are zero and return lower_32_bits.
Did I get it right this time round? :)
Regards,
Tvrtko
next prev parent reply other threads:[~2023-05-15 10:11 UTC|newest]
Thread overview: 48+ messages / expand[flat|nested] mbox.gz Atom feed top
2023-05-06 0:58 [Intel-gfx] [PATCH 0/6] Add MTL PMU support for multi-gt Umesh Nerlige Ramappa
2023-05-06 0:58 ` [Intel-gfx] [PATCH 1/6] drm/i915/pmu: Support PMU for all engines Umesh Nerlige Ramappa
2023-05-08 17:52 ` Umesh Nerlige Ramappa
2023-05-09 12:26 ` Tvrtko Ursulin
2023-05-06 0:58 ` [Intel-gfx] [PATCH 2/6] drm/i915/pmu: Skip sampling engines with no enabled counters Umesh Nerlige Ramappa
2023-05-08 17:53 ` Umesh Nerlige Ramappa
2023-05-06 0:58 ` [Intel-gfx] [PATCH 3/6] drm/i915/pmu: Transform PMU parking code to be GT based Umesh Nerlige Ramappa
2023-05-08 17:55 ` Umesh Nerlige Ramappa
2023-05-09 15:10 ` Umesh Nerlige Ramappa
2023-05-06 0:58 ` [Intel-gfx] [PATCH 4/6] drm/i915/pmu: Add reference counting to the sampling timer Umesh Nerlige Ramappa
2023-05-08 17:58 ` Umesh Nerlige Ramappa
2023-05-09 17:25 ` Dixit, Ashutosh
2023-05-10 6:02 ` Dixit, Ashutosh
2023-05-12 22:29 ` Dixit, Ashutosh
2023-05-12 22:44 ` Umesh Nerlige Ramappa
2023-05-12 23:20 ` Dixit, Ashutosh
2023-05-12 23:44 ` Umesh Nerlige Ramappa
2023-05-15 9:52 ` Tvrtko Ursulin
2023-05-15 21:24 ` Dixit, Ashutosh
2023-05-16 7:12 ` Tvrtko Ursulin
2023-05-16 16:29 ` Dixit, Ashutosh
2023-05-06 0:58 ` [Intel-gfx] [PATCH 5/6] drm/i915/pmu: Prepare for multi-tile non-engine counters Umesh Nerlige Ramappa
2023-05-08 18:07 ` Umesh Nerlige Ramappa
2023-05-12 1:08 ` Dixit, Ashutosh
2023-05-12 10:56 ` Tvrtko Ursulin
2023-05-12 20:57 ` Umesh Nerlige Ramappa
2023-05-12 22:37 ` Umesh Nerlige Ramappa
2023-05-13 1:09 ` Dixit, Ashutosh
2023-05-15 10:10 ` Tvrtko Ursulin [this message]
2023-05-15 22:07 ` Dixit, Ashutosh
2023-05-16 8:35 ` Tvrtko Ursulin
2023-05-06 0:58 ` [Intel-gfx] [PATCH 6/6] drm/i915/pmu: Export counters from all tiles Umesh Nerlige Ramappa
2023-05-08 18:08 ` Umesh Nerlige Ramappa
2023-05-09 12:38 ` Tvrtko Ursulin
2023-05-11 18:57 ` Dixit, Ashutosh
2023-05-12 10:57 ` Tvrtko Ursulin
2023-05-12 17:08 ` Dixit, Ashutosh
2023-05-12 18:53 ` Umesh Nerlige Ramappa
2023-05-12 20:10 ` Dixit, Ashutosh
2023-05-06 2:20 ` [Intel-gfx] ✗ Fi.CI.CHECKPATCH: warning for Add MTL PMU support for multi-gt (rev2) Patchwork
2023-05-06 2:21 ` [Intel-gfx] ✗ Fi.CI.SPARSE: " Patchwork
2023-05-06 2:38 ` [Intel-gfx] ✗ Fi.CI.BAT: failure " Patchwork
-- strict thread matches above, loose matches on Subject: below --
2023-05-13 1:55 [Intel-gfx] [PATCH 0/6] Add MTL PMU support for multi-gt Umesh Nerlige Ramappa
2023-05-13 1:55 ` [Intel-gfx] [PATCH 5/6] drm/i915/pmu: Prepare for multi-tile non-engine counters Umesh Nerlige Ramappa
2023-05-13 4:41 ` Dixit, Ashutosh
2023-05-15 6:16 ` Umesh Nerlige Ramappa
2023-05-15 6:44 [Intel-gfx] [PATCH v4 0/6] Add MTL PMU support for multi-gt Umesh Nerlige Ramappa
2023-05-15 6:44 ` [Intel-gfx] [PATCH 5/6] drm/i915/pmu: Prepare for multi-tile non-engine counters Umesh Nerlige Ramappa
2023-05-15 10:12 ` Tvrtko Ursulin
2023-05-15 18:39 ` Umesh Nerlige Ramappa
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=45f66bdd-7a67-d6a2-e1cb-111c78c0a70f@linux.intel.com \
--to=tvrtko.ursulin@linux.intel.com \
--cc=intel-gfx@lists.freedesktop.org \
--cc=umesh.nerlige.ramappa@intel.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox