From: Tvrtko Ursulin <tvrtko.ursulin@linux.intel.com>
To: "Goel, Akash" <akash.goel@intel.com>, intel-gfx@lists.freedesktop.org
Subject: Re: [PATCH 05/20] drm/i915: Support for GuC interrupts
Date: Fri, 12 Aug 2016 14:31:36 +0100 [thread overview]
Message-ID: <57ADCFB8.9090603@linux.intel.com> (raw)
In-Reply-To: <8a99de0d-6547-8158-55a4-8ad6777bd428@intel.com>
On 12/08/16 14:10, Goel, Akash wrote:
> On 8/12/2016 5:24 PM, Tvrtko Ursulin wrote:
>>
>> On 12/08/16 07:25, akash.goel@intel.com wrote:
>>> From: Sagar Arun Kamble <sagar.a.kamble@intel.com>
>>>
>>> There are certain types of interrupts which Host can recieve from GuC.
>>> GuC ukernel sends an interrupt to Host for certain events, like for
>>> example retrieve/consume the logs generated by ukernel.
>>> This patch adds support to receive interrupts from GuC but currently
>>> enables & partially handles only the interrupt sent by GuC ukernel.
>>> Future patches will add support for handling other interrupt types.
>>>
>>> v2:
>>> - Use common low level routines for PM IER/IIR programming (Chris)
>>> - Rename interrupt functions to gen9_xxx from gen8_xxx (Chris)
>>> - Replace disabling of wake ref asserts with rpm get/put (Chris)
>>>
>>> v3:
>>> - Update comments for more clarity. (Tvrtko)
>>> - Remove the masking of GuC interrupt, which was kept masked till the
>>> start of bottom half, its not really needed as there is only a
>>> single instance of work item & wq is ordered. (Tvrtko)
>>>
>>> v4:
>>> - Rebase.
>>> - Rename guc_events to pm_guc_events so as to be indicative of the
>>> register/control block it is associated with. (Chris)
>>> - Add handling for back to back log buffer flush interrupts.
>>>
>>> v5:
>>> - Move the read & clearing of register, containing Guc2Host message
>>> bits, outside the irq spinlock. (Tvrtko)
>>>
>>> Signed-off-by: Sagar Arun Kamble <sagar.a.kamble@intel.com>
>>> Signed-off-by: Akash Goel <akash.goel@intel.com>
>>> ---
>>> drivers/gpu/drm/i915/i915_drv.h | 1 +
>>> drivers/gpu/drm/i915/i915_guc_submission.c | 5 ++
>>> drivers/gpu/drm/i915/i915_irq.c | 100
>>> +++++++++++++++++++++++++++--
>>> drivers/gpu/drm/i915/i915_reg.h | 11 ++++
>>> drivers/gpu/drm/i915/intel_drv.h | 3 +
>>> drivers/gpu/drm/i915/intel_guc.h | 4 ++
>>> drivers/gpu/drm/i915/intel_guc_loader.c | 4 ++
>>> 7 files changed, 124 insertions(+), 4 deletions(-)
>>>
>>> diff --git a/drivers/gpu/drm/i915/i915_drv.h
>>> b/drivers/gpu/drm/i915/i915_drv.h
>>> index a608a5c..28ffac5 100644
>>> --- a/drivers/gpu/drm/i915/i915_drv.h
>>> +++ b/drivers/gpu/drm/i915/i915_drv.h
>>> @@ -1779,6 +1779,7 @@ struct drm_i915_private {
>>> u32 pm_imr;
>>> u32 pm_ier;
>>> u32 pm_rps_events;
>>> + u32 pm_guc_events;
>>> u32 pipestat_irq_mask[I915_MAX_PIPES];
>>>
>>> struct i915_hotplug hotplug;
>>> diff --git a/drivers/gpu/drm/i915/i915_guc_submission.c
>>> b/drivers/gpu/drm/i915/i915_guc_submission.c
>>> index ad3b55f..c7c679f 100644
>>> --- a/drivers/gpu/drm/i915/i915_guc_submission.c
>>> +++ b/drivers/gpu/drm/i915/i915_guc_submission.c
>>> @@ -1071,6 +1071,8 @@ int intel_guc_suspend(struct drm_device *dev)
>>> if (guc->guc_fw.guc_fw_load_status != GUC_FIRMWARE_SUCCESS)
>>> return 0;
>>>
>>> + gen9_disable_guc_interrupts(dev_priv);
>>> +
>>> ctx = dev_priv->kernel_context;
>>>
>>> data[0] = HOST2GUC_ACTION_ENTER_S_STATE;
>>> @@ -1097,6 +1099,9 @@ int intel_guc_resume(struct drm_device *dev)
>>> if (guc->guc_fw.guc_fw_load_status != GUC_FIRMWARE_SUCCESS)
>>> return 0;
>>>
>>> + if (i915.guc_log_level >= 0)
>>> + gen9_enable_guc_interrupts(dev_priv);
>>> +
>>> ctx = dev_priv->kernel_context;
>>>
>>> data[0] = HOST2GUC_ACTION_EXIT_S_STATE;
>>> diff --git a/drivers/gpu/drm/i915/i915_irq.c
>>> b/drivers/gpu/drm/i915/i915_irq.c
>>> index 5f93309..5f1974f 100644
>>> --- a/drivers/gpu/drm/i915/i915_irq.c
>>> +++ b/drivers/gpu/drm/i915/i915_irq.c
>>> @@ -170,6 +170,7 @@ static void gen5_assert_iir_is_zero(struct
>>> drm_i915_private *dev_priv,
>>> } while (0)
>>>
>>> static void gen6_rps_irq_handler(struct drm_i915_private *dev_priv,
>>> u32 pm_iir);
>>> +static void gen9_guc_irq_handler(struct drm_i915_private *dev_priv,
>>> u32 pm_iir);
>>>
>>> /* For display hotplug interrupt */
>>> static inline void
>>> @@ -411,6 +412,38 @@ void gen6_disable_rps_interrupts(struct
>>> drm_i915_private *dev_priv)
>>> gen6_reset_rps_interrupts(dev_priv);
>>> }
>>>
>>> +void gen9_reset_guc_interrupts(struct drm_i915_private *dev_priv)
>>> +{
>>> + spin_lock_irq(&dev_priv->irq_lock);
>>> + gen6_reset_pm_iir(dev_priv, dev_priv->pm_guc_events);
>>> + spin_unlock_irq(&dev_priv->irq_lock);
>>> +}
>>> +
>>> +void gen9_enable_guc_interrupts(struct drm_i915_private *dev_priv)
>>> +{
>>> + spin_lock_irq(&dev_priv->irq_lock);
>>> + if (!dev_priv->guc.interrupts_enabled) {
>>> + WARN_ON_ONCE(I915_READ(gen6_pm_iir(dev_priv)) &
>>> + dev_priv->pm_guc_events);
>>> + dev_priv->guc.interrupts_enabled = true;
>>> + gen6_enable_pm_irq(dev_priv, dev_priv->pm_guc_events);
>>> + }
>>> + spin_unlock_irq(&dev_priv->irq_lock);
>>> +}
>>> +
>>> +void gen9_disable_guc_interrupts(struct drm_i915_private *dev_priv)
>>> +{
>>> + spin_lock_irq(&dev_priv->irq_lock);
>>> + dev_priv->guc.interrupts_enabled = false;
>>> +
>>> + gen6_disable_pm_irq(dev_priv, dev_priv->pm_guc_events);
>>> +
>>> + spin_unlock_irq(&dev_priv->irq_lock);
>>> + synchronize_irq(dev_priv->drm.irq);
>>> +
>>> + gen9_reset_guc_interrupts(dev_priv);
>>> +}
>>> +
>>> /**
>>> * bdw_update_port_irq - update DE port interrupt
>>> * @dev_priv: driver private
>>> @@ -1167,6 +1200,21 @@ static void gen6_pm_rps_work(struct work_struct
>>> *work)
>>> mutex_unlock(&dev_priv->rps.hw_lock);
>>> }
>>>
>>> +static void gen9_guc2host_events_work(struct work_struct *work)
>>> +{
>>> + struct drm_i915_private *dev_priv =
>>> + container_of(work, struct drm_i915_private, guc.events_work);
>>> +
>>> + spin_lock_irq(&dev_priv->irq_lock);
>>> + /* Speed up work cancellation during disabling guc interrupts. */
>>> + if (!dev_priv->guc.interrupts_enabled) {
>>> + spin_unlock_irq(&dev_priv->irq_lock);
>>> + return;
>>
>> I suppose locking for early exit is something about ensuring the worker
>> sees the update to dev_priv->guc.interrupts_enabled done on another CPU?
>
> Yes locking (providing implicit barrier) will ensure that update made
> from another CPU is immediately visible to the worker.
What if the disable happens after the unlock above? It would wait in
disable until the irq handler exits. So the same as if not bothering
with the spinlock above, no?
>> synchronize_irq there is not enough for some reason?
>>
> synchronize_irq would not be enough, its for a different purpose, to
> ensure that any ongoing handling of irq completes (after the caller has
> disabled the irq).
>
> As per my understanding synchronize_irq won't have an effect on the
> worker, with respect to the moment when the update of
> 'interrupts_enabled' flag is visible to the worker.
>
>>> + }
>>> + spin_unlock_irq(&dev_priv->irq_lock);
>>> +
>>> + /* TODO: Handle the events for which GuC interrupted host */
>>> +}
>>>
>>> /**
>>> * ivybridge_parity_work - Workqueue called when a parity error
>>> interrupt
>>> @@ -1339,11 +1387,13 @@ static irqreturn_t gen8_gt_irq_ack(struct
>>> drm_i915_private *dev_priv,
>>> DRM_ERROR("The master control interrupt lied (GT3)!\n");
>>> }
>>>
>>> - if (master_ctl & GEN8_GT_PM_IRQ) {
>>> + if (master_ctl & (GEN8_GT_PM_IRQ | GEN8_GT_GUC_IRQ)) {
>>> gt_iir[2] = I915_READ_FW(GEN8_GT_IIR(2));
>>> - if (gt_iir[2] & dev_priv->pm_rps_events) {
>>> + if (gt_iir[2] & (dev_priv->pm_rps_events |
>>> + dev_priv->pm_guc_events)) {
>>> I915_WRITE_FW(GEN8_GT_IIR(2),
>>> - gt_iir[2] & dev_priv->pm_rps_events);
>>> + gt_iir[2] & (dev_priv->pm_rps_events |
>>> + dev_priv->pm_guc_events));
>>> ret = IRQ_HANDLED;
>>> } else
>>> DRM_ERROR("The master control interrupt lied (PM)!\n");
>>> @@ -1375,6 +1425,9 @@ static void gen8_gt_irq_handler(struct
>>> drm_i915_private *dev_priv,
>>>
>>> if (gt_iir[2] & dev_priv->pm_rps_events)
>>> gen6_rps_irq_handler(dev_priv, gt_iir[2]);
>>> +
>>> + if (gt_iir[2] & dev_priv->pm_guc_events)
>>> + gen9_guc_irq_handler(dev_priv, gt_iir[2]);
>>> }
>>>
>>> static bool bxt_port_hotplug_long_detect(enum port port, u32 val)
>>> @@ -1621,6 +1674,41 @@ static void gen6_rps_irq_handler(struct
>>> drm_i915_private *dev_priv, u32 pm_iir)
>>> }
>>> }
>>>
>>> +static void gen9_guc_irq_handler(struct drm_i915_private *dev_priv,
>>> u32 gt_iir)
>>> +{
>>> + bool interrupts_enabled;
>>> +
>>> + if (gt_iir & GEN9_GUC_TO_HOST_INT_EVENT) {
>>> + spin_lock(&dev_priv->irq_lock);
>>> + interrupts_enabled = dev_priv->guc.interrupts_enabled;
>>> + spin_unlock(&dev_priv->irq_lock);
>>
>> Not sure that taking a lock around only this read is needed.
>>
> Again same reason as above, to make sure an update made on another CPU
> is immediately visible to the irq handler.
I don't get it, see above. :)
>>> + if (interrupts_enabled) {
>>> + /* Sample the log buffer flush related bits & clear them
>>> + * out now itself from the message identity register to
>>> + * minimize the probability of losing a flush interrupt,
>>> + * when there are back to back flush interrupts.
>>> + * There can be a new flush interrupt, for different log
>>> + * buffer type (like for ISR), whilst Host is handling
>>> + * one (for DPC). Since same bit is used in message
>>> + * register for ISR & DPC, it could happen that GuC
>>> + * sets the bit for 2nd interrupt but Host clears out
>>> + * the bit on handling the 1st interrupt.
>>> + */
>>> + u32 msg = I915_READ(SOFT_SCRATCH(15)) &
>>> + (GUC2HOST_MSG_CRASH_DUMP_POSTED |
>>> + GUC2HOST_MSG_FLUSH_LOG_BUFFER);
>>> + if (msg) {
>>> + /* Clear the message bits that are handled */
>>> + I915_WRITE(SOFT_SCRATCH(15),
>>> + I915_READ(SOFT_SCRATCH(15)) & ~msg);
>>
>> Cache full value of SOFT_SCRATCH(15) so you don't have to mmio read it
>> twice?
>>
> Thought reading it again (just before the update) is bit safer compared
> to reading it once, as there is a potential race problem here.
> GuC could also write to the SOFT_SCRATCH(15) register, set new events
> bit, while Host clears off the bit of handled events.
Don't get it. If there is a race between read and write there still is,
don't see how a second read makes it safer.
>> Also, is the RMW outside any locks safe?
>>
>
> Ideally need a way to atomically do the RMW, i.e. read the register
> value, clear off the handled events bit and update the register with the
> modified value.
>
> Please kindly suggest how to address the above.
> Or can this be left as a TODO, when we do start handling other events also.
From the comment in code above it sounds like a GuC fw interface
shortcoming - that there is a single bit for two different interrupt
sources, is that right? Is there any other register or something that
you can read to detect that the interrupt has been re-asserted while in
the irq handler? Although I thought you said before that the GuC will
not do that - that it won't re-assert the interrupt before we send the
flush command.
>>> +
>>> + /* Handle flush interrupt event in bottom half */
>>> + queue_work(dev_priv->wq, &dev_priv->guc.events_work);
>>
>> IMHO it would be nicer if the code started straight away with a final wq
>> solution.
>>
>> Especially since the next patch in the series is called "Handle log
>> buffer flush interrupt event from GuC" and the actual handling of the
>> log buffer flush interrupt is split between this one
>> (GUC2HOST_MSG_FLUSH_LOG_BUFFER above) and that one.
>>
>> So it would almost be nicer that the above chunk which handles
>> GUC2HOST_MSG_FLUSH_LOG_BUFFER and the worker init is only added in the
>> next patch and this one only does the generic bits.
>>
>
> Fine will move the log buffer flush interrupt event related stuff to the
> next patch and so irq handler in this patch will just be a
> placeholder.
Great thanks!
>> I don't know.. I'll leave it on your conscience - if you think the split
>> (series) can't be done any nicer or it makes sense to have it in this
>> order then ok.
>>
>>> + }
>>
>> Mabye:
>>
>> } else
>>
>> And log something unexpected has happened in the !msg case?
>>
>> Since it won't clear the message in that case so would it keep
>> triggering?
>>
>
> Actually after enabling of GuC interrupt, there can be interrupts from
> GuC side for some other events which are right now not handled by Host.
>
> But not clearing of unhandled event bits won't result in re-triggering
> of the interrupt.
Ok I suggest documenting that as a comment in code then.
Regards,
Tvrtko
_______________________________________________
Intel-gfx mailing list
Intel-gfx@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/intel-gfx
next prev parent reply other threads:[~2016-08-12 13:31 UTC|newest]
Thread overview: 68+ messages / expand[flat|nested] mbox.gz Atom feed top
2016-08-12 6:25 [PATCH v5 00/20] Support for sustained capturing of GuC firmware logs akash.goel
2016-08-12 6:25 ` [PATCH 01/20] drm/i915: Decouple GuC log setup from verbosity parameter akash.goel
2016-08-12 6:25 ` [PATCH 02/20] drm/i915: Add GuC ukernel logging related fields to fw interface file akash.goel
2016-08-12 6:25 ` [PATCH 03/20] drm/i915: New structure to contain GuC logging related fields akash.goel
2016-08-12 6:25 ` [PATCH 04/20] drm/i915: Add low level set of routines for programming PM IER/IIR/IMR register set akash.goel
2016-08-12 11:15 ` Tvrtko Ursulin
2016-08-12 6:25 ` [PATCH 05/20] drm/i915: Support for GuC interrupts akash.goel
2016-08-12 11:54 ` Tvrtko Ursulin
2016-08-12 13:10 ` Goel, Akash
2016-08-12 13:31 ` Tvrtko Ursulin [this message]
2016-08-12 14:31 ` Goel, Akash
2016-08-12 15:05 ` Tvrtko Ursulin
2016-08-12 15:32 ` Goel, Akash
2016-08-12 6:25 ` [PATCH 06/20] drm/i915: Handle log buffer flush interrupt event from GuC akash.goel
2016-08-12 6:28 ` Chris Wilson
2016-08-12 6:44 ` Goel, Akash
2016-08-12 6:51 ` Chris Wilson
2016-08-12 7:00 ` Goel, Akash
2016-08-12 13:17 ` Tvrtko Ursulin
2016-08-12 13:45 ` Goel, Akash
2016-08-12 14:07 ` Tvrtko Ursulin
2016-08-12 16:17 ` Goel, Akash
2016-08-12 6:25 ` [PATCH 07/20] relay: Use per CPU constructs for the relay channel buffer pointers akash.goel
2016-08-12 6:25 ` [PATCH 08/20] drm/i915: Add a relay backed debugfs interface for capturing GuC logs akash.goel
2016-08-12 13:53 ` Tvrtko Ursulin
2016-08-12 16:10 ` Goel, Akash
2016-08-12 6:25 ` [PATCH 09/20] drm/i915: New lock to serialize the Host2GuC actions akash.goel
2016-08-12 13:55 ` Tvrtko Ursulin
2016-08-12 15:01 ` Goel, Akash
2016-08-12 6:25 ` [PATCH 10/20] drm/i915: Add stats for GuC log buffer flush interrupts akash.goel
2016-08-12 14:26 ` Tvrtko Ursulin
2016-08-12 14:56 ` Goel, Akash
2016-08-12 6:25 ` [PATCH 11/20] drm/i915: Optimization to reduce the sampling time of GuC log buffer akash.goel
2016-08-12 14:42 ` Tvrtko Ursulin
2016-08-12 14:48 ` Goel, Akash
2016-08-12 15:06 ` Tvrtko Ursulin
2016-08-12 6:25 ` [PATCH 12/20] drm/i915: Increase GuC log buffer size to reduce flush interrupts akash.goel
2016-08-12 6:25 ` [PATCH 13/20] drm/i915: Augment i915 error state to include the dump of GuC log buffer akash.goel
2016-08-12 15:20 ` Tvrtko Ursulin
2016-08-12 15:32 ` Chris Wilson
2016-08-12 15:46 ` Goel, Akash
2016-08-12 15:52 ` Chris Wilson
2016-08-12 16:04 ` Goel, Akash
2016-08-12 16:09 ` Chris Wilson
2016-08-12 6:25 ` [PATCH 14/20] drm/i915: Forcefully flush GuC log buffer on reset akash.goel
2016-08-12 6:33 ` Chris Wilson
2016-08-12 7:02 ` Goel, Akash
2016-08-12 6:25 ` [PATCH 15/20] drm/i915: Debugfs support for GuC logging control akash.goel
2016-08-12 15:57 ` Tvrtko Ursulin
2016-08-12 17:08 ` Goel, Akash
2016-08-12 6:25 ` [PATCH 16/20] drm/i915: Support to create write combined type vmaps akash.goel
2016-08-12 10:49 ` Tvrtko Ursulin
2016-08-12 15:13 ` Goel, Akash
2016-08-12 15:16 ` Chris Wilson
2016-08-12 16:46 ` Goel, Akash
2016-08-12 6:25 ` [PATCH 17/20] drm/i915: Use uncached(WC) mapping for acessing the GuC log buffer akash.goel
2016-08-12 16:05 ` Tvrtko Ursulin
2016-08-12 6:25 ` [PATCH 18/20] drm/i915: Use SSE4.1 movntdqa to accelerate reads from WC memory akash.goel
2016-08-12 10:54 ` Tvrtko Ursulin
2016-08-12 12:22 ` Chris Wilson
2016-08-12 6:25 ` [PATCH 19/20] drm/i915: Use SSE4.1 movntdqa based memcpy for sampling GuC log buffer akash.goel
2016-08-12 16:06 ` Tvrtko Ursulin
2016-08-12 6:25 ` [PATCH 20/20] drm/i915: Early creation of relay channel for capturing boot time logs akash.goel
2016-08-12 16:22 ` Tvrtko Ursulin
2016-08-12 16:31 ` Goel, Akash
2016-08-15 9:20 ` Tvrtko Ursulin
2016-08-15 10:20 ` Goel, Akash
2016-08-12 6:58 ` ✗ Ro.CI.BAT: warning for Support for sustained capturing of GuC firmware logs (rev6) Patchwork
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=57ADCFB8.9090603@linux.intel.com \
--to=tvrtko.ursulin@linux.intel.com \
--cc=akash.goel@intel.com \
--cc=intel-gfx@lists.freedesktop.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.