public inbox for intel-gfx@lists.freedesktop.org
 help / color / mirror / Atom feed
From: "O'Rourke, Tom" <Tom.O'Rourke@intel.com>
To: Daniel Vetter <daniel@ffwll.ch>
Cc: Michael Auchter <a@phire.org>,
	Daniel Vetter <daniel.vetter@ffwll.ch>,
	intel-gfx@lists.freedesktop.org
Subject: Re: i915: WARN_ON(val > dev_priv->rps.max_freq_softlimit)
Date: Wed, 11 Feb 2015 08:57:29 -0800	[thread overview]
Message-ID: <20150211165729.GA89231@torourke-desk1> (raw)
In-Reply-To: <20150211073044.GU24485@phenom.ffwll.local>

On Wed, Feb 11, 2015 at 08:30:44AM +0100, Daniel Vetter wrote:
> On Tue, Feb 10, 2015 at 10:26:02PM -0800, O'Rourke, Tom wrote:
> > On Thu, Jan 29, 2015 at 08:56:06PM -0500, Michael Auchter wrote:
> > > On Thu, Jan 29, 2015 at 06:12:31PM +0100, Daniel Vetter wrote:
> > > > On Wed, Jan 28, 2015 at 10:36:02PM -0800, O'Rourke, Tom wrote:
> > > > > On Wed, Jan 28, 2015 at 01:28:58PM +0200, Ville Syrjälä wrote:
> > > > > > On Wed, Jan 28, 2015 at 09:58:15AM +0000, Chris Wilson wrote:
> > > > > > > On Wed, Jan 28, 2015 at 12:43:21AM -0500, Michael Auchter wrote:
> > > > > > > > Testing out 3.19-rc6 on my 2014 Thinkpad X1 Carbon (Haswell) resulted in
> > > > > > > > this WARN at boot (and pretty frequently afterwards):
> > > > > > > > 
> > > > > > > > WARNING: CPU: 0 PID: 989 at drivers/gpu/drm/i915/intel_pm.c:4377 gen6_set_rps+0x371/0x3c0()
> > > > > > > > WARN_ON(val > dev_priv->rps.max_freq_softlimit)
> > > > > > > 
> > > > > > > [snip]
> > > > > > >  
> > > > > > > > I'm not at all familiar with this hardware, but I took a quick look into
> > > > > > > > what changed with this commit for my laptop. Before the commit,
> > > > > > > > rps.min_freq_softlimit is 4 (from rps.min_freq) and
> > > > > > > > rps.max_freq_softlimit is 22.
> > > > > > > > 
> > > > > > > > After the commit, rps.min_freq_softlimit is set to the
> > > > > > > > rps.efficient_freq value read from pcode, which is 34 on my laptop.
> > > > > > > > So later when gen6_set_rps() is called with rps.min_freq_softlimit that
> > > > > > > > warning is hit.
> > > > > > > > 
> > > > > > > > Any thoughts? It certainly seems fishy that this commit causes
> > > > > > > > rps.min_freq_softlimit to be greater than rps.max_freq_softlimit.
> > > > > > > 
> > > > > > > Very fishy indeed. Moral of this story, never trust hw.
> > > > > > > 
> > > > > > > diff --git a/drivers/gpu/drm/i915/intel_pm.c b/drivers/gpu/drm/i915/intel_pm.c
> > > > > > > index 3e630feb18e4..bbedd2901c54 100644
> > > > > > > --- a/drivers/gpu/drm/i915/intel_pm.c
> > > > > > > +++ b/drivers/gpu/drm/i915/intel_pm.c
> > > > > > > @@ -4007,7 +4007,10 @@ static void gen6_init_rps_frequencies(struct drm_device *dev)
> > > > > > >                                         &ddcc_status);
> > > > > > >                 if (0 == ret)
> > > > > > >                         dev_priv->rps.efficient_freq =
> > > > > > > -                               (ddcc_status >> 8) & 0xff;
> > > > > > > +                               clamp_t(u8,
> > > > > > > +                                       (ddcc_status >> 8) & 0xff,
> > > > > > > +                                       dev_priv->rps.min_freq,
> > > > > > > +                                       dev_priv->rps.max_freq);
> > > > > > 
> > > > > > Maybe better to fall back to rp1_freq if this is bogus?
> > > > > >
> > > > > [TOR:] Michael, Thank you for bringing this problem to our attention.
> > > > > 
> > > > > Yes, this function needs some range checking to maintain
> > > > > RPn <= RPe <= RP0.
> > > > > 
> > > > > A value of 34 seems too high for RPe.  
> > > > > What values does the Carbon X1 (Haswell) have for RPn and RP0?
> > > > 
> > > > 4 & 22, already in Micheal's original bug report.
> > > > 
> > > > Tom, can you pls polish the clamping into a proper patch with m-l
> > > > references?
> > > > 
> > > > Micheal, can you please test the first hunk from Chris (the one that adds
> > > > the clamp) to make sure it does indeed address the WARN_ON you're seeing?
> > > 
> > > The clamp suggested by Chris does indeed fix the WARN_ON.
> > > 
> > > In the case where RPe is greater than RP0, RPe will now be clamped to
> > > RP0. Is this likely to result in increased power consumption?
> > > 
> > > At a quick glance on my laptop it does not (idling around 5W before and
> > > after) but Ville had suggested earlier to fall back to RP1, which would
> > > be more consistent with previous kernels.
> > > 
> > > Thanks again for the quick responses,
> > >  Michael
> > 
> > [TOR:] Michael,  I discussed this report with a pcode architect here.
> > 
> > The RPe value is clamped to the [RPn, RP0] range by pcode before
> > returning the value to the driver on Broadwell but not on Haswell.
> > 
> > On Haswell, an efficient frequency value above RP0 is not a garbage
> > value and could result from a relatively flat efficiency curve.  In
> > this situation, leakage power would dominate the efficiency curve
> > such that running at lower frequencies may not save power overall.
> > Higher leakage power may result from a higher package temperature.
> > 
> > Running at RP0 may actually save power compared to RP1 by allowing
> > more time in RC6.  
> 
> Hm, I'd just go with clamping since that's what bdw does too. So Chris
> diff essentially.
> -Daniel

[TOR:] Yes, we should go with clamping, essentially the diff from Chris.
I am trying to provide an explanation of why that is the right thing to do.

Thanks,
Tom
_______________________________________________
Intel-gfx mailing list
Intel-gfx@lists.freedesktop.org
http://lists.freedesktop.org/mailman/listinfo/intel-gfx

      reply	other threads:[~2015-02-11 16:57 UTC|newest]

Thread overview: 12+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2015-01-28  5:43 i915: WARN_ON(val > dev_priv->rps.max_freq_softlimit) Michael Auchter
2015-01-28  9:44 ` Daniel Vetter
2015-01-28  9:58 ` Chris Wilson
2015-01-28 11:28   ` Ville Syrjälä
2015-01-29  6:36     ` O'Rourke, Tom
2015-01-29 14:53       ` Michael Auchter
2015-01-29 17:12       ` Daniel Vetter
2015-01-30  1:56         ` Michael Auchter
2015-02-03  2:01           ` O'Rourke, Tom
2015-02-11  6:26           ` O'Rourke, Tom
2015-02-11  7:30             ` Daniel Vetter
2015-02-11 16:57               ` O'Rourke, Tom [this message]

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20150211165729.GA89231@torourke-desk1 \
    --to=tom.o'rourke@intel.com \
    --cc=a@phire.org \
    --cc=daniel.vetter@ffwll.ch \
    --cc=daniel@ffwll.ch \
    --cc=intel-gfx@lists.freedesktop.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox