From: Mika Kuoppala <mika.kuoppala@linux.intel.com>
To: Chris Wilson <chris@chris-wilson.co.uk>, intel-gfx@lists.freedesktop.org
Cc: Chris Wilson <chris@chris-wilson.co.uk>
Subject: Re: [Intel-gfx] [PATCH 2/2] drm/i915: Make GEM errors non-fatal by default
Date: Tue, 19 Jan 2021 13:33:43 +0200 [thread overview]
Message-ID: <87bldlgnzc.fsf@gaia.fi.intel.com> (raw)
In-Reply-To: <20210114113434.8229-2-chris@chris-wilson.co.uk>
Chris Wilson <chris@chris-wilson.co.uk> writes:
> While immensely convenient for developing to only tackle the first
> error, and not be flooded by repeated or secondiary issues, many more
> casual testers are not setup to remotely capture debug traces. For those
> testers, it is more beneficial to keep the system running in the remote
> chance that they are able to extract the original debug logs.
>
> Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk>
> Cc: Mika Kuoppala <mika.kuoppala@linux.intel.com>
> Cc: Tvrtko Ursulin <tvrtko.ursulin@intel.com>
> ---
> drivers/gpu/drm/i915/Kconfig.debug | 16 ++++++++++++++++
> drivers/gpu/drm/i915/i915_gem.h | 9 ++++++++-
> 2 files changed, 24 insertions(+), 1 deletion(-)
>
> diff --git a/drivers/gpu/drm/i915/Kconfig.debug b/drivers/gpu/drm/i915/Kconfig.debug
> index 3701bae5b855..4005f6619bec 100644
> --- a/drivers/gpu/drm/i915/Kconfig.debug
> +++ b/drivers/gpu/drm/i915/Kconfig.debug
> @@ -41,6 +41,7 @@ config DRM_I915_DEBUG
> select SW_SYNC # signaling validation framework (igt/syncobj*)
> select DRM_I915_WERROR
> select DRM_I915_DEBUG_GEM
> + select DRM_I915_DEBUG_GEM_ONCE
> select DRM_I915_DEBUG_MMIO
> select DRM_I915_DEBUG_RUNTIME_PM
> select DRM_I915_SW_FENCE_DEBUG_OBJECTS
> @@ -80,6 +81,21 @@ config DRM_I915_DEBUG_GEM
>
> If in doubt, say "N".
>
> +config DRM_I915_DEBUG_GEM_ONCE
> + bool "Make a GEM debug failure fatal"
> + default n
> + depends on DRM_I915_DEBUG_GEM
> + help
> + During development, we often only want the very first failure
> + as that would otherwise be lost in the deluge of subsequent
> + failures. However, more causal testers may not want to trigger
> + a hard BUG_ON and hope that the system remains sufficiently usable
> + to capture a bug report in situ.
Yes. And also sometimes check the state the hardware had ended up into.
eg. unusual conditions that are not captured by hang/hangcheck.
Reviewed-by: Mika Kuoppala <mika.kuoppala@linux.intel.com>
> +
> + Recommended for driver developers only.
> +
> + If in doubt, say "N".
> +
> config DRM_I915_ERRLOG_GEM
> bool "Insert extra logging (very verbose) for common GEM errors"
> default n
> diff --git a/drivers/gpu/drm/i915/i915_gem.h b/drivers/gpu/drm/i915/i915_gem.h
> index a4cad3f154ca..e622aee6e4be 100644
> --- a/drivers/gpu/drm/i915/i915_gem.h
> +++ b/drivers/gpu/drm/i915/i915_gem.h
> @@ -38,11 +38,18 @@ struct drm_i915_private;
>
> #define GEM_SHOW_DEBUG() drm_debug_enabled(DRM_UT_DRIVER)
>
> +#ifdef CONFIG_DRM_I915_DEBUG_GEM_ONCE
> +#define __GEM_BUG(cond) BUG()
> +#else
> +#define __GEM_BUG(cond) \
> + WARN(1, "%s:%d GEM_BUG_ON(%s)\n", __func__, __LINE__, __stringify(cond))
> +#endif
> +
> #define GEM_BUG_ON(condition) do { if (unlikely((condition))) { \
> GEM_TRACE_ERR("%s:%d GEM_BUG_ON(%s)\n", \
> __func__, __LINE__, __stringify(condition)); \
> GEM_TRACE_DUMP(); \
> - BUG(); \
> + __GEM_BUG(condition); \
> } \
> } while(0)
> #define GEM_WARN_ON(expr) WARN_ON(expr)
> --
> 2.20.1
_______________________________________________
Intel-gfx mailing list
Intel-gfx@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/intel-gfx
next prev parent reply other threads:[~2021-01-19 11:36 UTC|newest]
Thread overview: 6+ messages / expand[flat|nested] mbox.gz Atom feed top
2021-01-14 11:34 [Intel-gfx] [PATCH 1/2] drm/i915: Add DEBUG_GEM to the recommended CI config Chris Wilson
2021-01-14 11:34 ` [Intel-gfx] [PATCH 2/2] drm/i915: Make GEM errors non-fatal by default Chris Wilson
2021-01-19 11:33 ` Mika Kuoppala [this message]
2021-01-14 18:42 ` [Intel-gfx] ✗ Fi.CI.CHECKPATCH: warning for series starting with [1/2] drm/i915: Add DEBUG_GEM to the recommended CI config Patchwork
2021-01-14 19:12 ` [Intel-gfx] ✓ Fi.CI.BAT: success " Patchwork
2021-01-15 9:09 ` [Intel-gfx] ✓ Fi.CI.IGT: " Patchwork
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=87bldlgnzc.fsf@gaia.fi.intel.com \
--to=mika.kuoppala@linux.intel.com \
--cc=chris@chris-wilson.co.uk \
--cc=intel-gfx@lists.freedesktop.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox