Re: [Intel-xe] [PATCH 0/1] Remove uses of BUG_ON

Intel-XE Archive on lore.kernel.org
 help / color / mirror / Atom feed

From: Rodrigo Vivi <rodrigo.vivi@intel.com>
To: Matthew Brost <matthew.brost@intel.com>
Cc: Francois Dugast <francois.dugast@intel.com>,
	intel-xe@lists.freedesktop.org
Subject: Re: [Intel-xe] [PATCH 0/1] Remove uses of BUG_ON
Date: Mon, 24 Jul 2023 14:24:34 -0400	[thread overview]
Message-ID: <ZL7B4iij9xU5hXz1@intel.com> (raw)
In-Reply-To: <ZLqpUr90JRk4fQIR@DUT025-TGLU.fm.intel.com>

On Fri, Jul 21, 2023 at 03:50:42PM +0000, Matthew Brost wrote:
> On Fri, Jul 21, 2023 at 01:20:29PM +0000, Francois Dugast wrote:
> > This is a first pass on removing BUG_ON. It is replaced with a call to
> > drm_err() and a return. Feedback on this is welcome before removing
> > remaining uses of BUG_ON, which will require more manual and specific
> > work.
> > 
> 
> Personally I don't like this at all, agree the BUG_ON should be removed
> but IMO we just replace these with a XE_WARN_ON that gets compiled out
> of non-debug builds.

I disagree here. The BUG_ON cases are in general cases that were marked
as possibly catastrophic, hence the execution should be stopped. But
instead, we need to avoid the code after and exit gracefully with some
error message.

With this in mind I like the initial proposal of this patch here.
I just believe we need to go and change message by message.

>
I like WARN/BUG as they act more like a self
> documenting assert of the codes usage plus if they get trigger we get

The if conditions here are pretty clear in the code as well.
no reason to be some macro in capital letters.

> the call stack which usuaully help diagnosis the issue and disappear in
> non-debug builds.

well, the bug cases should be the catastrophic cases, so in general
I don't see why we should avoid in production mode the cases where
we simply print error and return gracefully and avoiding the
catastrophic scenarios,.

For warns, we should use that in cases where the information of the
stack of the callers is a helpful information for debugging or
triage. And this is usually helpful on production cases as well
to receive bug reports with helpful information.

Also, in general we shouldn't need to disable any kind of message
in production because of the bug report case mentioned above. What
we need to do instead is to carefully mark the debug level for the
drm debug to take care of the messages. So a bug reporter could
enable the right level of debug and collect all the useful info
for us.

That said, I need to say that I hate that we have XE_ variants of
BUG_ON() and WARN_ON(). We need to get rid of this things and use
only the drm and kernel standards.

> 
> Let's see if the team agrees.
> 
> Matt
> 
> > For this pass, most of the changes were automated with coccinelle using:
> > 
> > 	@notpossible@
> > 	@@
> > 
> > 	- XE_BUG_ON("NOT POSSIBLE");
> > 	+ drm_err(&vm->xe->drm, "NOT POSSIBLE");
> > 	+ return -EINVAL;
> > 
> > 	@e@
> > 	identifier macro =~ "^XE_BUG_ON$";
> > 	expression cond;
> > 	@@
> > 	macro(cond)
> > 
> > 	@script : python q@
> > 	cond << e.cond;
> > 	cond_expr;
> > 	@@
> > 	coccinelle.cond_expr = cocci.make_expr("\""+cond.replace(" ", "")+"\"");
> > 
> > 	@replace_in_func_return_struct@
> > 	identifier e.macro;
> > 	expression e.cond;
> > 	expression q.cond_expr;
> > 	identifier func;
> > 	identifier a;
> > 	@@
> > 
> > 	struct a *func(...) {
> > 	...
> > 	- macro(cond);
> > 	+ if (cond) {
> > 	+ drm_err(&xe->drm, cond_expr);
> > 	+ return NULL;
> > 	+ }
> > 	...
> > 	}
> > 
> > 	@replace_in_func_return_void@
> > 	identifier e.macro;
> > 	expression e.cond;
> > 	expression q.cond_expr;
> > 	identifier func;
> > 	@@
> > 
> > 	void func(...) {
> > 	...
> > 	- macro(cond);
> > 	+ if (cond) {
> > 	+ drm_err(&xe->drm, cond_expr);
> > 	+ return;
> > 	+ }
> > 	...
> > 	}
> > 
> > 	@replace_in_func_return_other@
> > 	identifier e.macro;
> > 	expression e.cond;
> > 	expression q.cond_expr;
> > 	@@
> > 
> > 	- macro(cond);
> > 	+ if (cond) {
> > 	+ drm_err(&xe->drm, cond_expr);
> > 	+ return -EINVAL;
> > 	+ }
> > 
> > Francois Dugast (1):
> >   drm/xe: Remove uses of BUG_ON
> > 
> >  drivers/gpu/drm/xe/xe_bo.c                  | 105 ++++++++++++----
> >  drivers/gpu/drm/xe/xe_bo_evict.c            |  10 +-
> >  drivers/gpu/drm/xe/xe_execlist.c            |  22 +++-
> >  drivers/gpu/drm/xe/xe_force_wake.c          |  10 +-
> >  drivers/gpu/drm/xe/xe_gt_clock.c            |   5 +-
> >  drivers/gpu/drm/xe/xe_gt_debugfs.c          |   5 +-
> >  drivers/gpu/drm/xe/xe_gt_tlb_invalidation.c |  31 ++++-
> >  drivers/gpu/drm/xe/xe_guc.c                 |  32 +++--
> >  drivers/gpu/drm/xe/xe_guc_ads.c             |  35 ++++--
> >  drivers/gpu/drm/xe/xe_guc_hwconfig.c        |   5 +-
> >  drivers/gpu/drm/xe/xe_guc_submit.c          |  95 +++++++++++----
> >  drivers/gpu/drm/xe/xe_huc.c                 |   5 +-
> >  drivers/gpu/drm/xe/xe_migrate.c             |  61 ++++++++--
> >  drivers/gpu/drm/xe/xe_sched_job.c           |   9 +-
> >  drivers/gpu/drm/xe/xe_ttm_stolen_mgr.c      |  10 +-
> >  drivers/gpu/drm/xe/xe_vm.c                  | 125 +++++++++++++++-----
> >  drivers/gpu/drm/xe/xe_wopcm.c               |  45 +++++--
> >  17 files changed, 482 insertions(+), 128 deletions(-)
> > 
> > -- 
> > 2.34.1
> >

     prev parent reply	other threads:[~2023-07-24 18:24 UTC|newest]

Thread overview: 12+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2023-07-21 13:20 [Intel-xe] [PATCH 0/1] Remove uses of BUG_ON Francois Dugast
2023-07-21 13:20 ` [Intel-xe] [PATCH 1/1] drm/xe: " Francois Dugast
2023-07-24 18:26   ` Rodrigo Vivi
2023-07-21 13:34 ` [Intel-xe] ✓ CI.Patch_applied: success for " Patchwork
2023-07-21 13:34 ` [Intel-xe] ✗ CI.checkpatch: warning " Patchwork
2023-07-21 13:35 ` [Intel-xe] ✓ CI.KUnit: success " Patchwork
2023-07-21 13:39 ` [Intel-xe] ✓ CI.Build: " Patchwork
2023-07-21 13:39 ` [Intel-xe] ✓ CI.Hooks: " Patchwork
2023-07-21 13:40 ` [Intel-xe] ✓ CI.checksparse: " Patchwork
2023-07-21 14:08 ` [Intel-xe] ○ CI.BAT: info " Patchwork
2023-07-21 15:50 ` [Intel-xe] [PATCH 0/1] " Matthew Brost
2023-07-24 18:24   ` Rodrigo Vivi [this message]

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=ZL7B4iij9xU5hXz1@intel.com \
    --to=rodrigo.vivi@intel.com \
    --cc=francois.dugast@intel.com \
    --cc=intel-xe@lists.freedesktop.org \
    --cc=matthew.brost@intel.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link

Be sure your reply has a Subject: header at the top and a blank line before the message body.

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox