From: "Ville Syrjälä" <ville.syrjala@linux.intel.com>
To: Ben Widawsky <ben@bwidawsk.net>
Cc: Intel GFX <intel-gfx@lists.freedesktop.org>,
Ben Widawsky <benjamin.widawsky@intel.com>
Subject: Re: [PATCH 06/13] drm/i915/bdw: implement semaphore signal
Date: Wed, 12 Feb 2014 11:29:20 +0200 [thread overview]
Message-ID: <20140212092920.GA3891@intel.com> (raw)
In-Reply-To: <20140211230131.GA6449@bwidawsk.net>
On Tue, Feb 11, 2014 at 03:01:31PM -0800, Ben Widawsky wrote:
> On Tue, Feb 11, 2014 at 02:22:37PM -0800, Ben Widawsky wrote:
> > On Tue, Feb 11, 2014 at 02:11:04PM -0800, Ben Widawsky wrote:
> > > On Thu, Jan 30, 2014 at 02:38:17PM +0200, Ville Syrjälä wrote:
> > > > On Wed, Jan 29, 2014 at 11:55:26AM -0800, Ben Widawsky wrote:
> > > > > Semaphore signalling works similarly to previous GENs with the exception
> > > > > that the per ring mailboxes no longer exist. Instead you must define
> > > > > your own space, somewhere in the GTT.
> > > > >
> > > > > The comments in the code define the layout I've opted for, which should
> > > > > be fairly future proof. Ie. I tried to define offsets in abstract terms
> > > > > (NUM_RINGS, seqno size, etc).
> > > > >
> > > > > NOTE: If one wanted to move this to the HWSP they could. I've decided
> > > > > one 4k object would be easier to deal with, and provide potential wins
> > > > > with cache locality, but that's all speculative.
> > > > >
> > > > > v2: Update the macro to not need the other ring's ring->id (Chris)
> > > > > Update the comment to use the correct formula (Chris)
> > > > >
> > > > > Signed-off-by: Ben Widawsky <ben@bwidawsk.net>
> > > > > ---
> > > > > drivers/gpu/drm/i915/i915_drv.h | 1 +
> > > > > drivers/gpu/drm/i915/i915_reg.h | 5 +-
> > > > > drivers/gpu/drm/i915/intel_ringbuffer.c | 199 +++++++++++++++++++++++++-------
> > > > > drivers/gpu/drm/i915/intel_ringbuffer.h | 38 +++++-
> > > > > 4 files changed, 197 insertions(+), 46 deletions(-)
> > > > >
> > > > > diff --git a/drivers/gpu/drm/i915/i915_drv.h b/drivers/gpu/drm/i915/i915_drv.h
> > > > > index 3673ba1..f521059 100644
> > > > > --- a/drivers/gpu/drm/i915/i915_drv.h
> > > > > +++ b/drivers/gpu/drm/i915/i915_drv.h
> > > > > @@ -1380,6 +1380,7 @@ typedef struct drm_i915_private {
> > > > >
> > > > > struct pci_dev *bridge_dev;
> > > > > struct intel_ring_buffer ring[I915_NUM_RINGS];
> > > > > + struct drm_i915_gem_object *semaphore_obj;
> > > > > uint32_t last_seqno, next_seqno;
> > > > >
> > > > > drm_dma_handle_t *status_page_dmah;
> > > > > diff --git a/drivers/gpu/drm/i915/i915_reg.h b/drivers/gpu/drm/i915/i915_reg.h
> > > > > index cbbaf26..8b745dc 100644
> > > > > --- a/drivers/gpu/drm/i915/i915_reg.h
> > > > > +++ b/drivers/gpu/drm/i915/i915_reg.h
> > > > > @@ -216,7 +216,7 @@
> > > > > #define MI_DISPLAY_FLIP_IVB_SPRITE_B (3 << 19)
> > > > > #define MI_DISPLAY_FLIP_IVB_PLANE_C (4 << 19)
> > > > > #define MI_DISPLAY_FLIP_IVB_SPRITE_C (5 << 19)
> > > > > -#define MI_SEMAPHORE_MBOX MI_INSTR(0x16, 1) /* gen6+ */
> > > > > +#define MI_SEMAPHORE_MBOX MI_INSTR(0x16, 1) /* gen6, gen7 */
> > > > > #define MI_SEMAPHORE_GLOBAL_GTT (1<<22)
> > > > > #define MI_SEMAPHORE_UPDATE (1<<21)
> > > > > #define MI_SEMAPHORE_COMPARE (1<<20)
> > > > > @@ -241,6 +241,8 @@
> > > > > #define MI_RESTORE_EXT_STATE_EN (1<<2)
> > > > > #define MI_FORCE_RESTORE (1<<1)
> > > > > #define MI_RESTORE_INHIBIT (1<<0)
> > > > > +#define MI_SEMAPHORE_SIGNAL MI_INSTR(0x1b, 0) /* GEN8+ */
> > > > > +#define MI_SEMAPHORE_TARGET(engine) ((engine)<<15)
> > > > > #define MI_STORE_DWORD_IMM MI_INSTR(0x20, 1)
> > > > > #define MI_MEM_VIRTUAL (1 << 22) /* 965+ only */
> > > > > #define MI_STORE_DWORD_INDEX MI_INSTR(0x21, 1)
> > > > > @@ -329,6 +331,7 @@
> > > > > #define PIPE_CONTROL_TEXTURE_CACHE_INVALIDATE (1<<10) /* GM45+ only */
> > > > > #define PIPE_CONTROL_INDIRECT_STATE_DISABLE (1<<9)
> > > > > #define PIPE_CONTROL_NOTIFY (1<<8)
> > > > > +#define PIPE_CONTROL_FLUSH_ENABLE (1<<7) /* gen7+ */
> > > > > #define PIPE_CONTROL_VF_CACHE_INVALIDATE (1<<4)
> > > > > #define PIPE_CONTROL_CONST_CACHE_INVALIDATE (1<<3)
> > > > > #define PIPE_CONTROL_STATE_CACHE_INVALIDATE (1<<2)
> > > > > diff --git a/drivers/gpu/drm/i915/intel_ringbuffer.c b/drivers/gpu/drm/i915/intel_ringbuffer.c
> > > > > index 37ae2b1..b750835 100644
> > > > > --- a/drivers/gpu/drm/i915/intel_ringbuffer.c
> > > > > +++ b/drivers/gpu/drm/i915/intel_ringbuffer.c
> > > > > @@ -619,6 +619,13 @@ static int init_render_ring(struct intel_ring_buffer *ring)
> > > > > static void render_ring_cleanup(struct intel_ring_buffer *ring)
> > > > > {
> > > > > struct drm_device *dev = ring->dev;
> > > > > + struct drm_i915_private *dev_priv = dev->dev_private;
> > > > > +
> > > > > + if (dev_priv->semaphore_obj) {
> > > > > + i915_gem_object_ggtt_unpin(dev_priv->semaphore_obj);
> > > > > + drm_gem_object_unreference(&dev_priv->semaphore_obj->base);
> > > > > + dev_priv->semaphore_obj = NULL;
> > > > > + }
> > > > >
> > > > > if (ring->scratch.obj == NULL)
> > > > > return;
> > > > > @@ -632,6 +639,86 @@ static void render_ring_cleanup(struct intel_ring_buffer *ring)
> > > > > ring->scratch.obj = NULL;
> > > > > }
> > > > >
> > > > > +static int gen8_rcs_signal(struct intel_ring_buffer *signaller,
> > > > > + unsigned int num_dwords)
> > > > > +{
> > > > > +#define MBOX_UPDATE_DWORDS 8
> > > > > + struct drm_device *dev = signaller->dev;
> > > > > + struct drm_i915_private *dev_priv = dev->dev_private;
> > > > > + struct intel_ring_buffer *waiter;
> > > > > + int i, ret, num_rings;
> > > > > +
> > > > > + num_rings = hweight_long(INTEL_INFO(dev)->ring_mask);
> > > > > + num_dwords = (num_rings-1) * MBOX_UPDATE_DWORDS;
> > > >
> > > > Again num_dwords +=
> > > >
> > > > > +#undef MBOX_UPDATE_DWORDS
> > > > > +
> > > > > + /* XXX: + 4 for the caller */
> > > > > + ret = intel_ring_begin(signaller, num_dwords + 4);
> > > >
> > > > and the +4 goes away.
> > > >
> > > > > + if (ret)
> > > > > + return ret;
> > > > > +
> > > > > + for_each_ring(waiter, dev_priv, i) {
> > > > > + u64 gtt_offset = signaller->semaphore.signal_ggtt[i];
> > > > > + if (gtt_offset == MI_SEMAPHORE_SYNC_INVALID)
> > > > > + continue;
> > > > > +
> > > > > + intel_ring_emit(signaller, GFX_OP_PIPE_CONTROL(6));
> > > > > + intel_ring_emit(signaller, PIPE_CONTROL_GLOBAL_GTT_IVB |
> > > > > + PIPE_CONTROL_QW_WRITE |
> > > > > + PIPE_CONTROL_FLUSH_ENABLE);
> > > > > + intel_ring_emit(signaller, lower_32_bits(gtt_offset));
> > > > > + intel_ring_emit(signaller, upper_32_bits(gtt_offset));
> > > > > + intel_ring_emit(signaller, signaller->outstanding_lazy_seqno);
> > > > > + intel_ring_emit(signaller, 0);
> > > > > + intel_ring_emit(signaller, MI_SEMAPHORE_SIGNAL |
> > > > > + MI_SEMAPHORE_TARGET(waiter->id));
> > > > > + intel_ring_emit(signaller, 0);
> > > > > + }
> > > > > +
> > > > > + WARN_ON(i != num_rings);
> > > > > +
> > > > > + return 0;
> > > > > +}
> > > >
> > > > <snip>
> > >
> > > Got those, thanks.
> > >
> > > >
> > > > > diff --git a/drivers/gpu/drm/i915/intel_ringbuffer.h b/drivers/gpu/drm/i915/intel_ringbuffer.h
> > > > > index c69ae10..f1e7a66 100644
> > > > > --- a/drivers/gpu/drm/i915/intel_ringbuffer.h
> > > > > +++ b/drivers/gpu/drm/i915/intel_ringbuffer.h
> > > > > @@ -111,6 +111,39 @@ struct intel_ring_buffer {
> > > > > #define I915_DISPATCH_PINNED 0x2
> > > > > void (*cleanup)(struct intel_ring_buffer *ring);
> > > > >
> > > > > + /* GEN8 signal/wait table
> > > > > + * signal to signal to signal to signal to
> > > > > + * RCS VCS BCS VECS
> > > > > + * ------------------------------------------------------
> > > > > + * RCS | NOP (0x00) | BCS (0x08) | VCS (0x10) | VECS (0x18) |
> > > > > + * |-----------------------------------------------------
> > > > > + * VCS | RCS (0x20) | NOP (0x28) | BCS (0x30) | VECS (0x38) |
> > > > > + * |-----------------------------------------------------
> > > > > + * BCS | RCS (0x40) | VCS (0x48) | NOP (0x50) | VECS (0x58) |
> > > > > + * |-----------------------------------------------------
> > > > > + * VECS | RCS (0x60) | VCS (0x68) | BCS (0x70) | NOP (0x78) |
> > > > > + * |-----------------------------------------------------
> > > > > + *
> > > > > + * Generalization:
> > > > > + * f(x, y) := (x->id * NUM_RINGS * seqno_size) + (seqno_size * y->id)
> > > > > + * ie. transpose of g(x, y)
> > > > > + *
> > > > > + * sync from sync from sync from sync from
> > > > > + * RCS VCS BCS VECS
> > > > > + * ------------------------------------------------------
> > > > > + * RCS | NOP (0x00) | BCS (0x20) | VCS (0x40) | VECS (0x60) |
> > > > > + * |-----------------------------------------------------
> > > > > + * VCS | RCS (0x08) | NOP (0x28) | BCS (0x48) | VECS (0x68) |
> > > > > + * |-----------------------------------------------------
> > > > > + * BCS | RCS (0x10) | VCS (0x30) | NOP (0x50) | VECS (0x60) |
> > > > > + * |-----------------------------------------------------
> > > > > + * VECS | RCS (0x18) | VCS (0x38) | BCS (0x58) | NOP (0x78) |
> > > > > + * |-----------------------------------------------------
> > > > > + *
> > > > > + * Generalization:
> > > > > + * g(x, y) := (y->id * NUM_RINGS * seqno_size) + (seqno_size * x->id)
> > > > > + * ie. transpose of f(x, y)
> > > > > + */
> > > > > struct {
> > > > > u32 sync_seqno[I915_NUM_RINGS-1];
> > > > > /* AKA wait() */
> > > > > @@ -120,7 +153,10 @@ struct intel_ring_buffer {
> > > > > /* our mbox written by others */
> > > > > u32 mbox[I915_NUM_RINGS];
> > > >
> > > > mbox should also get a u64 friend, right?
> > >
> > > mbox should be gen6 only, given the change to using the gtt on gen8. In
> > > this point in the series, semaphores should be forcibly disabled on
> > > gen8, so the code looks wrong, but the path cannot [should not] be
> > > taken.
> > >
> > > I suppose I should kill the initialization of mbox for gen8, or somehow
> > > consolidate with a union to prevent confusion.
> > >
> >
> > Just to clarify it should be
> >
> > gen6:
> > signal uses signal_mbox for signal
> > wait uses mbox
> >
> > gen8:
> > signal uses signal_ggtt for signal
> > wait uses arithmetic to figure out the offset
> >
>
> Ok, I've fixed this up to make things clearer, but it ends up in the
> next patch. So look there when I repost.
I was confused by these:
+ ring->semaphore.mbox[RCS] = GEN8_WAIT_OFFSET(ring, RCS);
So you did store the wait offset into mbox, but then you didn't use the
precomputed values and instead recomputed on the spot in gen8
ring_sync().
>
> > > >
> > > > > /* mboxes this ring signals to */
> > > > > - u32 signal_mbox[I915_NUM_RINGS];
> > > > > + union {
> > > > > + u32 signal_mbox[I915_NUM_RINGS];
> > > > > + u64 signal_ggtt[I915_NUM_RINGS];
> > > > > + };
> > > > >
> > > > > /* num_dwords is space the caller will need for atomic update */
> > > > > int (*signal)(struct intel_ring_buffer *signaller,
> > > > > --
> > > > > 1.8.5.3
> > > > >
> > > > > _______________________________________________
> > > > > Intel-gfx mailing list
> > > > > Intel-gfx@lists.freedesktop.org
> > > > > http://lists.freedesktop.org/mailman/listinfo/intel-gfx
> > > >
> > > > --
> > > > Ville Syrjälä
> > > > Intel OTC
> > >
> > > --
> > > Ben Widawsky, Intel Open Source Technology Center
> >
> > --
> > Ben Widawsky, Intel Open Source Technology Center
>
> --
> Ben Widawsky, Intel Open Source Technology Center
--
Ville Syrjälä
Intel OTC
next prev parent reply other threads:[~2014-02-12 9:29 UTC|newest]
Thread overview: 43+ messages / expand[flat|nested] mbox.gz Atom feed top
2014-01-29 19:55 [PATCH 00/13] [REPOST] Broadwell HW semaphores Ben Widawsky
2014-01-29 19:55 ` [PATCH 01/13] drm/i915: Move semaphore specific ring members to struct Ben Widawsky
2014-01-29 19:55 ` [PATCH 02/13] drm/i915: Virtualize the ringbuffer signal func Ben Widawsky
2014-01-29 19:55 ` [PATCH 03/13] drm/i915: Move ring_begin to signal() Ben Widawsky
2014-01-29 19:55 ` [PATCH 04/13] drm/i915: Make semaphore updates more precise Ben Widawsky
2014-01-30 11:25 ` Ville Syrjälä
2014-02-11 16:08 ` Ben Widawsky
2014-02-11 17:13 ` Ville Syrjälä
2014-02-11 20:20 ` [PATCH] [v2] " Ben Widawsky
2014-02-11 20:53 ` Ville Syrjälä
2014-02-11 21:50 ` Ben Widawsky
2014-01-29 19:55 ` [PATCH 05/13] drm/i915: gen specific ring init Ben Widawsky
2014-01-29 19:55 ` [PATCH 06/13] drm/i915/bdw: implement semaphore signal Ben Widawsky
2014-01-30 12:38 ` Ville Syrjälä
2014-01-30 12:46 ` Chris Wilson
2014-01-30 13:18 ` Daniel Vetter
2014-01-30 13:25 ` Chris Wilson
2014-01-30 13:35 ` Chris Wilson
2014-02-11 21:48 ` Ben Widawsky
2014-02-11 22:23 ` Chris Wilson
2014-02-11 22:25 ` Ben Widawsky
2014-02-11 22:28 ` Chris Wilson
2014-02-11 22:11 ` Ben Widawsky
2014-02-11 22:22 ` Ben Widawsky
2014-02-11 23:01 ` Ben Widawsky
2014-02-12 9:29 ` Ville Syrjälä [this message]
2014-01-29 19:55 ` [PATCH 07/13] drm/i915/bdw: implement semaphore wait Ben Widawsky
2014-01-30 12:48 ` Ville Syrjälä
2014-01-29 19:55 ` [PATCH 08/13] drm/i915: FORCE_RESTORE for gen8 semaphores Ben Widawsky
2014-01-29 19:55 ` [PATCH 09/13] drm/i915/bdw: poll semaphores Ben Widawsky
2014-01-30 13:26 ` Ville Syrjälä
2014-01-29 19:55 ` [PATCH 10/13] drm/i915: Extract semaphore error collection Ben Widawsky
2014-01-29 19:55 ` [PATCH 11/13] drm/i915/bdw: collect semaphore error state Ben Widawsky
2014-01-30 14:53 ` Ville Syrjälä
2014-01-30 14:58 ` Chris Wilson
2014-02-12 0:19 ` Ben Widawsky
2014-02-12 0:23 ` Ben Widawsky
2014-01-29 19:55 ` [PATCH 12/13] drm/i915: unleash semaphores on gen8 Ben Widawsky
2014-01-29 19:55 ` [PATCH 13/13] drm/i915: semaphore debugfs Ben Widawsky
-- strict thread matches above, loose matches on Subject: below --
2014-02-20 6:19 [PATCH 01/13] drm/i915: Move semaphore specific ring members to struct Ben Widawsky
2014-02-20 6:19 ` [PATCH 06/13] drm/i915/bdw: implement semaphore signal Ben Widawsky
2014-02-24 13:10 ` Ville Syrjälä
2014-03-05 12:24 ` Ville Syrjälä
2014-04-29 21:52 [PATCH 00/13] [REPOST] BDW Semaphores Ben Widawsky
2014-04-29 21:52 ` [PATCH 06/13] drm/i915/bdw: implement semaphore signal Ben Widawsky
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20140212092920.GA3891@intel.com \
--to=ville.syrjala@linux.intel.com \
--cc=ben@bwidawsk.net \
--cc=benjamin.widawsky@intel.com \
--cc=intel-gfx@lists.freedesktop.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.