From: "Thomas Hellström" <thomas.hellstrom@linux.intel.com>
To: "Christian König" <christian.koenig@amd.com>,
dri-devel@lists.freedesktop.org
Subject: Re: [RFC PATCH 1/2] drm/amdgpu: Move dma_resv waiting outside the notifier lock
Date: Fri, 08 Oct 2021 08:43:16 +0200 [thread overview]
Message-ID: <571f25bb979922f84c12cd0e98ab5870c9f63448.camel@linux.intel.com> (raw)
In-Reply-To: <d56974c3-c887-2086-5e7f-ba46a7f82319@amd.com>
Hi, Christian,
On Thu, 2021-10-07 at 16:57 +0200, Christian König wrote:
> Am 07.10.21 um 15:26 schrieb Thomas Hellström:
> > While the range notifier is executing, we have the write-side mmu
> > interval
> > seqlock, and mmu_interval_read_retry() is always returning true,
> > which means that if amdgpu_cs_submit grabs the notifier lock during
> > the
> > fence wait, it will retry anyway when checking the userptr validity
> > and block when retrying in mmu_interval_read_begin().
> > (See the extensive comments in mmu_interval_read_begin())
> >
> > Hence we can release the notifier lock before the fence wait and
> > avoid
> > a device-wide command submission block during invalidation.
>
> First of all I'm not convinced that this works and second blocking
> the
> CS while an MMU invalidation is underway is completely intentional.
>
> In other words when for example fork() is called in parallel with a
> CS
> the CS should be blocked until the invalidation caused by the fork()
> operation is completed and *NOT* risk that the CS succeeds and adds
> another dependency to the MMU invalidation.
The point is that command submission can't succeed since while the
notifer runs, we have the write-side seqlock. It's released when the
notifier ends, and we can rely on that. The following will happen:
Thread1 Thread 2
enter_cs(); enter_invalidation();
notifier_lock()
update_seqno();
notifier_unlock();
notifier_lock(); fence_wait();
validate_userptr();
(invalid_seqno)
notifier_unlock();
(retry)
mmu_interval_read_begin();
(blocks)
fence_signal();
(notifier ends)
mn_itree_inv_end();
(unblocks, new seqno);
enter_cs();
(succeeds)
So the difference is that you block only CS that affect that particular
userptr, not CS across the entire device.
So for the sake of discussion, is there a particular situation where
you see a race that might happen here?
Thanks,
Thomas
>
> Christian.
>
> >
> > Cc: Christian König <christian.koenig@amd.com>
> > Signed-off-by: Thomas Hellström <thomas.hellstrom@linux.intel.com>
> > ---
> > drivers/gpu/drm/amd/amdgpu/amdgpu_mn.c | 3 ++-
> > 1 file changed, 2 insertions(+), 1 deletion(-)
> >
> > diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_mn.c
> > b/drivers/gpu/drm/amd/amdgpu/amdgpu_mn.c
> > index 4b153daf283d..d3d340a6129c 100644
> > --- a/drivers/gpu/drm/amd/amdgpu/amdgpu_mn.c
> > +++ b/drivers/gpu/drm/amd/amdgpu/amdgpu_mn.c
> > @@ -75,9 +75,10 @@ static bool amdgpu_mn_invalidate_gfx(struct
> > mmu_interval_notifier *mni,
> >
> > mmu_interval_set_seq(mni, cur_seq);
> >
> > + mutex_unlock(&adev->notifier_lock);
> > +
> > r = dma_resv_wait_timeout(bo->tbo.base.resv, true, false,
> > MAX_SCHEDULE_TIMEOUT);
> > - mutex_unlock(&adev->notifier_lock);
> > if (r <= 0)
> > DRM_ERROR("(%ld) failed to wait for user bo\n", r);
> > return true;
>
next prev parent reply other threads:[~2021-10-08 6:43 UTC|newest]
Thread overview: 5+ messages / expand[flat|nested] mbox.gz Atom feed top
2021-10-07 13:26 [RFC PATCH 0/2] drm/amdgpu: Reduce the notifier_lock contention Thomas Hellström
2021-10-07 13:26 ` [RFC PATCH 1/2] drm/amdgpu: Move dma_resv waiting outside the notifier lock Thomas Hellström
2021-10-07 14:57 ` Christian König
2021-10-08 6:43 ` Thomas Hellström [this message]
2021-10-07 13:26 ` [RFC PATCH 2/2] drm/amdgpu: Use an rwsem for " Thomas Hellström
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=571f25bb979922f84c12cd0e98ab5870c9f63448.camel@linux.intel.com \
--to=thomas.hellstrom@linux.intel.com \
--cc=christian.koenig@amd.com \
--cc=dri-devel@lists.freedesktop.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox