From: "Thomas Hellström" <thomas.hellstrom@linux.intel.com>
To: "Christian König" <christian.koenig@amd.com>,
intel-xe@lists.freedesktop.org, intel-gfx@list.freedesktop.org
Cc: dri-devel@lists.freedesktop.org
Subject: Re: [PATCH 0/4] TTM unlockable restartable LRU list iteration
Date: Fri, 16 Feb 2024 15:20:56 +0100 [thread overview]
Message-ID: <47fc8e42dcfd868341ffc32754c302e58ac49484.camel@linux.intel.com> (raw)
In-Reply-To: <29000a0d-19ce-4727-945b-d5734313c7f1@amd.com>
On Fri, 2024-02-16 at 15:00 +0100, Christian König wrote:
> Am 16.02.24 um 14:13 schrieb Thomas Hellström:
> > This patch-set is a prerequisite for a standalone TTM shrinker
> > and for exhaustive TTM eviction using sleeping dma_resv locks,
> > which is the motivation it.
> >
> > Currently when unlocking the TTM lru list lock, iteration needs
> > to be restarted from the beginning, rather from the next LRU list
> > node. This can potentially be a big problem, because if eviction
> > or shrinking fails for whatever reason after unlock, restarting
> > is likely to cause the same failure over and over again.
>
> Oh, yes please. I have been working on that problem before as well,
> but
> wasn't able to come up with something working.
>
> > There are various schemes to be able to continue the list
> > iteration from where we left off. One such scheme used by the
> > GEM LRU list traversal is to pull items already considered off
> > the LRU list and reinsert them when iteration is done.
> > This has the drawback that concurrent list iteration doesn't see
> > the complete list (which is bad for exhaustive eviction) and also
> > doesn't lend itself well to bulk-move sublists since these will
> > be split in the process where items from those lists are
> > temporarily pulled from the list and moved to the list tail.
>
> Completely agree that this is not a desirable solution.
>
> > The approach taken here is that list iterators insert themselves
> > into the list next position using a special list node. Iteration
> > is then using that list node as starting point when restarting.
> > Concurrent iterators just skip over the special list nodes.
> >
> > This is implemented in patch 1 and 2.
> >
> > For bulk move sublist the approach is the same, but when a bulk
> > move sublist is moved to the tail, the iterator is also moved,
> > causing us to skip parts of the list. That is undesirable.
> > Patch 3 deals with that, and when iterator detects it is
> > traversing a sublist, it inserts a second restarting point just
> > after the sublist and if the sublist is moved to the tail,
> > it just uses the second restarting point instead.
> >
> > This is implemented in patch 3.
>
> Interesting approach, that is probably even better than what I tried.
>
> My approach was basically to not only lock the current BO, but also
> the
> next one. Since only a locked BO can move on the LRU we effectively
> created an anchor.
>
> Before I dig into the code a couple of questions:
These are described in the patches but brief comments inline.
> 1. How do you distinct BOs and iteration anchors on the LRU?
Using a struct ttm_lru_item, containing a struct list_head and the
type. List nodes embeds this instead of a struct list_head. This is
larger than the list head but makes it explicit what we're doing.
> 2. How do you detect that a bulk list moved on the LRU?
An age u64 counter on the bulk move that we're comparing against. It's
updated each time it moves.
> 3. How do you remove the iteration anchors from the bulk list?
A function call at the end of iteration, that the function iterating is
requried to call.
/Thomas
>
> Regards,
> Christian.
>
> >
> > The restartable property is used in patch 4 to restart swapout if
> > needed, but the main purpose is this paves the way for
> > shrinker- and exhaustive eviction.
> >
> > Cc: Christian König <christian.koenig@amd.com>
> > Cc: <dri-devel@lists.freedesktop.org>
> >
> > Thomas Hellström (4):
> > drm/ttm: Allow TTM LRU list nodes of different types
> > drm/ttm: Use LRU hitches
> > drm/ttm: Consider hitch moves within bulk sublist moves
> > drm/ttm: Allow continued swapout after -ENOSPC falure
> >
> > drivers/gpu/drm/ttm/ttm_bo.c | 1 +
> > drivers/gpu/drm/ttm/ttm_device.c | 33 +++--
> > drivers/gpu/drm/ttm/ttm_resource.c | 202 +++++++++++++++++++++++-
> > -----
> > include/drm/ttm/ttm_resource.h | 91 +++++++++++--
> > 4 files changed, 267 insertions(+), 60 deletions(-)
> >
>
next prev parent reply other threads:[~2024-02-16 14:21 UTC|newest]
Thread overview: 20+ messages / expand[flat|nested] mbox.gz Atom feed top
2024-02-16 13:13 [PATCH 0/4] TTM unlockable restartable LRU list iteration Thomas Hellström
2024-02-16 13:13 ` [PATCH 1/4] drm/ttm: Allow TTM LRU list nodes of different types Thomas Hellström
2024-02-16 13:13 ` [PATCH 2/4] drm/ttm: Use LRU hitches Thomas Hellström
2024-02-16 13:13 ` [PATCH 3/4] drm/ttm: Consider hitch moves within bulk sublist moves Thomas Hellström
2024-02-16 13:13 ` [PATCH 4/4] drm/ttm: Allow continued swapout after -ENOSPC falure Thomas Hellström
2024-02-16 13:19 ` ✓ CI.Patch_applied: success for TTM unlockable restartable LRU list iteration Patchwork
2024-02-16 13:20 ` ✓ CI.checkpatch: " Patchwork
2024-02-16 13:20 ` ✓ CI.KUnit: " Patchwork
2024-02-16 13:31 ` ✓ CI.Build: " Patchwork
2024-02-16 13:31 ` ✓ CI.Hooks: " Patchwork
2024-02-16 13:33 ` ✗ CI.checksparse: warning " Patchwork
2024-02-16 14:00 ` [PATCH 0/4] " Christian König
2024-02-16 14:20 ` Thomas Hellström [this message]
2024-02-29 15:08 ` Christian König
2024-02-29 17:34 ` Thomas Hellström
2024-03-01 13:45 ` Thomas Hellström
2024-03-01 14:20 ` Christian König
2024-03-01 14:41 ` Thomas Hellström
2024-02-16 14:05 ` ✓ CI.BAT: success for " Patchwork
-- strict thread matches above, loose matches on Subject: below --
2024-02-16 13:14 [PATCH 0/4] " Thomas Hellström
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=47fc8e42dcfd868341ffc32754c302e58ac49484.camel@linux.intel.com \
--to=thomas.hellstrom@linux.intel.com \
--cc=christian.koenig@amd.com \
--cc=dri-devel@lists.freedesktop.org \
--cc=intel-gfx@list.freedesktop.org \
--cc=intel-xe@lists.freedesktop.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.