public inbox for gfs2@lists.linux.dev
 help / color / mirror / Atom feed
* [PATCH dlm/next 0/9] dlm: sand fix, rhashtable, timers and lookup hotpath speedup
@ 2024-04-12 13:43 Alexander Aring
  2024-04-12 13:43 ` [PATCH dlm/next 1/9] dlm: increment ls_count on find_ls_to_scan() Alexander Aring
                   ` (8 more replies)
  0 siblings, 9 replies; 12+ messages in thread
From: Alexander Aring @ 2024-04-12 13:43 UTC (permalink / raw)
  To: teigland; +Cc: gfs2, aahringo

Hi,

this is a patch series for an immense change in DLM rsb hashtable logic.
It removes the double lookup functionality for rsb hashtables, convert to
rhashtable instead of own bucket hlist hash implementation.

At first there is a fix for scand that I detected while I was
implementing this patch series. It could be that remove messages are
still send when the lockspace is releasing the resource that could occur
into a use after free.

There is a conversion to use lists (keep/toss lists) instead of
iterating over the hash bucket. As we do a transition to rhashtable,
they don't like to being iterated regarding to their own bucket sizing
implementation that is sitting in the rhashtable implementation. We just
introduce the lists to do the iteration, as advantage we have a huge
reduce of code in the debugfs dump functionality as we use the dump list
helpers of debugfs. There is also a potential refcount bug when holding
rsb references of rsbs in toss state as receive remove message requires
no rsb references being hold. Another issue is also holding rsb in keep
state as they are not going into toss state when they required to.

It is now forbidden to hold references of rsbs in toss state. The
refcounter must be only functional in rsb keep state. That hopefully
will show is more invalid usage of the rsb refcounter if the rsb is in
toss state.

The scand was being fixed but now also it's removed. The scand process
was holding the hashtable/hash bucket lock for a longer timer because it
iterated over the whole hash. We use timers now to reduce the held time
of the hashtable lock. We introduce a per lockspace toss queue with tossed
timer rsb and the first item is the earliest rsb that will be expired by
the timer vice versa the last item. This makes it easy to change the
timer expiration to the next one in the queue.

The last two patches we move very likely lookup hotpath to read lock
mostly. This should for sure avoid contention in the most cases. Unlikely
path need to still hold the write lock and do some extra relookup and check
if the state of an rsb changed. However I think we hit over 90% the likely
path that we only need to hold the read lock and avoid contention between
processing DLM messages and the user triggers new DLM requests.

- Alex

changes since v2:
 - hold the write_lock in find_rsb_dir/nodir when hitting the do_toss
   path and then do the lookup and check on the do_toss rsb fields
 - move from a per rsb timer to a per lockspace timer and introduce
   a per rsb toss queue.

Alexander Aring (9):
  dlm: increment ls_count on find_ls_to_scan()
  dlm: change to non per bucket hashtable lock
  dlm: merge toss and keep hash into one
  dlm: fix avoid rsb hold during debugfs dump
  dlm: switch to use rhashtable for rsbs
  dlm: remove refcounting if rsb is on toss
  dlm: drop scand kthread and use timers
  dlm: likely read lock path for rsb lookup
  dlm: convert lkbidr to rwlock

 fs/dlm/config.c       |   8 +
 fs/dlm/config.h       |   2 +
 fs/dlm/debug_fs.c     | 212 ++---------
 fs/dlm/dir.c          |  14 +-
 fs/dlm/dlm_internal.h |  40 +-
 fs/dlm/lock.c         | 836 +++++++++++++++++++++++-------------------
 fs/dlm/lock.h         |   4 +-
 fs/dlm/lockspace.c    | 151 ++------
 fs/dlm/member.c       |   2 +
 fs/dlm/recover.c      |  29 +-
 fs/dlm/recoverd.c     |  54 +--
 11 files changed, 615 insertions(+), 737 deletions(-)

-- 
2.43.0


^ permalink raw reply	[flat|nested] 12+ messages in thread

end of thread, other threads:[~2024-04-12 19:24 UTC | newest]

Thread overview: 12+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2024-04-12 13:43 [PATCH dlm/next 0/9] dlm: sand fix, rhashtable, timers and lookup hotpath speedup Alexander Aring
2024-04-12 13:43 ` [PATCH dlm/next 1/9] dlm: increment ls_count on find_ls_to_scan() Alexander Aring
2024-04-12 13:43 ` [PATCH dlm/next 2/9] dlm: change to non per bucket hashtable lock Alexander Aring
2024-04-12 13:43 ` [PATCH dlm/next 3/9] dlm: merge toss and keep hash into one Alexander Aring
2024-04-12 13:43 ` [PATCH dlm/next 4/9] dlm: fix avoid rsb hold during debugfs dump Alexander Aring
2024-04-12 13:43 ` [PATCH dlm/next 5/9] dlm: switch to use rhashtable for rsbs Alexander Aring
2024-04-12 13:43 ` [PATCH dlm/next 6/9] dlm: remove refcounting if rsb is on toss Alexander Aring
2024-04-12 13:43 ` [PATCH dlm/next 7/9] dlm: drop scand kthread and use timers Alexander Aring
2024-04-12 13:43 ` [PATCH dlm/next 8/9] dlm: likely read lock path for rsb lookup Alexander Aring
2024-04-12 14:53   ` Alexander Aring
2024-04-12 19:24     ` Alexander Aring
2024-04-12 13:43 ` [PATCH dlm/next 9/9] dlm: convert lkbidr to rwlock Alexander Aring

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox