linux-fsdevel.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* [PATCH v2 0/3] fs: introduce file_ref_t
@ 2024-10-07 14:23 Christian Brauner
  2024-10-07 14:23 ` [PATCH v2 1/3] fs: protect backing files with rcu Christian Brauner
                   ` (3 more replies)
  0 siblings, 4 replies; 14+ messages in thread
From: Christian Brauner @ 2024-10-07 14:23 UTC (permalink / raw)
  To: Linus Torvalds
  Cc: linux-fsdevel, Thomas Gleixner, Jens Axboe, Christian Brauner

As atomic_inc_not_zero() is implemented with a try_cmpxchg() loop it has
O(N^2) behaviour under contention with N concurrent operations and it is
in a hot path in __fget_files_rcu().

The rcuref infrastructures remedies this problem by using an
unconditional increment relying on safe- and dead zones to make this
work and requiring rcu protection for the data structure in question.
This not just scales better it also introduces overflow protection.

However, in contrast to generic rcuref, files require a memory barrier
and thus cannot rely on *_relaxed() atomic operations and also require
to be built on atomic_long_t as having massive amounts of reference
isn't unheard of even if it is just an attack.

As suggested by Linus, add a file specific variant instead of making
this a generic library.

I've been testing this with will-it-scale using a multi-threaded fstat()
on the same file descriptor on a machine that Jens gave me access (thank
you very much!):

processor       : 511
vendor_id       : AuthenticAMD
cpu family      : 25
model           : 160
model name      : AMD EPYC 9754 128-Core Processor

and I consistently get a 3-5% improvement on workloads with 256+ and
more threads comparing v6.12-rc1 as base with and without these patches
applied.

In vfs.file now.

Signed-off-by: Christian Brauner <brauner@kernel.org>
---
Changes in v2:
- Don't introduce a separate rcuref_long_t library just implement it
  only for files for now.
- Retain memory barrier by using atomic_long_add_negative() instead of
  atomic_long_add_negative_relaxed() to order the loads before and after
  the increment in __fget_files_rcu().
- Link to v1: https://lore.kernel.org/r/20241005-brauner-file-rcuref-v1-0-725d5e713c86@kernel.org

---
Christian Brauner (3):
      fs: protect backing files with rcu
      fs: add file_ref
      fs: port files to file_ref

 drivers/gpu/drm/i915/gt/shmem_utils.c |   2 +-
 drivers/gpu/drm/vmwgfx/ttm_object.c   |   2 +-
 fs/eventpoll.c                        |   2 +-
 fs/file.c                             | 120 ++++++++++++++++++++++++++++++++--
 fs/file_table.c                       |  23 +++++--
 include/linux/file_ref.h              | 116 ++++++++++++++++++++++++++++++++
 include/linux/fs.h                    |   9 +--
 7 files changed, 253 insertions(+), 21 deletions(-)
---
base-commit: 9852d85ec9d492ebef56dc5f229416c925758edc
change-id: 20240927-brauner-file-rcuref-bfa4a4ba915b


^ permalink raw reply	[flat|nested] 14+ messages in thread

end of thread, other threads:[~2024-10-29 17:31 UTC | newest]

Thread overview: 14+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2024-10-07 14:23 [PATCH v2 0/3] fs: introduce file_ref_t Christian Brauner
2024-10-07 14:23 ` [PATCH v2 1/3] fs: protect backing files with rcu Christian Brauner
2024-10-07 14:23 ` [PATCH v2 2/3] fs: add file_ref Christian Brauner
2024-10-07 18:07   ` Linus Torvalds
2024-10-08 10:12     ` Christian Brauner
2024-10-08 17:29       ` Linus Torvalds
2024-10-07 14:23 ` [PATCH v2 3/3] fs: port files to file_ref Christian Brauner
2024-10-25 20:45   ` Jann Horn
2024-10-25 23:55     ` Jann Horn
2024-10-28 11:17       ` Christian Brauner
2024-10-28 18:30         ` Linus Torvalds
2024-10-29 14:18           ` Christian Brauner
2024-10-29 17:30           ` Endorsing __randomize_layout for projects! " Cedric Blancher
2024-10-07 18:27 ` [PATCH v2 0/3] fs: introduce file_ref_t Jens Axboe

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).