All of lore.kernel.org
 help / color / mirror / Atom feed
* [PATCH] btrfs: use lockless read in nr_cached_objects shrinker callback
@ 2026-05-29 21:23 Boris Burkov
  2026-05-29 22:11 ` Shakeel Butt
  2026-05-29 22:23 ` Qu Wenruo
  0 siblings, 2 replies; 3+ messages in thread
From: Boris Burkov @ 2026-05-29 21:23 UTC (permalink / raw)
  To: linux-btrfs, kernel-team

From: Ben Maurer <bmaurer@meta.com>

Under heavy memcg-driven slab reclaim with many memcgs and CPUs,
shrink_slab_memcg() invokes the per-superblock count callback once
per (memcg, NUMA node) tuple. For btrfs that callback reaches
percpu_counter_sum_positive() on fs_info->evictable_extent_maps,
which takes the percpu_counter's raw spinlock with IRQs disabled and
walks every online CPU. With hundreds of memcgs driving reclaim on a
host with dozens of CPUs, this counter lock becomes a global
serialization point: profiles show CPU pinned in the spin_lock_irqsave
acquire under __percpu_counter_sum, with cross-CPU IPIs hitting
csd_lock_wait_toolong while waiting for spinning vCPUs.

The shrinker count is advisory -- super_cache_count() already notes
"counts can change between super_cache_count and super_cache_scan, so
we really don't need locks here." Use percpu_counter_read_positive(),
which is lockless. Worst-case skew is bounded by batch * num_online_cpus
(a few thousand), negligible compared to the millions of extent maps a
busy filesystem accumulates and well within the noise that the shrinker
already tolerates.

Tested-by: Boris Burkov <boris@bur.io>
Signed-off-by: Ben Maurer <bmaurer@meta.com>
---
 fs/btrfs/super.c | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/fs/btrfs/super.c b/fs/btrfs/super.c
index 958dd185c0d6..c946bccf0748 100644
--- a/fs/btrfs/super.c
+++ b/fs/btrfs/super.c
@@ -2432,7 +2432,7 @@ static int btrfs_show_devname(struct seq_file *m, struct dentry *root)
 static long btrfs_nr_cached_objects(struct super_block *sb, struct shrink_control *sc)
 {
 	struct btrfs_fs_info *fs_info = btrfs_sb(sb);
-	const s64 nr = percpu_counter_sum_positive(&fs_info->evictable_extent_maps);
+	const s64 nr = percpu_counter_read_positive(&fs_info->evictable_extent_maps);
 
 	trace_btrfs_extent_map_shrinker_count(fs_info, nr);
 
-- 
2.54.0


^ permalink raw reply related	[flat|nested] 3+ messages in thread

* Re: [PATCH] btrfs: use lockless read in nr_cached_objects shrinker callback
  2026-05-29 21:23 [PATCH] btrfs: use lockless read in nr_cached_objects shrinker callback Boris Burkov
@ 2026-05-29 22:11 ` Shakeel Butt
  2026-05-29 22:23 ` Qu Wenruo
  1 sibling, 0 replies; 3+ messages in thread
From: Shakeel Butt @ 2026-05-29 22:11 UTC (permalink / raw)
  To: Boris Burkov; +Cc: linux-btrfs, kernel-team

On Fri, May 29, 2026 at 02:23:46PM -0700, Boris Burkov wrote:
> From: Ben Maurer <bmaurer@meta.com>
> 
> Under heavy memcg-driven slab reclaim with many memcgs and CPUs,
> shrink_slab_memcg() invokes the per-superblock count callback once
> per (memcg, NUMA node) tuple. For btrfs that callback reaches
> percpu_counter_sum_positive() on fs_info->evictable_extent_maps,
> which takes the percpu_counter's raw spinlock with IRQs disabled and
> walks every online CPU. With hundreds of memcgs driving reclaim on a
> host with dozens of CPUs, this counter lock becomes a global
> serialization point: profiles show CPU pinned in the spin_lock_irqsave
> acquire under __percpu_counter_sum, with cross-CPU IPIs hitting
> csd_lock_wait_toolong while waiting for spinning vCPUs.
> 
> The shrinker count is advisory -- super_cache_count() already notes
> "counts can change between super_cache_count and super_cache_scan, so
> we really don't need locks here." Use percpu_counter_read_positive(),
> which is lockless. Worst-case skew is bounded by batch * num_online_cpus
> (a few thousand), negligible compared to the millions of extent maps a
> busy filesystem accumulates and well within the noise that the shrinker
> already tolerates.
> 
> Tested-by: Boris Burkov <boris@bur.io>
> Signed-off-by: Ben Maurer <bmaurer@meta.com>

Reviewed-by: Shakeel Butt <shakeel.butt@linux.dev>

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: [PATCH] btrfs: use lockless read in nr_cached_objects shrinker callback
  2026-05-29 21:23 [PATCH] btrfs: use lockless read in nr_cached_objects shrinker callback Boris Burkov
  2026-05-29 22:11 ` Shakeel Butt
@ 2026-05-29 22:23 ` Qu Wenruo
  1 sibling, 0 replies; 3+ messages in thread
From: Qu Wenruo @ 2026-05-29 22:23 UTC (permalink / raw)
  To: Boris Burkov, linux-btrfs, kernel-team



在 2026/5/30 06:53, Boris Burkov 写道:
> From: Ben Maurer <bmaurer@meta.com>
> 
> Under heavy memcg-driven slab reclaim with many memcgs and CPUs,
> shrink_slab_memcg() invokes the per-superblock count callback once
> per (memcg, NUMA node) tuple. For btrfs that callback reaches
> percpu_counter_sum_positive() on fs_info->evictable_extent_maps,
> which takes the percpu_counter's raw spinlock with IRQs disabled and
> walks every online CPU. With hundreds of memcgs driving reclaim on a
> host with dozens of CPUs, this counter lock becomes a global
> serialization point: profiles show CPU pinned in the spin_lock_irqsave
> acquire under __percpu_counter_sum, with cross-CPU IPIs hitting
> csd_lock_wait_toolong while waiting for spinning vCPUs.
> 
> The shrinker count is advisory -- super_cache_count() already notes
> "counts can change between super_cache_count and super_cache_scan, so
> we really don't need locks here." Use percpu_counter_read_positive(),
> which is lockless. Worst-case skew is bounded by batch * num_online_cpus
> (a few thousand), negligible compared to the millions of extent maps a
> busy filesystem accumulates and well within the noise that the shrinker
> already tolerates.
> 
> Tested-by: Boris Burkov <boris@bur.io>
> Signed-off-by: Ben Maurer <bmaurer@meta.com>

Reviewed-by: Qu Wenruo <wqu@suse.com>

Thanks,
Qu

> ---
>   fs/btrfs/super.c | 2 +-
>   1 file changed, 1 insertion(+), 1 deletion(-)
> 
> diff --git a/fs/btrfs/super.c b/fs/btrfs/super.c
> index 958dd185c0d6..c946bccf0748 100644
> --- a/fs/btrfs/super.c
> +++ b/fs/btrfs/super.c
> @@ -2432,7 +2432,7 @@ static int btrfs_show_devname(struct seq_file *m, struct dentry *root)
>   static long btrfs_nr_cached_objects(struct super_block *sb, struct shrink_control *sc)
>   {
>   	struct btrfs_fs_info *fs_info = btrfs_sb(sb);
> -	const s64 nr = percpu_counter_sum_positive(&fs_info->evictable_extent_maps);
> +	const s64 nr = percpu_counter_read_positive(&fs_info->evictable_extent_maps);
>   
>   	trace_btrfs_extent_map_shrinker_count(fs_info, nr);
>   


^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2026-05-29 22:23 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2026-05-29 21:23 [PATCH] btrfs: use lockless read in nr_cached_objects shrinker callback Boris Burkov
2026-05-29 22:11 ` Shakeel Butt
2026-05-29 22:23 ` Qu Wenruo

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.