From: Roman Gushchin <roman.gushchin@linux.dev>
To: "Leonardo Brás" <leobras@redhat.com>
Cc: Michal Hocko <mhocko@suse.com>,
Marcelo Tosatti <mtosatti@redhat.com>,
Johannes Weiner <hannes@cmpxchg.org>,
Shakeel Butt <shakeelb@google.com>,
Muchun Song <muchun.song@linux.dev>,
Andrew Morton <akpm@linux-foundation.org>,
cgroups@vger.kernel.org, linux-mm@kvack.org,
linux-kernel@vger.kernel.org
Subject: Re: [PATCH v2 0/5] Introduce memcg_stock_pcp remote draining
Date: Sun, 5 Feb 2023 11:49:13 -0800 [thread overview]
Message-ID: <Y+AIOQy0HdVXCw8m@P9FQF9L96D> (raw)
In-Reply-To: <28e08669302ad1e7a41bdf8b9988de6a352b5fe1.camel@redhat.com>
Hi Leonardo!
> Yes, but we are exchanging an "always schedule_work_on()", which is a kind of
> contention, for a "sometimes we hit spinlock contention".
>
> For the spinlock proposal, on the local cpu side, the *worst case* contention
> is:
> 1 - wait the spin_unlock() for a complete <percpu cache drain process>,
> 2 - wait a cache hit for local per-cpu cacheline
>
> What is current implemented (schedule_work_on() approach), for the local
> cpu side there is *always* this contention:
> 1 - wait for a context switch,
> 2 - wait a cache hit from it's local per-cpu cacheline,
> 3 - wait a complete <percpu cache drain process>,
> 4 - then for a new context switch to the current thread.
I think both Michal and me are thinking of a more generic case in which the cpu
is not exclusively consumed by 1 special process, so that the draining work can
be executed during an idle time. In this case the work is basically free.
And the introduction of a spin_lock() on the hot path is what we're are concerned
about. I agree, that on some hardware platforms it won't be that expensive, but
in general not having any spinlocks is so much better.
>
> So moving from schedule_work_on() to spinlocks will save 2 context switches per
> cpu every time drain_all_stock() is called.
>
> On the remote cpu side, my tests point that doing the remote draining is faster
> than scheduling a local draining, so it's also a gain.
>
> Also, IIUC the possible contention in the spinlock approach happens only on
> page-faulting and syscalls, versus the schedule_work_on() approach that can
> interrupt user workload at any time.
>
> In fact, not interrupting the user workload in isolated cpus is just a bonus of
> using spinlocks.
I believe it significantly depends on the preemption model: you're right regarding
fully preemptive kernels, but with voluntary/none preemption it's exactly opposite:
the draining work will be executed at some point later (probably with 0 cost),
while the remote access from another cpu will potentially cause delays on the
spin lock as well as a need to refill the stock.
Overall I'd expect a noticeable performance regression from an introduction of
spin locks and remote draining. Maybe not on all platforms, but at least on some.
That's my main concern. And I don't think the problem we're aiming to solve here
justifies this potential regression.
Thanks!
next prev parent reply other threads:[~2023-02-05 19:49 UTC|newest]
Thread overview: 48+ messages / expand[flat|nested] mbox.gz Atom feed top
2023-01-25 7:34 [PATCH v2 0/5] Introduce memcg_stock_pcp remote draining Leonardo Bras
[not found] ` <20230125073502.743446-1-leobras-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org>
2023-01-25 7:34 ` [PATCH v2 1/5] mm/memcontrol: Align percpu memcg_stock to cache Leonardo Bras
2023-01-25 7:34 ` [PATCH v2 2/5] mm/memcontrol: Change stock_lock type from local_lock_t to spinlock_t Leonardo Bras
2023-01-25 7:35 ` [PATCH v2 3/5] mm/memcontrol: Reorder memcg_stock_pcp members to avoid holes Leonardo Bras
2023-01-25 7:35 ` [PATCH v2 5/5] mm/memcontrol: Remove flags from memcg_stock_pcp Leonardo Bras
2023-01-25 8:33 ` [PATCH v2 0/5] Introduce memcg_stock_pcp remote draining Michal Hocko
[not found] ` <Y9DpbVF+JR/G+5Or-2MMpYkNvuYDjFM9bn6wA6Q@public.gmane.org>
2023-01-25 11:06 ` Leonardo Brás
[not found] ` <9e61ab53e1419a144f774b95230b789244895424.camel-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org>
2023-01-25 11:39 ` Michal Hocko
2023-01-25 18:22 ` Marcelo Tosatti
2023-01-25 23:14 ` Roman Gushchin
[not found] ` <Y9G36AiqPPFDlax3-+xijCwNIfdoLQcUKs7qKB+WAnPUfkyWGUBSOeVevoDU@public.gmane.org>
2023-01-26 7:41 ` Michal Hocko
2023-01-26 18:03 ` Marcelo Tosatti
2023-01-26 19:20 ` Michal Hocko
[not found] ` <Y9LSjnNEEUiF/70R-2MMpYkNvuYDjFM9bn6wA6Q@public.gmane.org>
2023-01-27 0:32 ` Marcelo Tosatti
2023-01-27 6:58 ` Michal Hocko
2023-02-01 18:31 ` Roman Gushchin
2023-01-26 23:12 ` Roman Gushchin
[not found] ` <Y9MI42NSLooyVZNu-+xijCwNIfdoLQcUKs7qKB+WAnPUfkyWGUBSOeVevoDU@public.gmane.org>
2023-01-27 7:11 ` Michal Hocko
2023-01-27 7:22 ` Leonardo Brás
[not found] ` <52a0f1e593b1ec0ca7e417ba37680d65df22de82.camel-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org>
2023-01-27 8:12 ` Leonardo Brás
[not found] ` <601fc35a8cc2167e53e45c636fccb2d899fd7c50.camel-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org>
2023-01-27 9:23 ` Michal Hocko
2023-01-27 13:03 ` Frederic Weisbecker
[not found] ` <Y9N5CI8PpsfiaY9c-2MMpYkNvuYDjFM9bn6wA6Q@public.gmane.org>
2023-01-27 13:58 ` Michal Hocko
[not found] ` <Y9PYe1X7dRQOcahg-2MMpYkNvuYDjFM9bn6wA6Q@public.gmane.org>
2023-01-27 18:18 ` Roman Gushchin
[not found] ` <Y9QVWwAreTlDVdZ0-+xijCwNIfdoLQcUKs7qKB+WAnPUfkyWGUBSOeVevoDU@public.gmane.org>
2023-02-03 15:21 ` Michal Hocko
[not found] ` <Y90mZQhW89HtYfT9-2MMpYkNvuYDjFM9bn6wA6Q@public.gmane.org>
2023-02-03 19:25 ` Roman Gushchin
[not found] ` <Y91fnF5uEcSA0/99-+xijCwNIfdoLQcUKs7qKB+WAnPUfkyWGUBSOeVevoDU@public.gmane.org>
2023-02-13 13:36 ` Michal Hocko
2023-01-27 7:14 ` Leonardo Brás
[not found] ` <55ac6e3cbb97c7d13c49c3125c1455d8a2c785c3.camel-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org>
2023-01-27 7:20 ` Michal Hocko
2023-01-27 7:35 ` Leonardo Brás
2023-01-27 9:29 ` Michal Hocko
[not found] ` <Y9OZezjUPITtEvTx-2MMpYkNvuYDjFM9bn6wA6Q@public.gmane.org>
2023-01-27 19:29 ` Leonardo Brás
2023-01-27 23:50 ` Roman Gushchin
2023-01-26 18:19 ` Marcelo Tosatti
2023-01-27 5:40 ` Leonardo Brás
2023-01-26 7:45 ` Michal Hocko
[not found] ` <Y9IvoDJbLbFcitTc-2MMpYkNvuYDjFM9bn6wA6Q@public.gmane.org>
2023-01-26 18:14 ` Marcelo Tosatti
2023-01-26 19:13 ` Michal Hocko
[not found] ` <Y9LQ615H13RmG7wL-2MMpYkNvuYDjFM9bn6wA6Q@public.gmane.org>
2023-01-27 6:55 ` Leonardo Brás
[not found] ` <0122005439ffb7895efda7a1a67992cbe41392fe.camel-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org>
2023-01-31 11:35 ` Marcelo Tosatti
2023-02-01 4:36 ` Leonardo Brás
[not found] ` <5ba79c4feb829ed75cfd98cf5c8042dcb2ddea91.camel-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org>
2023-02-01 12:52 ` Michal Hocko
2023-02-01 12:41 ` Michal Hocko
[not found] ` <Y9pd7AxAILUSHrpe-2MMpYkNvuYDjFM9bn6wA6Q@public.gmane.org>
2023-02-04 4:55 ` Leonardo Brás
2023-02-05 19:49 ` Roman Gushchin [this message]
2023-02-07 3:18 ` Leonardo Brás
[not found] ` <4b232f47e038ab6fcaa0114f73c28d4bf8799f84.camel-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org>
2023-02-08 19:23 ` Roman Gushchin
2023-01-25 7:35 ` [PATCH v2 4/5] mm/memcontrol: Perform all stock drain in current CPU Leonardo Bras
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=Y+AIOQy0HdVXCw8m@P9FQF9L96D \
--to=roman.gushchin@linux.dev \
--cc=akpm@linux-foundation.org \
--cc=cgroups@vger.kernel.org \
--cc=hannes@cmpxchg.org \
--cc=leobras@redhat.com \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=mhocko@suse.com \
--cc=mtosatti@redhat.com \
--cc=muchun.song@linux.dev \
--cc=shakeelb@google.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox