From: Frederic Weisbecker <frederic@kernel.org>
To: Aaron Tomlin <atomlin@redhat.com>,
Marcelo Tosatti <mtosatti@redhat.com>,
Christoph Lameter <cl@linux.com>
Cc: tglx@linutronix.de, mingo@kernel.org, atomlin@atomlin.com,
linux-kernel@vger.kernel.org, linux-mm@kvack.org,
Phil Auld <pauld@redhat.com>
Subject: Re: [RFC PATCH] tick/sched: Ensure quiet_vmstat() is called when the idle tick was stopped too
Date: Thu, 17 Feb 2022 13:47:29 +0100 [thread overview]
Message-ID: <20220217124729.GA743618@lothringen> (raw)
In-Reply-To: <20220203214339.1889971-1-atomlin@redhat.com>
Hi Aaron,
I fear my blood-brain barrier doesn't let much of mm/ code in, so I'm adding a
few interested people in Cc. Meanwhile a few comments below:
On Thu, Feb 03, 2022 at 09:43:39PM +0000, Aaron Tomlin wrote:
> Hi Frederic,
>
> If I understand correctly, in the context of the idle task and a nohz_full
> CPU, quiet_vmstat() can be called: before stopping the idle tick, entering
> an idle state and on exit. In particular, for the latter case, when the
> idle task is required to reschedule, the idle tick can remain stopped and
> the timer expiration time endless i.e., KTIME_MAX. Now, indeed before a
> nohz_full CPU enters an idle state, CPU-specific vmstat counters should
> be processed to ensure the respective values have been reset and folded
> into the zone specific vm_stat[]. That being said, it can only occur when:
> the idle tick was previously stopped, and reprogramming of the timer is not
> required.
So, to make sure I understand, the issue is that with nohz_full, we may
well enter into the idle loop with the tick already stopped. We may also
exit from idle without restarting the tick (again only with nohz_full). And
so this can cause the vmstat to not be flushed upon idle entry. Right?
>
> A customer provided some evidence which indicates that the idle tick was
> stopped; albeit, CPU-specific vmstat counters still remained populated.
> Thus one can only assume quiet_vmstat() was not invoked on return to the
> idle loop.
>
> Unfortunately, I suspect this divergence might erroneously prevent a
> reclaim attempt by kswapd. If the number of zone specific free pages are
> below their per-cpu drift value then zone_page_state_snapshot() is used to
> compute a more accurate view of the aforementioned statistic.
> Thus any task blocked on the NUMA node specific pfmemalloc_wait queue will
> be unable to make significant progress via direct reclaim unless it is
> killed after being woken up by kswapd (see throttle_direct_reclaim()).
> That being said, eventually reclaim should give up if the conditions are
> correct, no?
Now if quiet_vmstat() isn't called, the vmstat_work should fix this later,
right? Or does that happen too late perhaps?
Thanks!
next prev parent reply other threads:[~2022-02-17 12:47 UTC|newest]
Thread overview: 21+ messages / expand[flat|nested] mbox.gz Atom feed top
2022-02-03 21:43 [RFC PATCH] tick/sched: Ensure quiet_vmstat() is called when the idle tick was stopped too Aaron Tomlin
2022-02-03 22:22 ` Phil Auld
2022-02-16 14:34 ` Aaron Tomlin
2022-02-16 21:20 ` Phil Auld
2022-02-17 12:57 ` Frederic Weisbecker
2022-02-17 14:45 ` Aaron Tomlin
2022-02-17 12:47 ` Frederic Weisbecker [this message]
2022-02-17 14:26 ` Aaron Tomlin
2022-02-17 16:32 ` Frederic Weisbecker
2022-02-18 12:54 ` Aaron Tomlin
2022-02-19 15:46 ` Aaron Tomlin
2022-02-24 12:27 ` Marcelo Tosatti
2022-02-24 12:30 ` Marcelo Tosatti
2022-02-24 13:01 ` Aaron Tomlin
2022-02-24 12:37 ` Marcelo Tosatti
2022-02-24 13:00 ` Aaron Tomlin
2022-02-24 13:14 ` Marcelo Tosatti
2022-02-24 13:28 ` Aaron Tomlin
2022-02-24 13:40 ` Marcelo Tosatti
2022-02-24 13:44 ` Aaron Tomlin
2022-03-31 14:33 ` Aaron Tomlin
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20220217124729.GA743618@lothringen \
--to=frederic@kernel.org \
--cc=atomlin@atomlin.com \
--cc=atomlin@redhat.com \
--cc=cl@linux.com \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=mingo@kernel.org \
--cc=mtosatti@redhat.com \
--cc=pauld@redhat.com \
--cc=tglx@linutronix.de \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.