From: Marcelo Tosatti <mtosatti@redhat.com>
To: Frederic Weisbecker <frederic@kernel.org>
Cc: linux-kernel@vger.kernel.org, Nitesh Lal <nilal@redhat.com>,
Nicolas Saenz Julienne <nsaenzju@redhat.com>,
Christoph Lameter <cl@linux.com>,
Juri Lelli <juri.lelli@redhat.com>,
Peter Zijlstra <peterz@infradead.org>,
Alex Belits <abelits@belits.com>, Peter Xu <peterx@redhat.com>,
Thomas Gleixner <tglx@linutronix.de>,
Daniel Bristot de Oliveira <bristot@redhat.com>,
Oscar Shiang <oscar0225@livemail.tw>
Subject: Re: [patch v11 05/13] task isolation: sync vmstats on return to userspace
Date: Mon, 7 Feb 2022 12:16:33 -0300 [thread overview]
Message-ID: <YgE30X7z8htKWzbS@fuller.cnet> (raw)
In-Reply-To: <20220207145718.GA523931@lothringen>
On Mon, Feb 07, 2022 at 03:57:18PM +0100, Frederic Weisbecker wrote:
> On Fri, Feb 04, 2022 at 02:35:42PM -0300, Marcelo Tosatti wrote:
> > The logic to disable vmstat worker thread, when entering
> > nohz full, does not cover all scenarios. For example, it is possible
> > for the following to happen:
> >
> > 1) enter nohz_full, which calls refresh_cpu_vm_stats, syncing the stats.
> > 2) app runs mlock, which increases counters for mlock'ed pages.
> > 3) start -RT loop
> >
> > Since refresh_cpu_vm_stats from nohz_full logic can happen _before_
> > the mlock, vmstat shepherd can restart vmstat worker thread on
> > the CPU in question.
> >
> > To fix this, use the task isolation prctl interface to quiesce
> > deferred actions when returning to userspace.
> >
> > This patch adds hooks to fork and exit code paths.
> >
> > Signed-off-by: Marcelo Tosatti <mtosatti@redhat.com>
> >
> > ---
> > v11: fold patch to add task_isol_exit hooks (Frederic)
> > Use _TIF_TASK_ISOL bit on thread flags (Frederic)
> >
> > v6: modify exit_to_user_mode_loop to cover exceptions and interrupts
> > v5: no changes
> > v4: add oneshot mode support
> >
> > include/linux/task_isolation.h | 16 ++++++++++++++++
> > include/linux/vmstat.h | 8 ++++++++
> > kernel/entry/common.c | 15 +++++++++++----
> > kernel/task_isolation.c | 21 +++++++++++++++++++++
> > mm/vmstat.c | 21 +++++++++++++++++++++
> > 5 files changed, 77 insertions(+), 4 deletions(-)
> >
> > Index: linux-2.6/include/linux/task_isolation.h
> > ===================================================================
> > --- linux-2.6.orig/include/linux/task_isolation.h
> > +++ linux-2.6/include/linux/task_isolation.h
> > @@ -27,6 +27,13 @@ static inline void task_isol_free(struct
> > __task_isol_free(tsk);
> > }
> >
> > +void __task_isol_exit(struct task_struct *tsk);
> > +static inline void task_isol_exit(struct task_struct *tsk)
> > +{
> > + if (tsk->task_isol_info)
> > + __task_isol_exit(tsk);
> > +}
> > +
> > int prctl_task_isol_feat_get(unsigned long arg2, unsigned long arg3,
> > unsigned long arg4, unsigned long arg5);
> > int prctl_task_isol_cfg_get(unsigned long arg2, unsigned long arg3,
> > @@ -40,12 +47,22 @@ int prctl_task_isol_activate_set(unsigne
> >
> > int __copy_task_isol(struct task_struct *tsk);
> >
> > +void task_isol_exit_to_user_mode(void);
> > +
> > #else
> >
> > +static inline void task_isol_exit_to_user_mode(void)
> > +{
> > +}
> > +
> > static inline void task_isol_free(struct task_struct *tsk)
> > {
> > }
> >
> > +static inline void task_isol_exit(struct task_struct *tsk)
> > +{
> > +}
> > +
> > static inline int prctl_task_isol_feat_get(unsigned long arg2,
> > unsigned long arg3,
> > unsigned long arg4,
> > Index: linux-2.6/include/linux/vmstat.h
> > ===================================================================
> > --- linux-2.6.orig/include/linux/vmstat.h
> > +++ linux-2.6/include/linux/vmstat.h
> > @@ -21,6 +21,14 @@ int sysctl_vm_numa_stat_handler(struct c
> > void *buffer, size_t *length, loff_t *ppos);
> > #endif
> >
> > +#if defined(CONFIG_SMP) && defined(CONFIG_TASK_ISOLATION)
> > +void sync_vmstat(void);
> > +#else
> > +static inline void sync_vmstat(void)
> > +{
> > +}
> > +#endif
> > +
> > struct reclaim_stat {
> > unsigned nr_dirty;
> > unsigned nr_unqueued_dirty;
> > Index: linux-2.6/kernel/entry/common.c
> > ===================================================================
> > --- linux-2.6.orig/kernel/entry/common.c
> > +++ linux-2.6/kernel/entry/common.c
> > @@ -6,6 +6,7 @@
> > #include <linux/livepatch.h>
> > #include <linux/audit.h>
> > #include <linux/tick.h>
> > +#include <linux/task_isolation.h>
> >
> > #include "common.h"
> >
> > @@ -174,6 +175,9 @@ static unsigned long exit_to_user_mode_l
> > if (ti_work & _TIF_NOTIFY_RESUME)
> > tracehook_notify_resume(regs);
> >
> > + if (ti_work & _TIF_TASK_ISOL)
> > + task_isol_exit_to_user_mode();
> > +
> > /* Architecture specific TIF work */
> > arch_exit_to_user_mode_work(regs, ti_work);
> >
> > Index: linux-2.6/kernel/task_isolation.c
> > ===================================================================
> > --- linux-2.6.orig/kernel/task_isolation.c
> > +++ linux-2.6/kernel/task_isolation.c
> > @@ -18,6 +18,12 @@
> > #include <linux/sysfs.h>
> > #include <linux/init.h>
> > #include <linux/sched/task.h>
> > +#include <linux/mm.h>
> > +#include <linux/vmstat.h>
> > +
> > +void __task_isol_exit(struct task_struct *tsk)
> > +{
> > +}
> >
> > void __task_isol_free(struct task_struct *tsk)
> > {
> > @@ -251,6 +257,9 @@ static int cfg_feat_quiesce_set(unsigned
> > task_isol_info->quiesce_mask = i_qctrl->quiesce_mask;
> > task_isol_info->oneshot_mask = i_qctrl->quiesce_oneshot_mask;
> > task_isol_info->conf_mask |= ISOL_F_QUIESCE;
> > + if (task_isol_info->quiesce_mask & ISOL_F_QUIESCE_VMSTATS)
> > + set_thread_flag(TIF_TASK_ISOL);
>
> Should you check if (i->active_mask == ISOL_F_QUIESCE) before setting the
> flag?
>
> > +
> > ret = 0;
> >
> > out_free:
> > @@ -303,6 +312,7 @@ int __copy_task_isol(struct task_struct
> > new_info->active_mask = info->active_mask;
> >
> > tsk->task_isol_info = new_info;
> > + set_ti_thread_flag(task_thread_info(tsk), TIF_TASK_ISOL);
>
> Same here?
Yes, should fix that.
Will wait for more comments before resending -v12.
next prev parent reply other threads:[~2022-02-08 16:32 UTC|newest]
Thread overview: 30+ messages / expand[flat|nested] mbox.gz Atom feed top
2022-02-04 17:35 [patch v11 00/13] extensible prctl task isolation interface and vmstat sync Marcelo Tosatti
2022-02-04 17:35 ` [patch v11 01/13] s390: add support for TIF_TASK_ISOL Marcelo Tosatti
2022-02-07 11:40 ` Sven Schnelle
2022-02-04 17:35 ` [patch v11 02/13] x86: " Marcelo Tosatti
2022-02-04 17:35 ` [patch v11 03/13] add basic task isolation prctl interface Marcelo Tosatti
2022-02-04 17:35 ` [patch v11 04/13] add prctl task isolation prctl docs and samples Marcelo Tosatti
2022-02-04 17:35 ` [patch v11 05/13] task isolation: sync vmstats on return to userspace Marcelo Tosatti
2022-02-07 14:57 ` Frederic Weisbecker
2022-02-07 15:16 ` Marcelo Tosatti [this message]
2022-02-04 17:35 ` [patch v11 06/13] procfs: add per-pid task isolation state Marcelo Tosatti
2022-02-04 17:35 ` [patch v11 07/13] task isolation: sync vmstats conditional on changes Marcelo Tosatti
2022-02-07 15:19 ` Frederic Weisbecker
2022-03-10 16:30 ` Marcelo Tosatti
2022-02-07 15:29 ` Frederic Weisbecker
2022-02-04 17:35 ` [patch v11 08/13] task isolation: enable return to userspace processing Marcelo Tosatti
2022-02-04 17:35 ` [patch v11 09/13] task isolation: add preempt notifier to sync per-CPU vmstat dirty info to thread info Marcelo Tosatti
2022-02-07 15:36 ` Frederic Weisbecker
2022-02-04 17:35 ` [patch v11 10/13] KVM: x86: process isolation work from VM-entry code path Marcelo Tosatti
2022-02-07 15:47 ` Frederic Weisbecker
2022-03-10 16:35 ` Marcelo Tosatti
2022-02-04 17:35 ` [patch v11 11/13] mm: vmstat: move need_update Marcelo Tosatti
2022-02-04 17:35 ` [patch v11 12/13] mm: vmstat_refresh: avoid queueing work item if cpu stats are clean Marcelo Tosatti
2022-02-04 17:35 ` [patch v11 13/13] task isolation: only TIF_TASK_ISOL if task isolation is enabled Marcelo Tosatti
2022-02-07 15:38 ` Frederic Weisbecker
2022-02-19 8:02 ` [patch v11 00/13] extensible prctl task isolation interface and vmstat sync Oscar Shiang
2022-02-23 17:31 ` Marcelo Tosatti
2022-03-08 6:32 ` Oscar Shiang
2022-03-08 13:12 ` Marcelo Tosatti
2022-03-09 15:31 ` Oscar Shiang
2022-03-08 7:20 ` Oscar Shiang
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=YgE30X7z8htKWzbS@fuller.cnet \
--to=mtosatti@redhat.com \
--cc=abelits@belits.com \
--cc=bristot@redhat.com \
--cc=cl@linux.com \
--cc=frederic@kernel.org \
--cc=juri.lelli@redhat.com \
--cc=linux-kernel@vger.kernel.org \
--cc=nilal@redhat.com \
--cc=nsaenzju@redhat.com \
--cc=oscar0225@livemail.tw \
--cc=peterx@redhat.com \
--cc=peterz@infradead.org \
--cc=tglx@linutronix.de \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox