* [RFC PATCH v2 00/10] mm/damon: let DAMON be paused and resumed
@ 2026-03-19 5:21 SeongJae Park
2026-03-19 5:21 ` [RFC PATCH v2 06/10] mm/damon/tests/core-kunit: test pause commitment SeongJae Park
` (4 more replies)
0 siblings, 5 replies; 10+ messages in thread
From: SeongJae Park @ 2026-03-19 5:21 UTC (permalink / raw)
Cc: SeongJae Park, Liam R. Howlett, Andrew Morton, Brendan Higgins,
David Gow, David Hildenbrand, Jonathan Corbet, Lorenzo Stoakes,
Michal Hocko, Mike Rapoport, Shuah Khan, Shuah Khan,
Suren Baghdasaryan, Vlastimil Babka, damon, kunit-dev, linux-doc,
linux-kernel, linux-kselftest, linux-mm
DAMON utilizes a few mechanisms that enhance itself over time. Adaptive
regions adjustment, goal-based DAMOS quota auto-tuning and monitoring
intervals auto-tuning like self-training mechanisms are such examples.
It also adds access frequency stability information (age) to the
monitoring results, which makes it enhanced over time.
Sometimes users have to stop DAMON. In this case, DAMON internal state
that enhanced over the time of the last execution simply goes away.
Restarted DAMON have to train itself and enhance its output from the
scratch. This makes DAMON less useful in such cases. Introducing three
such use cases below.
Investigation of DAMON. It is best to do the investigation online,
especially when it is a production environment. DAMON therefore
provides features for such online investigations, including DAMOS stats,
monitoring result snapshot exposure, and multiple tracepoints. When
those are insufficient, and there are additional clues that could be
interfered by DAMON, users have to temporarily stop DAMON to collect the
additional clues. It is not very useful since many of DAMON internal
clues are gone when DAMON is stopped. The loss of the monitoring
results that improved over time is also problematic, especially in
production environments.
Monitoring of workloads that have different user-known phases. For
example, in Android, applications are known to have very different
access patterns and behaviors when they are running on the foreground
and the background. It can therefore be useful to separate monitoring
of apps based on whether they are running on the foreground and on the
background. Having two DAMON threads per application that paused and
resumed for the apps foreground/background switches can be useful for
the purpose. But such pause/resume of the execution is not supported.
Tests of DAMON. A few DAMON selftests are using drgn to dump the
internal DAMON status. The tests show if the dumped status is the same
as what the test code expected. Because DAMON keeps running and
modifying its internal status, there are chances of data races that can
cause false test results. Stopping DAMON can avoid the race. But,
since the internal state of DAMON is dropped, the test coverage will be
limited.
Let DAMON execution be paused and resumed without loss of the internal
state, to overhaul the limitations. For this, introduce a new DAMON
context parameter, namely 'pause'. API callers can update it while the
context is running, using the online parameters update functions
(damon_commit_ctx() and damon_call()). Once it is set, kdamond_fn()
main loop will do only limited works excluding the monitoring and DAMOS
works, while sleeping sampling intervals per the work. The limited
works include handling of the online parameters update. Hence users can
unset the 'pause' parameter again. Once it is unset, kdamond_fn() main
loop will do all the work again (resumed). Under the paused state, it
also does stop condition checks and handling of it, so that paused DAMON
can also be stopped if needed. Expose the feature to the user space via
DAMON sysfs interface. Also, update existing drgn-based tests to test
and use the feature.
Tests
=====
I confirmed the feature functionality using real time tracing ('perf
trace' or 'trace-cmd stream') of damon:damon_aggregated DAMON
tracepoint. By pausing and resuming the DAMON execution, I was able to
see the trace stops and continued as expected. Note that the pause
feature support is added to DAMON user-space tool (damo) after v3.1.9.
Users can use '--pause_ctx' command line option of damo for that, and I
actually used it for my test. The extended drgn-based selftests are
also testing a part of the functionality.
Patches Sequence
================
Patch 1 introduces the new core API for the pause feature. Patch 2
extend DAMON sysfs interface for the new parameter. Patches 3-5 update
design, usage and ABI documents for the new sysfs file, respectively.
The following five patches are for tests. Patch 6 implements a new
kunit test for the pause parameter online commitment. Patches 7 and 8
extend DAMON selftest helpers to support the new feature. Patch 9
extends selftest to test the commitment of the feature. Finally, patch
10 updates existing selftest to be safe from the race condition using
the pause/resume feature.
Changelog
=========
Changes from RFC v1
(https://lore.kernel.org/20260315210012.94846-1-sj@kernel.org)
- Continuously cancel new damos_walk() requests when paused.
- Initialize damon_sysfs_context->pause.
- Make sysfs.py dump-purpose pausing to work for all contexts.
SeongJae Park (10):
mm/damon/core: introduce damon_ctx->paused
mm/damon/sysfs: add pause file under context dir
Docs/mm/damon/design: update for context pause/resume feature
Docs/admin-guide/mm/damon/usage: update for pause file
Docs/ABI/damon: update for pause sysfs file
mm/damon/tests/core-kunit: test pause commitment
selftests/damon/_damon_sysfs: support pause file staging
selftests/damon/drgn_dump_damon_status: dump pause
selftests/damon/sysfs.py: check pause on assert_ctx_committed()
selftets/damon/sysfs.py: pause DAMON before dumping status
.../ABI/testing/sysfs-kernel-mm-damon | 7 ++++
Documentation/admin-guide/mm/damon/usage.rst | 12 ++++---
Documentation/mm/damon/design.rst | 7 ++++
include/linux/damon.h | 2 ++
mm/damon/core.c | 9 ++++++
mm/damon/sysfs.c | 31 ++++++++++++++++++
mm/damon/tests/core-kunit.h | 4 +++
tools/testing/selftests/damon/_damon_sysfs.py | 10 +++++-
.../selftests/damon/drgn_dump_damon_status.py | 1 +
tools/testing/selftests/damon/sysfs.py | 32 +++++++++++++++++++
10 files changed, 110 insertions(+), 5 deletions(-)
base-commit: 89fea69e3a636d7f4c7a0dee9c25e2b417a74c7a
--
2.47.3
^ permalink raw reply [flat|nested] 10+ messages in thread* [RFC PATCH v2 06/10] mm/damon/tests/core-kunit: test pause commitment
2026-03-19 5:21 [RFC PATCH v2 00/10] mm/damon: let DAMON be paused and resumed SeongJae Park
@ 2026-03-19 5:21 ` SeongJae Park
2026-03-19 5:21 ` [RFC PATCH v2 07/10] selftests/damon/_damon_sysfs: support pause file staging SeongJae Park
` (3 subsequent siblings)
4 siblings, 0 replies; 10+ messages in thread
From: SeongJae Park @ 2026-03-19 5:21 UTC (permalink / raw)
Cc: SeongJae Park, Andrew Morton, Brendan Higgins, David Gow, damon,
kunit-dev, linux-kernel, linux-kselftest, linux-mm
Add a kunit test for commitment of damon_ctx->pause parameter that can
be done using damon_commit_ctx().
Signed-off-by: SeongJae Park <sj@kernel.org>
---
mm/damon/tests/core-kunit.h | 4 ++++
1 file changed, 4 insertions(+)
diff --git a/mm/damon/tests/core-kunit.h b/mm/damon/tests/core-kunit.h
index 9e5904c2beeb2..0030f682b23b7 100644
--- a/mm/damon/tests/core-kunit.h
+++ b/mm/damon/tests/core-kunit.h
@@ -1077,6 +1077,10 @@ static void damon_test_commit_ctx(struct kunit *test)
KUNIT_EXPECT_EQ(test, damon_commit_ctx(dst, src), 0);
src->min_region_sz = 4095;
KUNIT_EXPECT_EQ(test, damon_commit_ctx(dst, src), -EINVAL);
+ src->min_region_sz = 4096;
+ src->pause = true;
+ KUNIT_EXPECT_EQ(test, damon_commit_ctx(dst, src), 0);
+ KUNIT_EXPECT_TRUE(test, dst->pause);
damon_destroy_ctx(src);
damon_destroy_ctx(dst);
}
--
2.47.3
^ permalink raw reply related [flat|nested] 10+ messages in thread* [RFC PATCH v2 07/10] selftests/damon/_damon_sysfs: support pause file staging
2026-03-19 5:21 [RFC PATCH v2 00/10] mm/damon: let DAMON be paused and resumed SeongJae Park
2026-03-19 5:21 ` [RFC PATCH v2 06/10] mm/damon/tests/core-kunit: test pause commitment SeongJae Park
@ 2026-03-19 5:21 ` SeongJae Park
2026-03-20 15:22 ` SeongJae Park
2026-03-19 5:21 ` [RFC PATCH v2 08/10] selftests/damon/drgn_dump_damon_status: dump pause SeongJae Park
` (2 subsequent siblings)
4 siblings, 1 reply; 10+ messages in thread
From: SeongJae Park @ 2026-03-19 5:21 UTC (permalink / raw)
Cc: SeongJae Park, Shuah Khan, damon, linux-kernel, linux-kselftest,
linux-mm
DAMON test-purpose sysfs interface control Python module, _damon_sysfs,
is not supporting the newly added pause file. Add the support of the
file, for future test and use of the feature.
Signed-off-by: SeongJae Park <sj@kernel.org>
---
tools/testing/selftests/damon/_damon_sysfs.py | 10 +++++++++-
1 file changed, 9 insertions(+), 1 deletion(-)
diff --git a/tools/testing/selftests/damon/_damon_sysfs.py b/tools/testing/selftests/damon/_damon_sysfs.py
index 2b4df655d9fd0..120b96ecbd741 100644
--- a/tools/testing/selftests/damon/_damon_sysfs.py
+++ b/tools/testing/selftests/damon/_damon_sysfs.py
@@ -604,10 +604,11 @@ class DamonCtx:
targets = None
schemes = None
kdamond = None
+ pause = None
idx = None
def __init__(self, ops='paddr', monitoring_attrs=DamonAttrs(), targets=[],
- schemes=[]):
+ schemes=[], pause=False):
self.ops = ops
self.monitoring_attrs = monitoring_attrs
self.monitoring_attrs.context = self
@@ -622,6 +623,8 @@ class DamonCtx:
scheme.idx = idx
scheme.context = self
+ self.pause=pause
+
def sysfs_dir(self):
return os.path.join(self.kdamond.sysfs_dir(), 'contexts',
'%d' % self.idx)
@@ -662,6 +665,11 @@ class DamonCtx:
err = scheme.stage()
if err is not None:
return err
+
+ err = write_file(os.path.join(self.sysfs_dir(), 'pause'), self.pause)
+ if err is not None:
+ return err
+
return None
class Kdamond:
--
2.47.3
^ permalink raw reply related [flat|nested] 10+ messages in thread* Re: [RFC PATCH v2 07/10] selftests/damon/_damon_sysfs: support pause file staging
2026-03-19 5:21 ` [RFC PATCH v2 07/10] selftests/damon/_damon_sysfs: support pause file staging SeongJae Park
@ 2026-03-20 15:22 ` SeongJae Park
0 siblings, 0 replies; 10+ messages in thread
From: SeongJae Park @ 2026-03-20 15:22 UTC (permalink / raw)
To: SeongJae Park; +Cc: Shuah Khan, damon, linux-kernel, linux-kselftest, linux-mm
On Wed, 18 Mar 2026 22:21:50 -0700 SeongJae Park <sj@kernel.org> wrote:
> DAMON test-purpose sysfs interface control Python module, _damon_sysfs,
> is not supporting the newly added pause file. Add the support of the
> file, for future test and use of the feature.
>
> Signed-off-by: SeongJae Park <sj@kernel.org>
> ---
> tools/testing/selftests/damon/_damon_sysfs.py | 10 +++++++++-
> 1 file changed, 9 insertions(+), 1 deletion(-)
>
> diff --git a/tools/testing/selftests/damon/_damon_sysfs.py b/tools/testing/selftests/damon/_damon_sysfs.py
> index 2b4df655d9fd0..120b96ecbd741 100644
> --- a/tools/testing/selftests/damon/_damon_sysfs.py
> +++ b/tools/testing/selftests/damon/_damon_sysfs.py
> @@ -604,10 +604,11 @@ class DamonCtx:
> targets = None
> schemes = None
> kdamond = None
> + pause = None
> idx = None
>
> def __init__(self, ops='paddr', monitoring_attrs=DamonAttrs(), targets=[],
> - schemes=[]):
> + schemes=[], pause=False):
Sashiko comment
(https://sashiko.dev/#/patchset/20260319052157.99433-8-sj@kernel.org) and my
reply.
: Since this line is being modified, could the use of mutable default
: arguments for monitoring_attrs, targets, and schemes cause unintended
: state sharing between instances?
:
: In Python, default arguments are evaluated exactly once at function
: definition time. If multiple DamonCtx objects are instantiated without
: explicitly providing these parameters, they will share the exact same
: object instances in memory.
:
: When the constructor later executes self.monitoring_attrs.context = self,
: it overwrites the context reference of the shared object to point to the
: newest DamonCtx instance. If multiple contexts are staged concurrently
: relying on the default arguments, earlier contexts might inadvertently
: write their sysfs files into the directory of the last created context.
:
: Would it be better to use None for the defaults and instantiate new objects
: inside the constructor?
Good points. Orthogonal to this patch, though. Also, there is no such use
case of this class to my best knowledge. I will consider such change later,
though.
Thanks,
SJ
[...]
^ permalink raw reply [flat|nested] 10+ messages in thread
* [RFC PATCH v2 08/10] selftests/damon/drgn_dump_damon_status: dump pause
2026-03-19 5:21 [RFC PATCH v2 00/10] mm/damon: let DAMON be paused and resumed SeongJae Park
2026-03-19 5:21 ` [RFC PATCH v2 06/10] mm/damon/tests/core-kunit: test pause commitment SeongJae Park
2026-03-19 5:21 ` [RFC PATCH v2 07/10] selftests/damon/_damon_sysfs: support pause file staging SeongJae Park
@ 2026-03-19 5:21 ` SeongJae Park
2026-03-19 5:21 ` [RFC PATCH v2 09/10] selftests/damon/sysfs.py: check pause on assert_ctx_committed() SeongJae Park
2026-03-19 5:21 ` [RFC PATCH v2 10/10] selftets/damon/sysfs.py: pause DAMON before dumping status SeongJae Park
4 siblings, 0 replies; 10+ messages in thread
From: SeongJae Park @ 2026-03-19 5:21 UTC (permalink / raw)
Cc: SeongJae Park, Shuah Khan, damon, linux-kernel, linux-kselftest,
linux-mm
drgn_dump_damon_status is not dumping the damon_ctx->pause parameter
value, so it cannot be tested. Dump it for future tests.
Signed-off-by: SeongJae Park <sj@kernel.org>
---
tools/testing/selftests/damon/drgn_dump_damon_status.py | 1 +
1 file changed, 1 insertion(+)
diff --git a/tools/testing/selftests/damon/drgn_dump_damon_status.py b/tools/testing/selftests/damon/drgn_dump_damon_status.py
index af99b07a4f565..5b90eb8e7ef88 100755
--- a/tools/testing/selftests/damon/drgn_dump_damon_status.py
+++ b/tools/testing/selftests/damon/drgn_dump_damon_status.py
@@ -200,6 +200,7 @@ def damon_ctx_to_dict(ctx):
['attrs', attrs_to_dict],
['adaptive_targets', targets_to_list],
['schemes', schemes_to_list],
+ ['pause', bool],
])
def main():
--
2.47.3
^ permalink raw reply related [flat|nested] 10+ messages in thread* [RFC PATCH v2 09/10] selftests/damon/sysfs.py: check pause on assert_ctx_committed()
2026-03-19 5:21 [RFC PATCH v2 00/10] mm/damon: let DAMON be paused and resumed SeongJae Park
` (2 preceding siblings ...)
2026-03-19 5:21 ` [RFC PATCH v2 08/10] selftests/damon/drgn_dump_damon_status: dump pause SeongJae Park
@ 2026-03-19 5:21 ` SeongJae Park
2026-03-19 5:21 ` [RFC PATCH v2 10/10] selftets/damon/sysfs.py: pause DAMON before dumping status SeongJae Park
4 siblings, 0 replies; 10+ messages in thread
From: SeongJae Park @ 2026-03-19 5:21 UTC (permalink / raw)
Cc: SeongJae Park, Shuah Khan, damon, linux-kernel, linux-kselftest,
linux-mm
Extend sysfs.py tests to confirm damon_ctx->pause can be set using the
pause sysfs file.
Signed-off-by: SeongJae Park <sj@kernel.org>
---
tools/testing/selftests/damon/sysfs.py | 1 +
1 file changed, 1 insertion(+)
diff --git a/tools/testing/selftests/damon/sysfs.py b/tools/testing/selftests/damon/sysfs.py
index 3aa5c91548a53..e6d34ba05893f 100755
--- a/tools/testing/selftests/damon/sysfs.py
+++ b/tools/testing/selftests/damon/sysfs.py
@@ -190,6 +190,7 @@ def assert_ctx_committed(ctx, dump):
assert_monitoring_attrs_committed(ctx.monitoring_attrs, dump['attrs'])
assert_monitoring_targets_committed(ctx.targets, dump['adaptive_targets'])
assert_schemes_committed(ctx.schemes, dump['schemes'])
+ assert_true(dump['pause'] == ctx.pause, 'pause', dump)
def assert_ctxs_committed(kdamonds):
status, err = dump_damon_status_dict(kdamonds.kdamonds[0].pid)
--
2.47.3
^ permalink raw reply related [flat|nested] 10+ messages in thread* [RFC PATCH v2 10/10] selftets/damon/sysfs.py: pause DAMON before dumping status
2026-03-19 5:21 [RFC PATCH v2 00/10] mm/damon: let DAMON be paused and resumed SeongJae Park
` (3 preceding siblings ...)
2026-03-19 5:21 ` [RFC PATCH v2 09/10] selftests/damon/sysfs.py: check pause on assert_ctx_committed() SeongJae Park
@ 2026-03-19 5:21 ` SeongJae Park
2026-03-20 15:29 ` SeongJae Park
4 siblings, 1 reply; 10+ messages in thread
From: SeongJae Park @ 2026-03-19 5:21 UTC (permalink / raw)
Cc: SeongJae Park, Shuah Khan, damon, linux-kernel, linux-kselftest,
linux-mm
The sysfs.py test commits DAMON parameters, dump the internal DAMON
state, and show if the parameters are committed as expected using the
dumped state. While the dumping is ongoing, DAMON is alive. It can
make internal changes including addition and removal of regions. It can
therefore make a race that can result in false test results. Pause
DAMON execution during the state dumping to avoid such races.
Signed-off-by: SeongJae Park <sj@kernel.org>
---
tools/testing/selftests/damon/sysfs.py | 31 ++++++++++++++++++++++++++
1 file changed, 31 insertions(+)
diff --git a/tools/testing/selftests/damon/sysfs.py b/tools/testing/selftests/damon/sysfs.py
index e6d34ba05893f..704729c7a318e 100755
--- a/tools/testing/selftests/damon/sysfs.py
+++ b/tools/testing/selftests/damon/sysfs.py
@@ -193,18 +193,48 @@ def assert_ctx_committed(ctx, dump):
assert_true(dump['pause'] == ctx.pause, 'pause', dump)
def assert_ctxs_committed(kdamonds):
+ ctxs_paused_for_dump = []
+ for kd in kdamonds.kdamonds:
+ for ctx in kd.contexts:
+ if ctx.pause is False:
+ ctx.pause = True
+ err = kd.commit()
+ if err is not None:
+ print('pause fail (%s)' % err)
+ kdamonds.stop()
+ exit(1)
+ ctxs_paused_for_dump.append(ctx)
+
status, err = dump_damon_status_dict(kdamonds.kdamonds[0].pid)
if err is not None:
print(err)
kdamonds.stop()
exit(1)
+ for kd in kdamonds.kdamonds:
+ for ctx in kd.contexts:
+ if ctx in ctxs_paused_for_dump:
+ ctx.pause = False
+ err = kd.commit()
+ if err is not None:
+ print('resume fail (%s)' % err)
+ kdamonds.stop()
+ exit(1)
+ # restore for comparison
+ ctx.pause = True
+
ctxs = kdamonds.kdamonds[0].contexts
dump = status['contexts']
assert_true(len(ctxs) == len(dump), 'ctxs length', dump)
for idx, ctx in enumerate(ctxs):
assert_ctx_committed(ctx, dump[idx])
+ # restore for the caller
+ for kd in kdamonds.kdamonds:
+ for ctx in kd.contexts:
+ if ctx in ctxs_paused_for_dump:
+ ctx.pause = False
+
def main():
kdamonds = _damon_sysfs.Kdamonds(
[_damon_sysfs.Kdamond(
@@ -302,6 +332,7 @@ def main():
print('kdamond start failed: %s' % err)
exit(1)
kdamonds.kdamonds[0].contexts[0].targets[1].obsolete = True
+ kdamonds.kdamonds[0].contexts[0].pause = True
kdamonds.kdamonds[0].commit()
del kdamonds.kdamonds[0].contexts[0].targets[1]
assert_ctxs_committed(kdamonds)
--
2.47.3
^ permalink raw reply related [flat|nested] 10+ messages in thread* Re: [RFC PATCH v2 10/10] selftets/damon/sysfs.py: pause DAMON before dumping status
2026-03-19 5:21 ` [RFC PATCH v2 10/10] selftets/damon/sysfs.py: pause DAMON before dumping status SeongJae Park
@ 2026-03-20 15:29 ` SeongJae Park
2026-03-20 15:41 ` SeongJae Park
0 siblings, 1 reply; 10+ messages in thread
From: SeongJae Park @ 2026-03-20 15:29 UTC (permalink / raw)
To: SeongJae Park; +Cc: Shuah Khan, damon, linux-kernel, linux-kselftest, linux-mm
Adding Sashiko comment
(https://lore.kernel.org/https://sashiko.dev/#/patchset/20260319052157.99433-11-sj@kernel.org)
with ': ' line prefix, and my replies in line.
On Wed, 18 Mar 2026 22:21:53 -0700 SeongJae Park <sj@kernel.org> wrote:
> The sysfs.py test commits DAMON parameters, dump the internal DAMON
> state, and show if the parameters are committed as expected using the
> dumped state. While the dumping is ongoing, DAMON is alive. It can
> make internal changes including addition and removal of regions. It can
> therefore make a race that can result in false test results. Pause
> DAMON execution during the state dumping to avoid such races.
>
> Signed-off-by: SeongJae Park <sj@kernel.org>
> ---
> tools/testing/selftests/damon/sysfs.py | 31 ++++++++++++++++++++++++++
> 1 file changed, 31 insertions(+)
>
> diff --git a/tools/testing/selftests/damon/sysfs.py b/tools/testing/selftests/damon/sysfs.py
> index e6d34ba05893f..704729c7a318e 100755
> --- a/tools/testing/selftests/damon/sysfs.py
> +++ b/tools/testing/selftests/damon/sysfs.py
> @@ -193,18 +193,48 @@ def assert_ctx_committed(ctx, dump):
> assert_true(dump['pause'] == ctx.pause, 'pause', dump)
>
> def assert_ctxs_committed(kdamonds):
> + ctxs_paused_for_dump = []
> + for kd in kdamonds.kdamonds:
> + for ctx in kd.contexts:
> + if ctx.pause is False:
> + ctx.pause = True
> + err = kd.commit()
> + if err is not None:
> + print('pause fail (%s)' % err)
> + kdamonds.stop()
> + exit(1)
> + ctxs_paused_for_dump.append(ctx)
> +
> status, err = dump_damon_status_dict(kdamonds.kdamonds[0].pid)
> if err is not None:
> print(err)
> kdamonds.stop()
> exit(1)
>
> + for kd in kdamonds.kdamonds:
> + for ctx in kd.contexts:
> + if ctx in ctxs_paused_for_dump:
> + ctx.pause = False
> + err = kd.commit()
> + if err is not None:
> + print('resume fail (%s)' % err)
> + kdamonds.stop()
> + exit(1)
> + # restore for comparison
> + ctx.pause = True
: If a kdamond contains multiple contexts, does this loop leave earlier contexts
: paused in the kernel?
:
: Since kd.commit() stages and commits the state of all contexts associated with
: the kdamond, when kd.commit() is called for the second context, the first
: context's local pause attribute is already back to True.
:
: This would cause kd.commit() to write to sysfs and instruct the kernel to
: pause the first context again. By the end of this loop, only the last context
: in the kdamond would remain unpaused in the kernel.
No. The pause field of the earlier context is set to False, so later
kd.commit() will commit the False 'pause' again. But this finds a good point.
There is no reason to call kd.commit() for each context. It is more efficient
to be called for each kdamond., thouth currently we support only one context
per kdamond. I will update the code so, in the next spin.
> +
> ctxs = kdamonds.kdamonds[0].contexts
> dump = status['contexts']
> assert_true(len(ctxs) == len(dump), 'ctxs length', dump)
> for idx, ctx in enumerate(ctxs):
> assert_ctx_committed(ctx, dump[idx])
>
> + # restore for the caller
> + for kd in kdamonds.kdamonds:
> + for ctx in kd.contexts:
> + if ctx in ctxs_paused_for_dump:
> + ctx.pause = False
: Since kd.commit() is not called after restoring the Python objects here, does
: this leave the previous contexts permanently paused in the kernel while their
: Python state reflects them as running?
No, we already unpaused the unpause-required contexts above.
Thanks,
SJ
[...]
^ permalink raw reply [flat|nested] 10+ messages in thread* Re: [RFC PATCH v2 10/10] selftets/damon/sysfs.py: pause DAMON before dumping status
2026-03-20 15:29 ` SeongJae Park
@ 2026-03-20 15:41 ` SeongJae Park
2026-03-21 1:02 ` SeongJae Park
0 siblings, 1 reply; 10+ messages in thread
From: SeongJae Park @ 2026-03-20 15:41 UTC (permalink / raw)
To: SeongJae Park; +Cc: Shuah Khan, damon, linux-kernel, linux-kselftest, linux-mm
On Fri, 20 Mar 2026 08:29:39 -0700 SeongJae Park <sj@kernel.org> wrote:
> Adding Sashiko comment
> (https://lore.kernel.org/https://sashiko.dev/#/patchset/20260319052157.99433-11-sj@kernel.org)
> with ': ' line prefix, and my replies in line.
>
> On Wed, 18 Mar 2026 22:21:53 -0700 SeongJae Park <sj@kernel.org> wrote:
>
> > The sysfs.py test commits DAMON parameters, dump the internal DAMON
> > state, and show if the parameters are committed as expected using the
> > dumped state. While the dumping is ongoing, DAMON is alive. It can
> > make internal changes including addition and removal of regions. It can
> > therefore make a race that can result in false test results. Pause
> > DAMON execution during the state dumping to avoid such races.
> >
> > Signed-off-by: SeongJae Park <sj@kernel.org>
> > ---
> > tools/testing/selftests/damon/sysfs.py | 31 ++++++++++++++++++++++++++
> > 1 file changed, 31 insertions(+)
> >
> > diff --git a/tools/testing/selftests/damon/sysfs.py b/tools/testing/selftests/damon/sysfs.py
> > index e6d34ba05893f..704729c7a318e 100755
> > --- a/tools/testing/selftests/damon/sysfs.py
> > +++ b/tools/testing/selftests/damon/sysfs.py
> > @@ -193,18 +193,48 @@ def assert_ctx_committed(ctx, dump):
> > assert_true(dump['pause'] == ctx.pause, 'pause', dump)
> >
> > def assert_ctxs_committed(kdamonds):
> > + ctxs_paused_for_dump = []
> > + for kd in kdamonds.kdamonds:
> > + for ctx in kd.contexts:
> > + if ctx.pause is False:
> > + ctx.pause = True
> > + err = kd.commit()
> > + if err is not None:
> > + print('pause fail (%s)' % err)
> > + kdamonds.stop()
> > + exit(1)
> > + ctxs_paused_for_dump.append(ctx)
> > +
> > status, err = dump_damon_status_dict(kdamonds.kdamonds[0].pid)
> > if err is not None:
> > print(err)
> > kdamonds.stop()
> > exit(1)
> >
> > + for kd in kdamonds.kdamonds:
> > + for ctx in kd.contexts:
> > + if ctx in ctxs_paused_for_dump:
> > + ctx.pause = False
> > + err = kd.commit()
> > + if err is not None:
> > + print('resume fail (%s)' % err)
> > + kdamonds.stop()
> > + exit(1)
> > + # restore for comparison
> > + ctx.pause = True
>
> : If a kdamond contains multiple contexts, does this loop leave earlier contexts
> : paused in the kernel?
> :
> : Since kd.commit() stages and commits the state of all contexts associated with
> : the kdamond, when kd.commit() is called for the second context, the first
> : context's local pause attribute is already back to True.
> :
> : This would cause kd.commit() to write to sysfs and instruct the kernel to
> : pause the first context again. By the end of this loop, only the last context
> : in the kdamond would remain unpaused in the kernel.
>
> No. The pause field of the earlier context is set to False, so later
> kd.commit() will commit the False 'pause' again. But this finds a good point.
> There is no reason to call kd.commit() for each context. It is more efficient
> to be called for each kdamond., thouth currently we support only one context
> per kdamond. I will update the code so, in the next spin.
I'm wrong, Sashiko is correct. I missed the last 'ctx.pause = True' in the
loop. I will fix this together, like below:
'''
--- a/tools/testing/selftests/damon/sysfs.py
+++ b/tools/testing/selftests/damon/sysfs.py
@@ -201,16 +201,17 @@ def assert_ctx_committed(ctx, dump):
def assert_ctxs_committed(kdamonds):
ctxs_paused_for_dump = []
+ # pause for safe state dumping
for kd in kdamonds.kdamonds:
for ctx in kd.contexts:
if ctx.pause is False:
ctx.pause = True
- err = kd.commit()
- if err is not None:
- print('pause fail (%s)' % err)
- kdamonds.stop()
- exit(1)
ctxs_paused_for_dump.append(ctx)
+ err = kd.commit()
+ if err is not None:
+ print('pause fail (%s)' % err)
+ kdamonds.stop()
+ exit(1)
status, err = dump_damon_status_dict(kdamonds.kdamonds[0].pid)
if err is not None:
@@ -218,17 +219,19 @@ def assert_ctxs_committed(kdamonds):
kdamonds.stop()
exit(1)
+ # resume contexts paused for safe state dumping
for kd in kdamonds.kdamonds:
- for ctx in kd.contexts:
- if ctx in ctxs_paused_for_dump:
- ctx.pause = False
- err = kd.commit()
- if err is not None:
- print('resume fail (%s)' % err)
- kdamonds.stop()
- exit(1)
- # restore for comparison
- ctx.pause = True
+ for ctx in ctxs_paused_for_dump:
+ ctx.pause = False
+ err = kd.commit()
+ if err is not None:
+ print('resume fail (%s)' % err)
+ kdamonds.stop()
+ exit(1)
+
+ # restore for comparison
+ for ctx in ctxs_paused_for_dump:
+ ctx.pause = True
ctxs = kdamonds.kdamonds[0].contexts
dump = status['contexts']
'''
Thanks,
SJ
[...]
^ permalink raw reply [flat|nested] 10+ messages in thread* Re: [RFC PATCH v2 10/10] selftets/damon/sysfs.py: pause DAMON before dumping status
2026-03-20 15:41 ` SeongJae Park
@ 2026-03-21 1:02 ` SeongJae Park
0 siblings, 0 replies; 10+ messages in thread
From: SeongJae Park @ 2026-03-21 1:02 UTC (permalink / raw)
To: SeongJae Park; +Cc: Shuah Khan, damon, linux-kernel, linux-kselftest, linux-mm
On Fri, 20 Mar 2026 08:41:33 -0700 SeongJae Park <sj@kernel.org> wrote:
> On Fri, 20 Mar 2026 08:29:39 -0700 SeongJae Park <sj@kernel.org> wrote:
>
> > Adding Sashiko comment
> > (https://lore.kernel.org/https://sashiko.dev/#/patchset/20260319052157.99433-11-sj@kernel.org)
> > with ': ' line prefix, and my replies in line.
> >
> > On Wed, 18 Mar 2026 22:21:53 -0700 SeongJae Park <sj@kernel.org> wrote:
> >
> > > The sysfs.py test commits DAMON parameters, dump the internal DAMON
> > > state, and show if the parameters are committed as expected using the
> > > dumped state. While the dumping is ongoing, DAMON is alive. It can
> > > make internal changes including addition and removal of regions. It can
> > > therefore make a race that can result in false test results. Pause
> > > DAMON execution during the state dumping to avoid such races.
> > >
> > > Signed-off-by: SeongJae Park <sj@kernel.org>
> > > ---
> > > tools/testing/selftests/damon/sysfs.py | 31 ++++++++++++++++++++++++++
> > > 1 file changed, 31 insertions(+)
> > >
> > > diff --git a/tools/testing/selftests/damon/sysfs.py b/tools/testing/selftests/damon/sysfs.py
> > > index e6d34ba05893f..704729c7a318e 100755
> > > --- a/tools/testing/selftests/damon/sysfs.py
> > > +++ b/tools/testing/selftests/damon/sysfs.py
> > > @@ -193,18 +193,48 @@ def assert_ctx_committed(ctx, dump):
> > > assert_true(dump['pause'] == ctx.pause, 'pause', dump)
> > >
> > > def assert_ctxs_committed(kdamonds):
> > > + ctxs_paused_for_dump = []
> > > + for kd in kdamonds.kdamonds:
> > > + for ctx in kd.contexts:
> > > + if ctx.pause is False:
> > > + ctx.pause = True
> > > + err = kd.commit()
> > > + if err is not None:
> > > + print('pause fail (%s)' % err)
> > > + kdamonds.stop()
> > > + exit(1)
> > > + ctxs_paused_for_dump.append(ctx)
> > > +
> > > status, err = dump_damon_status_dict(kdamonds.kdamonds[0].pid)
> > > if err is not None:
> > > print(err)
> > > kdamonds.stop()
> > > exit(1)
> > >
> > > + for kd in kdamonds.kdamonds:
> > > + for ctx in kd.contexts:
> > > + if ctx in ctxs_paused_for_dump:
> > > + ctx.pause = False
> > > + err = kd.commit()
> > > + if err is not None:
> > > + print('resume fail (%s)' % err)
> > > + kdamonds.stop()
> > > + exit(1)
> > > + # restore for comparison
> > > + ctx.pause = True
> >
> > : If a kdamond contains multiple contexts, does this loop leave earlier contexts
> > : paused in the kernel?
> > :
> > : Since kd.commit() stages and commits the state of all contexts associated with
> > : the kdamond, when kd.commit() is called for the second context, the first
> > : context's local pause attribute is already back to True.
> > :
> > : This would cause kd.commit() to write to sysfs and instruct the kernel to
> > : pause the first context again. By the end of this loop, only the last context
> > : in the kdamond would remain unpaused in the kernel.
> >
> > No. The pause field of the earlier context is set to False, so later
> > kd.commit() will commit the False 'pause' again. But this finds a good point.
> > There is no reason to call kd.commit() for each context. It is more efficient
> > to be called for each kdamond., thouth currently we support only one context
> > per kdamond. I will update the code so, in the next spin.
>
> I'm wrong, Sashiko is correct. I missed the last 'ctx.pause = True' in the
> loop. I will fix this together, like below:
>
> '''
> --- a/tools/testing/selftests/damon/sysfs.py
> +++ b/tools/testing/selftests/damon/sysfs.py
> @@ -201,16 +201,17 @@ def assert_ctx_committed(ctx, dump):
>
> def assert_ctxs_committed(kdamonds):
> ctxs_paused_for_dump = []
> + # pause for safe state dumping
> for kd in kdamonds.kdamonds:
> for ctx in kd.contexts:
> if ctx.pause is False:
> ctx.pause = True
> - err = kd.commit()
> - if err is not None:
> - print('pause fail (%s)' % err)
> - kdamonds.stop()
> - exit(1)
> ctxs_paused_for_dump.append(ctx)
> + err = kd.commit()
> + if err is not None:
> + print('pause fail (%s)' % err)
> + kdamonds.stop()
> + exit(1)
>
> status, err = dump_damon_status_dict(kdamonds.kdamonds[0].pid)
> if err is not None:
> @@ -218,17 +219,19 @@ def assert_ctxs_committed(kdamonds):
> kdamonds.stop()
> exit(1)
>
> + # resume contexts paused for safe state dumping
> for kd in kdamonds.kdamonds:
> - for ctx in kd.contexts:
> - if ctx in ctxs_paused_for_dump:
> - ctx.pause = False
> - err = kd.commit()
> - if err is not None:
> - print('resume fail (%s)' % err)
> - kdamonds.stop()
> - exit(1)
> - # restore for comparison
> - ctx.pause = True
> + for ctx in ctxs_paused_for_dump:
> + ctx.pause = False
> + err = kd.commit()
> + if err is not None:
> + print('resume fail (%s)' % err)
> + kdamonds.stop()
> + exit(1)
> +
> + # restore for comparison
> + for ctx in ctxs_paused_for_dump:
> + ctx.pause = True
>
> ctxs = kdamonds.kdamonds[0].contexts
> dump = status['contexts']
> '''
The above fixup makes the test fails, for the obsolete targets. The fixup
needs another fixup, like below. I will add that to the next spin.
'''
--- a/tools/testing/selftests/damon/sysfs.py
+++ b/tools/testing/selftests/damon/sysfs.py
@@ -200,11 +200,12 @@ def assert_ctxs_committed(kdamonds):
if ctx.pause is False:
ctx.pause = True
ctxs_paused_for_dump.append(ctx)
- err = kd.commit()
- if err is not None:
- print('pause fail (%s)' % err)
- kdamonds.stop()
- exit(1)
+ if len(ctxs_paused_for_dump) > 0:
+ err = kd.commit()
+ if err is not None:
+ print('pause fail (%s)' % err)
+ kdamonds.stop()
+ exit(1)
status, err = dump_damon_status_dict(kdamonds.kdamonds[0].pid)
if err is not None:
@@ -216,11 +217,12 @@ def assert_ctxs_committed(kdamonds):
for kd in kdamonds.kdamonds:
for ctx in ctxs_paused_for_dump:
ctx.pause = False
- err = kd.commit()
- if err is not None:
- print('resume fail (%s)' % err)
- kdamonds.stop()
- exit(1)
+ if len(ctxs_paused_for_dump) > 0:
+ err = kd.commit()
+ if err is not None:
+ print('resume fail (%s)' % err)
+ kdamonds.stop()
+ exit(1)
# restore for comparison
for ctx in ctxs_paused_for_dump:
'''
Thanks,
SJ
^ permalink raw reply [flat|nested] 10+ messages in thread
end of thread, other threads:[~2026-03-21 1:02 UTC | newest]
Thread overview: 10+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2026-03-19 5:21 [RFC PATCH v2 00/10] mm/damon: let DAMON be paused and resumed SeongJae Park
2026-03-19 5:21 ` [RFC PATCH v2 06/10] mm/damon/tests/core-kunit: test pause commitment SeongJae Park
2026-03-19 5:21 ` [RFC PATCH v2 07/10] selftests/damon/_damon_sysfs: support pause file staging SeongJae Park
2026-03-20 15:22 ` SeongJae Park
2026-03-19 5:21 ` [RFC PATCH v2 08/10] selftests/damon/drgn_dump_damon_status: dump pause SeongJae Park
2026-03-19 5:21 ` [RFC PATCH v2 09/10] selftests/damon/sysfs.py: check pause on assert_ctx_committed() SeongJae Park
2026-03-19 5:21 ` [RFC PATCH v2 10/10] selftets/damon/sysfs.py: pause DAMON before dumping status SeongJae Park
2026-03-20 15:29 ` SeongJae Park
2026-03-20 15:41 ` SeongJae Park
2026-03-21 1:02 ` SeongJae Park
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox