DAMON development mailing list
 help / color / mirror / Atom feed
From: SeongJae Park <sj@kernel.org>
To: sashiko-bot@kernel.org
Cc: SeongJae Park <sj@kernel.org>, damon@lists.linux.dev
Subject: Re: [RFC PATCH v3 07/28] mm/damon/core: do data attributes monitoring
Date: Sat, 16 May 2026 14:33:27 -0700	[thread overview]
Message-ID: <20260516213329.3083-1-sj@kernel.org> (raw)
In-Reply-To: <20260516190011.77604C19425@smtp.kernel.org>

On Sat, 16 May 2026 19:00:10 +0000 sashiko-bot@kernel.org wrote:

> Thank you for your contribution! Sashiko AI review found 4 potential issue(s) to consider:
> - [High] The `probe_hits` array counters can silently overflow, yielding corrupted monitoring results.
> - [High] DAMON dynamically merges regions based strictly on `nr_accesses`, which corrupts the spatial granularity of `probe_hits`.
> - [High] If DAMON is run without `check_accesses`, all regions will be aggressively merged up to `sz_limit`, neutralizing data attribute monitoring.
> - [High] Size-weighted average calculations for `probe_hits` overflow `unsigned long` on 32-bit systems.
> --
> 
> commit 55dcc5294b4ee96c9d7e5a24e9dcdb0d798c401d
> Author: SeongJae Park <sj@kernel.org>
> 
> mm/damon/core: do data attributes monitoring
> 
> Implement the data attributes monitoring execution.  Update kdamond to
> invoke the probes application callback, and reset the aggregated number
> of per-region per-probe positive samples for every aggregation interval.
> 
> > diff --git a/mm/damon/core.c b/mm/damon/core.c
> > index e5667660b4cd9..dde3c8d8fef89 100644
> > --- a/mm/damon/core.c
> > +++ b/mm/damon/core.c
> > @@ -1886,10 +1886,14 @@ static void kdamond_reset_aggregated(struct damon_ctx *c)
> >  		struct damon_region *r;
> >  
> >  		damon_for_each_region(r, t) {
> > +			int i;
> > +
> >  			trace_damon_aggregated(ti, r, damon_nr_regions(t));
> >  			damon_warn_fix_nr_accesses_corruption(r);
> >  			r->last_nr_accesses = r->nr_accesses;
> >  			r->nr_accesses = 0;
> > +			for (i = 0; i < DAMON_MAX_PROBES; i++)
> > +				r->probe_hits[i] = 0;
> 
> Can probe_hits overflow here?
> 
> The probe_hits array is defined as unsigned char, which has a maximum value
> of 255.
> 
> The number of samples per interval is user-configurable and can legitimately
> exceed 255 (e.g., a 1000ms aggregation with a 1ms sample interval yields 1000
> samples).
> 
> If a probe matches continuously under such configurations, could the counter
> silently overflow and wrap around, corrupting the monitoring results?

No blocker, as I previously explained [1].

> 
> >  			damon_verify_reset_aggregated(r, c);
> >  		}
> >  		ti++;
> > @@ -3179,6 +3183,8 @@ static int kdamond_fn(void *data)
> >  
> >  		if (ctx->ops.check_accesses)
> >  			max_nr_accesses = ctx->ops.check_accesses(ctx);
> > +		if (ctx->ops.apply_probes)
> > +			ctx->ops.apply_probes(ctx);
> 
> If DAMON is configured strictly for data attribute monitoring where
> check_accesses is NULL but apply_probes is set, max_nr_accesses and all
> regional nr_accesses persistently evaluate to 0.
> 
> Will this cause all adjacent regions to have a calculated difference
> threshold of 0, resulting in all regions merging indiscriminately up to
> sz_limit during the subsequent kdamond_merge_regions() call?
> 
> Additionally, damon_merge_regions_of() merges adjacent regions by evaluating
> only nr_accesses:
> 
> abs(prev->nr_accesses - r->nr_accesses) <= thres
> 
> Since this merge condition ignores probe_hits, could two contiguous regions
> with identical access frequencies but vastly different data attributes be
> incorrectly merged?

Correct, and that's the intended behavior.  Data attributes are just additional
information.  Data access pattern, specifically nr_accesses, is the primary
information that drives region splits and merges.  I'm planning to let users
set the primary information in future, though.

> 
> Does calculating the size-weighted average for probe_hits risk overflowing on
> 32-bit systems?
> 
> When damon_merge_two_regions() calculates averages using:
> 
> (metric * sz_l + metric * sz_r) / (sz_l + sz_r)
> 
> Both sz_l and sz_r are evaluated in bytes as unsigned long. On 32-bit
> systems where ULONG_MAX is roughly 4 GB, a 200 MB region with 25 probe_hits
> yields an intermediate product of 5,000,000,000.
> 
> Does this silently overflow the 32-bit unsigned long and corrupt the region
> metrics?

I previously answered [1] this.  32-bit arch users could use addr_unit to avoid
this issue.

> 
> >  
> >  		if (time_after_eq(ctx->passed_sample_intervals,
> >  					next_aggregation_sis)) {
> 
> -- 
> Sashiko AI review · https://sashiko.dev/#/patchset/20260516183712.81393-1-sj@kernel.org?part=7

[1] https://lore.kernel.org/20260516003559.141476-1-sj@kernel.org


Thanks,
SJ

  reply	other threads:[~2026-05-16 21:33 UTC|newest]

Thread overview: 43+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2026-05-16 18:36 [RFC PATCH v3 00/28] mm/damon: introduce data attributes monitoring SeongJae Park
2026-05-16 18:36 ` [RFC PATCH v3 01/28] mm/damon/core: introduce struct damon_probe SeongJae Park
2026-05-16 18:36 ` [RFC PATCH v3 02/28] mm/damon/core: embed damon_probe objects in damon_ctx SeongJae Park
2026-05-16 18:36 ` [RFC PATCH v3 03/28] mm/damon/core: introduce damon_filter SeongJae Park
2026-05-16 18:36 ` [RFC PATCH v3 04/28] mm/damon/core: commit probes SeongJae Park
2026-05-16 18:36 ` [RFC PATCH v3 05/28] mm/damon/core: introduce damon_region->probe_hits SeongJae Park
2026-05-16 18:54   ` sashiko-bot
2026-05-16 21:26     ` SeongJae Park
2026-05-16 18:36 ` [RFC PATCH v3 06/28] mm/damon/core: introduce damon_ops->apply_probes SeongJae Park
2026-05-16 18:36 ` [RFC PATCH v3 07/28] mm/damon/core: do data attributes monitoring SeongJae Park
2026-05-16 19:00   ` sashiko-bot
2026-05-16 21:33     ` SeongJae Park [this message]
2026-05-16 18:36 ` [RFC PATCH v3 08/28] mm/damon/paddr: support " SeongJae Park
2026-05-16 19:05   ` sashiko-bot
2026-05-16 21:46     ` SeongJae Park
2026-05-16 18:36 ` [RFC PATCH v3 09/28] mm/damon/sysfs: implement probes dir SeongJae Park
2026-05-16 18:36 ` [RFC PATCH v3 10/28] mm/damon/sysfs: implement probe dir SeongJae Park
2026-05-16 18:36 ` [RFC PATCH v3 11/28] mm/damon/sysfs: implement filters directory SeongJae Park
2026-05-16 18:36 ` [RFC PATCH v3 12/28] mm/damon/sysfs: implement filter dir SeongJae Park
2026-05-16 18:36 ` [RFC PATCH v3 13/28] mm/damon/sysfs: implement filter dir files SeongJae Park
2026-05-16 18:36 ` [RFC PATCH v3 14/28] mm/damon/sysfs: setup probes on DAMON core API parameters SeongJae Park
2026-05-16 18:36 ` [RFC PATCH v3 15/28] mm/damon/sysfs-schemes: implement tried_regions/<r>/probes/ SeongJae Park
2026-05-16 18:36 ` [RFC PATCH v3 16/28] mm/damon/sysfs-schemes: implement probe dir SeongJae Park
2026-05-16 18:36 ` [RFC PATCH v3 17/28] mm/damon/sysfs-schemes: implement probe/hits file SeongJae Park
2026-05-16 18:36 ` [RFC PATCH v3 18/28] mm/damon: trace probe_hits SeongJae Park
2026-05-16 18:37 ` [RFC PATCH v3 19/28] selftests/damon/sysfs.sh: test probes dir SeongJae Park
2026-05-16 18:37 ` [RFC PATCH v3 20/28] Docs/mm/damon/design: document data attributes monitoring SeongJae Park
2026-05-16 18:37 ` [RFC PATCH v3 21/28] Docs/admin-guide/mm/damon/usage: " SeongJae Park
2026-05-16 18:37 ` [RFC PATCH v3 22/28] mm/damon/core: introduce DAMON_FILTER_TYPE_MEMCG SeongJae Park
2026-05-16 18:37 ` [RFC PATCH v3 23/28] mm/damon/paddr: support DAMON_FILTER_TYPE_MEMCG SeongJae Park
2026-05-16 18:37 ` [RFC PATCH v3 24/28] mm/damon/sysfs: add filters/<F>/path file SeongJae Park
2026-05-16 19:29   ` sashiko-bot
2026-05-16 21:51     ` SeongJae Park
2026-05-16 18:37 ` [RFC PATCH v3 25/28] mm/damon/sysfs-schemes: move memcg_path_to_id() to sysfs-common SeongJae Park
2026-05-16 19:16   ` sashiko-bot
2026-05-16 21:54     ` SeongJae Park
2026-05-16 18:37 ` [RFC PATCH v3 26/28] mm/damon/sysfs: setup damon_filter->memcg_id from path SeongJae Park
2026-05-16 18:37 ` [RFC PATCH v3 27/28] Docs/mm/damon/design: update for memcg damon filter SeongJae Park
2026-05-16 18:37 ` [RFC PATCH v3 28/28] Docs/admin-guide/mm/damon/usage: " SeongJae Park
2026-05-16 19:09   ` sashiko-bot
2026-05-16 21:57     ` SeongJae Park
2026-05-16 18:50 ` [RFC PATCH v3 00/28] mm/damon: introduce data attributes monitoring SeongJae Park
2026-05-16 22:03 ` SeongJae Park

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20260516213329.3083-1-sj@kernel.org \
    --to=sj@kernel.org \
    --cc=damon@lists.linux.dev \
    --cc=sashiko-bot@kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox