From: Wei Yang <richard.weiyang@gmail.com>
To: Wei Yang <richard.weiyang@gmail.com>
Cc: David Hildenbrand <david@redhat.com>,
akpm@linux-foundation.org, linux-mm@kvack.org,
Lorenzo Stoakes <lorenzo.stoakes@oracle.com>,
Rik van Riel <riel@surriel.com>,
"Liam R . Howlett" <Liam.Howlett@oracle.com>,
Vlastimil Babka <vbabka@suse.cz>,
Harry Yoo <harry.yoo@oracle.com>
Subject: Re: [PATCH 3/3] selftests/mm: assert rmap behave as expected
Date: Fri, 25 Jul 2025 02:16:49 +0000 [thread overview]
Message-ID: <20250725021649.eetoa4sev7r4uweh@master> (raw)
In-Reply-To: <20250717031748.27eic7qxotft6uko@master>
On Thu, Jul 17, 2025 at 03:17:48AM +0000, Wei Yang wrote:
>On Wed, Jul 16, 2025 at 03:34:11PM +0200, David Hildenbrand wrote:
>>On 16.07.25 10:27, Wei Yang wrote:
>[..]
>>> +
>>> +FIXTURE(migrate)
>>> +{
>>> + struct global_data data;
>>> +};
>>> +
>>> +FIXTURE_SETUP(migrate)
>>> +{
>>> + struct global_data *data = &self->data;
>>> +
>>> + ASSERT_EQ(numa_available(), 0);
>>
>>if (numa_available() < 0)
>> SKIP(return, "NUMA not available");
>>
>>Should that be a skip instead?
>>
>
>You are right, will fix it.
>
>>> + if (numa_bitmask_weight(numa_all_nodes_ptr) <= 1)
>>> + SKIP(return, "Not enough NUMA nodes available");
>>> +
>>> + data->mapsize = getpagesize();> +
>>> + /* Prepare semaphore */
>>> + data->semid = semget(IPC_PRIVATE, 1, 0666 | IPC_CREAT);
>>> + ASSERT_NE(data->semid, -1);
>>> + ASSERT_NE(semctl(data->semid, 0, SETVAL, 0), -1);
>>> +
>>> + /* Prepare pipe */
>>> + ASSERT_NE(pipe(data->pipefd), -1);
>>> +
>>> + data->rand_seed = time(NULL);
>>> + srand(data->rand_seed);
>>> +
>>> + data->worker_level = rand() % TOTAL_LEVEL + 1;
>>> +
>>> + data->do_prepare = NULL;
>>> + data->do_work = NULL;
>>> + data->do_check = NULL;
>>> +
>>> + data->backend = ANON;
>>> +};
>>> +
>>> +FIXTURE_TEARDOWN(migrate)
>>> +{
>>> + struct global_data *data = &self->data;
>>> +
>>> + if (data->region != MAP_FAILED)
>>> + munmap(data->region, data->mapsize);
>>> + data->region = MAP_FAILED;
>>> + if (data->expected_pfn != MAP_FAILED)
>>> + munmap(data->expected_pfn, sizeof(unsigned long));
>>> + data->expected_pfn = MAP_FAILED;
>>> + semctl(data->semid, 0, IPC_RMID);
>>> + data->semid = -1;
>>> +
>>> + close(data->pipefd[0]);
>>> +
>>> + data->do_work = NULL;
>>> + data->do_check = NULL;
>>> +
>>> + switch (data->backend) {
>>> + case ANON:
>>> + break;
>>> + case SHM:
>>> + shm_unlink(data->filename);
>>> + break;
>>> + case NORM_FILE:
>>> + unlink(data->filename);
>>> + break;
>>> + }
>>> +}
>>> +
>>> +int try_to_move_page(char *region)
>>> +{
>>> + int ret;
>>> + int node;
>>> + int status = 0;
>>> + volatile unsigned long dummy = 0;
>>> +
>>> + /*
>>> + * Fault in page in case it is not, otherwise move_pages() would
>>> + * return -ENOENT.
>>> + */
>>> + dummy = *((unsigned long *)region);
>>
>>Use FORCE_READ() here
>>
>>https://lkml.kernel.org/r/20250716123126.3851-1-lianux.mm@gmail.com
>>
>>But, this really must happen in all children before actually performing the
>>move in the worker. Otherwise the other processes don't map the page and will
>>just ... fault it in later.
>>
>
>Ok, will access the region in all child before migrate.
>
>>> + /* Prevent the compiler from optimizing out the entire loop: */
>>> + asm volatile("" : "+r" (dummy));
>>> +
>>> + ret = move_pages(0, 1, (void **)®ion, NULL, &status, MPOL_MF_MOVE_ALL);
>>> + if (ret != 0)
>>> + return FAIL_ON_WORK;
>>> +
>>> + /* Pick up a different target node */
>>> + for (node = 0; node <= numa_max_node(); node++) {
>>> + if (numa_bitmask_isbitset(numa_all_nodes_ptr, node) && node != status)
>>> + break;
>>> + }
>>> +
>>> + if (node > numa_max_node()) {
>>> + ksft_print_msg("Couldn't find available numa node for testing\n");
>>> + return FAIL_ON_WORK;
>>> + }
>>> +
>>> + ret = move_pages(0, 1, (void **)®ion, &node, &status, MPOL_MF_MOVE_ALL);
>>> + if (ret != 0)
>>> + return FAIL_ON_WORK;
>>
>>Probably, if we don't manage to migrate, we should retry a couple of times
>>and then SKIP.
>>
>>Point is, migration might fail for various reasons (e.g., 2 NUMA nodes but
>>one of them doesn't even have memory) etc.
>>
>>Migration failures might indicate other problems, yes, but false failures
>>from the test are suboptimal.
>>
>
>Will add retry logic.
>
>>> +
>>> + return 0;
>>> +}
>>> +
>>> +int move_and_update(struct global_data *data)
>>> +{
>>> + int ret;
>>> +
>>> + ret = try_to_move_page(data->region);
>>> + if (ret != 0)
>>> + return ret;
>>> +
>>> + /* Change the content */
>>> + strcpy(data->region, updated_data);
>>> +
>>> + return ret;
>>> +}
>>> +
>>> +int data_updated(struct global_data *data)
>>> +{
>>> + if (data->region == MAP_FAILED)
>>> + return 0;
>>> +
>>> + if (strncmp((char *)data->region, updated_data, strlen(updated_data)))
>>> + return FAIL_ON_CHECK;
>>> + return 0;
>>> +}
>>
>>I assume checking the PFN is sufficient. No need for the additional data
>>content. In particular, with proper anon pages (CoW, see below) that doesn't
>>work either way.
>>
>>> +
>>> +TEST_F(migrate, anon)
>>> +{
>>> + pid_t root_pid;
>>> + int ret;
>>> + struct global_data *data = &self->data;
>>> +
>>> + /* Map a shared area and fault in */
>>> + data->region = mmap(0, data->mapsize, PROT_READ | PROT_WRITE,
>>> + MAP_SHARED | MAP_ANONYMOUS, -1, 0);
>>
>>That is anon_shmem. We should test proper anon memory (MAP_PRIVATE), whereby
>>pages are shared using CoW.
>>
>
>So the case should be
>
> * mapping (MAP_PRIVATE | MAP_ANONYMOUS)
> * write some content in root parent
> * fault in for each child
> * do migration and record pfn
> * then check pfn is the same in each child
>
Hi, David
Is my understanding correct? If so, may I send a new version?
--
Wei Yang
Help you, Help me
next prev parent reply other threads:[~2025-07-25 2:16 UTC|newest]
Thread overview: 16+ messages / expand[flat|nested] mbox.gz Atom feed top
2025-07-16 8:27 [PATCH 0/3] selftests/mm: assert rmap behave as expected Wei Yang
2025-07-16 8:27 ` [PATCH 1/3] selftests/mm: check a valid fd with negative value Wei Yang
2025-07-16 8:52 ` David Hildenbrand
2025-07-17 1:18 ` Wei Yang
2025-07-17 8:27 ` David Hildenbrand
2025-07-16 8:27 ` [PATCH 2/3] selftests/mm: put general ksm operation into vm_util Wei Yang
2025-07-16 9:20 ` David Hildenbrand
2025-07-17 2:45 ` Wei Yang
2025-07-17 3:30 ` Wei Yang
2025-07-17 11:57 ` David Hildenbrand
2025-07-17 11:56 ` David Hildenbrand
2025-07-16 8:27 ` [PATCH 3/3] selftests/mm: assert rmap behave as expected Wei Yang
2025-07-16 13:34 ` David Hildenbrand
2025-07-17 3:17 ` Wei Yang
2025-07-25 2:16 ` Wei Yang [this message]
2025-07-25 13:34 ` David Hildenbrand
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20250725021649.eetoa4sev7r4uweh@master \
--to=richard.weiyang@gmail.com \
--cc=Liam.Howlett@oracle.com \
--cc=akpm@linux-foundation.org \
--cc=david@redhat.com \
--cc=harry.yoo@oracle.com \
--cc=linux-mm@kvack.org \
--cc=lorenzo.stoakes@oracle.com \
--cc=riel@surriel.com \
--cc=vbabka@suse.cz \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).