From mboxrd@z Thu Jan 1 00:00:00 1970 From: Joerg Vehlow Date: Fri, 27 Nov 2020 08:14:19 +0100 Subject: [LTP] [PATCH v3] overcommit_memory: Fix unstable subtest Message-ID: <20201127071419.20370-1-lkml@jv-coder.de> List-Id: MIME-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit To: ltp@lists.linux.it From: Joerg Vehlow Changes to v2: - Removed useless call to get_total_batch_size_bytes - Calculate batch size before reading /proc/meminfo This should reduce memory allocations after meminfo was read The test sets overcommit policy to never overcommit and then tries to allocate the commit limit reported by /proc/meminfo. This value is an exact value (at least at that point in time) of memory, that can be allocated according to the policy and ratio settings. This should fail, since there is already some memory allocated for running the test programm, but due to inaccurate memory accounting in mm/util.c __vm_enough_memory(), the allocation can still succeed. The commited memory is stored in a percpu counter, that counts in 1 + ncpu variables. For small allocations and deallocations, the memory is counted in a counter per cpu, without locking. If this counter reaches a threshold, the value is committed to a global counter. Due to this the global counter can become negative. This global counter is the only thing taken into account in __vm_enough_memory, propably for performance reasons, because otherwise a lock is required. Because of the inaccuracy the system can overcommit a bit by number of cpus times threshold value. By adding this value to the exact commit limit reported by /proc/meminfo, we can be sure, that we really always hit the memory limit. Signed-off-by: Joerg Vehlow --- .../kernel/mem/tunable/overcommit_memory.c | 56 +++++++++++++------ 1 file changed, 39 insertions(+), 17 deletions(-) diff --git a/testcases/kernel/mem/tunable/overcommit_memory.c b/testcases/kernel/mem/tunable/overcommit_memory.c index f77939908..918b4e68e 100644 --- a/testcases/kernel/mem/tunable/overcommit_memory.c +++ b/testcases/kernel/mem/tunable/overcommit_memory.c @@ -1,18 +1,7 @@ +// SPDX-License-Identifier: GPL-2.0-or-later /* - * Copyright (c) Linux Test Project, 2012-2020 - * Copyright (C) 2012-2017 Red Hat, Inc. - * - * This program is free software; you can redistribute it and/or modify - * it under the terms of the GNU General Public License as published by - * the Free Software Foundation; either version 2 of the License, or - * (at your option) any later version. - * - * This program is distributed in the hope that it will be useful, - * but WITHOUT ANY WARRANTY; without even the implied warranty of - * MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See - * the GNU General Public License for more details. - * - * Descriptions: + * Copyright (c) 2012-2020 Linux Test Project + * Copyright (c) 2012-2017 Red Hat, Inc. * * There are two tunables overcommit_memory and overcommit_ratio under * /proc/sys/vm/, which can control memory overcommitment. @@ -53,12 +42,16 @@ * the system is limited to CommitLimit(Swap+RAM*overcommit_ratio) * commit_left(allocatable memory) = CommitLimit - Committed_AS * a. less than commit_left: commit_left / 2, alloc should pass - * b. greater than commit_left: commit_left * 2, alloc should fail - * c. overcommit limit: CommitLimit, alloc should fail + * b. overcommit limit: CommitLimit + TotalBatchSize, should fail + * c. greater than commit_left: commit_left * 2, alloc should fail * *note: CommitLimit is the current overcommit limit. * Committed_AS is the amount of memory that system has used. * it couldn't choose 'equal to commit_left' as a case, because * commit_left rely on Committed_AS, but the Committed_AS is not stable. + * *note2: TotalBatchSize is the total number of bytes, that can be + * accounted for in the per cpu counters for the vm_committed_as + * counter. Since the check used by malloc only looks at the + * global counter of vm_committed_as, it can overallocate a bit. * * References: * - Documentation/sysctl/vm.txt @@ -89,11 +82,13 @@ static long sum_total; static long free_total; static long commit_limit; static long commit_left; +static long total_batch_size; static int heavy_malloc(long size); static void alloc_and_check(long size, int expect_result); static void update_mem(void); static void update_mem_commit(void); +static long get_total_batch_size_bytes(void); static void setup(void) { @@ -154,7 +149,7 @@ static void overcommit_memory_test(void) update_mem_commit(); alloc_and_check(commit_left * 2, EXPECT_FAIL); - alloc_and_check(commit_limit, EXPECT_FAIL); + alloc_and_check(commit_limit + total_batch_size, EXPECT_FAIL); update_mem_commit(); alloc_and_check(commit_left / 2, EXPECT_PASS); @@ -228,6 +223,8 @@ static void update_mem_commit(void) { long committed; + total_batch_size = get_total_batch_size_bytes(); + commit_limit = SAFE_READ_MEMINFO("CommitLimit:"); committed = SAFE_READ_MEMINFO("Committed_AS:"); commit_left = commit_limit - committed; @@ -247,6 +244,31 @@ static void update_mem_commit(void) } } +static long get_total_batch_size_bytes(void) +{ + struct sysinfo info; + long ncpus = tst_ncpus_conf(); + long pagesize = getpagesize(); + SAFE_SYSINFO(&info); + + /* see linux source mm/mm_init.c mm_compute_batch() (This is in pages) */ + long batch_size = MAX( + ncpus * 2, + MAX( + 32, + MIN( + INT32_MAX, + (long)(info.totalram / pagesize) / ncpus / 256 + ) + ) + ); + + /* there are ncpu separate counters, that can all grow up to + * batch_size. So the maximum error for __vm_enough_memory is + * batch_size * ncpus. */ + return batch_size * ncpus * pagesize; +} + static struct tst_test test = { .needs_root = 1, .options = options, -- 2.25.1