From: kernel test robot <lkp@intel.com>
To: Mina Almasry <almasrymina@google.com>
Cc: llvm@lists.linux.dev, oe-kbuild-all@lists.linux.dev
Subject: Re: [PATCH RFC net-next v1] page_pool: import Jesper's page_pool benchmark
Date: Mon, 10 Mar 2025 16:08:15 +0800 [thread overview]
Message-ID: <202503101512.rKBZoYNW-lkp@intel.com> (raw)
In-Reply-To: <20250309084118.3080950-1-almasrymina@google.com>
Hi Mina,
[This is a private test report for your RFC patch.]
kernel test robot noticed the following build errors:
[auto build test ERROR on 8ef890df4031121a94407c84659125cbccd3fdbe]
url: https://github.com/intel-lab-lkp/linux/commits/Mina-Almasry/page_pool-import-Jesper-s-page_pool-benchmark/20250309-164215
base: 8ef890df4031121a94407c84659125cbccd3fdbe
patch link: https://lore.kernel.org/r/20250309084118.3080950-1-almasrymina%40google.com
patch subject: [PATCH RFC net-next v1] page_pool: import Jesper's page_pool benchmark
config: s390-allmodconfig (https://download.01.org/0day-ci/archive/20250310/202503101512.rKBZoYNW-lkp@intel.com/config)
compiler: clang version 19.1.7 (https://github.com/llvm/llvm-project cd708029e0b2869e80abe31ddb175f7c35361f90)
reproduce (this is a W=1 build): (https://download.01.org/0day-ci/archive/20250310/202503101512.rKBZoYNW-lkp@intel.com/reproduce)
If you fix the issue in a separate patch/commit (i.e. not just a new version of
the same patch/commit), kindly add following tags
| Reported-by: kernel test robot <lkp@intel.com>
| Closes: https://lore.kernel.org/oe-kbuild-all/202503101512.rKBZoYNW-lkp@intel.com/
All errors (new ones prefixed by >>):
In file included from lib/bench/bench_page_pool_simple.c:10:
In file included from include/linux/module.h:19:
In file included from include/linux/elf.h:6:
In file included from arch/s390/include/asm/elf.h:181:
In file included from arch/s390/include/asm/mmu_context.h:11:
In file included from arch/s390/include/asm/pgalloc.h:18:
In file included from include/linux/mm.h:2224:
include/linux/vmstat.h:504:43: warning: arithmetic between different enumeration types ('enum zone_stat_item' and 'enum numa_stat_item') [-Wenum-enum-conversion]
504 | return vmstat_text[NR_VM_ZONE_STAT_ITEMS +
| ~~~~~~~~~~~~~~~~~~~~~ ^
505 | item];
| ~~~~
include/linux/vmstat.h:511:43: warning: arithmetic between different enumeration types ('enum zone_stat_item' and 'enum numa_stat_item') [-Wenum-enum-conversion]
511 | return vmstat_text[NR_VM_ZONE_STAT_ITEMS +
| ~~~~~~~~~~~~~~~~~~~~~ ^
512 | NR_VM_NUMA_EVENT_ITEMS +
| ~~~~~~~~~~~~~~~~~~~~~~
include/linux/vmstat.h:524:43: warning: arithmetic between different enumeration types ('enum zone_stat_item' and 'enum numa_stat_item') [-Wenum-enum-conversion]
524 | return vmstat_text[NR_VM_ZONE_STAT_ITEMS +
| ~~~~~~~~~~~~~~~~~~~~~ ^
525 | NR_VM_NUMA_EVENT_ITEMS +
| ~~~~~~~~~~~~~~~~~~~~~~
In file included from lib/bench/bench_page_pool_simple.c:19:
>> lib/bench/time_bench.h:109:30: error: unknown register name '%rax' in asm
109 | : "=r"(hi), "=r"(lo)::"%rax", "%rbx", "%rcx", "%rdx");
| ^
lib/bench/time_bench.h:122:30: error: unknown register name '%rax' in asm
122 | : "=r"(hi), "=r"(lo)::"%rax", "%rbx", "%rcx", "%rdx");
| ^
>> lib/bench/time_bench.h:188:46: error: invalid input constraint 'c' in asm
188 | asm volatile("rdpmc" : "=d" (d), "=a" (a) : "c" (in) : "memory");
| ^
>> lib/bench/time_bench.h:216:9: error: call to undeclared function 'rdmsrl_safe'; ISO C99 and later do not support implicit function declarations [-Wimplicit-function-declaration]
216 | return rdmsrl_safe(MSR_IA32_PCM0, msr_result);
| ^
3 warnings and 4 errors generated.
--
In file included from lib/bench/time_bench.c:9:
In file included from include/linux/module.h:19:
In file included from include/linux/elf.h:6:
In file included from arch/s390/include/asm/elf.h:181:
In file included from arch/s390/include/asm/mmu_context.h:11:
In file included from arch/s390/include/asm/pgalloc.h:18:
In file included from include/linux/mm.h:2224:
include/linux/vmstat.h:504:43: warning: arithmetic between different enumeration types ('enum zone_stat_item' and 'enum numa_stat_item') [-Wenum-enum-conversion]
504 | return vmstat_text[NR_VM_ZONE_STAT_ITEMS +
| ~~~~~~~~~~~~~~~~~~~~~ ^
505 | item];
| ~~~~
include/linux/vmstat.h:511:43: warning: arithmetic between different enumeration types ('enum zone_stat_item' and 'enum numa_stat_item') [-Wenum-enum-conversion]
511 | return vmstat_text[NR_VM_ZONE_STAT_ITEMS +
| ~~~~~~~~~~~~~~~~~~~~~ ^
512 | NR_VM_NUMA_EVENT_ITEMS +
| ~~~~~~~~~~~~~~~~~~~~~~
include/linux/vmstat.h:524:43: warning: arithmetic between different enumeration types ('enum zone_stat_item' and 'enum numa_stat_item') [-Wenum-enum-conversion]
524 | return vmstat_text[NR_VM_ZONE_STAT_ITEMS +
| ~~~~~~~~~~~~~~~~~~~~~ ^
525 | NR_VM_NUMA_EVENT_ITEMS +
| ~~~~~~~~~~~~~~~~~~~~~~
>> lib/bench/time_bench.c:11:10: fatal error: 'linux/time_bench.h' file not found
11 | #include <linux/time_bench.h>
| ^~~~~~~~~~~~~~~~~~~~
3 warnings and 1 error generated.
vim +109 lib/bench/time_bench.h
74
75 /*
76 * Below TSC assembler code is not compatible with other archs, and
77 * can also fail on guests if cpu-flags are not correct.
78 *
79 * The way TSC reading is used, many iterations, does not require as
80 * high accuracy as described below (in Intel Doc #324264).
81 *
82 * Considering changing to use get_cycles() (#include <asm/timex.h>).
83 */
84
85 /** TSC (Time-Stamp Counter) based **
86 * Recommend reading, to understand details of reading TSC accurately:
87 * Intel Doc #324264, "How to Benchmark Code Execution Times on Intel"
88 *
89 * Consider getting exclusive ownership of CPU by using:
90 * unsigned long flags;
91 * preempt_disable();
92 * raw_local_irq_save(flags);
93 * _your_code_
94 * raw_local_irq_restore(flags);
95 * preempt_enable();
96 *
97 * Clobbered registers: "%rax", "%rbx", "%rcx", "%rdx"
98 * RDTSC only change "%rax" and "%rdx" but
99 * CPUID clears the high 32-bits of all (rax/rbx/rcx/rdx)
100 */
101 static __always_inline uint64_t tsc_start_clock(void)
102 {
103 /* See: Intel Doc #324264 */
104 unsigned hi, lo;
105 asm volatile("CPUID\n\t"
106 "RDTSC\n\t"
107 "mov %%edx, %0\n\t"
108 "mov %%eax, %1\n\t"
> 109 : "=r"(hi), "=r"(lo)::"%rax", "%rbx", "%rcx", "%rdx");
110 //FIXME: on 32bit use clobbered %eax + %edx
111 return ((uint64_t)lo) | (((uint64_t)hi) << 32);
112 }
113
114 static __always_inline uint64_t tsc_stop_clock(void)
115 {
116 /* See: Intel Doc #324264 */
117 unsigned hi, lo;
118 asm volatile("RDTSCP\n\t"
119 "mov %%edx, %0\n\t"
120 "mov %%eax, %1\n\t"
121 "CPUID\n\t"
122 : "=r"(hi), "=r"(lo)::"%rax", "%rbx", "%rcx", "%rdx");
123 return ((uint64_t)lo) | (((uint64_t)hi) << 32);
124 }
125
126 /* Notes for RDTSC and RDTSCP
127 *
128 * Hannes found out that __builtin_ia32_rdtsc and
129 * __builtin_ia32_rdtscp are undocumented available in gcc, so there
130 * is no need to write inline assembler functions for them any more.
131 *
132 * unsigned long long __builtin_ia32_rdtscp(unsigned int *foo);
133 * (where foo is set to: numa_node << 12 | cpu)
134 * and
135 * unsigned long long __builtin_ia32_rdtsc(void);
136 *
137 * Above we combine the calls with CPUID, thus I don't see how this is
138 * directly appreciable.
139 */
140
141 /*
142 inline uint64_t rdtsc(void)
143 {
144 uint32_t low, high;
145 asm volatile("rdtsc" : "=a" (low), "=d" (high));
146 return low | (((uint64_t )high ) << 32);
147 }
148 */
149
150 /** Wall-clock based **
151 *
152 * use: getnstimeofday()
153 * getnstimeofday(&rec->ts_start);
154 * getnstimeofday(&rec->ts_stop);
155 *
156 * API changed see: Documentation/core-api/timekeeping.rst
157 * https://www.kernel.org/doc/html/latest/core-api/timekeeping.html#c.getnstimeofday
158 *
159 * We should instead use: ktime_get_real_ts64() is a direct
160 * replacement, but consider using monotonic time (ktime_get_ts64())
161 * and/or a ktime_t based interface (ktime_get()/ktime_get_real()).
162 */
163
164 /** PMU (Performance Monitor Unit) based **
165 *
166 * Needed for calculating: Instructions Per Cycle (IPC)
167 * - The IPC number tell how efficient the CPU pipelining were
168 */
169 //lookup: perf_event_create_kernel_counter()
170
171 bool time_bench_PMU_config(bool enable);
172
173 /* Raw reading via rdpmc() using fixed counters
174 *
175 * From: https://github.com/andikleen/simple-pmu
176 */
177 enum {
178 FIXED_SELECT = (1U << 30), /* == 0x40000000 */
179 FIXED_INST_RETIRED_ANY = 0,
180 FIXED_CPU_CLK_UNHALTED_CORE = 1,
181 FIXED_CPU_CLK_UNHALTED_REF = 2,
182 };
183
184 static __always_inline unsigned long long p_rdpmc(unsigned in)
185 {
186 unsigned d, a;
187
> 188 asm volatile("rdpmc" : "=d" (d), "=a" (a) : "c" (in) : "memory");
189 return ((unsigned long long)d << 32) | a;
190 }
191
192 /* These PMU counter needs to be enabled, but I don't have the
193 * configure code implemented. My current hack is running:
194 * sudo perf stat -e cycles:k -e instructions:k insmod lib/ring_queue_test.ko
195 */
196 /* Reading all pipelined instruction */
197 static __always_inline unsigned long long pmc_inst(void)
198 {
199 return p_rdpmc(FIXED_SELECT | FIXED_INST_RETIRED_ANY);
200 }
201
202 /* Reading CPU clock cycles */
203 static __always_inline unsigned long long pmc_clk(void)
204 {
205 return p_rdpmc(FIXED_SELECT | FIXED_CPU_CLK_UNHALTED_CORE);
206 }
207
208 /* Raw reading via MSR rdmsr() is likely wrong
209 * FIXME: How can I know which raw MSR registers are conf for what?
210 */
211 #define MSR_IA32_PCM0 0x400000C1 /* PERFCTR0 */
212 #define MSR_IA32_PCM1 0x400000C2 /* PERFCTR1 */
213 #define MSR_IA32_PCM2 0x400000C3
214 static inline uint64_t msr_inst(unsigned long long *msr_result)
215 {
> 216 return rdmsrl_safe(MSR_IA32_PCM0, msr_result);
217 }
218
--
0-DAY CI Kernel Test Service
https://github.com/intel/lkp-tests/wiki
next prev parent reply other threads:[~2025-03-10 8:08 UTC|newest]
Thread overview: 11+ messages / expand[flat|nested] mbox.gz Atom feed top
2025-03-09 8:41 [PATCH RFC net-next v1] page_pool: import Jesper's page_pool benchmark Mina Almasry
2025-03-09 10:34 ` kernel test robot
2025-03-10 6:22 ` kernel test robot
2025-03-10 8:08 ` kernel test robot [this message]
2025-03-10 9:15 ` Toke Høiland-Jørgensen
2025-03-24 20:23 ` Mina Almasry
2025-03-24 22:11 ` Jesper Dangaard Brouer
2025-03-24 13:55 ` Jakub Kicinski
2025-03-24 20:21 ` Mina Almasry
2025-03-25 22:38 ` Jakub Kicinski
2025-03-24 22:34 ` Jesper Dangaard Brouer
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=202503101512.rKBZoYNW-lkp@intel.com \
--to=lkp@intel.com \
--cc=almasrymina@google.com \
--cc=llvm@lists.linux.dev \
--cc=oe-kbuild-all@lists.linux.dev \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.