public inbox for bpf@vger.kernel.org
 help / color / mirror / Atom feed
* [PATCH bpf-next v2] selftests/bpf: Fix pyperf180 compilation failure with clang18
@ 2023-11-10 19:36 Yonghong Song
  2023-11-10 19:45 ` Andrii Nakryiko
                   ` (2 more replies)
  0 siblings, 3 replies; 5+ messages in thread
From: Yonghong Song @ 2023-11-10 19:36 UTC (permalink / raw)
  To: bpf
  Cc: Alexei Starovoitov, Andrii Nakryiko, Daniel Borkmann, kernel-team,
	Martin KaFai Lau

With latest clang18 (main branch of llvm-project repo), when building bpf selftests,
    [~/work/bpf-next (master)]$ make -C tools/testing/selftests/bpf LLVM=1 -j

The following compilation error happens:
    fatal error: error in backend: Branch target out of insn range
    ...
    Stack dump:
    0.      Program arguments: clang -g -Wall -Werror -D__TARGET_ARCH_x86 -mlittle-endian
      -I/home/yhs/work/bpf-next/tools/testing/selftests/bpf/tools/include
      -I/home/yhs/work/bpf-next/tools/testing/selftests/bpf -I/home/yhs/work/bpf-next/tools/include/uapi
      -I/home/yhs/work/bpf-next/tools/testing/selftests/usr/include -idirafter
      /home/yhs/work/llvm-project/llvm/build.18/install/lib/clang/18/include -idirafter /usr/local/include
      -idirafter /usr/include -Wno-compare-distinct-pointer-types -DENABLE_ATOMICS_TESTS -O2 --target=bpf
      -c progs/pyperf180.c -mcpu=v3 -o /home/yhs/work/bpf-next/tools/testing/selftests/bpf/pyperf180.bpf.o
    1.      <eof> parser at end of file
    2.      Code generation
    ...

The compilation failure only happens to cpu=v2 and cpu=v3. cpu=v4 is okay
since cpu=v4 supports 32-bit branch target offset.

The above failure is due to upstream llvm patch [1] where some inlining behavior
are changed in clang18.

To workaround the issue, previously all 180 loop iterations are fully unrolled.
The bpf macro __BPF_CPU_VERSION__ (implemented in clang18 recently) is used to avoid
unrolling changes if cpu=v4. If __BPF_CPU_VERSION__ is not available and the
compiler is clang18, the unrollng amount is unconditionally reduced.

  [1] https://github.com/llvm/llvm-project/commit/1a2e77cf9e11dbf56b5720c607313a566eebb16e

Signed-off-by: Yonghong Song <yonghong.song@linux.dev>
---
 tools/testing/selftests/bpf/progs/pyperf180.c | 22 +++++++++++++++++++
 1 file changed, 22 insertions(+)

diff --git a/tools/testing/selftests/bpf/progs/pyperf180.c b/tools/testing/selftests/bpf/progs/pyperf180.c
index c39f559d3100..42c4a8b62e36 100644
--- a/tools/testing/selftests/bpf/progs/pyperf180.c
+++ b/tools/testing/selftests/bpf/progs/pyperf180.c
@@ -1,4 +1,26 @@
 // SPDX-License-Identifier: GPL-2.0
 // Copyright (c) 2019 Facebook
 #define STACK_MAX_LEN 180
+
+/* llvm upstream commit at clang18
+ *   https://github.com/llvm/llvm-project/commit/1a2e77cf9e11dbf56b5720c607313a566eebb16e
+ * changed inlining behavior and caused compilation failure as some branch
+ * target distance exceeded 16bit representation which is the maximum for
+ * cpu v1/v2/v3. Macro __BPF_CPU_VERSION__ is later implemented in clang18
+ * to specify which cpu version is used for compilation. So a smaller
+ * unroll_count can be set if __BPF_CPU_VERSION__ is less than 4, which
+ * reduced some branch target distances and resolved the compilation failure.
+ *
+ * To capture the case where a developer/ci uses clang18 but the corresponding
+ * repo checkpoint does not have __BPF_CPU_VERSION__, a smaller unroll_count
+ * will be set as well to prevent potential compilation failures.
+ */
+#ifdef __BPF_CPU_VERSION__
+#if __BPF_CPU_VERSION__ < 4
+#define UNROLL_COUNT 90
+#endif
+#elif __clang_major__ == 18
+#define UNROLL_COUNT 90
+#endif
+
 #include "pyperf.h"
-- 
2.34.1


^ permalink raw reply related	[flat|nested] 5+ messages in thread

* Re: [PATCH bpf-next v2] selftests/bpf: Fix pyperf180 compilation failure with clang18
  2023-11-10 19:36 [PATCH bpf-next v2] selftests/bpf: Fix pyperf180 compilation failure with clang18 Yonghong Song
@ 2023-11-10 19:45 ` Andrii Nakryiko
  2023-11-10 21:03   ` Yonghong Song
  2023-11-10 19:59 ` Alan Maguire
  2023-11-11 20:21 ` patchwork-bot+netdevbpf
  2 siblings, 1 reply; 5+ messages in thread
From: Andrii Nakryiko @ 2023-11-10 19:45 UTC (permalink / raw)
  To: Yonghong Song
  Cc: bpf, Alexei Starovoitov, Andrii Nakryiko, Daniel Borkmann,
	kernel-team, Martin KaFai Lau

On Fri, Nov 10, 2023 at 11:37 AM Yonghong Song <yonghong.song@linux.dev> wrote:
>
> With latest clang18 (main branch of llvm-project repo), when building bpf selftests,
>     [~/work/bpf-next (master)]$ make -C tools/testing/selftests/bpf LLVM=1 -j
>
> The following compilation error happens:
>     fatal error: error in backend: Branch target out of insn range
>     ...
>     Stack dump:
>     0.      Program arguments: clang -g -Wall -Werror -D__TARGET_ARCH_x86 -mlittle-endian
>       -I/home/yhs/work/bpf-next/tools/testing/selftests/bpf/tools/include
>       -I/home/yhs/work/bpf-next/tools/testing/selftests/bpf -I/home/yhs/work/bpf-next/tools/include/uapi
>       -I/home/yhs/work/bpf-next/tools/testing/selftests/usr/include -idirafter
>       /home/yhs/work/llvm-project/llvm/build.18/install/lib/clang/18/include -idirafter /usr/local/include
>       -idirafter /usr/include -Wno-compare-distinct-pointer-types -DENABLE_ATOMICS_TESTS -O2 --target=bpf
>       -c progs/pyperf180.c -mcpu=v3 -o /home/yhs/work/bpf-next/tools/testing/selftests/bpf/pyperf180.bpf.o
>     1.      <eof> parser at end of file
>     2.      Code generation
>     ...
>
> The compilation failure only happens to cpu=v2 and cpu=v3. cpu=v4 is okay
> since cpu=v4 supports 32-bit branch target offset.
>
> The above failure is due to upstream llvm patch [1] where some inlining behavior
> are changed in clang18.
>
> To workaround the issue, previously all 180 loop iterations are fully unrolled.
> The bpf macro __BPF_CPU_VERSION__ (implemented in clang18 recently) is used to avoid
> unrolling changes if cpu=v4. If __BPF_CPU_VERSION__ is not available and the
> compiler is clang18, the unrollng amount is unconditionally reduced.
>
>   [1] https://github.com/llvm/llvm-project/commit/1a2e77cf9e11dbf56b5720c607313a566eebb16e
>
> Signed-off-by: Yonghong Song <yonghong.song@linux.dev>
> ---
>  tools/testing/selftests/bpf/progs/pyperf180.c | 22 +++++++++++++++++++
>  1 file changed, 22 insertions(+)
>
> diff --git a/tools/testing/selftests/bpf/progs/pyperf180.c b/tools/testing/selftests/bpf/progs/pyperf180.c
> index c39f559d3100..42c4a8b62e36 100644
> --- a/tools/testing/selftests/bpf/progs/pyperf180.c
> +++ b/tools/testing/selftests/bpf/progs/pyperf180.c
> @@ -1,4 +1,26 @@
>  // SPDX-License-Identifier: GPL-2.0
>  // Copyright (c) 2019 Facebook
>  #define STACK_MAX_LEN 180
> +
> +/* llvm upstream commit at clang18
> + *   https://github.com/llvm/llvm-project/commit/1a2e77cf9e11dbf56b5720c607313a566eebb16e
> + * changed inlining behavior and caused compilation failure as some branch
> + * target distance exceeded 16bit representation which is the maximum for
> + * cpu v1/v2/v3. Macro __BPF_CPU_VERSION__ is later implemented in clang18
> + * to specify which cpu version is used for compilation. So a smaller
> + * unroll_count can be set if __BPF_CPU_VERSION__ is less than 4, which
> + * reduced some branch target distances and resolved the compilation failure.
> + *
> + * To capture the case where a developer/ci uses clang18 but the corresponding
> + * repo checkpoint does not have __BPF_CPU_VERSION__, a smaller unroll_count
> + * will be set as well to prevent potential compilation failures.
> + */
> +#ifdef __BPF_CPU_VERSION__
> +#if __BPF_CPU_VERSION__ < 4
> +#define UNROLL_COUNT 90
> +#endif
> +#elif __clang_major__ == 18
> +#define UNROLL_COUNT 90
> +#endif
> +

can it be written as one if?

#if (defined(__BPF_CPU_VERSION__) && __BPF_CPU_VERSION__ < 4) ||
__clang_major >= 18


?

>  #include "pyperf.h"
> --
> 2.34.1
>
>

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: [PATCH bpf-next v2] selftests/bpf: Fix pyperf180 compilation failure with clang18
  2023-11-10 19:36 [PATCH bpf-next v2] selftests/bpf: Fix pyperf180 compilation failure with clang18 Yonghong Song
  2023-11-10 19:45 ` Andrii Nakryiko
@ 2023-11-10 19:59 ` Alan Maguire
  2023-11-11 20:21 ` patchwork-bot+netdevbpf
  2 siblings, 0 replies; 5+ messages in thread
From: Alan Maguire @ 2023-11-10 19:59 UTC (permalink / raw)
  To: Yonghong Song, bpf
  Cc: Alexei Starovoitov, Andrii Nakryiko, Daniel Borkmann, kernel-team,
	Martin KaFai Lau

On 10/11/2023 19:36, Yonghong Song wrote:
> With latest clang18 (main branch of llvm-project repo), when building bpf selftests,
>     [~/work/bpf-next (master)]$ make -C tools/testing/selftests/bpf LLVM=1 -j
> 
> The following compilation error happens:
>     fatal error: error in backend: Branch target out of insn range
>     ...
>     Stack dump:
>     0.      Program arguments: clang -g -Wall -Werror -D__TARGET_ARCH_x86 -mlittle-endian
>       -I/home/yhs/work/bpf-next/tools/testing/selftests/bpf/tools/include
>       -I/home/yhs/work/bpf-next/tools/testing/selftests/bpf -I/home/yhs/work/bpf-next/tools/include/uapi
>       -I/home/yhs/work/bpf-next/tools/testing/selftests/usr/include -idirafter
>       /home/yhs/work/llvm-project/llvm/build.18/install/lib/clang/18/include -idirafter /usr/local/include
>       -idirafter /usr/include -Wno-compare-distinct-pointer-types -DENABLE_ATOMICS_TESTS -O2 --target=bpf
>       -c progs/pyperf180.c -mcpu=v3 -o /home/yhs/work/bpf-next/tools/testing/selftests/bpf/pyperf180.bpf.o
>     1.      <eof> parser at end of file
>     2.      Code generation
>     ...
> 
> The compilation failure only happens to cpu=v2 and cpu=v3. cpu=v4 is okay
> since cpu=v4 supports 32-bit branch target offset.
> 
> The above failure is due to upstream llvm patch [1] where some inlining behavior
> are changed in clang18.
> 
> To workaround the issue, previously all 180 loop iterations are fully unrolled.
> The bpf macro __BPF_CPU_VERSION__ (implemented in clang18 recently) is used to avoid
> unrolling changes if cpu=v4. If __BPF_CPU_VERSION__ is not available and the
> compiler is clang18, the unrollng amount is unconditionally reduced.
> 
>   [1] https://github.com/llvm/llvm-project/commit/1a2e77cf9e11dbf56b5720c607313a566eebb16e
> 
> Signed-off-by: Yonghong Song <yonghong.song@linux.dev>

Fixes the issue for me;

Tested-by: Alan Maguire <alan.maguire@oracle.com>

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: [PATCH bpf-next v2] selftests/bpf: Fix pyperf180 compilation failure with clang18
  2023-11-10 19:45 ` Andrii Nakryiko
@ 2023-11-10 21:03   ` Yonghong Song
  0 siblings, 0 replies; 5+ messages in thread
From: Yonghong Song @ 2023-11-10 21:03 UTC (permalink / raw)
  To: Andrii Nakryiko
  Cc: bpf, Alexei Starovoitov, Andrii Nakryiko, Daniel Borkmann,
	kernel-team, Martin KaFai Lau


On 11/10/23 11:45 AM, Andrii Nakryiko wrote:
> On Fri, Nov 10, 2023 at 11:37 AM Yonghong Song <yonghong.song@linux.dev> wrote:
>> With latest clang18 (main branch of llvm-project repo), when building bpf selftests,
>>      [~/work/bpf-next (master)]$ make -C tools/testing/selftests/bpf LLVM=1 -j
>>
>> The following compilation error happens:
>>      fatal error: error in backend: Branch target out of insn range
>>      ...
>>      Stack dump:
>>      0.      Program arguments: clang -g -Wall -Werror -D__TARGET_ARCH_x86 -mlittle-endian
>>        -I/home/yhs/work/bpf-next/tools/testing/selftests/bpf/tools/include
>>        -I/home/yhs/work/bpf-next/tools/testing/selftests/bpf -I/home/yhs/work/bpf-next/tools/include/uapi
>>        -I/home/yhs/work/bpf-next/tools/testing/selftests/usr/include -idirafter
>>        /home/yhs/work/llvm-project/llvm/build.18/install/lib/clang/18/include -idirafter /usr/local/include
>>        -idirafter /usr/include -Wno-compare-distinct-pointer-types -DENABLE_ATOMICS_TESTS -O2 --target=bpf
>>        -c progs/pyperf180.c -mcpu=v3 -o /home/yhs/work/bpf-next/tools/testing/selftests/bpf/pyperf180.bpf.o
>>      1.      <eof> parser at end of file
>>      2.      Code generation
>>      ...
>>
>> The compilation failure only happens to cpu=v2 and cpu=v3. cpu=v4 is okay
>> since cpu=v4 supports 32-bit branch target offset.
>>
>> The above failure is due to upstream llvm patch [1] where some inlining behavior
>> are changed in clang18.
>>
>> To workaround the issue, previously all 180 loop iterations are fully unrolled.
>> The bpf macro __BPF_CPU_VERSION__ (implemented in clang18 recently) is used to avoid
>> unrolling changes if cpu=v4. If __BPF_CPU_VERSION__ is not available and the
>> compiler is clang18, the unrollng amount is unconditionally reduced.
>>
>>    [1] https://github.com/llvm/llvm-project/commit/1a2e77cf9e11dbf56b5720c607313a566eebb16e
>>
>> Signed-off-by: Yonghong Song <yonghong.song@linux.dev>
>> ---
>>   tools/testing/selftests/bpf/progs/pyperf180.c | 22 +++++++++++++++++++
>>   1 file changed, 22 insertions(+)
>>
>> diff --git a/tools/testing/selftests/bpf/progs/pyperf180.c b/tools/testing/selftests/bpf/progs/pyperf180.c
>> index c39f559d3100..42c4a8b62e36 100644
>> --- a/tools/testing/selftests/bpf/progs/pyperf180.c
>> +++ b/tools/testing/selftests/bpf/progs/pyperf180.c
>> @@ -1,4 +1,26 @@
>>   // SPDX-License-Identifier: GPL-2.0
>>   // Copyright (c) 2019 Facebook
>>   #define STACK_MAX_LEN 180
>> +
>> +/* llvm upstream commit at clang18
>> + *   https://github.com/llvm/llvm-project/commit/1a2e77cf9e11dbf56b5720c607313a566eebb16e
>> + * changed inlining behavior and caused compilation failure as some branch
>> + * target distance exceeded 16bit representation which is the maximum for
>> + * cpu v1/v2/v3. Macro __BPF_CPU_VERSION__ is later implemented in clang18
>> + * to specify which cpu version is used for compilation. So a smaller
>> + * unroll_count can be set if __BPF_CPU_VERSION__ is less than 4, which
>> + * reduced some branch target distances and resolved the compilation failure.
>> + *
>> + * To capture the case where a developer/ci uses clang18 but the corresponding
>> + * repo checkpoint does not have __BPF_CPU_VERSION__, a smaller unroll_count
>> + * will be set as well to prevent potential compilation failures.
>> + */
>> +#ifdef __BPF_CPU_VERSION__
>> +#if __BPF_CPU_VERSION__ < 4
>> +#define UNROLL_COUNT 90
>> +#endif
>> +#elif __clang_major__ == 18
>> +#define UNROLL_COUNT 90
>> +#endif
>> +
> can it be written as one if?
>
> #if (defined(__BPF_CPU_VERSION__) && __BPF_CPU_VERSION__ < 4) ||
> __clang_major >= 18
>
>
> ?

This won't work. For example, using latest upstream clang18, __BPF_CPU_VERSION__ does exist,
and user use cpu v4, in this case we do not want to do unrolling but with the above:
   
(defined(__BPF_CPU_VERSION__) && __BPF_CPU_VERSION__ < 4) is false
__clang_major >= 18 is true

so we do unrolling but we do not need to do since user uses cpu v4.



>
>>   #include "pyperf.h"
>> --
>> 2.34.1
>>
>>

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: [PATCH bpf-next v2] selftests/bpf: Fix pyperf180 compilation failure with clang18
  2023-11-10 19:36 [PATCH bpf-next v2] selftests/bpf: Fix pyperf180 compilation failure with clang18 Yonghong Song
  2023-11-10 19:45 ` Andrii Nakryiko
  2023-11-10 19:59 ` Alan Maguire
@ 2023-11-11 20:21 ` patchwork-bot+netdevbpf
  2 siblings, 0 replies; 5+ messages in thread
From: patchwork-bot+netdevbpf @ 2023-11-11 20:21 UTC (permalink / raw)
  To: Yonghong Song; +Cc: bpf, ast, andrii, daniel, kernel-team, martin.lau

Hello:

This patch was applied to bpf/bpf-next.git (master)
by Andrii Nakryiko <andrii@kernel.org>:

On Fri, 10 Nov 2023 11:36:44 -0800 you wrote:
> With latest clang18 (main branch of llvm-project repo), when building bpf selftests,
>     [~/work/bpf-next (master)]$ make -C tools/testing/selftests/bpf LLVM=1 -j
> 
> The following compilation error happens:
>     fatal error: error in backend: Branch target out of insn range
>     ...
>     Stack dump:
>     0.      Program arguments: clang -g -Wall -Werror -D__TARGET_ARCH_x86 -mlittle-endian
>       -I/home/yhs/work/bpf-next/tools/testing/selftests/bpf/tools/include
>       -I/home/yhs/work/bpf-next/tools/testing/selftests/bpf -I/home/yhs/work/bpf-next/tools/include/uapi
>       -I/home/yhs/work/bpf-next/tools/testing/selftests/usr/include -idirafter
>       /home/yhs/work/llvm-project/llvm/build.18/install/lib/clang/18/include -idirafter /usr/local/include
>       -idirafter /usr/include -Wno-compare-distinct-pointer-types -DENABLE_ATOMICS_TESTS -O2 --target=bpf
>       -c progs/pyperf180.c -mcpu=v3 -o /home/yhs/work/bpf-next/tools/testing/selftests/bpf/pyperf180.bpf.o
>     1.      <eof> parser at end of file
>     2.      Code generation
>     ...
> 
> [...]

Here is the summary with links:
  - [bpf-next,v2] selftests/bpf: Fix pyperf180 compilation failure with clang18
    https://git.kernel.org/bpf/bpf-next/c/100888fb6d8a

You are awesome, thank you!
-- 
Deet-doot-dot, I am a bot.
https://korg.docs.kernel.org/patchwork/pwbot.html



^ permalink raw reply	[flat|nested] 5+ messages in thread

end of thread, other threads:[~2023-11-11 20:21 UTC | newest]

Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2023-11-10 19:36 [PATCH bpf-next v2] selftests/bpf: Fix pyperf180 compilation failure with clang18 Yonghong Song
2023-11-10 19:45 ` Andrii Nakryiko
2023-11-10 21:03   ` Yonghong Song
2023-11-10 19:59 ` Alan Maguire
2023-11-11 20:21 ` patchwork-bot+netdevbpf

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox