From: Richard Henderson <richard.henderson@linaro.org>
To: Anton Johansson <anjo@rev.ng>, qemu-devel@nongnu.org
Cc: ale@rev.ng, ltaylorsimpson@gmail.com, bcain@quicinc.com,
philmd@linaro.org, alex.bennee@linaro.org
Subject: Re: [RFC PATCH v1 09/43] helper-to-tcg: Introduce get-llvm-ir.py
Date: Fri, 22 Nov 2024 12:14:51 -0600 [thread overview]
Message-ID: <6b087061-fb11-4ac5-aecc-43f3324060df@linaro.org> (raw)
In-Reply-To: <20241121014947.18666-10-anjo@rev.ng>
On 11/20/24 19:49, Anton Johansson wrote:
> Introduces a new python helper script to convert a set of QEMU .c files to
> LLVM IR .ll using clang. Compile flags are found by looking at
> compile_commands.json, and llvm-link is used to link together all LLVM
> modules into a single module.
>
> Signed-off-by: Anton Johansson <anjo@rev.ng>
> ---
> subprojects/helper-to-tcg/get-llvm-ir.py | 143 +++++++++++++++++++++++
> 1 file changed, 143 insertions(+)
> create mode 100755 subprojects/helper-to-tcg/get-llvm-ir.py
Is this not something that can be done in meson?
r~
>
> diff --git a/subprojects/helper-to-tcg/get-llvm-ir.py b/subprojects/helper-to-tcg/get-llvm-ir.py
> new file mode 100755
> index 0000000000..9ee5d0e136
> --- /dev/null
> +++ b/subprojects/helper-to-tcg/get-llvm-ir.py
> @@ -0,0 +1,143 @@
> +#!/usr/bin/env python3
> +
> +##
> +## Copyright(c) 2024 rev.ng Labs Srl. All Rights Reserved.
> +##
> +## This program is free software; you can redistribute it and/or modify
> +## it under the terms of the GNU General Public License as published by
> +## the Free Software Foundation; either version 2 of the License, or
> +## (at your option) any later version.
> +##
> +## This program is distributed in the hope that it will be useful,
> +## but WITHOUT ANY WARRANTY; without even the implied warranty of
> +## MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the
> +## GNU General Public License for more details.
> +##
> +## You should have received a copy of the GNU General Public License
> +## along with this program; if not, see <http://www.gnu.org/licenses/>.
> +##
> +
> +import argparse
> +import json
> +import os
> +import shlex
> +import sys
> +import subprocess
> +
> +
> +def log(msg):
> + print(msg, file=sys.stderr)
> +
> +
> +def run_command(command):
> + proc = subprocess.Popen(command, stdout=subprocess.PIPE, stderr=subprocess.STDOUT)
> + out = proc.communicate()
> + if proc.wait() != 0:
> + log(f"Command: {' '.join(command)} exited with {proc.returncode}\n")
> + log(f"output:\n{out}\n")
> +
> +
> +def find_compile_commands(compile_commands_path, clang_path, input_path, target):
> + with open(compile_commands_path, "r") as f:
> + compile_commands = json.load(f)
> + for compile_command in compile_commands:
> + path = compile_command["file"]
> + if os.path.basename(path) != os.path.basename(input_path):
> + continue
> +
> + os.chdir(compile_command["directory"])
> + command = compile_command["command"]
> +
> + # If building multiple targets there's a chance
> + # input files share the same path and name.
> + # This could cause us to find the wrong compile
> + # command, we use the target path to distinguish
> + # between these.
> + if not target in command:
> + continue
> +
> + argv = shlex.split(command)
> + argv[0] = clang_path
> +
> + return argv
> +
> + raise ValueError(f"Unable to find compile command for {input_path}")
> +
> +
> +def generate_llvm_ir(
> + compile_commands_path, clang_path, output_path, input_path, target
> +):
> + command = find_compile_commands(
> + compile_commands_path, clang_path, input_path, target
> + )
> +
> + flags_to_remove = {
> + "-ftrivial-auto-var-init=zero",
> + "-fzero-call-used-regs=used-gpr",
> + "-Wimplicit-fallthrough=2",
> + "-Wold-style-declaration",
> + "-Wno-psabi",
> + "-Wshadow=local",
> + }
> +
> + # Remove
> + # - output of makefile rules (-MQ,-MF target);
> + # - output of object files (-o target);
> + # - excessive zero-initialization of block-scope variables
> + # (-ftrivial-auto-var-init=zero);
> + # - and any optimization flags (-O).
> + for i, arg in reversed(list(enumerate(command))):
> + if arg in {"-MQ", "-o", "-MF"}:
> + del command[i : i + 2]
> + elif arg.startswith("-O") or arg in flags_to_remove:
> + del command[i]
> +
> + # Define a HELPER_TO_TCG macro for translation units wanting to
> + # conditionally include or exclude code during translation to TCG.
> + # Disable optimization (-O0) and make sure clang doesn't emit optnone
> + # attributes (-disable-O0-optnone) which inhibit further optimization.
> + # Optimization will be performed at a later stage in the helper-to-tcg
> + # pipeline.
> + command += [
> + "-S",
> + "-emit-llvm",
> + "-DHELPER_TO_TCG",
> + "-O0",
> + "-Xclang",
> + "-disable-O0-optnone",
> + ]
> + if output_path:
> + command += ["-o", output_path]
> +
> + run_command(command)
> +
> +
> +def main():
> + parser = argparse.ArgumentParser(
> + description="Produce the LLVM IR of a given .c file."
> + )
> + parser.add_argument(
> + "--compile-commands", required=True, help="Path to compile_commands.json"
> + )
> + parser.add_argument("--clang", default="clang", help="Path to clang.")
> + parser.add_argument("--llvm-link", default="llvm-link", help="Path to llvm-link.")
> + parser.add_argument("-o", "--output", required=True, help="Output .ll file path")
> + parser.add_argument(
> + "--target-path", help="Path to QEMU target dir. (e.q. target/i386)"
> + )
> + parser.add_argument("inputs", nargs="+", help=".c file inputs")
> + args = parser.parse_args()
> +
> + outputs = []
> + for input in args.inputs:
> + output = os.path.basename(input) + ".ll"
> + generate_llvm_ir(
> + args.compile_commands, args.clang, output, input, args.target_path
> + )
> + outputs.append(output)
> +
> + run_command([args.llvm_link] + outputs + ["-S", "-o", args.output])
> +
> +
> +if __name__ == "__main__":
> + sys.exit(main())
next prev parent reply other threads:[~2024-11-22 18:15 UTC|newest]
Thread overview: 81+ messages / expand[flat|nested] mbox.gz Atom feed top
2024-11-21 1:49 [RFC PATCH v1 00/43] Introduce helper-to-tcg Anton Johansson via
2024-11-21 1:49 ` [RFC PATCH v1 01/43] Add option to enable/disable helper-to-tcg Anton Johansson via
2024-11-22 17:30 ` Richard Henderson
2024-11-22 18:23 ` Paolo Bonzini
2024-12-03 19:05 ` Anton Johansson via
2024-11-21 1:49 ` [RFC PATCH v1 02/43] accel/tcg: Add bitreverse and funnel-shift runtime helper functions Anton Johansson via
2024-11-22 17:35 ` Richard Henderson
2024-12-03 17:50 ` Anton Johansson via
2024-11-21 1:49 ` [RFC PATCH v1 03/43] accel/tcg: Add gvec size changing operations Anton Johansson via
2024-11-22 17:50 ` Richard Henderson
2024-12-03 18:08 ` Anton Johansson via
2024-12-03 18:57 ` Richard Henderson
2024-12-03 20:15 ` Anton Johansson via
2024-12-03 21:14 ` Richard Henderson
2024-11-21 1:49 ` [RFC PATCH v1 04/43] tcg: Add gvec functions for creating consant vectors Anton Johansson via
2024-11-22 18:00 ` Richard Henderson
2024-12-03 18:19 ` Anton Johansson via
2024-12-03 19:03 ` Richard Henderson
2024-11-21 1:49 ` [RFC PATCH v1 05/43] tcg: Add helper function dispatcher and hook tcg_gen_callN Anton Johansson via
2024-11-22 18:04 ` Richard Henderson
2024-12-03 18:45 ` Anton Johansson via
2024-11-21 1:49 ` [RFC PATCH v1 06/43] tcg: Introduce tcg-global-mappings Anton Johansson via
2024-11-22 19:14 ` Richard Henderson
2024-11-21 1:49 ` [RFC PATCH v1 07/43] tcg: Increase maximum TB size and maximum temporaries Anton Johansson via
2024-11-22 18:11 ` Richard Henderson
2024-11-21 1:49 ` [RFC PATCH v1 08/43] include/helper-to-tcg: Introduce annotate.h Anton Johansson via
2024-11-22 18:12 ` Richard Henderson
2024-11-25 11:27 ` Philippe Mathieu-Daudé
2024-12-03 19:00 ` Anton Johansson via
2024-11-21 1:49 ` [RFC PATCH v1 09/43] helper-to-tcg: Introduce get-llvm-ir.py Anton Johansson via
2024-11-22 18:14 ` Richard Henderson [this message]
2024-12-03 18:49 ` Anton Johansson via
2024-11-21 1:49 ` [RFC PATCH v1 10/43] helper-to-tcg: Add meson.build Anton Johansson via
2024-11-21 1:49 ` [RFC PATCH v1 11/43] helper-to-tcg: Introduce llvm-compat Anton Johansson via
2024-11-21 1:49 ` [RFC PATCH v1 12/43] helper-to-tcg: Introduce custom LLVM pipeline Anton Johansson via
2024-11-21 1:49 ` [RFC PATCH v1 13/43] helper-to-tcg: Introduce Error.h Anton Johansson via
2024-11-21 1:49 ` [RFC PATCH v1 14/43] helper-to-tcg: Introduce PrepareForOptPass Anton Johansson via
2024-11-21 1:49 ` [RFC PATCH v1 15/43] helper-to-tcg: PrepareForOptPass, map annotations Anton Johansson via
2024-11-21 1:49 ` [RFC PATCH v1 16/43] helper-to-tcg: PrepareForOptPass, Cull unused functions Anton Johansson via
2024-11-21 1:49 ` [RFC PATCH v1 17/43] helper-to-tcg: PrepareForOptPass, undef llvm.returnaddress Anton Johansson via
2024-11-21 1:49 ` [RFC PATCH v1 18/43] helper-to-tcg: PrepareForOptPass, Remove noinline attribute Anton Johansson via
2024-11-21 1:49 ` [RFC PATCH v1 19/43] helper-to-tcg: Pipeline, run optimization pass Anton Johansson via
2024-11-21 1:49 ` [RFC PATCH v1 20/43] helper-to-tcg: Introduce pseudo instructions Anton Johansson via
2024-11-21 1:49 ` [RFC PATCH v1 21/43] helper-to-tcg: Introduce PrepareForTcgPass Anton Johansson via
2024-11-21 1:49 ` [RFC PATCH v1 22/43] helper-to-tcg: PrepareForTcgPass, remove functions w. cycles Anton Johansson via
2024-11-21 1:49 ` [RFC PATCH v1 23/43] helper-to-tcg: PrepareForTcgPass, demote phi nodes Anton Johansson via
2024-11-21 1:49 ` [RFC PATCH v1 24/43] helper-to-tcg: PrepareForTcgPass, map TCG globals Anton Johansson via
2024-11-21 1:49 ` [RFC PATCH v1 25/43] helper-to-tcg: PrepareForTcgPass, transform GEPs Anton Johansson via
2024-11-21 1:49 ` [RFC PATCH v1 26/43] helper-to-tcg: PrepareForTcgPass, canonicalize IR Anton Johansson via
2024-11-21 1:49 ` [RFC PATCH v1 27/43] helper-to-tcg: PrepareForTcgPass, identity map trivial expressions Anton Johansson via
2024-11-21 1:49 ` [RFC PATCH v1 28/43] helper-to-tcg: Introduce TcgType.h Anton Johansson via
2024-11-22 18:26 ` Richard Henderson
2024-12-03 18:50 ` Anton Johansson via
2024-11-21 1:49 ` [RFC PATCH v1 29/43] helper-to-tcg: Introduce TCG register allocation Anton Johansson via
2024-11-21 1:49 ` [RFC PATCH v1 30/43] helper-to-tcg: TcgGenPass, introduce TcgEmit.[cpp|h] Anton Johansson via
2024-11-21 1:49 ` [RFC PATCH v1 31/43] helper-to-tcg: Introduce TcgGenPass Anton Johansson via
2024-11-21 1:49 ` [RFC PATCH v1 32/43] helper-to-tcg: Add README Anton Johansson via
2024-11-21 1:49 ` [RFC PATCH v1 33/43] helper-to-tcg: Add end-to-end tests Anton Johansson via
2024-11-21 1:49 ` [RFC PATCH v1 34/43] target/hexagon: Add get_tb_mmu_index() Anton Johansson via
2024-11-22 18:34 ` Richard Henderson
2024-12-03 18:50 ` Anton Johansson via
2024-11-21 1:49 ` [RFC PATCH v1 35/43] target/hexagon: Use argparse in all python scripts Anton Johansson via
2024-12-05 15:23 ` Brian Cain
2024-11-21 1:49 ` [RFC PATCH v1 36/43] target/hexagon: Add temporary vector storage Anton Johansson via
2024-11-22 18:35 ` Richard Henderson
2024-12-03 18:56 ` Anton Johansson via
2024-12-03 20:28 ` Brian Cain
2024-12-04 0:37 ` ltaylorsimpson
2024-11-21 1:49 ` [RFC PATCH v1 37/43] target/hexagon: Make HVX vector args. restrict * Anton Johansson via
2024-11-25 11:36 ` Philippe Mathieu-Daudé
2024-11-25 12:00 ` Paolo Bonzini
2024-12-03 18:57 ` Anton Johansson via
2024-12-03 18:58 ` Brian Cain
2024-11-21 1:49 ` [RFC PATCH v1 38/43] target/hexagon: Use cpu_mapping to map env -> TCG Anton Johansson via
2024-11-21 1:49 ` [RFC PATCH v1 39/43] target/hexagon: Keep gen_slotval/check_noshuf for helper-to-tcg Anton Johansson via
2024-11-21 1:49 ` [RFC PATCH v1 40/43] target/hexagon: Emit annotations for helpers Anton Johansson via
2024-11-21 1:49 ` [RFC PATCH v1 41/43] target/hexagon: Manually call generated HVX instructions Anton Johansson via
2024-11-21 1:49 ` [RFC PATCH v1 42/43] target/hexagon: Only translate w. idef-parser if helper-to-tcg failed Anton Johansson via
2024-11-21 1:49 ` [RFC PATCH v1 43/43] target/hexagon: Use helper-to-tcg Anton Johansson via
2024-11-25 11:34 ` [RFC PATCH v1 00/43] Introduce helper-to-tcg Philippe Mathieu-Daudé
2024-12-03 18:58 ` Anton Johansson via
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=6b087061-fb11-4ac5-aecc-43f3324060df@linaro.org \
--to=richard.henderson@linaro.org \
--cc=ale@rev.ng \
--cc=alex.bennee@linaro.org \
--cc=anjo@rev.ng \
--cc=bcain@quicinc.com \
--cc=ltaylorsimpson@gmail.com \
--cc=philmd@linaro.org \
--cc=qemu-devel@nongnu.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).