From: Arnaldo Carvalho de Melo <acme@kernel.org>
To: Alan Maguire <alan.maguire@oracle.com>
Cc: ast@kernel.org, andrii@kernel.org, daniel@iogearbox.net,
eddyz87@gmail.com, haoluo@google.com, jolsa@kernel.org,
john.fastabend@gmail.com, kpsingh@chromium.org,
sinquersw@gmail.com, martin.lau@kernel.org,
songliubraving@fb.com, sdf@google.com, timo@incline.eu,
yhs@fb.com, bpf@vger.kernel.org
Subject: Re: [PATCH v3 dwarves 0/8] dwarves: support encoding of optimized-out parameters, removal of inconsistent static functions
Date: Wed, 8 Feb 2023 13:20:39 -0300 [thread overview]
Message-ID: <Y+PL18hvJ7WwncGR@kernel.org> (raw)
In-Reply-To: <1675790102-23037-1-git-send-email-alan.maguire@oracle.com>
Em Tue, Feb 07, 2023 at 05:14:54PM +0000, Alan Maguire escreveu:
> At optimization level -O2 or higher in gcc, static functions may be
> optimized such that they have suffixes like .isra.0, .constprop.0 etc.
> These represent
>
> - constant propagation (.constprop.0);
> - interprocedural scalar replacement of aggregates, removal of
> unused parameters and replacement of parameters passed by
> reference by parameters passed by value (.isra.0)
Initial test, without using the new options:
[acme@pumpkin ~]$ pfunct /sys/kernel/btf/vmlinux | sort | uniq -c | sort -n | tail
3 start_show
3 timeout_show
3 uuid_show
4 m_next
4 parse_options
4 sk_diag_fill
4 state_show
4 state_store
5 status_show
6 type_show
[acme@pumpkin ~]$
Now I'll use --skip_encoding_btf_inconsistent_proto and --btf_gen_optimized
- Arnaldo
> See [1] for details.
>
> Currently BTF encoding does not handle such optimized functions
> that get renamed with a "." suffix such as ".isra.0", ".constprop.0".
> This is safer because such suffixes can often indicate parameters have
> been optimized out. This series addresses this by matching a
> function to a suffixed version ("foo" matching "foo.isra.0") while
> ensuring that the function signature does not contain optimized-out
> parameters. Note that if the function is found ("foo") it will
> be preferred, only falling back to "foo.isra.0" if lookup of the
> function fails. Addition to BTF is skipped if the function has
> optimized-out parameters, since the expected function signature
> will not match. BTF encoding does not include the "."-suffix to
> be consistent with DWARF. In addition, the kernel currently does
> not allow a "." suffix in a BTF function name.
>
> A problem with this approach however is that BTF carries out the
> encoding process in parallel across multiple CUs, and sometimes
> a function has optimized-out parameters in one CU but not others;
> we see this for NF_HOOK.constprop.0 for example. So in order to
> determine if the function has optimized-out parameters in any
> CU, its addition is not carried out until we have processed all
> CUs and are about to merge BTF. At this point we know if any
> such optimizations have occurred. Patches 1-5 handle the
> optimized-out parameter identification and matching "."-suffixed
> functions with the original function to facilitate BTF
> encoding. This feature can be enabled via the
> "--btf_gen_optimized" option.
>
> Patch 6 addresses a related problem - it is entirely possible
> for a static function of the same name to exist in different
> CUs with different function signatures. Because BTF does not
> currently encode any information that would help disambiguate
> which BTF function specification matches which static function
> (in the case of multiple different function signatures), it is
> best to eliminate such functions from BTF for now. The same
> mechanism that is used to compare static "."-suffixed functions
> is re-used for the static function comparison. A superficial
> comparison of number of parameters/parameter names is done to
> see if such representations are consistent, and if inconsistent
> prototypes are observed, the function is flagged for exclusion
> from BTF.
>
> When these methods are combined - the additive encoding of
> "."-suffixed functions and the subtractive elimination of
> functions with inconsistent parameters - we see an overall
> drop in the number of functions in vmlinux BTF, from
> 51529 to 50246. Skipping inconsistent functions is enabled
> via "--skip_encoding_btf_inconsistent_proto".
>
> Changes since v2 [2]
> - Arnaldo incorporated some of the suggestions in the v2 thread;
> these patches are based on those; the relevant changes are
> noted as committer changes.
> - Patch 1 is unchanged from v2, but the rest of the patches
> have been updated:
> - Patch 2 separates out the changes to the struct btf_encoder
> that better support later addition of functions.
> - Patch 3 then is changed insofar as these changes are no
> longer needed for the function addition refactoring.
> - Patch 4 has a small change; we need to verify that an
> encoder has actually been added to the encoders list
> prior to removal
> - Patch 5 changed significantly; when attempting to measure
> performance the relatively good numbers attained when using
> delayed function addition were not reproducible.
> Further analysis revealed that the large number of lookups
> caused by the presence of the separate function tree was
> a major cause of performance degradation in the multi
> threaded case. So instead of maintaining a separate tree,
> we use the ELF function list which we already need to look
> up to match ELF -> DWARF function descriptions to store
> the function representation. This has 2 benefits; firstly
> as mentioned, we already look up the ELF function so no
> additional lookup is required to save the function.
> Secondly, the ELF representation is identical for each
> encoder, so we can index the same function across multiple
> encoder function arrays - this greatly speeds up the
> processing of comparing function representations across
> encoders. There is still a performance cost in this
> approach however; more details are provided in patch 6.
> An option specific to adding functions with "." suffixes
> is added "--btf_gen_optimized"
> - Patch 6 builds on patch 5 in applying the save/merge/add
> approach for all functions using the same mechanisms.
> In addition the "--skip_encoding_btf_inconsistent_proto"
> option is introduced.
> - Patches 7/8 document the new options in the pahole manual
> page.
>
> Changes since v1 [3]
>
> - Eduard noted that a DW_AT_const_value attribute can signal
> an optimized-out parameter, and that the lack of a location
> attribute signals optimization; ensure we handle those cases
> also (Eduard, patch 1).
> - Jiri noted we can have inconsistencies between a static
> and non-static function; apply the comparison process to
> all functions (Jiri, patch 5)
> - segmentation fault was observed when handling functions with
> > 10 parameters; needed parameter comparison loop to exit
> at BTF_ENCODER_MAX_PARAMETERS (patch 5)
> - Kui-Feng Lee pointed out that having a global shared function
> tree would lead to a lot of contention; here a per-encoder
> tree is used, and once the threads are collected the trees
> are merged. Performance numbers are provided in patch 5
> (Kui-Feng Lee, patches 4/5)
>
> [1] https://gcc.gnu.org/onlinedocs/gcc/Optimize-Options.html
> [2] https://lore.kernel.org/bpf/1675088985-20300-1-git-send-email-alan.maguire@oracle.com/
> [3] https://lore.kernel.org/bpf/1674567931-26458-1-git-send-email-alan.maguire@oracle.com/
>
> Alan Maguire (8):
> dwarf_loader: Help spotting functions with optimized-out parameters
> btf_encoder: store type_id_off, unspecified type in encoder
> btf_encoder: Refactor function addition into dedicated
> btf_encoder__add_func
> btf_encoder: Rework btf_encoders__*() API to allow traversal of
> encoders
> btf_encoder: Represent "."-suffixed functions (".isra.0") in BTF
> btf_encoder: support delaying function addition to check for function
> prototype inconsistencies
> dwarves: document --btf_gen_optimized option
> dwarves: document --skip_encoding_btf_inconsistent_proto option
>
> btf_encoder.c | 360 +++++++++++++++++++++++++++++++++++++--------
> btf_encoder.h | 6 -
> dwarf_loader.c | 130 +++++++++++++++-
> dwarves.h | 11 +-
> man-pages/pahole.1 | 10 ++
> pahole.c | 30 +++-
> 6 files changed, 468 insertions(+), 79 deletions(-)
>
> --
> 2.31.1
>
--
- Arnaldo
next prev parent reply other threads:[~2023-02-08 16:20 UTC|newest]
Thread overview: 20+ messages / expand[flat|nested] mbox.gz Atom feed top
2023-02-07 17:14 [PATCH v3 dwarves 0/8] dwarves: support encoding of optimized-out parameters, removal of inconsistent static functions Alan Maguire
2023-02-07 17:14 ` [PATCH v3 dwarves 1/8] dwarf_loader: Help spotting functions with optimized-out parameters Alan Maguire
2023-02-07 17:14 ` [PATCH v3 dwarves 2/8] btf_encoder: store type_id_off, unspecified type in encoder Alan Maguire
2023-02-07 17:14 ` [PATCH v3 dwarves 3/8] btf_encoder: Refactor function addition into dedicated btf_encoder__add_func Alan Maguire
2023-02-07 17:14 ` [PATCH v3 dwarves 4/8] btf_encoder: Rework btf_encoders__*() API to allow traversal of encoders Alan Maguire
2023-02-07 17:14 ` [PATCH v3 dwarves 5/8] btf_encoder: Represent "."-suffixed functions (".isra.0") in BTF Alan Maguire
2023-02-08 13:19 ` Jiri Olsa
2023-02-08 14:43 ` Arnaldo Carvalho de Melo
2023-02-08 20:51 ` Jiri Olsa
2023-02-08 22:57 ` Alan Maguire
2023-02-07 17:15 ` [PATCH v3 dwarves 6/8] btf_encoder: support delaying function addition to check for function prototype inconsistencies Alan Maguire
2023-02-07 17:15 ` [PATCH v3 dwarves 7/8] dwarves: document --btf_gen_optimized option Alan Maguire
2023-02-07 17:15 ` [PATCH v3 dwarves 8/8] dwarves: document --skip_encoding_btf_inconsistent_proto option Alan Maguire
2023-02-08 13:20 ` [PATCH v3 dwarves 0/8] dwarves: support encoding of optimized-out parameters, removal of inconsistent static functions Jiri Olsa
2023-02-08 15:25 ` Alan Maguire
2023-02-08 16:20 ` Arnaldo Carvalho de Melo [this message]
2023-02-08 16:50 ` Arnaldo Carvalho de Melo
2023-02-09 9:36 ` Jiri Olsa
2023-02-09 12:22 ` Arnaldo Carvalho de Melo
[not found] ` <3c021d56-8818-2464-f7e0-889e769c0311@oracle.com>
2023-02-09 13:09 ` [PATCH bpf-next] bpf: add --skip_encoding_btf_inconsistent_proto, --btf_gen_optimized to pahole flags for v1.25 Arnaldo Carvalho de Melo
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=Y+PL18hvJ7WwncGR@kernel.org \
--to=acme@kernel.org \
--cc=alan.maguire@oracle.com \
--cc=andrii@kernel.org \
--cc=ast@kernel.org \
--cc=bpf@vger.kernel.org \
--cc=daniel@iogearbox.net \
--cc=eddyz87@gmail.com \
--cc=haoluo@google.com \
--cc=john.fastabend@gmail.com \
--cc=jolsa@kernel.org \
--cc=kpsingh@chromium.org \
--cc=martin.lau@kernel.org \
--cc=sdf@google.com \
--cc=sinquersw@gmail.com \
--cc=songliubraving@fb.com \
--cc=timo@incline.eu \
--cc=yhs@fb.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox