From: Yonghong Song <yhs@fb.com>
To: "Jose E. Marchesi" <jose.marchesi@oracle.com>, Lorenz Bauer <oss@lmb.io>
Cc: andrii@kernel.org, bpf@vger.kernel.org
Subject: Re: Signedness of char in BTF
Date: Thu, 21 Jul 2022 11:44:33 -0700 [thread overview]
Message-ID: <e636b480-8d53-a628-bacf-bac2b1506a47@fb.com> (raw)
In-Reply-To: <87wnc6bjny.fsf@oracle.com>
On 7/21/22 7:54 AM, Jose E. Marchesi wrote:
>
>> Hi Yonghong and Andrii,
>>
>> I have some questions re: signedness of chars in BTF. According to [1]
>> BTF_INT_ENCODING() may be one of SIGNED, CHAR or BOOL.
>
> I have always assumed that the bits in `encoding' are non-exclusive
> i.e. it is a bitmap, not an enumerated.
Based on current BTF design, it is enumerated. So signed char
is 'signed 1-byte int', unsigned char is 'unsigned 1-byte int'
and 'char' could be BTF_INT_CHAR but since in debuginfo
any 'char' has a signedness bit, so it is folded into
'signed 1-byte int' or 'unsigned 1-byte int'.
>
>> If I read [2] correctly the signedness of char is implementation
>> defined. Does this mean that I need to know which implementation
>> generated the BTF to interpret CHAR correctly?
>>
>> Somewhat related, how to I make clang emit BTF_INT_CHAR in the first
>> place? I've tried with clang-14, but only ever get
>>
>> [6] INT 'unsigned char' size=1 bits_offset=0 nr_bits=8 encoding=(none)
>> [6] INT 'char' size=1 bits_offset=0 nr_bits=8 encoding=SIGNED
>
> Hm, in GCC we currently generate:
>
> [1] int 'unsigned char'(0x00000001U#B) size=0x00000001U#B offset=0x00UB#b bits=0x08UB#b CHAR
> [2] int 'char'(0x00000001U#B) size=0x00000001U#B offset=0x00UB#b bits=0x08UB#b SIGNED CHAR
>
> Which turns out is not correct?
>
> We used a signed type for `char' because that was what the LLVM BPF
> toolchain uses, but then we assumed we had to emit the CHAR bit as
> well... wrong assumption apparently (I just tried with clang 15 and it
> doesn't set the CHAR bits for neither `char' nor `unsigned char').
>
> But then what is the CHAR bit for?
This is not generated by llvm or pahole but apparently it may still
have some meaning when printing the value, a 'char c' may have
a dump like 'c' instead of '0x63'. In kernel/bpf/btf.c, we have
/*
* BTF_INT_CHAR encoding never seems to be set for
* char arrays, so if size is 1 and element is
* printable as a char, we'll do that.
*/
if (elem_size == 1)
encoding = BTF_INT_CHAR;
>
>> The kernel seems to agree that CHAR isn't a thing [3].
>>
>> Thanks!
>> Lorenz
>>
>> 1: https://www.kernel.org/doc/html/latest/bpf/btf.html#btf-kind-int
>> 2: https://stackoverflow.com/a/2054941/19544965
>> 3:
>> https://sourcegraph.com/github.com/torvalds/linux@353f7988dd8413c47718f7ca79c030b6fb62cfe5/-/blob/kernel/bpf/btf.c?L2928-2934
next prev parent reply other threads:[~2022-07-21 18:45 UTC|newest]
Thread overview: 9+ messages / expand[flat|nested] mbox.gz Atom feed top
2022-07-21 14:31 Signedness of char in BTF Lorenz Bauer
2022-07-21 14:54 ` Jose E. Marchesi
2022-07-21 18:44 ` Yonghong Song [this message]
2022-07-21 22:21 ` Jose E. Marchesi
2022-07-21 22:52 ` Yonghong Song
2022-07-22 11:25 ` Jose E. Marchesi
2022-07-22 15:59 ` Yonghong Song
2022-08-02 17:28 ` Jose E. Marchesi
2022-07-21 18:35 ` Yonghong Song
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=e636b480-8d53-a628-bacf-bac2b1506a47@fb.com \
--to=yhs@fb.com \
--cc=andrii@kernel.org \
--cc=bpf@vger.kernel.org \
--cc=jose.marchesi@oracle.com \
--cc=oss@lmb.io \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox