linux-arch.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Andy Lutomirski <luto@amacapital.net>
To: Catalin Marinas <catalin.marinas@arm.com>
Cc: Eugene Syromiatnikov <esyr@redhat.com>,
	linux-doc@vger.kernel.org, szabolcs.nagy@arm.com,
	palmer@sifive.com, Heiko Carstens <heiko.carstens@de.ibm.com>,
	ynorov@caviumnetworks.com, Pavel Machek <pavel@ucw.cz>,
	philipp.tomsich@theobroma-systems.com,
	"Joseph S. Myers" <joseph@codesourcery.com>,
	linux-arch <linux-arch@vger.kernel.org>,
	sellcey@caviumnetworks.com, Prasun.Kapoor@caviumnetworks.com,
	schwab@suse.de, Alexander Graf <agraf@suse.de>,
	bamv2005@gmail.com, Geert Uytterhoeven <geert@linux-m68k.org>,
	Dave Martin <Dave.Martin@arm.com>,
	Adam Borowski <kilobyte@angband.pl>,
	manuel.montezelo@gmail.com, James Hogan <james.hogan@imgtec.com>,
	Chris Metcalf <cmetcalf@mellanox.com>,
	Arnd Bergmann <arnd@arndb.de>, Andrew Pinski <pinskia@gmail.com>,
	linyongting@huawei.com
Subject: Re: [PATCH v9 00/24] ILP32 for ARM64
Date: Sat, 13 Oct 2018 09:54:22 -0700	[thread overview]
Message-ID: <CALCETrW0sFoOknkLGFSiobQLg9O4qzD9f-9ws8uk69J1X866cg@mail.gmail.com> (raw)
In-Reply-To: <20181013093411.o3id6yzkspsxr5jt@mbp>

> On Oct 13, 2018, at 2:34 AM, Catalin Marinas <catalin.marinas@arm.com> wrote:
>
>> On Sat, Oct 13, 2018 at 04:14:16AM +0200, Eugene Syromiatnikov wrote:
>>> On Wed, Oct 10, 2018 at 04:36:56PM +0100, Catalin Marinas wrote:
>>>> On Wed, Oct 10, 2018 at 04:10:21PM +0200, Eugene Syromiatnikov wrote:
>>>> I have some questions regarding AArch64 ILP32 implementation for which I
>>>> failed to find an answer myself:
>>>> * How ptrace() tracer is supposed to distinguish between ILP32 and LP64
>>>>   tracees? For MIPS N32 and x32 this is possible based on syscall
>>>>   number, but for AArch64 ILP32 I do not see such a sign. There's also
>>>>   ARM_ip is employed for signalling entering/exiting, I wonder whether
>>>>   it's possible to employ it also for signalling tracee's personality.
>>>
>>> With the current implementation, I don't think you can distinguish. From
>>> the kernel perspective, the register set is the same. What is the
>>> use-case for this?
>>
>> Err, a ptrace()-based tracer trying to trace a process, for example?
>
> I first thought it wouldn't matter for ptrace-based tracers since the
> syscall numbers are (mostly) the same. But the arguments layout in
> register is indeed different, so I see your point now about having to
> distinguish.
>
>>> We could add a new regset to expose the ILP32 state (NT_ARM_..., I can't
>>> think of a name now but probably not PER* as this implies PER_LINUX_...
>>> which is independent from TIF_32BIT_*).
>>
>> So that would require an additional ptrace() call on each syscall stop,
>> is that correct?
>
> The ILP32 state does not change at run-time, so it could only do a
> ptrace() call once and save the information. No need to re-read it on
> each syscall stop.
>

Please solve this in an arch independent way.  This situation is
basically unusably broken on x86 right now.  Please solve it for real,
by, for example, adding a new ptrace operation that returns something
like this:

enum ptrace_syscall_state {
  NO_SYSCALL,
  SYSCALL_ENTRY,
  SYSCALL_EXIT,
  /* other values may be defined in the future. */
};

struct ptrace_syscall_info {
  enum ptrace_syscall_state state;
  unsigned long arch;
  union {
    struct {
      unsigned long nr;
      unsigned long args[6];
   } entry;
   struct {
      unsigned long ret;
   } exit;
};

where arch is an AUDIT_ARCH_XYZ constant.

On x86, it's currently essentially impossible for tools like strace to
correctly decode syscalls.

> We could set a high bit in the syscall number reported to the ptrace
> caller (though not changing the syscall ABI) but I haven't thought of
> other consequences. For example, can the ptrace caller change the
> syscall number?

Yes it can.

>
>>>> * What's the reasoning behind capping syscall arguments to 32 bit? x32
>>>>   and MIPS N32 do not have such a restriction (and do not need special
>>>>   wrappers for syscalls that pass 64-bit values as a result, except
>>>>   when they do,  as it is the case for preadv2 on x32); moreover, that
>>>>   would lead to insurmountable difficulties for AArch64 ILP32 tracers
>>>>   that try to trace LP64 tracees, as it would be impossible to pass
>>>>   64-bit addresses to process_vm_{read,write} or ptrace PEEK/POKE.
>>>
>>> We've attempted in earlier versions to allow a mix of 32 and 64-bit
>>> register values from ILP32 but it got pretty complicated. The entry code
>>> would need to know which registers need zeroing of the top 32-bit
>>
>> If kernel specifies 64-bit wide registers for syscalls, then it's the
>> caller's (libc's) responsibility to properly sign-extend arguments when
>> needed, and glibc, for example, already has proper type definitions that
>> aimed to handle this.
>
> We tried, see my other reply.
>
> --
> Catalin

WARNING: multiple messages have this Message-ID (diff)
From: Andy Lutomirski <luto@amacapital.net>
To: Catalin Marinas <catalin.marinas@arm.com>
Cc: Eugene Syromiatnikov <esyr@redhat.com>,
	linux-doc@vger.kernel.org, szabolcs.nagy@arm.com,
	palmer@sifive.com, Heiko Carstens <heiko.carstens@de.ibm.com>,
	ynorov@caviumnetworks.com, Pavel Machek <pavel@ucw.cz>,
	philipp.tomsich@theobroma-systems.com,
	"Joseph S. Myers" <joseph@codesourcery.com>,
	linux-arch <linux-arch@vger.kernel.org>,
	sellcey@caviumnetworks.com, Prasun.Kapoor@caviumnetworks.com,
	schwab@suse.de, Alexander Graf <agraf@suse.de>,
	bamv2005@gmail.com, Geert Uytterhoeven <geert@linux-m68k.org>,
	Dave Martin <Dave.Martin@arm.com>,
	Adam Borowski <kilobyte@angband.pl>,
	manuel.montezelo@gmail.com, James Hogan <james.hogan@imgtec.com>,
	Chris Metcalf <cmetcalf@mellanox.com>,
	Arnd Bergmann <arnd@arndb.de>, Andrew Pinski <pinskia@gmail.com>,
	linyongting@huawei.com, klimov.linux@gmail.com,
	wookey@wookware.org, Mark Brown <broonie@kernel.org>,
	linux-arm-kernel <linux-arm-kernel@lists.infradead.org>,
	Maxim Kuvyrkov <maxim.kuvyrkov@linaro.org>,
	Florian Weimer <fweimer@redhat.com>,
	Linux API <linux-api@vger.kernel.org>,
	Nathan Lynch <Nathan_Lynch@mentor.com>,
	LKML <linux-kernel@vger.kernel.org>,
	James Morse <james.morse@arm.com>,
	ramana.gcc@googlemail.com,
	Martin Schwidefsky <schwidefsky@de.ibm.com>,
	"David S. Miller" <davem@davemloft.net>,
	christoph.muellner@theobroma-systems.com
Subject: Re: [PATCH v9 00/24] ILP32 for ARM64
Date: Sat, 13 Oct 2018 09:54:22 -0700	[thread overview]
Message-ID: <CALCETrW0sFoOknkLGFSiobQLg9O4qzD9f-9ws8uk69J1X866cg@mail.gmail.com> (raw)
Message-ID: <20181013165422.R7cPFFuv2GnWXACVBAA2L9Btss2HA2sz1Zi-XULhlT0@z> (raw)
In-Reply-To: <20181013093411.o3id6yzkspsxr5jt@mbp>

> On Oct 13, 2018, at 2:34 AM, Catalin Marinas <catalin.marinas@arm.com> wrote:
>
>> On Sat, Oct 13, 2018 at 04:14:16AM +0200, Eugene Syromiatnikov wrote:
>>> On Wed, Oct 10, 2018 at 04:36:56PM +0100, Catalin Marinas wrote:
>>>> On Wed, Oct 10, 2018 at 04:10:21PM +0200, Eugene Syromiatnikov wrote:
>>>> I have some questions regarding AArch64 ILP32 implementation for which I
>>>> failed to find an answer myself:
>>>> * How ptrace() tracer is supposed to distinguish between ILP32 and LP64
>>>>   tracees? For MIPS N32 and x32 this is possible based on syscall
>>>>   number, but for AArch64 ILP32 I do not see such a sign. There's also
>>>>   ARM_ip is employed for signalling entering/exiting, I wonder whether
>>>>   it's possible to employ it also for signalling tracee's personality.
>>>
>>> With the current implementation, I don't think you can distinguish. From
>>> the kernel perspective, the register set is the same. What is the
>>> use-case for this?
>>
>> Err, a ptrace()-based tracer trying to trace a process, for example?
>
> I first thought it wouldn't matter for ptrace-based tracers since the
> syscall numbers are (mostly) the same. But the arguments layout in
> register is indeed different, so I see your point now about having to
> distinguish.
>
>>> We could add a new regset to expose the ILP32 state (NT_ARM_..., I can't
>>> think of a name now but probably not PER* as this implies PER_LINUX_...
>>> which is independent from TIF_32BIT_*).
>>
>> So that would require an additional ptrace() call on each syscall stop,
>> is that correct?
>
> The ILP32 state does not change at run-time, so it could only do a
> ptrace() call once and save the information. No need to re-read it on
> each syscall stop.
>

Please solve this in an arch independent way.  This situation is
basically unusably broken on x86 right now.  Please solve it for real,
by, for example, adding a new ptrace operation that returns something
like this:

enum ptrace_syscall_state {
  NO_SYSCALL,
  SYSCALL_ENTRY,
  SYSCALL_EXIT,
  /* other values may be defined in the future. */
};

struct ptrace_syscall_info {
  enum ptrace_syscall_state state;
  unsigned long arch;
  union {
    struct {
      unsigned long nr;
      unsigned long args[6];
   } entry;
   struct {
      unsigned long ret;
   } exit;
};

where arch is an AUDIT_ARCH_XYZ constant.

On x86, it's currently essentially impossible for tools like strace to
correctly decode syscalls.

> We could set a high bit in the syscall number reported to the ptrace
> caller (though not changing the syscall ABI) but I haven't thought of
> other consequences. For example, can the ptrace caller change the
> syscall number?

Yes it can.

>
>>>> * What's the reasoning behind capping syscall arguments to 32 bit? x32
>>>>   and MIPS N32 do not have such a restriction (and do not need special
>>>>   wrappers for syscalls that pass 64-bit values as a result, except
>>>>   when they do,  as it is the case for preadv2 on x32); moreover, that
>>>>   would lead to insurmountable difficulties for AArch64 ILP32 tracers
>>>>   that try to trace LP64 tracees, as it would be impossible to pass
>>>>   64-bit addresses to process_vm_{read,write} or ptrace PEEK/POKE.
>>>
>>> We've attempted in earlier versions to allow a mix of 32 and 64-bit
>>> register values from ILP32 but it got pretty complicated. The entry code
>>> would need to know which registers need zeroing of the top 32-bit
>>
>> If kernel specifies 64-bit wide registers for syscalls, then it's the
>> caller's (libc's) responsibility to properly sign-extend arguments when
>> needed, and glibc, for example, already has proper type definitions that
>> aimed to handle this.
>
> We tried, see my other reply.
>
> --
> Catalin

  parent reply	other threads:[~2018-10-13 16:54 UTC|newest]

Thread overview: 135+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2018-05-16  8:18 [PATCH v9 00/24] ILP32 for ARM64 Yury Norov
2018-05-16  8:18 ` Yury Norov
2018-05-16  8:18 ` [PATCH 01/24] arm64: signal: Make parse_user_sigframe() independent of rt_sigframe layout Yury Norov
2018-05-16  8:18   ` Yury Norov
2018-05-16  8:18 ` [PATCH 02/24] ptrace: Add compat PTRACE_{G,S}ETSIGMASK handlers Yury Norov
2018-05-16  8:18   ` Yury Norov
2018-05-16  8:18 ` [PATCH 03/24] compat ABI: use non-compat openat and open_by_handle_at variants Yury Norov
2018-05-16  8:18   ` Yury Norov
2018-05-16  8:18 ` [PATCH 04/24] 32-bit userspace ABI: introduce ARCH_32BIT_OFF_T config option Yury Norov
2018-05-16  8:18   ` Yury Norov
2018-06-08 17:32   ` Catalin Marinas
2018-06-08 17:32     ` Catalin Marinas
2018-06-08 22:33     ` Palmer Dabbelt
2018-06-08 22:33       ` Palmer Dabbelt
2018-06-09  7:43       ` Yury Norov
2018-06-09  7:43         ` Yury Norov
2018-06-09 21:13       ` Adam Borowski
2018-06-09 21:13         ` Adam Borowski
2018-06-09  7:42     ` Yury Norov
2018-06-09  7:42       ` Yury Norov
2018-06-11  7:48       ` Arnd Bergmann
2018-06-11  7:48         ` Arnd Bergmann
2018-06-11 11:27         ` Yury Norov
2018-06-11 11:27           ` Yury Norov
2018-06-25  6:19           ` Yury Norov
2018-06-25  6:19             ` Yury Norov
2018-08-02 18:30             ` Palmer Dabbelt
2018-08-02 18:30               ` Palmer Dabbelt
2018-05-16  8:18 ` [PATCH 05/24] asm-generic: Drop getrlimit and setrlimit syscalls from default list Yury Norov
2018-05-16  8:18 ` [PATCH 06/24] thread: move thread bits accessors to separated file Yury Norov
2018-05-16  8:18   ` Yury Norov
2018-05-16  8:18 ` [PATCH 07/24] arm64: ilp32: add documentation on the ILP32 ABI for ARM64 Yury Norov
2018-05-16  8:18   ` Yury Norov
2018-05-23 14:06   ` Pavel Machek
2018-05-23 14:06     ` Pavel Machek
2018-05-24 12:15     ` Yury Norov
2018-05-24 12:15       ` Yury Norov
2018-05-24 12:24       ` Dr. Philipp Tomsich
2018-05-24 12:24         ` Dr. Philipp Tomsich
2018-05-16  8:18 ` [PATCH 08/24] arm64: rename COMPAT to AARCH32_EL0 in Kconfig Yury Norov
2018-05-16  8:18   ` Yury Norov
2018-05-16  8:18 ` [PATCH 09/24] arm64: rename functions that reference compat term Yury Norov
2018-05-16  8:18   ` Yury Norov
2018-05-16  8:18 ` [PATCH 10/24] arm64: uapi: set __BITS_PER_LONG correctly for ILP32 and LP64 Yury Norov
2018-05-16  8:18   ` Yury Norov
2018-05-16  8:18 ` [PATCH 11/24] arm64: introduce is_a32_task and is_a32_thread (for AArch32 compat) Yury Norov
2018-05-16  8:18   ` Yury Norov
2018-05-16  8:18 ` [PATCH 12/24] arm64: ilp32: add is_ilp32_compat_{task,thread} and TIF_32BIT_AARCH64 Yury Norov
2018-05-16  8:18   ` Yury Norov
2018-05-16  8:18 ` [PATCH 13/24] arm64: introduce binfmt_elf32.c Yury Norov
2018-05-16  8:18   ` Yury Norov
2018-05-16  8:18 ` [PATCH 14/24] arm64: change compat_elf_hwcap and compat_elf_hwcap2 prefix to a32 Yury Norov
2018-05-16  8:18   ` Yury Norov
2018-05-16  8:19 ` [PATCH 15/24] arm64: ilp32: introduce binfmt_ilp32.c Yury Norov
2018-05-16  8:19   ` Yury Norov
2018-05-16  8:19 ` [PATCH 16/24] arm64: ilp32: share aarch32 syscall handlers Yury Norov
2018-05-16  8:19   ` Yury Norov
2018-05-16  8:19 ` [PATCH 17/24] arm64: ilp32: add sys_ilp32.c and a separate table (in entry.S) to use it Yury Norov
2018-05-16  8:19   ` Yury Norov
2018-05-16  8:19 ` [PATCH 18/24] arm64: signal: share lp64 signal structures and routines to ilp32 Yury Norov
2018-05-16  8:19   ` Yury Norov
2018-05-16  8:19 ` [PATCH 19/24] arm64: signal32: move ilp32 and aarch32 common code to separated file Yury Norov
2018-05-16  8:19   ` Yury Norov
2018-05-16  8:19 ` [PATCH 20/24] arm64: ilp32: introduce ilp32-specific sigframe and ucontext Yury Norov
2018-05-16  8:19   ` Yury Norov
2018-05-16  8:19 ` [PATCH 21/24] arm64: ptrace: handle ptrace_request differently for aarch32 and ilp32 Yury Norov
2018-05-16  8:19   ` Yury Norov
2018-05-16  8:19 ` [PATCH 22/24] arm64:ilp32: add vdso-ilp32 and use for signal return Yury Norov
2018-05-16  8:19   ` Yury Norov
2018-05-16  8:19 ` [PATCH 23/24] arm64:ilp32: add ARM64_ILP32 to Kconfig Yury Norov
2018-05-16  8:19   ` Yury Norov
2018-05-16  8:19 ` [PATCH 24/24] arm64: ilp32: Make the Kconfig option default y Yury Norov
2018-05-16  8:19   ` Yury Norov
2018-07-24 17:39 ` [PATCH v9 00/24] ILP32 for ARM64 Yury Norov
2018-07-24 17:39   ` Yury Norov
2018-07-25  9:48   ` Andreas Schwab
2018-07-25  9:48     ` Andreas Schwab
2018-10-10 14:10   ` Eugene Syromiatnikov
2018-10-10 14:10     ` Eugene Syromiatnikov
2018-10-10 14:18     ` Arnd Bergmann
2018-10-10 14:18       ` Arnd Bergmann
2018-10-10 14:39     ` Szabolcs Nagy
2018-10-10 14:39       ` Szabolcs Nagy
2018-10-13  2:07       ` Eugene Syromiatnikov
2018-10-13  2:07         ` Eugene Syromiatnikov
2018-10-13  9:20         ` Catalin Marinas
2018-10-13  9:20           ` Catalin Marinas
2018-10-14 19:53         ` Arnd Bergmann
2018-10-14 19:53           ` Arnd Bergmann
2018-10-10 15:36     ` Catalin Marinas
2018-10-10 15:36       ` Catalin Marinas
2018-10-13  2:14       ` Eugene Syromiatnikov
2018-10-13  2:14         ` Eugene Syromiatnikov
2018-10-13  9:34         ` Catalin Marinas
2018-10-13  9:34           ` Catalin Marinas
2018-10-13 13:43           ` Yury Norov
2018-10-13 13:43             ` Yury Norov
2018-10-13 16:54           ` Andy Lutomirski [this message]
2018-10-13 16:54             ` Andy Lutomirski
2018-10-13 19:36 ` Andy Lutomirski
2018-10-13 19:36   ` Andy Lutomirski
2018-10-14 19:49   ` Arnd Bergmann
2018-10-14 19:49     ` Arnd Bergmann
2018-10-18 11:14     ` Catalin Marinas
2018-10-18 11:14       ` Catalin Marinas
2018-11-19 21:29 ` Yury Norov
2018-11-19 21:29   ` Yury Norov
2019-01-07 15:50 ` Yuri Norov
2019-01-07 15:50   ` Yuri Norov
     [not found]   ` <DC9A951E-B638-4820-8499-02D5322E7DF7@amacapital.net>
2019-01-07 20:43     ` Yuri Norov
2019-01-07 20:43       ` Yuri Norov
2019-01-08 21:18   ` [PATCH] arm64: introduce AUDIT_ARCH_AARCH64ILP32 for ilp32 Yuri Norov
2019-01-08 21:18     ` Yuri Norov
2019-03-05 20:56 ` [PATCH v9 00/24] ILP32 for ARM64 Yury Norov
2019-03-05 20:56   ` Yury Norov
2019-05-08 22:59 ` Yury Norov
2019-05-08 22:59   ` Yury Norov
2019-05-08 23:10   ` Yury Norov
2019-05-08 23:10     ` Yury Norov
2019-05-13  8:48   ` Andreas Schwab
2019-05-13  8:48     ` Andreas Schwab
2019-05-13 20:16     ` [EXT] " Yuri Norov
2019-05-13 20:16       ` Yuri Norov
2019-05-14 10:43       ` [LTP] " Cyril Hrubis
2019-05-14 10:43         ` Cyril Hrubis
2019-05-14 23:01         ` Yury Norov
2019-05-14 23:01           ` Yury Norov
2019-05-14 23:41         ` Yury Norov
2019-05-14 23:41           ` Yury Norov
2019-07-09 22:42 ` Yury Norov
2019-07-09 22:42   ` Yury Norov
  -- strict thread matches above, loose matches on Subject: below --
2018-05-15 19:11 Yury Norov
2018-05-15 19:11 ` Yury Norov
2018-05-15 19:40 ` Yury Norov
2018-05-15 19:40   ` Yury Norov

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=CALCETrW0sFoOknkLGFSiobQLg9O4qzD9f-9ws8uk69J1X866cg@mail.gmail.com \
    --to=luto@amacapital.net \
    --cc=Dave.Martin@arm.com \
    --cc=Prasun.Kapoor@caviumnetworks.com \
    --cc=agraf@suse.de \
    --cc=arnd@arndb.de \
    --cc=bamv2005@gmail.com \
    --cc=catalin.marinas@arm.com \
    --cc=cmetcalf@mellanox.com \
    --cc=esyr@redhat.com \
    --cc=geert@linux-m68k.org \
    --cc=heiko.carstens@de.ibm.com \
    --cc=james.hogan@imgtec.com \
    --cc=joseph@codesourcery.com \
    --cc=kilobyte@angband.pl \
    --cc=linux-arch@vger.kernel.org \
    --cc=linux-doc@vger.kernel.org \
    --cc=linyongting@huawei.com \
    --cc=manuel.montezelo@gmail.com \
    --cc=palmer@sifive.com \
    --cc=pavel@ucw.cz \
    --cc=philipp.tomsich@theobroma-systems.com \
    --cc=pinskia@gmail.com \
    --cc=schwab@suse.de \
    --cc=sellcey@caviumnetworks.com \
    --cc=szabolcs.nagy@arm.com \
    --cc=ynorov@caviumnetworks.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).