From: Szabolcs Nagy <Szabolcs.Nagy@arm.com>
To: Mark Brown <broonie@kernel.org>
Cc: Marc Zyngier <maz@kernel.org>,
Basant KumarDwivedi <Basant.KumarDwivedi@arm.com>,
Will Deacon <will@kernel.org>,
Luis Machado <Luis.Machado@arm.com>,
Catalin Marinas <Catalin.Marinas@arm.com>,
Alan Hayward <Alan.Hayward@arm.com>,
"linux-arm-kernel@lists.infradead.org"
<linux-arm-kernel@lists.infradead.org>,
"linux-kselftest@vger.kernel.org"
<linux-kselftest@vger.kernel.org>,
Shuah Khan <skhan@linuxfoundation.org>,
Shuah Khan <shuah@kernel.org>,
"kvmarm@lists.cs.columbia.edu" <kvmarm@lists.cs.columbia.edu>,
Salil Akerkar <Salil.Akerkar@arm.com>
Subject: Re: [PATCH v12 06/40] arm64/sme: Provide ABI documentation for SME
Date: Thu, 31 Mar 2022 16:05:38 +0000 [thread overview]
Message-ID: <YkXRUlaoyDKQqndc@arm.com> (raw)
In-Reply-To: <YiuYMcR8zk73eBLo@sirena.org.uk>
The 03/11/2022 18:42, Mark Brown wrote:
> On Fri, Mar 11, 2022 at 05:21:21PM +0000, Szabolcs Nagy wrote:
> > The 02/25/2022 16:58, Mark Brown wrote:
> > > +* On creation fork() or clone() the newly created process will have PSTATE.SM
> > > + and PSTATE.ZA cleared.
>
> > is there a reason why fork() clears ZA?
>
> > i think this is a minor issue, but the usual expectation is that
> > on thread creation thread local state is reset in the child, but
> > in a forked child the state is the same as in the parent (where
> > ZA is preserved according to the first rule).
>
> It was partly consistency with SM and the SVE state (though that is also
> covered by just being in a system call unlike ZA) and partly concerns
> about what happens if the fork() happens in library code which isn't SME
> aware - it would end up carrying around a copy of ZA with associated
> power and performance impacts if it doesn't exec(). Overall it seemed
> like there would to be less potential for unpleasant surprises if we
> consistently discard the data.
>
> That's not a *super* strongly held opinion though, we could switch to
> preserving whenever we preserve TPIDR2.
i think it's slightly better to treat ZA like TPIDR2,
so only clear if CLONE_SETTLS is set.
otherwise in principle the child can return to the frame
where ZA was used and expect it to work (it's hard to
come up with a reason why would some code do that, but
this is valid in a single-threaded fork child).
sorry for not deciding this earlier.
_______________________________________________
kvmarm mailing list
kvmarm@lists.cs.columbia.edu
https://lists.cs.columbia.edu/mailman/listinfo/kvmarm
WARNING: multiple messages have this Message-ID (diff)
From: Szabolcs Nagy <Szabolcs.Nagy@arm.com>
To: Mark Brown <broonie@kernel.org>
Cc: Catalin Marinas <Catalin.Marinas@arm.com>,
Will Deacon <will@kernel.org>, Marc Zyngier <maz@kernel.org>,
Shuah Khan <skhan@linuxfoundation.org>,
Shuah Khan <shuah@kernel.org>,
Alan Hayward <Alan.Hayward@arm.com>,
Luis Machado <Luis.Machado@arm.com>,
Salil Akerkar <Salil.Akerkar@arm.com>,
Basant KumarDwivedi <Basant.KumarDwivedi@arm.com>,
James Morse <James.Morse@arm.com>,
Alexandru Elisei <Alexandru.Elisei@arm.com>,
Suzuki Poulose <Suzuki.Poulose@arm.com>,
"linux-arm-kernel@lists.infradead.org"
<linux-arm-kernel@lists.infradead.org>,
"linux-kselftest@vger.kernel.org"
<linux-kselftest@vger.kernel.org>,
"kvmarm@lists.cs.columbia.edu" <kvmarm@lists.cs.columbia.edu>
Subject: Re: [PATCH v12 06/40] arm64/sme: Provide ABI documentation for SME
Date: Thu, 31 Mar 2022 16:05:38 +0000 [thread overview]
Message-ID: <YkXRUlaoyDKQqndc@arm.com> (raw)
In-Reply-To: <YiuYMcR8zk73eBLo@sirena.org.uk>
The 03/11/2022 18:42, Mark Brown wrote:
> On Fri, Mar 11, 2022 at 05:21:21PM +0000, Szabolcs Nagy wrote:
> > The 02/25/2022 16:58, Mark Brown wrote:
> > > +* On creation fork() or clone() the newly created process will have PSTATE.SM
> > > + and PSTATE.ZA cleared.
>
> > is there a reason why fork() clears ZA?
>
> > i think this is a minor issue, but the usual expectation is that
> > on thread creation thread local state is reset in the child, but
> > in a forked child the state is the same as in the parent (where
> > ZA is preserved according to the first rule).
>
> It was partly consistency with SM and the SVE state (though that is also
> covered by just being in a system call unlike ZA) and partly concerns
> about what happens if the fork() happens in library code which isn't SME
> aware - it would end up carrying around a copy of ZA with associated
> power and performance impacts if it doesn't exec(). Overall it seemed
> like there would to be less potential for unpleasant surprises if we
> consistently discard the data.
>
> That's not a *super* strongly held opinion though, we could switch to
> preserving whenever we preserve TPIDR2.
i think it's slightly better to treat ZA like TPIDR2,
so only clear if CLONE_SETTLS is set.
otherwise in principle the child can return to the frame
where ZA was used and expect it to work (it's hard to
come up with a reason why would some code do that, but
this is valid in a single-threaded fork child).
sorry for not deciding this earlier.
WARNING: multiple messages have this Message-ID (diff)
From: Szabolcs Nagy <Szabolcs.Nagy@arm.com>
To: Mark Brown <broonie@kernel.org>
Cc: Catalin Marinas <Catalin.Marinas@arm.com>,
Will Deacon <will@kernel.org>, Marc Zyngier <maz@kernel.org>,
Shuah Khan <skhan@linuxfoundation.org>,
Shuah Khan <shuah@kernel.org>,
Alan Hayward <Alan.Hayward@arm.com>,
Luis Machado <Luis.Machado@arm.com>,
Salil Akerkar <Salil.Akerkar@arm.com>,
Basant KumarDwivedi <Basant.KumarDwivedi@arm.com>,
James Morse <James.Morse@arm.com>,
Alexandru Elisei <Alexandru.Elisei@arm.com>,
Suzuki Poulose <Suzuki.Poulose@arm.com>,
"linux-arm-kernel@lists.infradead.org"
<linux-arm-kernel@lists.infradead.org>,
"linux-kselftest@vger.kernel.org"
<linux-kselftest@vger.kernel.org>,
"kvmarm@lists.cs.columbia.edu" <kvmarm@lists.cs.columbia.edu>
Subject: Re: [PATCH v12 06/40] arm64/sme: Provide ABI documentation for SME
Date: Thu, 31 Mar 2022 16:05:38 +0000 [thread overview]
Message-ID: <YkXRUlaoyDKQqndc@arm.com> (raw)
In-Reply-To: <YiuYMcR8zk73eBLo@sirena.org.uk>
The 03/11/2022 18:42, Mark Brown wrote:
> On Fri, Mar 11, 2022 at 05:21:21PM +0000, Szabolcs Nagy wrote:
> > The 02/25/2022 16:58, Mark Brown wrote:
> > > +* On creation fork() or clone() the newly created process will have PSTATE.SM
> > > + and PSTATE.ZA cleared.
>
> > is there a reason why fork() clears ZA?
>
> > i think this is a minor issue, but the usual expectation is that
> > on thread creation thread local state is reset in the child, but
> > in a forked child the state is the same as in the parent (where
> > ZA is preserved according to the first rule).
>
> It was partly consistency with SM and the SVE state (though that is also
> covered by just being in a system call unlike ZA) and partly concerns
> about what happens if the fork() happens in library code which isn't SME
> aware - it would end up carrying around a copy of ZA with associated
> power and performance impacts if it doesn't exec(). Overall it seemed
> like there would to be less potential for unpleasant surprises if we
> consistently discard the data.
>
> That's not a *super* strongly held opinion though, we could switch to
> preserving whenever we preserve TPIDR2.
i think it's slightly better to treat ZA like TPIDR2,
so only clear if CLONE_SETTLS is set.
otherwise in principle the child can return to the frame
where ZA was used and expect it to work (it's hard to
come up with a reason why would some code do that, but
this is valid in a single-threaded fork child).
sorry for not deciding this earlier.
_______________________________________________
linux-arm-kernel mailing list
linux-arm-kernel@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/linux-arm-kernel
next prev parent reply other threads:[~2022-03-31 16:06 UTC|newest]
Thread overview: 171+ messages / expand[flat|nested] mbox.gz Atom feed top
2022-02-25 16:58 [PATCH v12 00/40] arm64/sme: Initial support for the Scalable Matrix Extension Mark Brown
2022-02-25 16:58 ` Mark Brown
2022-02-25 16:58 ` Mark Brown
2022-02-25 16:58 ` [PATCH v12 01/40] arm64: Define CPACR_EL1_FPEN similarly to other floating point controls Mark Brown
2022-02-25 16:58 ` Mark Brown
2022-02-25 16:58 ` Mark Brown
2022-02-25 16:58 ` [PATCH v12 02/40] arm64: Always use individual bits in CPACR floating point enables Mark Brown
2022-02-25 16:58 ` Mark Brown
2022-02-25 16:58 ` Mark Brown
2022-02-25 16:58 ` [PATCH v12 03/40] arm64: cpufeature: Always specify and use a field width for capabilities Mark Brown
2022-02-25 16:58 ` Mark Brown
2022-02-25 16:58 ` Mark Brown
2022-02-25 16:58 ` [PATCH v12 04/40] kselftest/arm64: Remove local ARRAY_SIZE() definitions Mark Brown
2022-02-25 16:58 ` Mark Brown
2022-02-25 16:58 ` Mark Brown
2022-02-25 16:58 ` [PATCH v12 05/40] kselftest/arm64: signal: Allow tests to be incompatible with features Mark Brown
2022-02-25 16:58 ` Mark Brown
2022-02-25 16:58 ` Mark Brown
2022-02-25 16:58 ` [PATCH v12 06/40] arm64/sme: Provide ABI documentation for SME Mark Brown
2022-02-25 16:58 ` Mark Brown
2022-02-25 16:58 ` Mark Brown
2022-03-02 17:23 ` Catalin Marinas
2022-03-02 17:23 ` Catalin Marinas
2022-03-02 17:23 ` Catalin Marinas
2022-03-11 17:21 ` Szabolcs Nagy
2022-03-11 17:21 ` Szabolcs Nagy
2022-03-11 17:21 ` Szabolcs Nagy
2022-03-11 18:42 ` Mark Brown
2022-03-11 18:42 ` Mark Brown
2022-03-11 18:42 ` Mark Brown
2022-03-31 16:05 ` Szabolcs Nagy [this message]
2022-03-31 16:05 ` Szabolcs Nagy
2022-03-31 16:05 ` Szabolcs Nagy
2022-04-06 18:50 ` Mark Brown
2022-04-06 18:50 ` Mark Brown
2022-04-06 18:50 ` Mark Brown
2022-04-07 15:26 ` Szabolcs Nagy
2022-04-07 15:26 ` Szabolcs Nagy
2022-04-07 15:26 ` Szabolcs Nagy
2022-06-06 10:35 ` Luis Machado
2022-06-06 10:35 ` Luis Machado
2022-06-06 10:35 ` Luis Machado
2022-02-25 16:58 ` [PATCH v12 07/40] arm64/sme: System register and exception syndrome definitions Mark Brown
2022-02-25 16:58 ` Mark Brown
2022-02-25 16:58 ` Mark Brown
2022-02-25 16:58 ` [PATCH v12 08/40] arm64/sme: Manually encode SME instructions Mark Brown
2022-02-25 16:58 ` Mark Brown
2022-02-25 16:58 ` Mark Brown
2022-03-02 14:35 ` Catalin Marinas
2022-03-02 14:35 ` Catalin Marinas
2022-03-02 14:35 ` Catalin Marinas
2022-02-25 16:58 ` [PATCH v12 09/40] arm64/sme: Early CPU setup for SME Mark Brown
2022-02-25 16:58 ` Mark Brown
2022-02-25 16:58 ` Mark Brown
2022-02-25 16:58 ` [PATCH v12 10/40] arm64/sme: Basic enumeration support Mark Brown
2022-02-25 16:58 ` Mark Brown
2022-02-25 16:58 ` Mark Brown
2022-03-02 16:29 ` Catalin Marinas
2022-03-02 16:29 ` Catalin Marinas
2022-03-02 16:29 ` Catalin Marinas
2022-02-25 16:58 ` [PATCH v12 11/40] arm64/sme: Identify supported SME vector lengths at boot Mark Brown
2022-02-25 16:58 ` Mark Brown
2022-02-25 16:58 ` Mark Brown
2022-03-02 16:41 ` Catalin Marinas
2022-03-02 16:41 ` Catalin Marinas
2022-03-02 16:41 ` Catalin Marinas
2022-03-16 21:32 ` Thiago Jung Bauermann
2022-03-16 21:32 ` Thiago Jung Bauermann
2022-03-16 21:32 ` Thiago Jung Bauermann
2022-02-25 16:58 ` [PATCH v12 12/40] arm64/sme: Implement sysctl to set the default vector length Mark Brown
2022-02-25 16:58 ` Mark Brown
2022-02-25 16:58 ` Mark Brown
2022-02-25 16:58 ` [PATCH v12 13/40] arm64/sme: Implement vector length configuration prctl()s Mark Brown
2022-02-25 16:58 ` Mark Brown
2022-02-25 16:58 ` Mark Brown
2022-02-25 16:58 ` [PATCH v12 14/40] arm64/sme: Implement support for TPIDR2 Mark Brown
2022-02-25 16:58 ` Mark Brown
2022-02-25 16:58 ` Mark Brown
2022-02-25 16:58 ` [PATCH v12 15/40] arm64/sme: Implement SVCR context switching Mark Brown
2022-02-25 16:58 ` Mark Brown
2022-02-25 16:58 ` Mark Brown
2022-02-25 16:58 ` [PATCH v12 16/40] arm64/sme: Implement streaming SVE " Mark Brown
2022-02-25 16:58 ` Mark Brown
2022-02-25 16:58 ` Mark Brown
2022-02-25 16:59 ` [PATCH v12 17/40] arm64/sme: Implement ZA " Mark Brown
2022-02-25 16:59 ` Mark Brown
2022-02-25 16:59 ` Mark Brown
2022-02-25 16:59 ` [PATCH v12 18/40] arm64/sme: Implement traps and syscall handling for SME Mark Brown
2022-02-25 16:59 ` Mark Brown
2022-02-25 16:59 ` Mark Brown
2022-03-02 17:07 ` Catalin Marinas
2022-03-02 17:07 ` Catalin Marinas
2022-03-02 17:07 ` Catalin Marinas
2022-02-25 16:59 ` [PATCH v12 19/40] arm64/sme: Disable ZA and streaming mode when handling signals Mark Brown
2022-02-25 16:59 ` Mark Brown
2022-02-25 16:59 ` Mark Brown
2022-02-25 16:59 ` [PATCH v12 20/40] arm64/sme: Implement streaming SVE signal handling Mark Brown
2022-02-25 16:59 ` Mark Brown
2022-02-25 16:59 ` Mark Brown
2022-03-02 17:09 ` Catalin Marinas
2022-03-02 17:09 ` Catalin Marinas
2022-03-02 17:09 ` Catalin Marinas
2022-03-16 22:38 ` Thiago Jung Bauermann
2022-03-16 22:38 ` Thiago Jung Bauermann
2022-03-16 22:38 ` Thiago Jung Bauermann
2022-02-25 16:59 ` [PATCH v12 21/40] arm64/sme: Implement ZA " Mark Brown
2022-02-25 16:59 ` Mark Brown
2022-02-25 16:59 ` Mark Brown
2022-02-25 16:59 ` [PATCH v12 22/40] arm64/sme: Implement ptrace support for streaming mode SVE registers Mark Brown
2022-02-25 16:59 ` Mark Brown
2022-02-25 16:59 ` Mark Brown
2022-03-02 17:11 ` Catalin Marinas
2022-03-02 17:11 ` Catalin Marinas
2022-03-02 17:11 ` Catalin Marinas
2022-02-25 16:59 ` [PATCH v12 23/40] arm64/sme: Add ptrace support for ZA Mark Brown
2022-02-25 16:59 ` Mark Brown
2022-02-25 16:59 ` Mark Brown
2022-03-15 21:51 ` Thiago Jung Bauermann
2022-03-15 21:51 ` Thiago Jung Bauermann
2022-03-15 21:51 ` Thiago Jung Bauermann
2022-02-25 16:59 ` [PATCH v12 24/40] arm64/sme: Disable streaming mode and ZA when flushing CPU state Mark Brown
2022-02-25 16:59 ` Mark Brown
2022-02-25 16:59 ` Mark Brown
2022-02-25 16:59 ` [PATCH v12 25/40] arm64/sme: Save and restore streaming mode over EFI runtime calls Mark Brown
2022-02-25 16:59 ` Mark Brown
2022-02-25 16:59 ` Mark Brown
2022-02-25 16:59 ` [PATCH v12 26/40] KVM: arm64: Hide SME system registers from guests Mark Brown
2022-02-25 16:59 ` Mark Brown
2022-02-25 16:59 ` Mark Brown
2022-02-25 16:59 ` [PATCH v12 27/40] KVM: arm64: Trap SME usage in guest Mark Brown
2022-02-25 16:59 ` Mark Brown
2022-02-25 16:59 ` Mark Brown
2022-02-25 16:59 ` [PATCH v12 28/40] KVM: arm64: Handle SME host state when running guests Mark Brown
2022-02-25 16:59 ` Mark Brown
2022-02-25 16:59 ` Mark Brown
2022-02-25 16:59 ` [PATCH v12 29/40] arm64/sme: Provide Kconfig for SME Mark Brown
2022-02-25 16:59 ` Mark Brown
2022-02-25 16:59 ` Mark Brown
2022-02-25 16:59 ` [PATCH v12 30/40] kselftest/arm64: Add manual encodings for SME instructions Mark Brown
2022-02-25 16:59 ` Mark Brown
2022-02-25 16:59 ` Mark Brown
2022-02-25 16:59 ` [PATCH v12 31/40] kselftest/arm64: sme: Add SME support to vlset Mark Brown
2022-02-25 16:59 ` Mark Brown
2022-02-25 16:59 ` Mark Brown
2022-02-25 16:59 ` [PATCH v12 32/40] kselftest/arm64: Add tests for TPIDR2 Mark Brown
2022-02-25 16:59 ` Mark Brown
2022-02-25 16:59 ` Mark Brown
2022-02-25 16:59 ` [PATCH v12 33/40] kselftest/arm64: Extend vector configuration API tests to cover SME Mark Brown
2022-02-25 16:59 ` Mark Brown
2022-02-25 16:59 ` Mark Brown
2022-02-25 16:59 ` [PATCH v12 34/40] kselftest/arm64: sme: Provide streaming mode SVE stress test Mark Brown
2022-02-25 16:59 ` Mark Brown
2022-02-25 16:59 ` Mark Brown
2022-02-25 16:59 ` [PATCH v12 35/40] kselftest/arm64: signal: Handle ZA signal context in core code Mark Brown
2022-02-25 16:59 ` Mark Brown
2022-02-25 16:59 ` Mark Brown
2022-02-25 16:59 ` [PATCH v12 36/40] kselftest/arm64: Add stress test for SME ZA context switching Mark Brown
2022-02-25 16:59 ` Mark Brown
2022-02-25 16:59 ` Mark Brown
2022-02-25 16:59 ` [PATCH v12 37/40] kselftest/arm64: signal: Add SME signal handling tests Mark Brown
2022-02-25 16:59 ` Mark Brown
2022-02-25 16:59 ` Mark Brown
2022-02-25 16:59 ` [PATCH v12 38/40] kselftest/arm64: Add streaming SVE to SVE ptrace tests Mark Brown
2022-02-25 16:59 ` Mark Brown
2022-02-25 16:59 ` Mark Brown
2022-02-25 16:59 ` [PATCH v12 39/40] kselftest/arm64: Add coverage for the ZA ptrace interface Mark Brown
2022-02-25 16:59 ` Mark Brown
2022-02-25 16:59 ` Mark Brown
2022-02-25 16:59 ` [PATCH v12 40/40] kselftest/arm64: Add SME support to syscall ABI test Mark Brown
2022-02-25 16:59 ` Mark Brown
2022-02-25 16:59 ` Mark Brown
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=YkXRUlaoyDKQqndc@arm.com \
--to=szabolcs.nagy@arm.com \
--cc=Alan.Hayward@arm.com \
--cc=Basant.KumarDwivedi@arm.com \
--cc=Catalin.Marinas@arm.com \
--cc=Luis.Machado@arm.com \
--cc=Salil.Akerkar@arm.com \
--cc=broonie@kernel.org \
--cc=kvmarm@lists.cs.columbia.edu \
--cc=linux-arm-kernel@lists.infradead.org \
--cc=linux-kselftest@vger.kernel.org \
--cc=maz@kernel.org \
--cc=shuah@kernel.org \
--cc=skhan@linuxfoundation.org \
--cc=will@kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.