From: Catalin Marinas <catalin.marinas@arm.com>
To: Lorenzo Pieralisi <lorenzo.pieralisi@arm.com>
Cc: Ard Biesheuvel <ardb@kernel.org>,
Linux ARM <linux-arm-kernel@lists.infradead.org>,
ACPI Devel Maling List <linux-acpi@vger.kernel.org>,
Will Deacon <will@kernel.org>,
Jeremy Linton <jeremy.linton@arm.com>,
Nicolas Saenz Julienne <nsaenzjulienne@suse.de>,
Rob Herring <robh+dt@kernel.org>, Christoph Hellwig <hch@lst.de>,
Robin Murphy <robin.murphy@arm.com>,
Hanjun Guo <guohanjun@huawei.com>,
Sudeep Holla <sudeep.holla@arm.com>,
Anshuman Khandual <anshuman.khandual@arm.com>
Subject: Re: [PATCH] arm64: mm: set ZONE_DMA size based on early IORT scan
Date: Wed, 14 Oct 2020 17:18:18 +0100 [thread overview]
Message-ID: <20201014161818.GF3589@gaia> (raw)
In-Reply-To: <20201013154100.GA22293@e121166-lin.cambridge.arm.com>
Hi Lorenzo,
On Tue, Oct 13, 2020 at 04:41:00PM +0100, Lorenzo Pieralisi wrote:
> On Tue, Oct 13, 2020 at 03:42:07PM +0200, Ard Biesheuvel wrote:
> > On Tue, 13 Oct 2020 at 15:13, Lorenzo Pieralisi
> > <lorenzo.pieralisi@arm.com> wrote:
> > > On Tue, Oct 13, 2020 at 01:22:32PM +0200, Ard Biesheuvel wrote:
> > > > > > diff --git a/Documentation/arm64/arm-acpi.rst b/Documentation/arm64/arm-acpi.rst
> > > > > > index 47ecb9930dde..947f5b5c45ef 100644
> > > > > > --- a/Documentation/arm64/arm-acpi.rst
> > > > > > +++ b/Documentation/arm64/arm-acpi.rst
> > > > > > @@ -205,6 +205,13 @@ devices available. This list of tables is not meant to be all inclusive;
> > > > > > in some environments other tables may be needed (e.g., any of the APEI
> > > > > > tables from section 18) to support specific functionality.
> > > > > >
> > > > > > +It is assumed that all DMA capable devices in the system are able to
> > > > > > +access the lowest 4 GB of system memory. If this is not the case, an
> > > > > > +IORT describing those limitations is mandatory, even if an IORT is not
> > > > > > +otherwise necessary to describe the I/O topology, and regardless of
> > > > > > +whether _DMA methods are used to describe the DMA limitations more
> > > > > > +precisely. Once the system has booted, _DMA methods will take precedence
> > > > > > +over DMA addressing limits described in the IORT.
> > > > >
> > > > > If this is a boot requirement it must be in ARM's official documentation,
> > > > > first, not the kernel one.
> > > > >
> > > > > I understand this is an urgent (well - no comments on why bootstrapping
> > > > > ACPI on Raspberry PI4 is causing all this fuss, honestly) fix but that's
> > > > > not a reason to rush through these guidelines.
> > > > >
> > > > > I would not add this paragraph to arm-acpi.rst, yet.
> > > >
> > > > Which documentation? ACPI compliance by itself is not sufficient for a
> > > > system to be able to boot Linux/arm64, which is why we documented the
> > > > requirements for ACPI boot on Linux/arm64 in this file. I don't think
> > > > we need endorsement from ARM to decide that odd platforms like this
> > > > need to abide by some additional rules if they want to boot in ACPI
> > > > mode.
> > >
> > > I think we do - if we don't we should not add this documentation either.
> > >
> > > ACPI on ARM64 software stack is based on standardized HW requirements.
> > > The sheer fact that we need to work around a HW deficiency shows that
> > > either this platform should have never been booted with ACPI or the _HW_
> > > design guidelines (BSA) are not tight enough.
> > >
> > > Please note that as you may have understood I asked if we can implement
> > > a workaround in IORT because that's information that must be there
> > > regardless (and an OEM ID match in arch code - though pragmatic -
> > > defeats the whole purpose), I don't think we should tell Linux kernel
> > > developers how firmware must be written to work around blatantly
> > > non-compliant systems.
> >
> > This is not about systems being compliant or not, unless there is a
> > requirement somewhere that I missed that all masters in the system
> > must be able to access at least 32 bits of DMA.
>
> I think there is in the SBSA (4.1.3 Memory Map) but regardless, this
> is clearly a design bug, that's not a feature.
I think in revision D of the SBSA 6.0 this was moved to 3.1.3. Anyway, I
guess you are referring to:
All Non-secure on-chip masters in a base server system that are
expected to be under the control of the operating system or hypervisor
must be capable of addressing all of the Non-secure address space.
IIUC, this rules out 32-bit devices as well. Even if we look at the less
strict BSA (https://developer.arm.com/documentation/den0094/latest), the
requirements are the same.
So we can consider the bug either in the *BSA specs or in hardware. If
at least the 32-bit devices are acceptable, the specs should probably be
updated. Otherwise we just state that Linux has slightly different
requirements than *BSA (more relaxed or tightened in various areas) and
we describe them somewhere under Documentation/arm64/.
> > The problem here is that Linux/arm64 cannot deal with fully compliant
> > systems that communicate their [permitted] DMA limitations via a _DMA
> > method if this limitation happens to be that the address limit < 32
> > bits. The DMA subsystem can deal with this fine, only the default DMA
> > zone sizing policy creates an internal issue where the DMA subsystem
> > is not able to allocate memory that matches the DMA constraints.
> >
> > So the 'correct' fix here would be to rework the memory allocator so
> > it can deal with arbitrary DMA limits at allocation time, so that any
> > limit returned by a _DMA method can be adhered to on the fly.
> >
> > However, we all agree that the Raspberry Pi4 is not worth that effort,
> > and that in the general case, SoCs with such limitations, even if they
> > are compliant per the spec, are not worth the trouble of complicating
> > this even more. So as a compromise, I think it is perfectly reasonable
> > to require that systems that have such limitations communicate them
> > via the IORT, which we can parse early, regardless of whether _DMA
> > methods exist as well, and whether they return the same information.
> >
> > So this is not a requirement on arm64 ACPI systems in general. It is a
> > requirement that expresses that we, as arm64
> > contributors/[co-]maintainers, are willing to cater for such systems
> > if they implement their firmware in a particular way.
>
> I don't think they should implement their firmware in any particular
> way, that's my point, I don't want them to in the first place.
I haven't checked the *BBR specs, do they say anything about _DMA
methods or IORT tables for devices that can't access the full memory?
> To start with there is no spec I am aware of that defines when/how to
> use _DMA vs IORT address limits, maybe we should spell that out better
> somewhere and that's useful regardless.
I guess this answers my question above.
> My point is: either this workaround works with firmware written with
> guidelines valid for all arm64 systems (not as a special case: add an
> IORT table because we can't parse _DMA to workaround DMA address range
> shenanigans) or I am not willing to merge it - I prefer to add an OEM ID
> quirk and show what we are forced to do to make this work.
If the guidelines don't say anything about <=32-bit device masks, we can
make up our requirements for Linux. Currently it happens to work on some
kernel versions because we have a ZONE_DMA of 1GB but prior to this
change it wouldn't have worked. We might as well disable ZONE_DMA and
ZONE_DMA32 altogether because the BSA specs state that there are no such
limited devices, but some SoCs would no longer boot.
I think all this depends on whether 32-bit devices are also special and
need quirks. If they do, can we add both 32-bit and <32-bit devices in
the same "special" pot and require IORT tables? I agree it's not nice to
ignore the _DMA method alternative during zone setup but we could still
check the latter at run-time at warn if smaller than the minimum
ZONE_DMA available.
--
Catalin
WARNING: multiple messages have this Message-ID (diff)
From: Catalin Marinas <catalin.marinas@arm.com>
To: Lorenzo Pieralisi <lorenzo.pieralisi@arm.com>
Cc: Anshuman Khandual <anshuman.khandual@arm.com>,
Robin Murphy <robin.murphy@arm.com>,
Sudeep Holla <sudeep.holla@arm.com>,
Jeremy Linton <jeremy.linton@arm.com>,
Christoph Hellwig <hch@lst.de>,
ACPI Devel Maling List <linux-acpi@vger.kernel.org>,
Rob Herring <robh+dt@kernel.org>,
Linux ARM <linux-arm-kernel@lists.infradead.org>,
Hanjun Guo <guohanjun@huawei.com>, Will Deacon <will@kernel.org>,
Ard Biesheuvel <ardb@kernel.org>,
Nicolas Saenz Julienne <nsaenzjulienne@suse.de>
Subject: Re: [PATCH] arm64: mm: set ZONE_DMA size based on early IORT scan
Date: Wed, 14 Oct 2020 17:18:18 +0100 [thread overview]
Message-ID: <20201014161818.GF3589@gaia> (raw)
In-Reply-To: <20201013154100.GA22293@e121166-lin.cambridge.arm.com>
Hi Lorenzo,
On Tue, Oct 13, 2020 at 04:41:00PM +0100, Lorenzo Pieralisi wrote:
> On Tue, Oct 13, 2020 at 03:42:07PM +0200, Ard Biesheuvel wrote:
> > On Tue, 13 Oct 2020 at 15:13, Lorenzo Pieralisi
> > <lorenzo.pieralisi@arm.com> wrote:
> > > On Tue, Oct 13, 2020 at 01:22:32PM +0200, Ard Biesheuvel wrote:
> > > > > > diff --git a/Documentation/arm64/arm-acpi.rst b/Documentation/arm64/arm-acpi.rst
> > > > > > index 47ecb9930dde..947f5b5c45ef 100644
> > > > > > --- a/Documentation/arm64/arm-acpi.rst
> > > > > > +++ b/Documentation/arm64/arm-acpi.rst
> > > > > > @@ -205,6 +205,13 @@ devices available. This list of tables is not meant to be all inclusive;
> > > > > > in some environments other tables may be needed (e.g., any of the APEI
> > > > > > tables from section 18) to support specific functionality.
> > > > > >
> > > > > > +It is assumed that all DMA capable devices in the system are able to
> > > > > > +access the lowest 4 GB of system memory. If this is not the case, an
> > > > > > +IORT describing those limitations is mandatory, even if an IORT is not
> > > > > > +otherwise necessary to describe the I/O topology, and regardless of
> > > > > > +whether _DMA methods are used to describe the DMA limitations more
> > > > > > +precisely. Once the system has booted, _DMA methods will take precedence
> > > > > > +over DMA addressing limits described in the IORT.
> > > > >
> > > > > If this is a boot requirement it must be in ARM's official documentation,
> > > > > first, not the kernel one.
> > > > >
> > > > > I understand this is an urgent (well - no comments on why bootstrapping
> > > > > ACPI on Raspberry PI4 is causing all this fuss, honestly) fix but that's
> > > > > not a reason to rush through these guidelines.
> > > > >
> > > > > I would not add this paragraph to arm-acpi.rst, yet.
> > > >
> > > > Which documentation? ACPI compliance by itself is not sufficient for a
> > > > system to be able to boot Linux/arm64, which is why we documented the
> > > > requirements for ACPI boot on Linux/arm64 in this file. I don't think
> > > > we need endorsement from ARM to decide that odd platforms like this
> > > > need to abide by some additional rules if they want to boot in ACPI
> > > > mode.
> > >
> > > I think we do - if we don't we should not add this documentation either.
> > >
> > > ACPI on ARM64 software stack is based on standardized HW requirements.
> > > The sheer fact that we need to work around a HW deficiency shows that
> > > either this platform should have never been booted with ACPI or the _HW_
> > > design guidelines (BSA) are not tight enough.
> > >
> > > Please note that as you may have understood I asked if we can implement
> > > a workaround in IORT because that's information that must be there
> > > regardless (and an OEM ID match in arch code - though pragmatic -
> > > defeats the whole purpose), I don't think we should tell Linux kernel
> > > developers how firmware must be written to work around blatantly
> > > non-compliant systems.
> >
> > This is not about systems being compliant or not, unless there is a
> > requirement somewhere that I missed that all masters in the system
> > must be able to access at least 32 bits of DMA.
>
> I think there is in the SBSA (4.1.3 Memory Map) but regardless, this
> is clearly a design bug, that's not a feature.
I think in revision D of the SBSA 6.0 this was moved to 3.1.3. Anyway, I
guess you are referring to:
All Non-secure on-chip masters in a base server system that are
expected to be under the control of the operating system or hypervisor
must be capable of addressing all of the Non-secure address space.
IIUC, this rules out 32-bit devices as well. Even if we look at the less
strict BSA (https://developer.arm.com/documentation/den0094/latest), the
requirements are the same.
So we can consider the bug either in the *BSA specs or in hardware. If
at least the 32-bit devices are acceptable, the specs should probably be
updated. Otherwise we just state that Linux has slightly different
requirements than *BSA (more relaxed or tightened in various areas) and
we describe them somewhere under Documentation/arm64/.
> > The problem here is that Linux/arm64 cannot deal with fully compliant
> > systems that communicate their [permitted] DMA limitations via a _DMA
> > method if this limitation happens to be that the address limit < 32
> > bits. The DMA subsystem can deal with this fine, only the default DMA
> > zone sizing policy creates an internal issue where the DMA subsystem
> > is not able to allocate memory that matches the DMA constraints.
> >
> > So the 'correct' fix here would be to rework the memory allocator so
> > it can deal with arbitrary DMA limits at allocation time, so that any
> > limit returned by a _DMA method can be adhered to on the fly.
> >
> > However, we all agree that the Raspberry Pi4 is not worth that effort,
> > and that in the general case, SoCs with such limitations, even if they
> > are compliant per the spec, are not worth the trouble of complicating
> > this even more. So as a compromise, I think it is perfectly reasonable
> > to require that systems that have such limitations communicate them
> > via the IORT, which we can parse early, regardless of whether _DMA
> > methods exist as well, and whether they return the same information.
> >
> > So this is not a requirement on arm64 ACPI systems in general. It is a
> > requirement that expresses that we, as arm64
> > contributors/[co-]maintainers, are willing to cater for such systems
> > if they implement their firmware in a particular way.
>
> I don't think they should implement their firmware in any particular
> way, that's my point, I don't want them to in the first place.
I haven't checked the *BBR specs, do they say anything about _DMA
methods or IORT tables for devices that can't access the full memory?
> To start with there is no spec I am aware of that defines when/how to
> use _DMA vs IORT address limits, maybe we should spell that out better
> somewhere and that's useful regardless.
I guess this answers my question above.
> My point is: either this workaround works with firmware written with
> guidelines valid for all arm64 systems (not as a special case: add an
> IORT table because we can't parse _DMA to workaround DMA address range
> shenanigans) or I am not willing to merge it - I prefer to add an OEM ID
> quirk and show what we are forced to do to make this work.
If the guidelines don't say anything about <=32-bit device masks, we can
make up our requirements for Linux. Currently it happens to work on some
kernel versions because we have a ZONE_DMA of 1GB but prior to this
change it wouldn't have worked. We might as well disable ZONE_DMA and
ZONE_DMA32 altogether because the BSA specs state that there are no such
limited devices, but some SoCs would no longer boot.
I think all this depends on whether 32-bit devices are also special and
need quirks. If they do, can we add both 32-bit and <32-bit devices in
the same "special" pot and require IORT tables? I agree it's not nice to
ignore the _DMA method alternative during zone setup but we could still
check the latter at run-time at warn if smaller than the minimum
ZONE_DMA available.
--
Catalin
_______________________________________________
linux-arm-kernel mailing list
linux-arm-kernel@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/linux-arm-kernel
next prev parent reply other threads:[~2020-10-14 16:18 UTC|newest]
Thread overview: 53+ messages / expand[flat|nested] mbox.gz Atom feed top
2020-10-10 9:31 [PATCH] arm64: mm: set ZONE_DMA size based on early IORT scan Ard Biesheuvel
2020-10-10 9:31 ` Ard Biesheuvel
2020-10-12 9:28 ` Catalin Marinas
2020-10-12 9:28 ` Catalin Marinas
2020-10-12 9:30 ` Ard Biesheuvel
2020-10-12 9:30 ` Ard Biesheuvel
2020-10-12 10:43 ` Ard Biesheuvel
2020-10-12 10:43 ` Ard Biesheuvel
2020-10-12 11:24 ` Catalin Marinas
2020-10-12 11:24 ` Catalin Marinas
2020-10-12 14:19 ` Ard Biesheuvel
2020-10-12 14:19 ` Ard Biesheuvel
2020-10-12 15:49 ` Catalin Marinas
2020-10-12 15:49 ` Catalin Marinas
2020-10-12 15:55 ` Ard Biesheuvel
2020-10-12 15:55 ` Ard Biesheuvel
2020-10-12 16:22 ` Catalin Marinas
2020-10-12 16:22 ` Catalin Marinas
2020-10-12 16:35 ` Ard Biesheuvel
2020-10-12 16:35 ` Ard Biesheuvel
2020-10-12 16:59 ` Catalin Marinas
2020-10-12 16:59 ` Catalin Marinas
2020-10-13 14:42 ` Nicolas Saenz Julienne
2020-10-13 14:42 ` Nicolas Saenz Julienne
2020-10-13 15:45 ` Catalin Marinas
2020-10-13 15:45 ` Catalin Marinas
2020-10-14 12:44 ` Ard Biesheuvel
2020-10-14 12:44 ` Ard Biesheuvel
2020-10-14 12:54 ` Nicolas Saenz Julienne
2020-10-14 12:54 ` Nicolas Saenz Julienne
2020-10-12 12:16 ` kernel test robot
2020-10-12 12:16 ` kernel test robot
2020-10-12 12:16 ` kernel test robot
2020-10-13 11:09 ` Lorenzo Pieralisi
2020-10-13 11:09 ` Lorenzo Pieralisi
2020-10-13 11:22 ` Ard Biesheuvel
2020-10-13 11:22 ` Ard Biesheuvel
2020-10-13 11:38 ` Ard Biesheuvel
2020-10-13 11:38 ` Ard Biesheuvel
2020-10-13 11:43 ` Ard Biesheuvel
2020-10-13 11:43 ` Ard Biesheuvel
2020-10-13 13:13 ` Lorenzo Pieralisi
2020-10-13 13:13 ` Lorenzo Pieralisi
2020-10-13 13:42 ` Ard Biesheuvel
2020-10-13 13:42 ` Ard Biesheuvel
2020-10-13 15:11 ` Robin Murphy
2020-10-13 15:11 ` Robin Murphy
2020-10-13 15:41 ` Lorenzo Pieralisi
2020-10-13 15:41 ` Lorenzo Pieralisi
2020-10-14 16:18 ` Catalin Marinas [this message]
2020-10-14 16:18 ` Catalin Marinas
2020-10-14 17:23 ` Lorenzo Pieralisi
2020-10-14 17:23 ` Lorenzo Pieralisi
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20201014161818.GF3589@gaia \
--to=catalin.marinas@arm.com \
--cc=anshuman.khandual@arm.com \
--cc=ardb@kernel.org \
--cc=guohanjun@huawei.com \
--cc=hch@lst.de \
--cc=jeremy.linton@arm.com \
--cc=linux-acpi@vger.kernel.org \
--cc=linux-arm-kernel@lists.infradead.org \
--cc=lorenzo.pieralisi@arm.com \
--cc=nsaenzjulienne@suse.de \
--cc=robh+dt@kernel.org \
--cc=robin.murphy@arm.com \
--cc=sudeep.holla@arm.com \
--cc=will@kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.