From: Bjorn Helgaas <helgaas@kernel.org>
To: Nicolas Chauvet <kwizart@gmail.com>
Cc: Thierry Reding <thierry.reding@gmail.com>,
Jonathan Hunter <jonathanh@nvidia.com>,
Manikanta Maddireddy <mmaddireddy@nvidia.com>,
Lorenzo Pieralisi <lorenzo.pieralisi@arm.com>,
linux-tegra@vger.kernel.org, linux-pci@vger.kernel.org,
stable@vger.kernel.org
Subject: Re: [PATCH] pci: tegra: Revert raw_violation_fixup for tegra124
Date: Fri, 17 Jul 2020 16:53:04 -0500 [thread overview]
Message-ID: <20200717215304.GA775582@bjorn-Precision-5520> (raw)
In-Reply-To: <20200717213510.171726-1-kwizart@gmail.com>
Please update subject to follow the convention ("git log --online
drivers/pci/controller/pci-tegra.c") to see it:
PCI: tegra: Revert tegra124 raw_violation_fixup
On Fri, Jul 17, 2020 at 11:35:10PM +0200, Nicolas Chauvet wrote:
> As reported in https://bugzilla.kernel.org/206217 , raw_violation_fixup
> is causing more harm than good in some common use-cases.
>
> This patch is a partial revert of the 191cd6fb5 commit:
> "PCI: tegra: Add SW fixup for RAW violations"
Usual style is:
191cd6fb5d2c ("PCI: tegra: Add SW fixup for RAW violations")
> that was first introduced in 5.3-rc1 kernel.
> This fix the following regression since then.
>
> * Description:
> When both the NIC and MMC are used one can see the following message:
>
> NETDEV WATCHDOG: enp1s0 (r8169): transmit queue 0 timed out
>
> and
>
> pcieport 0000:00:02.0: AER: Uncorrected (Non-Fatal) error received: 0000:01:00.0
> r8169 0000:01:00.0: AER: PCIe Bus Error: severity=Uncorrected (Non-Fatal), type=Transaction Layer, (Requester ID)
> r8169 0000:01:00.0: AER: device [10ec:8168] error status/mask=00004000/00400000
> r8169 0000:01:00.0: AER: [14] CmpltTO (First)
> r8169 0000:01:00.0: AER: can't recover (no error_detected callback)
> pcieport 0000:00:02.0: AER: device recovery failed
Indent the quoted text (messages) two spaces so it's distinct from the
prose.
> After that, the ethernet NIC isn't functional anymore even after reloading
> the r8169 module.
> After a reboot, this is reproducible by copying a large file over the
> NIC to the MMC.
This looks like two paragraphs; if so, put a blank line between them.
Otherwise wrap them so they fill the line. It's hard to read when
there are line breaks that look unnecessary.
> For some reasons this cannot be reproduced when the same file is copied
> to a tmpfs.
>
> * Little background on the fixup, by Manikanta Maddireddy:
> "In the internal testing with dGPU on Tegra124, CmplTO is reported by
> dGPU. This happened because FIFO queue in AFI(AXI to PCIe) module
> get full by upstream posted writes. Back to back upstream writes
> interleaved with infrequent reads, triggers RAW violation and CmpltTO.
> This is fixed by reducing the posted write credits and by changing
> updateFC timer frequency. These settings are fixed after stress test.
>
> In the current case, RTL NIC is also reporting CmplTO. These settings
> seems to be aggravating the issue instead of fixing it."
>
> v1: first non-RFC version
> - Disable raw_violation_fixup and fully remove unused code and macros
This version history can go after the "---" so it doesn't get included
in the final commit log. It's nice if your subject line includes
"[PATCH v2]" or whatever is appropriate.
Add this just before your Signed-off-by:
Fixes: 191cd6fb5d2c ("PCI: tegra: Add SW fixup for RAW violations")
> Signed-off-by: Nicolas Chauvet <kwizart@gmail.com>
> Reviewed-by: Manikanta Maddireddy <mmaddireddy@nvidia.com>
> Cc: <stable@vger.kernel.org> # 5.4.x
No "<>" needed around stable@vger.kernel.org
You need not (and shouldn't) cc: stable@vger.kernel.org when you post
this to the list. The stable tag here in the commit log is
sufficient. Documentation/process/stable-kernel-rules.rst for more
details.
Is v5.4.x really the oldest kernel that should get this fix? It looks
like 191cd6fb5d2c appeared in v5.3.
next prev parent reply other threads:[~2020-07-17 21:53 UTC|newest]
Thread overview: 4+ messages / expand[flat|nested] mbox.gz Atom feed top
2020-07-17 21:35 [PATCH] pci: tegra: Revert raw_violation_fixup for tegra124 Nicolas Chauvet
2020-07-17 21:53 ` Bjorn Helgaas [this message]
2020-07-20 7:02 ` Nicolas Chauvet
[not found] ` <CABr+WTkJ8jZDkM_=-LYxpbrqrsPEb96YBRJvBjR5u+0Ck9R4CQ-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2020-07-20 11:30 ` Lorenzo Pieralisi
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20200717215304.GA775582@bjorn-Precision-5520 \
--to=helgaas@kernel.org \
--cc=jonathanh@nvidia.com \
--cc=kwizart@gmail.com \
--cc=linux-pci@vger.kernel.org \
--cc=linux-tegra@vger.kernel.org \
--cc=lorenzo.pieralisi@arm.com \
--cc=mmaddireddy@nvidia.com \
--cc=stable@vger.kernel.org \
--cc=thierry.reding@gmail.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox