From: Jonathan Cameron <Jonathan.Cameron@huawei.com>
To: Jon Pan-Doh <pandoh@google.com>
Cc: Bjorn Helgaas <bhelgaas@google.com>,
Karolina Stolarek <karolina.stolarek@oracle.com>,
<linux-pci@vger.kernel.org>,
Martin Petersen <martin.petersen@oracle.com>,
Ben Fuller <ben.fuller@oracle.com>,
"Drew Walton" <drewwalton@microsoft.com>,
Anil Agrawal <anilagrawal@meta.com>,
Tony Luck <tony.luck@intel.com>
Subject: Re: [PATCH 5/8] PCI/AER: Introduce ratelimit for AER IRQs
Date: Fri, 31 Jan 2025 14:55:15 +0000 [thread overview]
Message-ID: <20250131145515.000049f9@huawei.com> (raw)
In-Reply-To: <20250115074301.3514927-6-pandoh@google.com>
On Tue, 14 Jan 2025 23:42:57 -0800
Jon Pan-Doh <pandoh@google.com> wrote:
> After ratelimiting logs, spammy devices can still slow execution by
> continued AER IRQ servicing.
>
> Add higher per-device ratelimits for AER errors to mask out those IRQs.
> Set the default rate to 3x default AER ratelimit (30 per 5s).
>
> Tested using aer-inject[1] tool. Injected 32 AER errors. Observed IRQ
> masked via lspci and sysfs counters record 31 errors (1 masked).
>
> Before: CEMsk: BadTLP-
> After: CEMsk: BadTLP+
>
> [1] https://git.kernel.org/pub/scm/linux/kernel/git/gong.chen/aer-inject.git
>
> Signed-off-by: Jon Pan-Doh <pandoh@google.com>
Hi Jon,
Comment inline.
> ---
> Documentation/PCI/pcieaer-howto.rst | 4 +-
> drivers/pci/pcie/aer.c | 64 +++++++++++++++++++++++++----
> 2 files changed, 57 insertions(+), 11 deletions(-)
>
> diff --git a/Documentation/PCI/pcieaer-howto.rst b/Documentation/PCI/pcieaer-howto.rst
> index 5546de60f184..d41272504b18 100644
> --- a/Documentation/PCI/pcieaer-howto.rst
> +++ b/Documentation/PCI/pcieaer-howto.rst
> @@ -88,8 +88,8 @@ fields.
> AER Ratelimits
> -------------------------
>
> -Error messages are ratelimited per device and error type. This prevents spammy
> -devices from flooding the console.
> +Errors, both at log and IRQ level, are ratelimited per device and error type.
> +This prevents spammy devices from stalling execution.
>
> AER Statistics / Counters
> -------------------------
> diff --git a/drivers/pci/pcie/aer.c b/drivers/pci/pcie/aer.c
> index 025c50b0f293..1db70ae87f52 100644
> --- a/drivers/pci/pcie/aer.c
> +++ b/drivers/pci/pcie/aer.c
> @@ -87,6 +87,8 @@ struct aer_info {
> u64 rootport_total_nonfatal_errs;
>
> /* Ratelimits for errors */
> + struct ratelimit_state cor_irq_ratelimit;
> + struct ratelimit_state uncor_irq_ratelimit;
> struct ratelimit_state cor_log_ratelimit;
> struct ratelimit_state uncor_log_ratelimit;
> };
> @@ -379,6 +381,10 @@ void pci_aer_init(struct pci_dev *dev)
> return;
>
> dev->aer_info = kzalloc(sizeof(struct aer_info), GFP_KERNEL);
> + ratelimit_state_init(&dev->aer_info->cor_irq_ratelimit,
> + DEFAULT_RATELIMIT_INTERVAL, DEFAULT_RATELIMIT_BURST*3);
> + ratelimit_state_init(&dev->aer_info->uncor_irq_ratelimit,
> + DEFAULT_RATELIMIT_INTERVAL, DEFAULT_RATELIMIT_BURST*3);
> ratelimit_state_init(&dev->aer_info->cor_log_ratelimit,
> DEFAULT_RATELIMIT_INTERVAL, DEFAULT_RATELIMIT_BURST);
> ratelimit_state_init(&dev->aer_info->uncor_log_ratelimit,
> @@ -676,6 +682,39 @@ static void pci_rootport_aer_stats_incr(struct pci_dev *pdev,
> }
> }
>
> +static void mask_reported_error(struct pci_dev *dev, struct aer_err_info *info)
> +{
> + const char **strings;
> + const char *errmsg;
> + u16 aer_offset = dev->aer_cap;
> + u16 mask_reg_offset;
> + u32 mask;
> + unsigned long status = info->status;
> + int i;
> +
> + if (info->severity == AER_CORRECTABLE) {
> + strings = aer_correctable_error_string;
> + mask_reg_offset = PCI_ERR_COR_MASK;
> + } else {
> + strings = aer_uncorrectable_error_string;
> + mask_reg_offset = PCI_ERR_UNCOR_MASK;
> + }
> +
> + pci_read_config_dword(dev, aer_offset + mask_reg_offset, &mask);
> + mask |= status;
> + pci_write_config_dword(dev, aer_offset + mask_reg_offset, mask);
> +
> + pci_warn(dev, "%s error(s) masked due to rate-limiting:",
> + aer_error_severity_string[info->severity]);
> + for_each_set_bit(i, &status, 32) {
> + errmsg = strings[i];
> + if (!errmsg)
> + errmsg = "Unknown Error Bit";
> +
> + pci_warn(dev, " [%2d] %-22s\n", i, errmsg);
> + }
> +}
> +
> static void __print_tlp_header(struct pci_dev *dev, struct pcie_tlp_log *t)
> {
> pci_err(dev, " TLP Header: %08x %08x %08x %08x\n",
> @@ -713,7 +752,8 @@ void aer_print_error(struct pci_dev *dev, struct aer_err_info *info)
> int layer, agent;
> int id = pci_dev_id(dev);
> const char *level;
> - struct ratelimit_state *ratelimit;
> + struct ratelimit_state *irq_ratelimit;
> + struct ratelimit_state *log_ratelimit;
>
> if (!info->status) {
> pci_err(dev, "PCIe Bus Error: severity=%s, type=Inaccessible, (Unregistered Agent ID)\n",
> @@ -722,14 +762,20 @@ void aer_print_error(struct pci_dev *dev, struct aer_err_info *info)
> }
>
> if (info->severity == AER_CORRECTABLE) {
> - ratelimit = &dev->aer_info->cor_log_ratelimit;
> + irq_ratelimit = &dev->aer_info->cor_irq_ratelimit;
> + log_ratelimit = &dev->aer_info->cor_log_ratelimit;
> level = KERN_WARNING;
> } else {
> - ratelimit = &dev->aer_info->uncor_log_ratelimit;
> + irq_ratelimit = &dev->aer_info->uncor_irq_ratelimit;
> + log_ratelimit = &dev->aer_info->uncor_log_ratelimit;
> level = KERN_ERR;
> }
>
> - if (!__ratelimit(ratelimit))
> + if (!__ratelimit(irq_ratelimit)) {
> + mask_reported_error(dev, info);
> + return;
So if I follow correctly. We count irqs for any error type and
then mask whatever was set on one that triggered this rate_limit check?
That last one isn't reported other than via a log message.
Imagine that is a totally unrelated error to the earlier ones,
now RASDaemon has no info on it at all as the tracepoint never
fired. To me that's a very different situation to it knowing there
were 10 errors of the type vs more.
I'd like to see that final trace point and also to see a tracepoint
that lets rasdaemon etc know you cut off errors after this point
+ rasdaemon support for using that.
Terry can address if we need to do anything different for CXL given
he is updating this handling for that. Superficially I think we
need to driver the masking down into the CXL RAS capability registers.
Internal error in the AER capabilities is far to big a hammer to apply.
> + }
> + if (!__ratelimit(log_ratelimit))
> return;
>
> layer = AER_GET_LAYER_ERROR(info->severity, info->status);
> @@ -776,14 +822,14 @@ void pci_print_aer(struct pci_dev *dev, int aer_severity,
> int layer, agent, tlp_header_valid = 0;
> u32 status, mask;
> struct aer_err_info info;
> - struct ratelimit_state *ratelimit;
> + struct ratelimit_state *log_ratelimit;
>
> if (aer_severity == AER_CORRECTABLE) {
> - ratelimit = &dev->aer_info->cor_log_ratelimit;
> + log_ratelimit = &dev->aer_info->cor_log_ratelimit;
> status = aer->cor_status;
> mask = aer->cor_mask;
> } else {
> - ratelimit = &dev->aer_info->uncor_log_ratelimit;
> + log_ratelimit = &dev->aer_info->uncor_log_ratelimit;
> status = aer->uncor_status;
> mask = aer->uncor_mask;
> tlp_header_valid = status & AER_LOG_TLP_MASKS;
> @@ -799,8 +845,8 @@ void pci_print_aer(struct pci_dev *dev, int aer_severity,
> info.first_error = PCI_ERR_CAP_FEP(aer->cap_control);
>
> pci_dev_aer_stats_incr(dev, &info);
> -
> - if (!__ratelimit(ratelimit))
> + /* Only ratelimit logs (no IRQ) as AERs reported via GHES/CXL (caller). */
> + if (!__ratelimit(log_ratelimit))
> return;
>
> pci_err(dev, "aer_status: 0x%08x, aer_mask: 0x%08x\n", status, mask);
next prev parent reply other threads:[~2025-01-31 14:55 UTC|newest]
Thread overview: 51+ messages / expand[flat|nested] mbox.gz Atom feed top
2025-01-15 7:42 [PATCH 0/8] Rate limit AER logs/IRQs Jon Pan-Doh
2025-01-15 7:42 ` [PATCH 1/8] PCI/AER: Remove aer_print_port_info Jon Pan-Doh
2025-01-16 14:27 ` Karolina Stolarek
2025-01-18 1:57 ` Jon Pan-Doh
2025-01-20 9:25 ` Karolina Stolarek
2025-02-12 23:20 ` Jon Pan-Doh
2025-01-21 14:18 ` Ilpo Järvinen
2025-02-12 23:20 ` Jon Pan-Doh
2025-01-25 4:15 ` Sathyanarayanan Kuppuswamy
2025-02-12 23:20 ` Jon Pan-Doh
2025-01-15 7:42 ` [PATCH 2/8] PCI/AER: Move AER stat collection out of __aer_print_error Jon Pan-Doh
2025-01-16 14:47 ` Karolina Stolarek
2025-01-18 1:57 ` Jon Pan-Doh
2025-01-25 4:37 ` Sathyanarayanan Kuppuswamy
2025-02-12 23:20 ` Jon Pan-Doh
2025-01-15 7:42 ` [PATCH 3/8] PCI/AER: Rename struct aer_stats to aer_info Jon Pan-Doh
2025-01-16 10:11 ` Karolina Stolarek
2025-01-18 1:59 ` Jon Pan-Doh
2025-01-20 10:13 ` Karolina Stolarek
2025-02-12 23:20 ` Jon Pan-Doh
2025-01-15 7:42 ` [PATCH 4/8] PCI/AER: Introduce ratelimit for error logs Jon Pan-Doh
2025-01-16 11:11 ` Karolina Stolarek
2025-01-18 1:59 ` Jon Pan-Doh
2025-01-20 10:25 ` Karolina Stolarek
2025-01-15 7:42 ` [PATCH 5/8] PCI/AER: Introduce ratelimit for AER IRQs Jon Pan-Doh
2025-01-16 12:02 ` Karolina Stolarek
2025-01-18 1:58 ` Jon Pan-Doh
2025-01-20 10:38 ` Karolina Stolarek
2025-01-25 7:39 ` Lukas Wunner
2025-01-31 14:43 ` Jonathan Cameron
2025-03-04 23:42 ` Jon Pan-Doh
2025-02-06 13:56 ` Karolina Stolarek
2025-02-06 20:25 ` Lukas Wunner
2025-01-31 14:55 ` Jonathan Cameron [this message]
2025-03-04 23:42 ` Jon Pan-Doh
2025-01-15 7:42 ` [PATCH 6/8] PCI/AER: Add AER sysfs attributes for ratelimits Jon Pan-Doh
2025-01-31 14:32 ` Jonathan Cameron
2025-02-28 23:11 ` Jon Pan-Doh
2025-01-15 7:42 ` [PATCH 7/8] PCI/AER: Update AER sysfs ABI filename Jon Pan-Doh
2025-01-15 7:43 ` [PATCH 8/8] PCI/AER: Move AER sysfs attributes into separate directory Jon Pan-Doh
2025-01-16 10:26 ` Karolina Stolarek
2025-01-16 17:18 ` Rajat Jain
2025-01-31 14:36 ` Jonathan Cameron
2025-02-12 23:19 ` Jon Pan-Doh
2025-01-23 15:18 ` [PATCH 0/8] Rate limit AER logs/IRQs Bowman, Terry
2025-01-24 6:46 ` Jon Pan-Doh
2025-01-25 7:59 ` Lukas Wunner
2025-02-06 13:32 ` Karolina Stolarek
2025-02-12 23:19 ` Jon Pan-Doh
2025-02-13 16:00 ` Karolina Stolarek
2025-02-14 2:49 ` Jon Pan-Doh
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20250131145515.000049f9@huawei.com \
--to=jonathan.cameron@huawei.com \
--cc=anilagrawal@meta.com \
--cc=ben.fuller@oracle.com \
--cc=bhelgaas@google.com \
--cc=drewwalton@microsoft.com \
--cc=karolina.stolarek@oracle.com \
--cc=linux-pci@vger.kernel.org \
--cc=martin.petersen@oracle.com \
--cc=pandoh@google.com \
--cc=tony.luck@intel.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.