From: Sumit Garg <sumit.garg@linaro.org>
To: Stephen Boyd <swboyd@chromium.org>
Cc: Mark Rutland <mark.rutland@arm.com>,
Daniel Thompson <daniel.thompson@linaro.org>,
Will Deacon <will@kernel.org>,
Peter Zijlstra <peterz@infradead.org>,
Catalin Marinas <catalin.marinas@arm.com>,
Alexandru Elisei <alexandru.elisei@arm.com>,
Douglas Anderson <dianders@chromium.org>,
acme@kernel.org, alexander.shishkin@linux.intel.com,
mingo@redhat.com, julien.thierry.kdev@gmail.com,
namhyung@kernel.org, Thomas Gleixner <tglx@linutronix.de>,
jolsa@redhat.com,
linux-arm-kernel <linux-arm-kernel@lists.infradead.org>
Subject: Re: [PATCH v2] arm64: Enable perf events based hard lockup detector
Date: Wed, 10 Jun 2020 10:32:49 +0530 [thread overview]
Message-ID: <CAFA6WYNnx+aEfMKmWxwesj2JQga-6H3pOOdBpspkBkLhTdpihQ@mail.gmail.com> (raw)
In-Reply-To: <159175395730.242598.3354488505981092317@swboyd.mtv.corp.google.com>
On Wed, 10 Jun 2020 at 07:22, Stephen Boyd <swboyd@chromium.org> wrote:
>
> Quoting Sumit Garg (2020-06-04 05:09:11)
> > With the recent feature added to enable perf events to use pseudo NMIs
> > as interrupts on platforms which support GICv3 or later, its now been
> > possible to enable hard lockup detector (or NMI watchdog) on arm64
> > platforms. So enable corresponding support.
> >
> > One thing to note here is that normally lockup detector is initialized
> > just after the early initcalls but PMU on arm64 comes up much later as
> > device_initcall(). So we need to re-initialize lockup detection once
> > PMU has been initialized.
> >
> > Signed-off-by: Sumit Garg <sumit.garg@linaro.org>
> > ---
> >
> > This patch is dependent on perf NMI patch-set [1].
> >
> > [1] https://patchwork.kernel.org/cover/11047407/
>
> That patch series is from last year. Any progress on it?
>
Alexandru (in Cc) has been working on rebasing and reworking this
patch-set with WIP-pmu-nmi branch [1]. For more information refer to
this thread [2].
[1] http://www.linux-arm.org/git?p=linux-ae.git;a=shortlog;h=refs/heads/WIP-pmu-nmi
[2] https://lkml.org/lkml/2020/5/20/431
> >
> > Changes since RFC:
> > - Rebased on top of Alex's WIP-pmu-nmi branch.
> > - Add comment for safe max. CPU frequency.
> > - Misc. cleanup.
> >
> > arch/arm64/Kconfig | 2 ++
> > arch/arm64/kernel/perf_event.c | 41 +++++++++++++++++++++++++++++++++++++++--
> > drivers/perf/arm_pmu.c | 9 +++++++++
> > include/linux/perf/arm_pmu.h | 2 ++
> > 4 files changed, 52 insertions(+), 2 deletions(-)
> >
> > diff --git a/arch/arm64/kernel/perf_event.c b/arch/arm64/kernel/perf_event.c
> > index e109aa5..a37f018 100644
> > --- a/arch/arm64/kernel/perf_event.c
> > +++ b/arch/arm64/kernel/perf_event.c
> > #define ARMV8_A53_PERFCTR_PREF_LINEFILL 0xC2
> > @@ -1226,3 +1239,27 @@ void arch_perf_update_userpage(struct perf_event *event,
> > userpg->time_shift = (u16)shift;
> > userpg->time_offset = -now;
> > }
> > +
> > +#ifdef CONFIG_HARDLOCKUP_DETECTOR_PERF
> > +/*
> > + * Safe maximum CPU frequency in case a particular platform doesn't implement
> > + * cpufreq driver. Although, architecture doesn't put any restrictions on
> > + * maximum frequency but 5 GHz seems to be safe maximum given the available
> > + * Arm CPUs in the market which are clocked much less than 5 GHz. On the other
> > + * hand, we can't make it much higher as it would lead to a large hard-lockup
> > + * detection timeout on parts which are running slower (eg. 1GHz on
> > + * Developerbox) and doesn't possess a cpufreq driver.
> > + */
> > +#define SAFE_MAX_CPU_FREQ 5000000000UL // 5 GHz
> > +u64 hw_nmi_get_sample_period(int watchdog_thresh)
>
> Is it ever negative? Odd that this API uses a signed integer.
>
No it isn't expected to be negative.
> > +{
> > + unsigned int cpu = smp_processor_id();
> > + unsigned int max_cpu_freq;
> > +
> > + max_cpu_freq = cpufreq_get_hw_max_freq(cpu);
> > + if (max_cpu_freq)
> > + return (u64)max_cpu_freq * 1000 * watchdog_thresh;
> > + else
> > + return (u64)SAFE_MAX_CPU_FREQ * watchdog_thresh;
>
> I'd rather see the return unindented and the else dropped.
Okay.
>
> > +}
> > +#endif
> > diff --git a/drivers/perf/arm_pmu.c b/drivers/perf/arm_pmu.c
> > index f96cfc4..6c25c0d1 100644
> > --- a/drivers/perf/arm_pmu.c
> > +++ b/drivers/perf/arm_pmu.c
> > @@ -718,6 +718,15 @@ static int armpmu_get_cpu_irq(struct arm_pmu *pmu, int cpu)
> > return per_cpu(hw_events->irq, cpu);
> > }
> >
> > +bool arm_pmu_irq_is_nmi(void)
> > +{
> > + const struct pmu_irq_ops *irq_ops;
> > +
> > + irq_ops = per_cpu(cpu_irq_ops, smp_processor_id());
>
> Can we use this_cpu_ptr()?
this_cpu_ptr() sounds more apt here, will use it instead.
-Sumit
>
> > +
> > + return irq_ops == &pmunmi_ops || irq_ops == &percpu_pmunmi_ops;
> > +}
> > +
> > /*
> > * PMU hardware loses all context when a CPU goes offline.
> > * When a CPU is hotplugged back in, since some hardware registers are
_______________________________________________
linux-arm-kernel mailing list
linux-arm-kernel@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/linux-arm-kernel
next prev parent reply other threads:[~2020-06-10 5:03 UTC|newest]
Thread overview: 4+ messages / expand[flat|nested] mbox.gz Atom feed top
2020-06-04 12:09 [PATCH v2] arm64: Enable perf events based hard lockup detector Sumit Garg
2020-06-10 1:52 ` Stephen Boyd
2020-06-10 5:02 ` Sumit Garg [this message]
2020-06-10 10:45 ` Alexandru Elisei
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=CAFA6WYNnx+aEfMKmWxwesj2JQga-6H3pOOdBpspkBkLhTdpihQ@mail.gmail.com \
--to=sumit.garg@linaro.org \
--cc=acme@kernel.org \
--cc=alexander.shishkin@linux.intel.com \
--cc=alexandru.elisei@arm.com \
--cc=catalin.marinas@arm.com \
--cc=daniel.thompson@linaro.org \
--cc=dianders@chromium.org \
--cc=jolsa@redhat.com \
--cc=julien.thierry.kdev@gmail.com \
--cc=linux-arm-kernel@lists.infradead.org \
--cc=mark.rutland@arm.com \
--cc=mingo@redhat.com \
--cc=namhyung@kernel.org \
--cc=peterz@infradead.org \
--cc=swboyd@chromium.org \
--cc=tglx@linutronix.de \
--cc=will@kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).