From: Pranjal Shrivastava <praan@google.com>
To: Will Deacon <will@kernel.org>
Cc: Joerg Roedel <joro@8bytes.org>,
Robin Murphy <robin.murphy@arm.com>,
Mostafa Saleh <smostafa@google.com>,
Nicolin Chen <nicolinc@nvidia.com>,
iommu@lists.linux.dev, Jason Gunthorpe <jgg@nvidia.com>
Subject: Re: [PATCH v4 2/3] iommu/arm-smmu-v3: Log better event records
Date: Tue, 29 Oct 2024 19:59:53 +0000 [thread overview]
Message-ID: <ZyE-uSGEQGFDcmc2@google.com> (raw)
In-Reply-To: <20241029185345.GB5454@willie-the-truck>
On Tue, Oct 29, 2024 at 06:53:46PM +0000, Will Deacon wrote:
> On Thu, Oct 24, 2024 at 02:14:42PM +0000, Pranjal Shrivastava wrote:
> > On Thu, Oct 24, 2024 at 02:15:12PM +0100, Will Deacon wrote:
> > > On Fri, Oct 18, 2024 at 06:00:21PM +0000, Pranjal Shrivastava wrote:
> > > > Currently, the driver dumps the raw hex for a received event record.
> > > > Improve this by leveraging `struct arm_smmu_event` for event fields
> > > > and log human-readable event records with meaningful information.
> > > >
> > > > Signed-off-by: Pranjal Shrivastava <praan@google.com>
> > > > ---
> > > > drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3.c | 109 ++++++++++++++++++--
> > > > drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3.h | 13 +++
> > > > 2 files changed, 113 insertions(+), 9 deletions(-)
> > > >
> > > > diff --git a/drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3.c b/drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3.c
> > > > index 2f1108e5de51..4477cf86cb8e 100644
> > > > --- a/drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3.c
> > > > +++ b/drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3.c
> > > > @@ -83,6 +83,34 @@ static struct arm_smmu_option_prop arm_smmu_options[] = {
> > > > { 0, NULL},
> > > > };
> > > >
> > > > +static const char * const event_str[] = {
> > > > + /* Bad config events */
> > > > + [EVT_ID_BAD_SID_CONFIG] = "C_BAD_STREAMID",
> > > > + [EVT_ID_BAD_STE_CONFIG] = "C_BAD_STE",
> > > > + [EVT_ID_BAD_CD_CONFIG] = "C_BAD_CD",
> > > > + [EVT_ID_BAD_SSID_CONFIG] = "C_BAD_SUBSTREAMID",
> > > > + [EVT_ID_STREAM_DISABLED] = "F_STREAM_DISABLED",
> > > > +
> > > > + /* Bad translation events */
> > > > + [EVT_ID_TRANSLATION_FAULT] = "F_TRANSLATION",
> > > > + [EVT_ID_ADDR_SIZE_FAULT] = "F_ADDR_SIZE",
> > > > + [EVT_ID_ACCESS_FAULT] = "F_ACCESS",
> > > > + [EVT_ID_PERMISSION_FAULT] = "F_PERMISSION",
> > > > +
> > > > + /* Bad fetch events */
> > > > + [EVT_ID_STE_FETCH_FAULT] = "F_STE_FETCH",
> > > > + [EVT_ID_CD_FETCH_FAULT] = "F_CD_FETCH",
> > > > + [EVT_ID_VMS_FETCH_FAULT] = "F_VMS_FAULT",
> > > > + [EVT_ID_MAX] = NULL,
> > > > +};
> > > > +
> > > > +static const char * const event_class_str[] = {
> > > > + [0] = "CD fetch",
> > > > + [1] = "Stage 1 translation table fetch",
> > > > + [2] = "Input address caused fault ",
> > > > + [3] = "Reserved",
> > > > +};
> > > > +
> > > > static int arm_smmu_domain_finalise(struct arm_smmu_domain *smmu_domain,
> > > > struct arm_smmu_device *smmu, u32 flags);
> > > > static int arm_smmu_alloc_cd_tables(struct arm_smmu_master *master);
> > > > @@ -1756,6 +1784,60 @@ arm_smmu_find_master(struct arm_smmu_device *smmu, u32 sid)
> > > > return rb_entry(node, struct arm_smmu_stream, node)->master;
> > > > }
> > > >
> > > > +static void arm_smmu_dump_raw_event(struct arm_smmu_event *event)
> > > > +{
> > > > + int i;
> > > > + struct arm_smmu_device *smmu = event->smmu;
> > > > +
> > > > + dev_err(smmu->dev, "event 0x%02x received: master %s:\n",
> > > > + event->id, event->master_name);
> > > > +
> > > > + for (i = 0; i < EVTQ_ENT_DWORDS; ++i)
> > > > + dev_err(smmu->dev, "\t0x%016llx\n", event->raw[i]);
> > > > +}
> > > > +
> > > > +static void arm_smmu_dump_event(struct arm_smmu_event *evt, struct ratelimit_state *rs)
> > > > +{
> > > > + struct arm_smmu_device *smmu = evt->smmu;
> > > > + char title[100] = {0};
> > > > + char mastr[100] = {0};
> > > > + char addrs[100] = {0};
> > > > + char flags[100] = {0};
> > > > + char other[50] = {0};
> > >
> > > I haven't followed previous versions of this series in detail, but
> > > allocating 450 bytes on the stack for this seems excessive to me.
> >
> > Hmm, I think we can reduce the title, mastr, addrs string to 50 chars
> > and the `other` can be reduced to 30? Or maybe we can avoid printing the
> > `stag` and `stall` fields altogether which should reduce it by 200 bytes
> > We can save another 20 bytes by reducing the `flags` len to 80 bytes,
> > overall saving us 220 bytes of stack space.
> >
> > I introduced this in v3 [1] based on suggestions in v2 reviews [2].
> > I can revert to the approach followed in v2 [3] as well.
> >
> > LMK what's your vote?
>
> Keeping the strings concise is one thing and we should do that regardless.
Hmm.. the longest string is the one printing all the flags, an example:
[ 85.410290] Unpriv | Data | Write | S2 | Stage 1 translation table fetch | TTD Write
One idea to reduce this is, maybe just print a letter for abbreviate and
trim the event class string? Something like:
[ 85.410290] Un | D | W | S2 | S1 TT fetch | TTDW
[ 85.410290] Prv | I | R | S1 | S1 TT fetch | TTDR
[ 85.410290] Prv | D | R | S1 | Input addr fault
> However, I'm questioning (a) why we need so many arrays rather than just
> e.g. char linebuf[...] and (b) why it needs to be on the stack at all.
>
a) The arrays were for breaking up the log in multiple parts and print
only the relevant parts for a particular type of event. Earlier we had
something similar to the linebuf (basically not needing the linebuf):
dev_err(smmu->dev, "Fault: %s client %s sid 0x%08x.0x%05x:\n\tiova = %#llx ipa = %#llx\n (%s%s%s%s%s%s)\n",
evts[event->id], event->master_name, event->sid, event->ssid,
event->iova, event->ipa,
event->privileged ? "Priv " : "Unpriv ",
event->instruction ? "Inst " : "Data ",
event->read ? "Read " : "Write ",
event->stage ? "S2 " : "S1 ", class_str[event->class],
((event->id == EVT_ID_PERMISSION_FAULT) &&
eevent->class == EVTQ_1_CLASS_TT)) ?
(FIELD_GET(EVTQ_1_TT_READ, event->raw[1]) ?
" TTDRead" : " TTD Write") : "");
Since I had 3 different logs for 3 different categories of events, it
was decided to break up the logs into smaller parts and print only the
relevant ones during the review of the v2 patch[1].
b) Hmm, I was thinking around the re-entrancy of `arm_smmu_dump_event`?
Since all events are queued up and handled later.. I was thinking of a
situation where the 2 instances of the evtq_thread would be running on 2
cores simultaneously handling different events?
I haven't looked into the core interrupt handling but I'm assuming the
following situation:
1. Interrupt Occurs: When the hardware interrupt triggers, the CPU takes
the interrupt and executes the first half of the interrupt handler.
2 Thread is "Woken Up": After Acking the interrupt, he associated irq
thread will "wake up" i.e. the evtq_thread is marked as "ready" to run.
3. Now, the scheduler sees the thread as "ready" and schedules it on a
free cpu say, on CPU0.
4. Second Interrupt (Before First Finishes): If another interrupt
for an event occurs before the first instance of the evtq_thread
has finished executing, it will again mark the thread as "ready".
5. The scheduler might now see evtq_thread is ready again and
schedule it on a different CPU (say CPU1). Now you have two
instances of the same thread running concurrently..
I'll take a look into the core interrupt handling code to verify if the
above is even a possibility or some hallucination of mine.
But in case we are sure that the above wouldn't happen, I think we
can consider moving these buffers to global scope. I'd just like to
avoid the race conditions and locking around the log buffers.
> Will
Thanks,
Praan
[1] https://lore.kernel.org/linux-iommu/ZtAW0hVPFD6JbLTL@Asurada-Nvidia/
next prev parent reply other threads:[~2024-10-29 20:00 UTC|newest]
Thread overview: 47+ messages / expand[flat|nested] mbox.gz Atom feed top
2024-10-18 18:00 [PATCH v4 0/3] iommu/arm-smmu-v3: Parse out event records Pranjal Shrivastava
2024-10-18 18:00 ` [PATCH v4 1/3] iommu/arm-smmu-v3: Introduce struct arm_smmu_event Pranjal Shrivastava
2024-10-19 1:56 ` Nicolin Chen
2024-10-21 6:20 ` Pranjal Shrivastava
2024-10-24 13:11 ` Will Deacon
2024-10-24 14:20 ` Pranjal Shrivastava
2024-10-24 17:02 ` Pranjal Shrivastava
2024-10-24 17:03 ` Jason Gunthorpe
2024-10-24 17:37 ` Pranjal Shrivastava
2024-10-28 12:23 ` Jason Gunthorpe
2024-10-28 14:46 ` Pranjal Shrivastava
2024-11-04 17:23 ` Daniel Mentz
2024-11-04 18:16 ` Pranjal Shrivastava
2024-11-04 18:19 ` Pranjal Shrivastava
2024-11-01 14:41 ` Robin Murphy
2024-11-01 15:08 ` Pranjal Shrivastava
2024-11-04 5:25 ` Daniel Mentz
2024-11-04 8:31 ` Pranjal Shrivastava
2024-11-07 0:10 ` Daniel Mentz
2024-11-07 14:33 ` Pranjal Shrivastava
2024-11-07 0:16 ` Daniel Mentz
2024-11-07 14:57 ` Pranjal Shrivastava
2024-11-11 22:20 ` Daniel Mentz
2024-11-12 0:52 ` Pranjal Shrivastava
2024-11-12 4:01 ` Daniel Mentz
2024-11-12 8:12 ` Pranjal Shrivastava
2024-10-18 18:00 ` [PATCH v4 2/3] iommu/arm-smmu-v3: Log better event records Pranjal Shrivastava
2024-10-19 2:06 ` Nicolin Chen
2024-10-19 4:51 ` Nicolin Chen
2024-10-21 6:29 ` Pranjal Shrivastava
2024-10-21 6:26 ` Pranjal Shrivastava
2024-10-21 22:53 ` Nicolin Chen
2024-10-24 13:15 ` Will Deacon
2024-10-24 14:14 ` Pranjal Shrivastava
2024-10-29 18:53 ` Will Deacon
2024-10-29 19:59 ` Pranjal Shrivastava [this message]
2024-10-24 19:00 ` Nicolin Chen
2024-10-29 18:49 ` Will Deacon
2024-11-01 15:05 ` Robin Murphy
2024-11-01 16:06 ` Pranjal Shrivastava
2024-11-04 6:36 ` Daniel Mentz
2024-11-04 10:51 ` Pranjal Shrivastava
2024-10-18 18:00 ` [PATCH v4 3/3] iommu/arm-smmu-v3: Avoid redundant master lookup in events Pranjal Shrivastava
2024-10-19 2:08 ` Nicolin Chen
2024-10-19 1:45 ` [PATCH v4 0/3] iommu/arm-smmu-v3: Parse out event records Nicolin Chen
2024-10-21 6:33 ` Pranjal Shrivastava
2024-10-21 22:51 ` Nicolin Chen
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=ZyE-uSGEQGFDcmc2@google.com \
--to=praan@google.com \
--cc=iommu@lists.linux.dev \
--cc=jgg@nvidia.com \
--cc=joro@8bytes.org \
--cc=nicolinc@nvidia.com \
--cc=robin.murphy@arm.com \
--cc=smostafa@google.com \
--cc=will@kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox