Issue in inserting a kernel module for counting performance counters on Jetson Nano

linux-perf-users.vger.kernel.org archive mirror
 help / color / mirror / Atom feed

* Issue in inserting a kernel module for counting performance counters on Jetson Nano
@ 2023-05-03 13:56 Danny Pereira
  0 siblings, 0 replies; only message in thread
From: Danny Pereira @ 2023-05-03 13:56 UTC (permalink / raw)
  To: linux-perf-users

Hello All,

I am testing a kernel module which creates perf events on every core
and counts the total number of L2D_CACHE_REFILL.

System details are :

NVIDIA Jetson nano,
OS: Ubuntu  18.04.6 LTS,
Kernel: 4.9.255, L4T: 32.7.3,
Jetpack: 4.6.3

insmod is inserting module in kernel space. But throws WARNING message
in kernel log file. The log obtained using  dmesg command is :

[  977.838359] CPU: 1 PID: 0 Comm: swapper/1 Tainted: G           O
4.9.299-BWv1.0 #1
[  977.838361] Hardware name: NVIDIA Jetson Nano Developer Kit (DT)
[  977.838364] task: ffffffc0fa7bb900 task.stack: ffffffc0fa7cc000
[  977.838368] PC is at armpmu_start+0x70/0x78
[  977.838372] LR is at cpu_pm_pmu_setup.isra.2+0x68/0xd8
[  977.838375] pc : [<ffffff8008ce1b48>] lr : [<ffffff8008ce1e30>]
pstate: 604000c5
[  977.838377] sp : ffffffc0fa7cfd20
[  977.838380] x29: ffffffc0fa7cfd20 x28: 0000000000000001
[  977.838385] x27: ffffff8009ed6000 x26: ffffff8009892b00
[  977.838391] x25: ffffffc0f7f88000 x24: ffffffc0f1cd2000
[  977.838396] x23: ffffffc0fefdfe00 x22: 0000000000000002
[  977.838400] x21: 0000000000000002 x20: ffffffc0f7f88000
[  977.838405] x19: ffffffc0f1cd2000 x18: 0000007f94f20a70
[  977.838410] x17: 0000000000021a3d x16: 0000000000000000
[  977.838414] x15: 000000000000003a x14: 0000000000022400
[  977.838419] x13: 0000000000000959 x12: ffffffc0fa7bb900
[  977.838423] x11: 000000000000000b x10: 0101010101010101
[  977.838428] x9 : fffffffffffffff9 x8 : 7f7f7f7f7f7f7f7f
[  977.838433] x7 : fefefeff646c606d x6 : 00170401e9e1acf4
[  977.838437] x5 : ffffff8008002004 x4 : 00000040f5744000
[  977.838441] x3 : 0000000000000001 x2 : 0000000000000001
[  977.838446] x1 : ffffff800a10d000 x0 : ffffffc0f1cd2148

[  977.838452] ---[ end trace aa706cb5bd510b65 ]---
[  977.843057] Call trace:
[  977.843063] [<ffffff8008ce1b48>] armpmu_start+0x70/0x78
[  977.843067] [<ffffff8008ce1e30>] cpu_pm_pmu_setup.isra.2+0x68/0xd8
[  977.843071] [<ffffff8008ce1fa0>] cpu_pm_pmu_notify+0x100/0x138
[  977.843076] [<ffffff80080dcbac>] notifier_call_chain+0x5c/0xa0
[  977.843080] [<ffffff80080dcd98>] __raw_notifier_call_chain+0x48/0x60
[  977.843085] [<ffffff80081acf84>] cpu_pm_exit+0x44/0x78
[  977.843089] [<ffffff8008ba6be0>] arm_enter_idle_state+0x80/0xc0
[  977.843092] [<ffffff8008ba45b4>] cpuidle_enter_state+0x84/0x380
[  977.843095] [<ffffff8008ba4924>] cpuidle_enter+0x34/0x48
[  977.843099] [<ffffff8008111664>] call_cpuidle+0x44/0x70
[  977.843102] [<ffffff80081119e0>] cpu_startup_entry+0x1b0/0x200
[  977.843106] [<ffffff8008091cc8>] secondary_start_kernel+0x190/0x1f8
[  977.843109] [<0000000084f841a8>] 0x84f841a8

This warning states that at line no. 199, the function armpmu_start()
is throwing a warning.

I tried checking the kernel source code. The kernel code snippet for
armpmu_start function from drivers/perf/arm_pmu file is as below:

 189 static void armpmu_start(struct perf_event *event, int flags)
 190 {
 191     struct arm_pmu *armpmu = to_arm_pmu(event->pmu);
 192     struct hw_perf_event *hwc = &event->hw;
 193
 194     /*
 195      * ARM pmu always has to reprogram the period, so ignore
 196      * PERF_EF_RELOAD, see the comment below.
 197      */
 198     if (flags & PERF_EF_RELOAD)
 199         WARN_ON_ONCE(!(hwc->state & PERF_HES_UPTODATE));
 200
 201     hwc->state = 0;
 202     /*
 203      * Set the period again. Some counters can't be stopped, so when we
 204      * were stopped we simply disabled the IRQ source and the counter
 205      * may have been left counting. If we don't do this step then we may
 206      * get an interrupt too soon or *way* too late if the overflow has
 207      * happened since disabling.
 208      */
 209     armpmu_event_set_period(event);
 210     armpmu->enable(event);
 211 }

Also when I try to remove the module with rmmod command my jetson nano
board is restarting.

Here are the few perf API's the kernel module is using:

Creating attributes structure of the perf event

 struct perf_event_attr sched_perf_hw_attr = {
        .type = PERF_TYPE_RAW,
        .config = 0x17,
        .size = sizeof (struct perf_event_attr),
        .pinned = 1,
        .disabled = 1,
        .exclude_kernel = 1,
        .sample_period = budget
        };

Creating perf event with event_overflow_callback function
     event = perf_event_create_kernel_counter (&sched_perf_hw_attr,
                                                                            cpu,

     NULL,

     event_overflow_callback,
                                                                            NULL
                                                                            );

Code to start the counter
     perf_event_enable (event);
     event->pmu->add (event, PERF_EF_START);

Code to stop the counter
     event->pmu->stop (event, PERF_EF_UPDATE);
     event->pmu->del(event,0);

In the event_overflow_callback function following activities are performed.
  *Stopping counter
          event->pmu->stop (event, PERF_EF_UPDATE);
 *Enable performance counter
          event->pmu->start (event, PERF_EF_RELOAD);

The same kernel module is working fine on Jetson Nvidia Xavier NX
without any warning messages.
Any help in diagnosing/resolving  the issue is highly appreciated.
Also, kindly provide learning resources on perf event counter programming.

Thank in advance
Danny

^ permalink raw reply	[flat|nested] only message in thread

only message in thread, other threads:[~2023-05-03 13:56 UTC | newest]

Thread overview: (only message) (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2023-05-03 13:56 Issue in inserting a kernel module for counting performance counters on Jetson Nano Danny Pereira

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).