From: Petr Tesarik <ptesarik@suse.com>
To: Steven Rostedt <rostedt@goodmis.org>
Cc: Masami Hiramatsu <mhiramat@kernel.org>,
Mathieu Desnoyers <mathieu.desnoyers@efficios.com>,
Sebastian Andrzej Siewior <bigeasy@linutronix.de>,
Clark Williams <clrkwllms@kernel.org>,
linux-kernel@vger.kernel.org, linux-trace-kernel@vger.kernel.org,
linux-rt-devel@lists.linux.dev
Subject: Re: [PATCH] ring-buffer: Use a housekeeping CPU to wake up waiters
Date: Wed, 7 Jan 2026 10:51:37 +0100 [thread overview]
Message-ID: <20260107105137.4cf9a67e@mordecai> (raw)
In-Reply-To: <20260107085009.58fcffd4@mordecai>
On Wed, 7 Jan 2026 08:50:09 +0100
Petr Tesarik <ptesarik@suse.com> wrote:
> On Tue, 6 Jan 2026 17:04:05 -0500
> Steven Rostedt <rostedt@goodmis.org> wrote:
>
> > On Tue, 6 Jan 2026 10:10:39 +0100
> > Petr Tesarik <ptesarik@suse.com> wrote:
> >
> > > Avoid running the wakeup irq_work on an isolated CPU. Since the wakeup can
> > > run on any CPU, let's pick a housekeeping CPU to do the job.
> > >
> > > This change reduces additional noise when tracing isolated CPUs. For
> > > example, the following ipi_send_cpu stack trace was captured with
> > > nohz_full=2 on the isolated CPU:
> > >
> > > <idle>-0 [002] d.h4. 1255.379293: ipi_send_cpu: cpu=2 callsite=irq_work_queue+0x2d/0x50 callback=rb_wake_up_waiters+0x0/0x80
> > > <idle>-0 [002] d.h4. 1255.379329: <stack trace>
> > > => trace_event_raw_event_ipi_send_cpu
> > > => __irq_work_queue_local
> > > => irq_work_queue
> > > => ring_buffer_unlock_commit
> > > => trace_buffer_unlock_commit_regs
> > > => trace_event_buffer_commit
> > > => trace_event_raw_event_x86_irq_vector
> > > => __sysvec_apic_timer_interrupt
> > > => sysvec_apic_timer_interrupt
> > > => asm_sysvec_apic_timer_interrupt
> > > => pv_native_safe_halt
> > > => default_idle
> > > => default_idle_call
> > > => do_idle
> > > => cpu_startup_entry
> > > => start_secondary
> > > => common_startup_64
> >
> > I take it that even with this patch you would still get the above events.
> > The only difference would be the "cpu=" in the event info will not be the
> > same as the CPU it executed on, right?
>
> Yes, this is trace of a similar event after applying the patch:
>
> <idle>-0 [002] d.h4. 313.334367: ipi_send_cpu: cpu=1 callsite=irq_work_queue_on+0x55/0x90 callback=generic_smp_call_function_single_interrupt+0x0/0x20
> <idle>-0 [002] d.h4. 313.334390: <stack trace>
> => trace_event_raw_event_ipi_send_cpu
> => __smp_call_single_queue
> => irq_work_queue_on
> => ring_buffer_unlock_commit
> => trace_buffer_unlock_commit_regs
> => trace_event_buffer_commit
> => trace_event_raw_event_x86_irq_vector
> => __sysvec_apic_timer_interrupt
> => sysvec_apic_timer_interrupt
> => asm_sysvec_apic_timer_interrupt
> => pv_native_safe_halt
> => default_idle
> => default_idle_call
> => do_idle
> => cpu_startup_entry
> => start_secondary
> => common_startup_64
>
> The callback function in the trace event is different. That's because
> send_call_function_single_ipi() always uses this value. Maybe it can be
> improved, and I can look into it, but that's clearly a very separate
> issue.
Erm. It's actually good I had a look. :-(
A helpful comment in irq_work_queue_on() explains that "arch remote IPI
send/receive backend aren't NMI safe". That's something I wasn't aware
of, and I'm afraid it's the end of story. The comment is followed by a
WARN_ON_ONCE(in_nmi()), and I can easily trigger it with "perf top"
while nmi:nmi_handler is traced.
Please, remove the patch again. I'm sorry.
Petr T
next prev parent reply other threads:[~2026-01-07 9:51 UTC|newest]
Thread overview: 12+ messages / expand[flat|nested] mbox.gz Atom feed top
2026-01-06 9:10 [PATCH] ring-buffer: Use a housekeeping CPU to wake up waiters Petr Tesarik
2026-01-06 22:04 ` Steven Rostedt
2026-01-07 7:50 ` Petr Tesarik
2026-01-07 9:51 ` Petr Tesarik [this message]
2026-01-07 16:17 ` Steven Rostedt
2026-01-07 16:19 ` Steven Rostedt
2026-01-08 8:39 ` Petr Tesarik
2026-01-08 10:46 ` Petr Tesarik
2026-01-08 16:58 ` Steven Rostedt
2026-01-09 8:57 ` Petr Tesarik
2026-01-09 16:15 ` Steven Rostedt
2026-01-09 16:54 ` Petr Tesarik
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20260107105137.4cf9a67e@mordecai \
--to=ptesarik@suse.com \
--cc=bigeasy@linutronix.de \
--cc=clrkwllms@kernel.org \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-rt-devel@lists.linux.dev \
--cc=linux-trace-kernel@vger.kernel.org \
--cc=mathieu.desnoyers@efficios.com \
--cc=mhiramat@kernel.org \
--cc=rostedt@goodmis.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox