From: William Cohen <wcohen@redhat.com>
To: "linux-perf-use." <linux-perf-users@vger.kernel.org>
Cc: 大平怜 <rei.odaira@gmail.com>
Subject: Issue perf attaching to processes creating many short-live threads
Date: Fri, 23 Oct 2015 14:52:29 -0400 [thread overview]
Message-ID: <562A81ED.70900@redhat.com> (raw)
[-- Attachment #1: Type: text/plain, Size: 2799 bytes --]
Earlier this month Rei Odeira found that the oprofile tool operf would
have problems attaching and monitoring a process that created many
very short-lived threads. It looks like the kernel's perf tool also
has issues when attempting to attach and monitor a process that is
creating many short-lived threads. Attached is the source code used to
reproduce the problem. The code is compiled and run with the
following commands. The arguments to the reproducer are the total
number of threads to spawn, the number of concurrent threads, and the
number of times each threads loops. When run with "-1" as first argument
it will need to be stopped with a cntl-c.
$ gcc -o oprofile_multithread_test oprofile_multithread_test.c -lpthread
$ ./oprofile_multithread_test
Usage: oprofile_multithread_test <number of spawns> <number of threads> <number of operations per thread>
$ ./oprofile_multithread_test -1 16 100000
Having the reproducer run as a child of perf works fine.
$ perf --version
perf version 4.2.3-200.fc22.x86_64
$ perf stat ./oprofile_multithread_test -1 16 1000000
^C./oprofile_multithread_test: Interrupt
failed to read counter stalled-cycles-backend
Performance counter stats for './oprofile_multithread_test -1 16 1000000':
54632.571382 task-clock (msec) # 5.764 CPUs utilized
23,447 context-switches # 0.429 K/sec
16,153 cpu-migrations # 0.296 K/sec
86 page-faults # 0.002 K/sec
168,749,585,390 cycles # 3.089 GHz
136,160,264,023 stalled-cycles-frontend # 80.69% frontend cycles idle
<not supported> stalled-cycles-backend
95,947,021,711 instructions # 0.57 insns per cycle
# 1.42 stalled cycles per insn
16,018,454,088 branches # 293.203 M/sec
6,990,932 branch-misses # 0.04% of all branches
9.477617613 seconds time elapsed
However, when the starting the reproducer program and then attaching
to pid using '-p' one sometimes gets the following failure:
$ ./oprofile_multithread_test -1 16 100000 &
$ perf stat -p `pgrep oprofile_mul`
Error:
The sys_perf_event_open() syscall returned with 3 (No such process) for event (instructions).
/bin/dmesg may provide additional information.
No CONFIG_PERF_EVENTS=y kernel support configured?
I got the same results using perf built from a check out of the
current mainline linux kernel. Shouldn't perf be able to attach to a
process regardless of how quickly it is creating threads?
-Will
[-- Attachment #2: oprofile_multithread_test.c --]
[-- Type: text/x-csrc, Size: 1785 bytes --]
#include <stdio.h>
#include <stdlib.h>
#include <stdint.h>
#include <string.h>
#include <pthread.h>
static int num_ops;
static pthread_t *thr_array;
static void *
thr_main(void *arg)
{
int i;
int sum = 0;
for (i = 0; i < num_ops; i++) {
sum += i;
}
return (void *)(intptr_t)sum;
}
static void
spawn_thread(int thr)
{
int ret;
ret = pthread_create(&thr_array[thr], NULL, thr_main, NULL);
if (ret != 0) {
fprintf(stderr, "pthread_create: %s\n", strerror(ret));
exit(1);
}
}
static void
join_thread(int thr)
{
int ret;
ret = pthread_join(thr_array[thr], NULL);
if (ret != 0) {
fprintf(stderr, "pthread_join: %s\n", strerror(ret));
exit(1);
}
}
int
main(int argc, char *argv[])
{
int num_spawns;
int num_threads;
int thr;
int thr_saved;
int ret;
int spawn_count;
if (argc != 4) {
fprintf(stderr, "Usage: oprofile_multithread_test <number of spawns> <number of threads> <number of operations per thread>\n");
exit(1);
}
num_spawns = atoi(argv[1]);
num_threads = atoi(argv[2]);
num_ops = atoi(argv[3]);
if (num_threads < 1) {
fprintf(stderr, "Number of threads must be positive.\n");
exit(1);
}
thr_array = malloc(sizeof(pthread_t) * num_threads);
if (thr_array == NULL) {
fprintf(stderr, "Cannot allocate thr_array\n");
exit(1);
}
spawn_count = 0;
for (thr = 0; thr < num_threads; thr++) {
spawn_thread(thr);
spawn_count++;
}
thr = 0;
while (num_spawns < 0 ? 1 /* infinite loop */ : spawn_count < num_spawns) {
join_thread(thr);
spawn_thread(thr);
thr = (thr + 1) % num_threads;
spawn_count++;
}
thr_saved = thr;
do {
join_thread(thr);
thr = (thr + 1) % num_threads;
} while (thr != thr_saved);
free(thr_array);
}
next reply other threads:[~2015-10-23 18:52 UTC|newest]
Thread overview: 9+ messages / expand[flat|nested] mbox.gz Atom feed top
2015-10-23 18:52 William Cohen [this message]
2015-10-23 18:56 ` Issue perf attaching to processes creating many short-live threads David Ahern
2015-10-23 19:27 ` William Cohen
2015-10-23 19:35 ` David Ahern
2015-10-26 19:49 ` Arnaldo Carvalho de Melo
2015-10-26 21:42 ` David Ahern
2015-10-27 12:33 ` Arnaldo Carvalho de Melo
2015-10-27 14:15 ` David Ahern
2015-10-27 14:47 ` Arnaldo Carvalho de Melo
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=562A81ED.70900@redhat.com \
--to=wcohen@redhat.com \
--cc=linux-perf-users@vger.kernel.org \
--cc=rei.odaira@gmail.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).