linux-perf-users.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* [PATCH v2 0/2] perf test: Display remaining tests while waiting
@ 2024-07-01  4:42 Ian Rogers
  2024-07-01  4:42 ` [PATCH v2 1/2] tools subcmd: Add non-waitpid check_if_command_finished() Ian Rogers
  2024-07-01  4:42 ` [PATCH v2 2/2] perf test: Display number of remaining tests Ian Rogers
  0 siblings, 2 replies; 12+ messages in thread
From: Ian Rogers @ 2024-07-01  4:42 UTC (permalink / raw)
  To: Peter Zijlstra, Ingo Molnar, Arnaldo Carvalho de Melo,
	Namhyung Kim, Mark Rutland, Alexander Shishkin, Jiri Olsa,
	Ian Rogers, Adrian Hunter, Kan Liang, James Clark, linux-kernel,
	linux-perf-users

The v1 patch series:
https://lore.kernel.org/lkml/20240405070931.1231245-1-irogers@google.com/
was partially merged. The check_if_command_finished wasn't working as
intended as stdout/stderr could be lost due to waitpid being
called. Modify this function to not use waitpid and use procfs
instead.

Keeping the output test display logic in 1 place should simplify
improving parallel and sequential output. Namhyung was experiencing
issues with this in:
https://lore.kernel.org/lkml/20240628215751.1512250-1-namhyung@kernel.org/

Ian Rogers (2):
  tools subcmd: Add non-waitpid check_if_command_finished()
  perf test: Display number of remaining tests

 tools/lib/subcmd/run-command.c  | 33 ++++++++++++++
 tools/perf/tests/builtin-test.c | 77 ++++++++++++++++++++++-----------
 tools/perf/util/color.h         |  1 +
 3 files changed, 86 insertions(+), 25 deletions(-)

-- 
2.45.2.803.g4e1b14247a-goog


^ permalink raw reply	[flat|nested] 12+ messages in thread

* [PATCH v2 1/2] tools subcmd: Add non-waitpid check_if_command_finished()
  2024-07-01  4:42 [PATCH v2 0/2] perf test: Display remaining tests while waiting Ian Rogers
@ 2024-07-01  4:42 ` Ian Rogers
  2024-07-03  3:23   ` Namhyung Kim
  2024-07-01  4:42 ` [PATCH v2 2/2] perf test: Display number of remaining tests Ian Rogers
  1 sibling, 1 reply; 12+ messages in thread
From: Ian Rogers @ 2024-07-01  4:42 UTC (permalink / raw)
  To: Peter Zijlstra, Ingo Molnar, Arnaldo Carvalho de Melo,
	Namhyung Kim, Mark Rutland, Alexander Shishkin, Jiri Olsa,
	Ian Rogers, Adrian Hunter, Kan Liang, James Clark, linux-kernel,
	linux-perf-users

Using waitpid can cause stdout/stderr of the child process to be
lost. Use Linux's /prod/<pid>/status file to determine if the process
has reached the zombie state. Use the 'status' file rather than 'stat'
to avoid issues around skipping the process name.

Signed-off-by: Ian Rogers <irogers@google.com>
---
 tools/lib/subcmd/run-command.c | 33 +++++++++++++++++++++++++++++++++
 1 file changed, 33 insertions(+)

diff --git a/tools/lib/subcmd/run-command.c b/tools/lib/subcmd/run-command.c
index 4e3a557a2f37..ec06683e77a0 100644
--- a/tools/lib/subcmd/run-command.c
+++ b/tools/lib/subcmd/run-command.c
@@ -2,6 +2,7 @@
 #include <unistd.h>
 #include <sys/types.h>
 #include <sys/stat.h>
+#include <ctype.h>
 #include <fcntl.h>
 #include <string.h>
 #include <linux/string.h>
@@ -217,8 +218,40 @@ static int wait_or_whine(struct child_process *cmd, bool block)
 
 int check_if_command_finished(struct child_process *cmd)
 {
+#ifdef __linux__
+	char filename[FILENAME_MAX + 12];
+	char status_line[256];
+	FILE *status_file;
+
+	/*
+	 * Check by reading /proc/<pid>/status as calling waitpid causes
+	 * stdout/stderr to be closed and data lost.
+	 */
+	sprintf(filename, "/proc/%d/status", cmd->pid);
+	status_file = fopen(filename, "r");
+	if (status_file == NULL) {
+		/* Open failed assume finish_command was called. */
+		return true;
+	}
+	while (fgets(status_line, sizeof(status_line), status_file) != NULL) {
+		char *p;
+
+		if (strncmp(status_line, "State:", 6))
+			continue;
+
+		fclose(status_file);
+		p = status_line + 6;
+		while (isspace(*p))
+			p++;
+		return *p == 'Z';
+	}
+	/* Read failed assume finish_command was called. */
+	fclose(status_file);
+	return true;
+#else
 	wait_or_whine(cmd, /*block=*/false);
 	return cmd->finished;
+#endif
 }
 
 int finish_command(struct child_process *cmd)
-- 
2.45.2.803.g4e1b14247a-goog


^ permalink raw reply related	[flat|nested] 12+ messages in thread

* [PATCH v2 2/2] perf test: Display number of remaining tests
  2024-07-01  4:42 [PATCH v2 0/2] perf test: Display remaining tests while waiting Ian Rogers
  2024-07-01  4:42 ` [PATCH v2 1/2] tools subcmd: Add non-waitpid check_if_command_finished() Ian Rogers
@ 2024-07-01  4:42 ` Ian Rogers
  2024-07-03  3:39   ` Namhyung Kim
  1 sibling, 1 reply; 12+ messages in thread
From: Ian Rogers @ 2024-07-01  4:42 UTC (permalink / raw)
  To: Peter Zijlstra, Ingo Molnar, Arnaldo Carvalho de Melo,
	Namhyung Kim, Mark Rutland, Alexander Shishkin, Jiri Olsa,
	Ian Rogers, Adrian Hunter, Kan Liang, James Clark, linux-kernel,
	linux-perf-users

Before polling or sleeping to wait for a test to complete, print out
": Running (<num> remaining)" where the number of remaining tests is
determined by iterating over the remaining tests and seeing which
return true for check_if_command_finished. After the delay, erase the
line and either update it with the new number of remaining tests, or
print the test's result. This allows a user to know a test is running
and in parallel mode (default) how many of the tests are waiting to
complete. If color mode is disabled then avoid displaying the
"Running" message.

Signed-off-by: Ian Rogers <irogers@google.com>
---
 tools/perf/tests/builtin-test.c | 77 ++++++++++++++++++++++-----------
 tools/perf/util/color.h         |  1 +
 2 files changed, 53 insertions(+), 25 deletions(-)

diff --git a/tools/perf/tests/builtin-test.c b/tools/perf/tests/builtin-test.c
index c3d84b67ca8e..23be9139f229 100644
--- a/tools/perf/tests/builtin-test.c
+++ b/tools/perf/tests/builtin-test.c
@@ -241,7 +241,10 @@ static int run_test_child(struct child_process *process)
 	return -err;
 }
 
-static int print_test_result(struct test_suite *t, int i, int subtest, int result, int width)
+#define TEST_RUNNING -3
+
+static int print_test_result(struct test_suite *t, int i, int subtest, int result, int width,
+			     int remaining)
 {
 	if (has_subtests(t)) {
 		int subw = width > 2 ? width - 2 : width;
@@ -251,6 +254,9 @@ static int print_test_result(struct test_suite *t, int i, int subtest, int resul
 		pr_info("%3d: %-*s:", i + 1, width, test_description(t, subtest));
 
 	switch (result) {
+	case TEST_RUNNING:
+		color_fprintf(stderr, PERF_COLOR_YELLOW, " Running (%d remaining)\n", remaining);
+		break;
 	case TEST_OK:
 		pr_info(" Ok\n");
 		break;
@@ -272,13 +278,15 @@ static int print_test_result(struct test_suite *t, int i, int subtest, int resul
 	return 0;
 }
 
-static int finish_test(struct child_test *child_test, int width)
+static int finish_test(struct child_test **child_tests, int running_test, int child_test_num,
+		       int width)
 {
+	struct child_test *child_test = child_tests[running_test];
 	struct test_suite *t = child_test->test;
 	int i = child_test->test_num;
 	int subi = child_test->subtest;
 	int err = child_test->process.err;
-	bool err_done = err <= 0;
+	bool err_done = false;
 	struct strbuf err_output = STRBUF_INIT;
 	int ret;
 
@@ -293,7 +301,7 @@ static int finish_test(struct child_test *child_test, int width)
 	 * Busy loop reading from the child's stdout/stderr that are set to be
 	 * non-blocking until EOF.
 	 */
-	if (!err_done)
+	if (err > 0)
 		fcntl(err, F_SETFL, O_NONBLOCK);
 	if (verbose > 1) {
 		if (has_subtests(t))
@@ -307,29 +315,48 @@ static int finish_test(struct child_test *child_test, int width)
 			  .events = POLLIN | POLLERR | POLLHUP | POLLNVAL,
 			},
 		};
-		char buf[512];
-		ssize_t len;
-
-		/* Poll to avoid excessive spinning, timeout set for 100ms. */
-		poll(pfds, ARRAY_SIZE(pfds), /*timeout=*/100);
-		if (!err_done && pfds[0].revents) {
-			errno = 0;
-			len = read(err, buf, sizeof(buf) - 1);
-
-			if (len <= 0) {
-				err_done = errno != EAGAIN;
-			} else {
-				buf[len] = '\0';
-				if (verbose > 1)
-					fprintf(stdout, "%s", buf);
-				else
+		if (perf_use_color_default) {
+			int tests_in_progress = running_test;
+
+			for (int y = running_test; y < child_test_num; y++) {
+				if (check_if_command_finished(&child_tests[y]->process))
+					tests_in_progress++;
+			}
+			print_test_result(t, i, subi, TEST_RUNNING, width,
+					  child_test_num - tests_in_progress);
+		}
+
+		err_done = true;
+		if (err <= 0) {
+			/* No child stderr to poll, sleep for 10ms for child to complete. */
+			usleep(10 * 1000);
+		} else {
+			/* Poll to avoid excessive spinning, timeout set for 100ms. */
+			poll(pfds, ARRAY_SIZE(pfds), /*timeout=*/100);
+			if (pfds[0].revents) {
+				char buf[512];
+				ssize_t len;
+
+				len = read(err, buf, sizeof(buf) - 1);
+
+				if (len > 0) {
+					err_done = false;
+					buf[len] = '\0';
 					strbuf_addstr(&err_output, buf);
+				}
 			}
 		}
+		if (err_done)
+			err_done = check_if_command_finished(&child_test->process);
+
+		if (perf_use_color_default) {
+			/* Erase "Running (.. remaining)" line printed before poll/sleep. */
+			fprintf(debug_file(), PERF_COLOR_DELETE_LINE);
+		}
 	}
 	/* Clean up child process. */
 	ret = finish_command(&child_test->process);
-	if (verbose == 1 && ret == TEST_FAIL) {
+	if (verbose > 1 || (verbose == 1 && ret == TEST_FAIL)) {
 		/* Add header for test that was skipped above. */
 		if (has_subtests(t))
 			pr_info("%3d.%1d: %s:\n", i + 1, subi + 1, test_description(t, subi));
@@ -338,7 +365,7 @@ static int finish_test(struct child_test *child_test, int width)
 		fprintf(stderr, "%s", err_output.buf);
 	}
 	strbuf_release(&err_output);
-	print_test_result(t, i, subi, ret, width);
+	print_test_result(t, i, subi, ret, width, /*remaining=*/0);
 	if (err > 0)
 		close(err);
 	return 0;
@@ -354,7 +381,7 @@ static int start_test(struct test_suite *test, int i, int subi, struct child_tes
 		pr_debug("--- start ---\n");
 		err = test_function(test, subi)(test, subi);
 		pr_debug("---- end ----\n");
-		print_test_result(test, i, subi, err, width);
+		print_test_result(test, i, subi, err, width, /*remaining=*/0);
 		return 0;
 	}
 
@@ -379,7 +406,7 @@ static int start_test(struct test_suite *test, int i, int subi, struct child_tes
 	err = start_command(&(*child)->process);
 	if (err || !sequential)
 		return  err;
-	return finish_test(*child, width);
+	return finish_test(child, /*running_test=*/0, /*child_test_num=*/1, width);
 }
 
 #define for_each_test(j, k, t)					\
@@ -464,7 +491,7 @@ static int __cmd_test(int argc, const char *argv[], struct intlist *skiplist)
 	}
 	for (i = 0; i < child_test_num; i++) {
 		if (!sequential) {
-			int ret  = finish_test(child_tests[i], width);
+			int ret  = finish_test(child_tests, i, child_test_num, width);
 
 			if (ret)
 				return ret;
diff --git a/tools/perf/util/color.h b/tools/perf/util/color.h
index 01f7bed21c9b..4b9f8d5d4439 100644
--- a/tools/perf/util/color.h
+++ b/tools/perf/util/color.h
@@ -22,6 +22,7 @@
 #define MIN_GREEN	0.5
 #define MIN_RED		5.0
 
+#define PERF_COLOR_DELETE_LINE	"\033[A\33[2K\r"
 /*
  * This variable stores the value of color.ui
  */
-- 
2.45.2.803.g4e1b14247a-goog


^ permalink raw reply related	[flat|nested] 12+ messages in thread

* Re: [PATCH v2 1/2] tools subcmd: Add non-waitpid check_if_command_finished()
  2024-07-01  4:42 ` [PATCH v2 1/2] tools subcmd: Add non-waitpid check_if_command_finished() Ian Rogers
@ 2024-07-03  3:23   ` Namhyung Kim
  2024-07-03  4:24     ` Ian Rogers
  0 siblings, 1 reply; 12+ messages in thread
From: Namhyung Kim @ 2024-07-03  3:23 UTC (permalink / raw)
  To: Ian Rogers
  Cc: Peter Zijlstra, Ingo Molnar, Arnaldo Carvalho de Melo,
	Mark Rutland, Alexander Shishkin, Jiri Olsa, Adrian Hunter,
	Kan Liang, James Clark, linux-kernel, linux-perf-users

Hi Ian,

On Sun, Jun 30, 2024 at 09:42:35PM -0700, Ian Rogers wrote:
> Using waitpid can cause stdout/stderr of the child process to be
> lost. Use Linux's /prod/<pid>/status file to determine if the process
> has reached the zombie state. Use the 'status' file rather than 'stat'
> to avoid issues around skipping the process name.
> 
> Signed-off-by: Ian Rogers <irogers@google.com>
> ---
>  tools/lib/subcmd/run-command.c | 33 +++++++++++++++++++++++++++++++++
>  1 file changed, 33 insertions(+)
> 
> diff --git a/tools/lib/subcmd/run-command.c b/tools/lib/subcmd/run-command.c
> index 4e3a557a2f37..ec06683e77a0 100644
> --- a/tools/lib/subcmd/run-command.c
> +++ b/tools/lib/subcmd/run-command.c
> @@ -2,6 +2,7 @@
>  #include <unistd.h>
>  #include <sys/types.h>
>  #include <sys/stat.h>
> +#include <ctype.h>
>  #include <fcntl.h>
>  #include <string.h>
>  #include <linux/string.h>
> @@ -217,8 +218,40 @@ static int wait_or_whine(struct child_process *cmd, bool block)
>  
>  int check_if_command_finished(struct child_process *cmd)
>  {
> +#ifdef __linux__

Is this really necessary?  I don't think we plan to support other OS..

Thanks,
Namhyung


> +	char filename[FILENAME_MAX + 12];
> +	char status_line[256];
> +	FILE *status_file;
> +
> +	/*
> +	 * Check by reading /proc/<pid>/status as calling waitpid causes
> +	 * stdout/stderr to be closed and data lost.
> +	 */
> +	sprintf(filename, "/proc/%d/status", cmd->pid);
> +	status_file = fopen(filename, "r");
> +	if (status_file == NULL) {
> +		/* Open failed assume finish_command was called. */
> +		return true;
> +	}
> +	while (fgets(status_line, sizeof(status_line), status_file) != NULL) {
> +		char *p;
> +
> +		if (strncmp(status_line, "State:", 6))
> +			continue;
> +
> +		fclose(status_file);
> +		p = status_line + 6;
> +		while (isspace(*p))
> +			p++;
> +		return *p == 'Z';
> +	}
> +	/* Read failed assume finish_command was called. */
> +	fclose(status_file);
> +	return true;
> +#else
>  	wait_or_whine(cmd, /*block=*/false);
>  	return cmd->finished;
> +#endif
>  }
>  
>  int finish_command(struct child_process *cmd)
> -- 
> 2.45.2.803.g4e1b14247a-goog
> 

^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: [PATCH v2 2/2] perf test: Display number of remaining tests
  2024-07-01  4:42 ` [PATCH v2 2/2] perf test: Display number of remaining tests Ian Rogers
@ 2024-07-03  3:39   ` Namhyung Kim
  2024-07-03  4:30     ` Ian Rogers
  0 siblings, 1 reply; 12+ messages in thread
From: Namhyung Kim @ 2024-07-03  3:39 UTC (permalink / raw)
  To: Ian Rogers
  Cc: Peter Zijlstra, Ingo Molnar, Arnaldo Carvalho de Melo,
	Mark Rutland, Alexander Shishkin, Jiri Olsa, Adrian Hunter,
	Kan Liang, James Clark, linux-kernel, linux-perf-users

On Sun, Jun 30, 2024 at 09:42:36PM -0700, Ian Rogers wrote:
> Before polling or sleeping to wait for a test to complete, print out
> ": Running (<num> remaining)" where the number of remaining tests is
> determined by iterating over the remaining tests and seeing which
> return true for check_if_command_finished. After the delay, erase the
> line and either update it with the new number of remaining tests, or
> print the test's result. This allows a user to know a test is running
> and in parallel mode (default) how many of the tests are waiting to

It's not default anymore. :)


> complete. If color mode is disabled then avoid displaying the
> "Running" message.
> 
> Signed-off-by: Ian Rogers <irogers@google.com>
> ---
>  tools/perf/tests/builtin-test.c | 77 ++++++++++++++++++++++-----------
>  tools/perf/util/color.h         |  1 +
>  2 files changed, 53 insertions(+), 25 deletions(-)
> 
> diff --git a/tools/perf/tests/builtin-test.c b/tools/perf/tests/builtin-test.c
> index c3d84b67ca8e..23be9139f229 100644
> --- a/tools/perf/tests/builtin-test.c
> +++ b/tools/perf/tests/builtin-test.c
> @@ -241,7 +241,10 @@ static int run_test_child(struct child_process *process)
>  	return -err;
>  }
>  
> -static int print_test_result(struct test_suite *t, int i, int subtest, int result, int width)
> +#define TEST_RUNNING -3
> +
> +static int print_test_result(struct test_suite *t, int i, int subtest, int result, int width,
> +			     int remaining)
>  {
>  	if (has_subtests(t)) {
>  		int subw = width > 2 ? width - 2 : width;
> @@ -251,6 +254,9 @@ static int print_test_result(struct test_suite *t, int i, int subtest, int resul
>  		pr_info("%3d: %-*s:", i + 1, width, test_description(t, subtest));
>  
>  	switch (result) {
> +	case TEST_RUNNING:
> +		color_fprintf(stderr, PERF_COLOR_YELLOW, " Running (%d remaining)\n", remaining);
> +		break;
>  	case TEST_OK:
>  		pr_info(" Ok\n");
>  		break;
> @@ -272,13 +278,15 @@ static int print_test_result(struct test_suite *t, int i, int subtest, int resul
>  	return 0;
>  }
>  
> -static int finish_test(struct child_test *child_test, int width)
> +static int finish_test(struct child_test **child_tests, int running_test, int child_test_num,
> +		       int width)
>  {
> +	struct child_test *child_test = child_tests[running_test];
>  	struct test_suite *t = child_test->test;
>  	int i = child_test->test_num;
>  	int subi = child_test->subtest;
>  	int err = child_test->process.err;
> -	bool err_done = err <= 0;
> +	bool err_done = false;
>  	struct strbuf err_output = STRBUF_INIT;
>  	int ret;
>  
> @@ -293,7 +301,7 @@ static int finish_test(struct child_test *child_test, int width)
>  	 * Busy loop reading from the child's stdout/stderr that are set to be
>  	 * non-blocking until EOF.
>  	 */
> -	if (!err_done)
> +	if (err > 0)
>  		fcntl(err, F_SETFL, O_NONBLOCK);
>  	if (verbose > 1) {
>  		if (has_subtests(t))
> @@ -307,29 +315,48 @@ static int finish_test(struct child_test *child_test, int width)
>  			  .events = POLLIN | POLLERR | POLLHUP | POLLNVAL,
>  			},
>  		};
> -		char buf[512];
> -		ssize_t len;
> -
> -		/* Poll to avoid excessive spinning, timeout set for 100ms. */
> -		poll(pfds, ARRAY_SIZE(pfds), /*timeout=*/100);
> -		if (!err_done && pfds[0].revents) {
> -			errno = 0;
> -			len = read(err, buf, sizeof(buf) - 1);
> -
> -			if (len <= 0) {
> -				err_done = errno != EAGAIN;
> -			} else {
> -				buf[len] = '\0';
> -				if (verbose > 1)
> -					fprintf(stdout, "%s", buf);
> -				else
> +		if (perf_use_color_default) {
> +			int tests_in_progress = running_test;
> +
> +			for (int y = running_test; y < child_test_num; y++) {
> +				if (check_if_command_finished(&child_tests[y]->process))
> +					tests_in_progress++;
> +			}
> +			print_test_result(t, i, subi, TEST_RUNNING, width,
> +					  child_test_num - tests_in_progress);
> +		}
> +
> +		err_done = true;
> +		if (err <= 0) {
> +			/* No child stderr to poll, sleep for 10ms for child to complete. */
> +			usleep(10 * 1000);
> +		} else {
> +			/* Poll to avoid excessive spinning, timeout set for 100ms. */
> +			poll(pfds, ARRAY_SIZE(pfds), /*timeout=*/100);

When I tested this patch, I saw it refreshes too often in parallel mode.
Maybe 100ms is too short?  I don't know if it's from usleep (10ms) or
here.

Thanks,
Namhyung


> +			if (pfds[0].revents) {
> +				char buf[512];
> +				ssize_t len;
> +
> +				len = read(err, buf, sizeof(buf) - 1);
> +
> +				if (len > 0) {
> +					err_done = false;
> +					buf[len] = '\0';
>  					strbuf_addstr(&err_output, buf);
> +				}
>  			}
>  		}
> +		if (err_done)
> +			err_done = check_if_command_finished(&child_test->process);
> +
> +		if (perf_use_color_default) {
> +			/* Erase "Running (.. remaining)" line printed before poll/sleep. */
> +			fprintf(debug_file(), PERF_COLOR_DELETE_LINE);
> +		}
>  	}
>  	/* Clean up child process. */
>  	ret = finish_command(&child_test->process);
> -	if (verbose == 1 && ret == TEST_FAIL) {
> +	if (verbose > 1 || (verbose == 1 && ret == TEST_FAIL)) {
>  		/* Add header for test that was skipped above. */
>  		if (has_subtests(t))
>  			pr_info("%3d.%1d: %s:\n", i + 1, subi + 1, test_description(t, subi));
> @@ -338,7 +365,7 @@ static int finish_test(struct child_test *child_test, int width)
>  		fprintf(stderr, "%s", err_output.buf);
>  	}
>  	strbuf_release(&err_output);
> -	print_test_result(t, i, subi, ret, width);
> +	print_test_result(t, i, subi, ret, width, /*remaining=*/0);
>  	if (err > 0)
>  		close(err);
>  	return 0;
> @@ -354,7 +381,7 @@ static int start_test(struct test_suite *test, int i, int subi, struct child_tes
>  		pr_debug("--- start ---\n");
>  		err = test_function(test, subi)(test, subi);
>  		pr_debug("---- end ----\n");
> -		print_test_result(test, i, subi, err, width);
> +		print_test_result(test, i, subi, err, width, /*remaining=*/0);
>  		return 0;
>  	}
>  
> @@ -379,7 +406,7 @@ static int start_test(struct test_suite *test, int i, int subi, struct child_tes
>  	err = start_command(&(*child)->process);
>  	if (err || !sequential)
>  		return  err;
> -	return finish_test(*child, width);
> +	return finish_test(child, /*running_test=*/0, /*child_test_num=*/1, width);
>  }
>  
>  #define for_each_test(j, k, t)					\
> @@ -464,7 +491,7 @@ static int __cmd_test(int argc, const char *argv[], struct intlist *skiplist)
>  	}
>  	for (i = 0; i < child_test_num; i++) {
>  		if (!sequential) {
> -			int ret  = finish_test(child_tests[i], width);
> +			int ret  = finish_test(child_tests, i, child_test_num, width);
>  
>  			if (ret)
>  				return ret;
> diff --git a/tools/perf/util/color.h b/tools/perf/util/color.h
> index 01f7bed21c9b..4b9f8d5d4439 100644
> --- a/tools/perf/util/color.h
> +++ b/tools/perf/util/color.h
> @@ -22,6 +22,7 @@
>  #define MIN_GREEN	0.5
>  #define MIN_RED		5.0
>  
> +#define PERF_COLOR_DELETE_LINE	"\033[A\33[2K\r"
>  /*
>   * This variable stores the value of color.ui
>   */
> -- 
> 2.45.2.803.g4e1b14247a-goog
> 

^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: [PATCH v2 1/2] tools subcmd: Add non-waitpid check_if_command_finished()
  2024-07-03  3:23   ` Namhyung Kim
@ 2024-07-03  4:24     ` Ian Rogers
  2024-07-12 20:33       ` Namhyung Kim
  0 siblings, 1 reply; 12+ messages in thread
From: Ian Rogers @ 2024-07-03  4:24 UTC (permalink / raw)
  To: Namhyung Kim
  Cc: Peter Zijlstra, Ingo Molnar, Arnaldo Carvalho de Melo,
	Mark Rutland, Alexander Shishkin, Jiri Olsa, Adrian Hunter,
	Kan Liang, James Clark, linux-kernel, linux-perf-users

On Tue, Jul 2, 2024 at 8:24 PM Namhyung Kim <namhyung@kernel.org> wrote:
>
> Hi Ian,
>
> On Sun, Jun 30, 2024 at 09:42:35PM -0700, Ian Rogers wrote:
> > Using waitpid can cause stdout/stderr of the child process to be
> > lost. Use Linux's /prod/<pid>/status file to determine if the process
> > has reached the zombie state. Use the 'status' file rather than 'stat'
> > to avoid issues around skipping the process name.
> >
> > Signed-off-by: Ian Rogers <irogers@google.com>
> > ---
> >  tools/lib/subcmd/run-command.c | 33 +++++++++++++++++++++++++++++++++
> >  1 file changed, 33 insertions(+)
> >
> > diff --git a/tools/lib/subcmd/run-command.c b/tools/lib/subcmd/run-command.c
> > index 4e3a557a2f37..ec06683e77a0 100644
> > --- a/tools/lib/subcmd/run-command.c
> > +++ b/tools/lib/subcmd/run-command.c
> > @@ -2,6 +2,7 @@
> >  #include <unistd.h>
> >  #include <sys/types.h>
> >  #include <sys/stat.h>
> > +#include <ctype.h>
> >  #include <fcntl.h>
> >  #include <string.h>
> >  #include <linux/string.h>
> > @@ -217,8 +218,40 @@ static int wait_or_whine(struct child_process *cmd, bool block)
> >
> >  int check_if_command_finished(struct child_process *cmd)
> >  {
> > +#ifdef __linux__
>
> Is this really necessary?  I don't think we plan to support other OS..

I don't think it'd be unreasonable to say run "perf report" on
Windows, or using wasm inside a web browser. Part of the reason for
doing things this way was to keep the WNOHANG logic although this
change no longer uses it for __linux__.

Thanks,
Ian

> Thanks,
> Namhyung
>
>
> > +     char filename[FILENAME_MAX + 12];
> > +     char status_line[256];
> > +     FILE *status_file;
> > +
> > +     /*
> > +      * Check by reading /proc/<pid>/status as calling waitpid causes
> > +      * stdout/stderr to be closed and data lost.
> > +      */
> > +     sprintf(filename, "/proc/%d/status", cmd->pid);
> > +     status_file = fopen(filename, "r");
> > +     if (status_file == NULL) {
> > +             /* Open failed assume finish_command was called. */
> > +             return true;
> > +     }
> > +     while (fgets(status_line, sizeof(status_line), status_file) != NULL) {
> > +             char *p;
> > +
> > +             if (strncmp(status_line, "State:", 6))
> > +                     continue;
> > +
> > +             fclose(status_file);
> > +             p = status_line + 6;
> > +             while (isspace(*p))
> > +                     p++;
> > +             return *p == 'Z';
> > +     }
> > +     /* Read failed assume finish_command was called. */
> > +     fclose(status_file);
> > +     return true;
> > +#else
> >       wait_or_whine(cmd, /*block=*/false);
> >       return cmd->finished;
> > +#endif
> >  }
> >
> >  int finish_command(struct child_process *cmd)
> > --
> > 2.45.2.803.g4e1b14247a-goog
> >

^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: [PATCH v2 2/2] perf test: Display number of remaining tests
  2024-07-03  3:39   ` Namhyung Kim
@ 2024-07-03  4:30     ` Ian Rogers
  2024-07-03 21:23       ` Namhyung Kim
  0 siblings, 1 reply; 12+ messages in thread
From: Ian Rogers @ 2024-07-03  4:30 UTC (permalink / raw)
  To: Namhyung Kim
  Cc: Peter Zijlstra, Ingo Molnar, Arnaldo Carvalho de Melo,
	Mark Rutland, Alexander Shishkin, Jiri Olsa, Adrian Hunter,
	Kan Liang, James Clark, linux-kernel, linux-perf-users

On Tue, Jul 2, 2024 at 8:40 PM Namhyung Kim <namhyung@kernel.org> wrote:
>
> On Sun, Jun 30, 2024 at 09:42:36PM -0700, Ian Rogers wrote:
> > Before polling or sleeping to wait for a test to complete, print out
> > ": Running (<num> remaining)" where the number of remaining tests is
> > determined by iterating over the remaining tests and seeing which
> > return true for check_if_command_finished. After the delay, erase the
> > line and either update it with the new number of remaining tests, or
> > print the test's result. This allows a user to know a test is running
> > and in parallel mode (default) how many of the tests are waiting to
>
> It's not default anymore. :)
>
>
> > complete. If color mode is disabled then avoid displaying the
> > "Running" message.
> >
> > Signed-off-by: Ian Rogers <irogers@google.com>
> > ---
> >  tools/perf/tests/builtin-test.c | 77 ++++++++++++++++++++++-----------
> >  tools/perf/util/color.h         |  1 +
> >  2 files changed, 53 insertions(+), 25 deletions(-)
> >
> > diff --git a/tools/perf/tests/builtin-test.c b/tools/perf/tests/builtin-test.c
> > index c3d84b67ca8e..23be9139f229 100644
> > --- a/tools/perf/tests/builtin-test.c
> > +++ b/tools/perf/tests/builtin-test.c
> > @@ -241,7 +241,10 @@ static int run_test_child(struct child_process *process)
> >       return -err;
> >  }
> >
> > -static int print_test_result(struct test_suite *t, int i, int subtest, int result, int width)
> > +#define TEST_RUNNING -3
> > +
> > +static int print_test_result(struct test_suite *t, int i, int subtest, int result, int width,
> > +                          int remaining)
> >  {
> >       if (has_subtests(t)) {
> >               int subw = width > 2 ? width - 2 : width;
> > @@ -251,6 +254,9 @@ static int print_test_result(struct test_suite *t, int i, int subtest, int resul
> >               pr_info("%3d: %-*s:", i + 1, width, test_description(t, subtest));
> >
> >       switch (result) {
> > +     case TEST_RUNNING:
> > +             color_fprintf(stderr, PERF_COLOR_YELLOW, " Running (%d remaining)\n", remaining);
> > +             break;
> >       case TEST_OK:
> >               pr_info(" Ok\n");
> >               break;
> > @@ -272,13 +278,15 @@ static int print_test_result(struct test_suite *t, int i, int subtest, int resul
> >       return 0;
> >  }
> >
> > -static int finish_test(struct child_test *child_test, int width)
> > +static int finish_test(struct child_test **child_tests, int running_test, int child_test_num,
> > +                    int width)
> >  {
> > +     struct child_test *child_test = child_tests[running_test];
> >       struct test_suite *t = child_test->test;
> >       int i = child_test->test_num;
> >       int subi = child_test->subtest;
> >       int err = child_test->process.err;
> > -     bool err_done = err <= 0;
> > +     bool err_done = false;
> >       struct strbuf err_output = STRBUF_INIT;
> >       int ret;
> >
> > @@ -293,7 +301,7 @@ static int finish_test(struct child_test *child_test, int width)
> >        * Busy loop reading from the child's stdout/stderr that are set to be
> >        * non-blocking until EOF.
> >        */
> > -     if (!err_done)
> > +     if (err > 0)
> >               fcntl(err, F_SETFL, O_NONBLOCK);
> >       if (verbose > 1) {
> >               if (has_subtests(t))
> > @@ -307,29 +315,48 @@ static int finish_test(struct child_test *child_test, int width)
> >                         .events = POLLIN | POLLERR | POLLHUP | POLLNVAL,
> >                       },
> >               };
> > -             char buf[512];
> > -             ssize_t len;
> > -
> > -             /* Poll to avoid excessive spinning, timeout set for 100ms. */
> > -             poll(pfds, ARRAY_SIZE(pfds), /*timeout=*/100);
> > -             if (!err_done && pfds[0].revents) {
> > -                     errno = 0;
> > -                     len = read(err, buf, sizeof(buf) - 1);
> > -
> > -                     if (len <= 0) {
> > -                             err_done = errno != EAGAIN;
> > -                     } else {
> > -                             buf[len] = '\0';
> > -                             if (verbose > 1)
> > -                                     fprintf(stdout, "%s", buf);
> > -                             else
> > +             if (perf_use_color_default) {
> > +                     int tests_in_progress = running_test;
> > +
> > +                     for (int y = running_test; y < child_test_num; y++) {
> > +                             if (check_if_command_finished(&child_tests[y]->process))
> > +                                     tests_in_progress++;
> > +                     }
> > +                     print_test_result(t, i, subi, TEST_RUNNING, width,
> > +                                       child_test_num - tests_in_progress);
> > +             }
> > +
> > +             err_done = true;
> > +             if (err <= 0) {
> > +                     /* No child stderr to poll, sleep for 10ms for child to complete. */
> > +                     usleep(10 * 1000);
> > +             } else {
> > +                     /* Poll to avoid excessive spinning, timeout set for 100ms. */
> > +                     poll(pfds, ARRAY_SIZE(pfds), /*timeout=*/100);
>
> When I tested this patch, I saw it refreshes too often in parallel mode.
> Maybe 100ms is too short?  I don't know if it's from usleep (10ms) or
> here.

It's usually the poll and I suspect it is the test writing a lot of
output. I agree it can look a little flickery but it is also
responsive in terms of not waiting too long before moving to the next
test. I think it is possible to improve on the code here, the main
thing I was after was making the output writing self contained and not
split between start test and finish test, as that won't work well in
the parallel case.

Thanks,
Ian

> Thanks,
> Namhyung
>
>
> > +                     if (pfds[0].revents) {
> > +                             char buf[512];
> > +                             ssize_t len;
> > +
> > +                             len = read(err, buf, sizeof(buf) - 1);
> > +
> > +                             if (len > 0) {
> > +                                     err_done = false;
> > +                                     buf[len] = '\0';
> >                                       strbuf_addstr(&err_output, buf);
> > +                             }
> >                       }
> >               }
> > +             if (err_done)
> > +                     err_done = check_if_command_finished(&child_test->process);
> > +
> > +             if (perf_use_color_default) {
> > +                     /* Erase "Running (.. remaining)" line printed before poll/sleep. */
> > +                     fprintf(debug_file(), PERF_COLOR_DELETE_LINE);
> > +             }
> >       }
> >       /* Clean up child process. */
> >       ret = finish_command(&child_test->process);
> > -     if (verbose == 1 && ret == TEST_FAIL) {
> > +     if (verbose > 1 || (verbose == 1 && ret == TEST_FAIL)) {
> >               /* Add header for test that was skipped above. */
> >               if (has_subtests(t))
> >                       pr_info("%3d.%1d: %s:\n", i + 1, subi + 1, test_description(t, subi));
> > @@ -338,7 +365,7 @@ static int finish_test(struct child_test *child_test, int width)
> >               fprintf(stderr, "%s", err_output.buf);
> >       }
> >       strbuf_release(&err_output);
> > -     print_test_result(t, i, subi, ret, width);
> > +     print_test_result(t, i, subi, ret, width, /*remaining=*/0);
> >       if (err > 0)
> >               close(err);
> >       return 0;
> > @@ -354,7 +381,7 @@ static int start_test(struct test_suite *test, int i, int subi, struct child_tes
> >               pr_debug("--- start ---\n");
> >               err = test_function(test, subi)(test, subi);
> >               pr_debug("---- end ----\n");
> > -             print_test_result(test, i, subi, err, width);
> > +             print_test_result(test, i, subi, err, width, /*remaining=*/0);
> >               return 0;
> >       }
> >
> > @@ -379,7 +406,7 @@ static int start_test(struct test_suite *test, int i, int subi, struct child_tes
> >       err = start_command(&(*child)->process);
> >       if (err || !sequential)
> >               return  err;
> > -     return finish_test(*child, width);
> > +     return finish_test(child, /*running_test=*/0, /*child_test_num=*/1, width);
> >  }
> >
> >  #define for_each_test(j, k, t)                                       \
> > @@ -464,7 +491,7 @@ static int __cmd_test(int argc, const char *argv[], struct intlist *skiplist)
> >       }
> >       for (i = 0; i < child_test_num; i++) {
> >               if (!sequential) {
> > -                     int ret  = finish_test(child_tests[i], width);
> > +                     int ret  = finish_test(child_tests, i, child_test_num, width);
> >
> >                       if (ret)
> >                               return ret;
> > diff --git a/tools/perf/util/color.h b/tools/perf/util/color.h
> > index 01f7bed21c9b..4b9f8d5d4439 100644
> > --- a/tools/perf/util/color.h
> > +++ b/tools/perf/util/color.h
> > @@ -22,6 +22,7 @@
> >  #define MIN_GREEN    0.5
> >  #define MIN_RED              5.0
> >
> > +#define PERF_COLOR_DELETE_LINE       "\033[A\33[2K\r"
> >  /*
> >   * This variable stores the value of color.ui
> >   */
> > --
> > 2.45.2.803.g4e1b14247a-goog
> >

^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: [PATCH v2 2/2] perf test: Display number of remaining tests
  2024-07-03  4:30     ` Ian Rogers
@ 2024-07-03 21:23       ` Namhyung Kim
  0 siblings, 0 replies; 12+ messages in thread
From: Namhyung Kim @ 2024-07-03 21:23 UTC (permalink / raw)
  To: Ian Rogers
  Cc: Peter Zijlstra, Ingo Molnar, Arnaldo Carvalho de Melo,
	Mark Rutland, Alexander Shishkin, Jiri Olsa, Adrian Hunter,
	Kan Liang, James Clark, linux-kernel, linux-perf-users

On Tue, Jul 02, 2024 at 09:30:44PM -0700, Ian Rogers wrote:
> On Tue, Jul 2, 2024 at 8:40 PM Namhyung Kim <namhyung@kernel.org> wrote:
> >
> > On Sun, Jun 30, 2024 at 09:42:36PM -0700, Ian Rogers wrote:
> > > Before polling or sleeping to wait for a test to complete, print out
> > > ": Running (<num> remaining)" where the number of remaining tests is
> > > determined by iterating over the remaining tests and seeing which
> > > return true for check_if_command_finished. After the delay, erase the
> > > line and either update it with the new number of remaining tests, or
> > > print the test's result. This allows a user to know a test is running
> > > and in parallel mode (default) how many of the tests are waiting to
> >
> > It's not default anymore. :)
> >
> >
> > > complete. If color mode is disabled then avoid displaying the
> > > "Running" message.
> > >
> > > Signed-off-by: Ian Rogers <irogers@google.com>
> > > ---
> > >  tools/perf/tests/builtin-test.c | 77 ++++++++++++++++++++++-----------
> > >  tools/perf/util/color.h         |  1 +
> > >  2 files changed, 53 insertions(+), 25 deletions(-)
> > >
> > > diff --git a/tools/perf/tests/builtin-test.c b/tools/perf/tests/builtin-test.c
> > > index c3d84b67ca8e..23be9139f229 100644
> > > --- a/tools/perf/tests/builtin-test.c
> > > +++ b/tools/perf/tests/builtin-test.c
> > > @@ -241,7 +241,10 @@ static int run_test_child(struct child_process *process)
> > >       return -err;
> > >  }
> > >
> > > -static int print_test_result(struct test_suite *t, int i, int subtest, int result, int width)
> > > +#define TEST_RUNNING -3
> > > +
> > > +static int print_test_result(struct test_suite *t, int i, int subtest, int result, int width,
> > > +                          int remaining)
> > >  {
> > >       if (has_subtests(t)) {
> > >               int subw = width > 2 ? width - 2 : width;
> > > @@ -251,6 +254,9 @@ static int print_test_result(struct test_suite *t, int i, int subtest, int resul
> > >               pr_info("%3d: %-*s:", i + 1, width, test_description(t, subtest));
> > >
> > >       switch (result) {
> > > +     case TEST_RUNNING:
> > > +             color_fprintf(stderr, PERF_COLOR_YELLOW, " Running (%d remaining)\n", remaining);
> > > +             break;
> > >       case TEST_OK:
> > >               pr_info(" Ok\n");
> > >               break;
> > > @@ -272,13 +278,15 @@ static int print_test_result(struct test_suite *t, int i, int subtest, int resul
> > >       return 0;
> > >  }
> > >
> > > -static int finish_test(struct child_test *child_test, int width)
> > > +static int finish_test(struct child_test **child_tests, int running_test, int child_test_num,
> > > +                    int width)
> > >  {
> > > +     struct child_test *child_test = child_tests[running_test];
> > >       struct test_suite *t = child_test->test;
> > >       int i = child_test->test_num;
> > >       int subi = child_test->subtest;
> > >       int err = child_test->process.err;
> > > -     bool err_done = err <= 0;
> > > +     bool err_done = false;
> > >       struct strbuf err_output = STRBUF_INIT;
> > >       int ret;
> > >
> > > @@ -293,7 +301,7 @@ static int finish_test(struct child_test *child_test, int width)
> > >        * Busy loop reading from the child's stdout/stderr that are set to be
> > >        * non-blocking until EOF.
> > >        */
> > > -     if (!err_done)
> > > +     if (err > 0)
> > >               fcntl(err, F_SETFL, O_NONBLOCK);
> > >       if (verbose > 1) {
> > >               if (has_subtests(t))
> > > @@ -307,29 +315,48 @@ static int finish_test(struct child_test *child_test, int width)
> > >                         .events = POLLIN | POLLERR | POLLHUP | POLLNVAL,
> > >                       },
> > >               };
> > > -             char buf[512];
> > > -             ssize_t len;
> > > -
> > > -             /* Poll to avoid excessive spinning, timeout set for 100ms. */
> > > -             poll(pfds, ARRAY_SIZE(pfds), /*timeout=*/100);
> > > -             if (!err_done && pfds[0].revents) {
> > > -                     errno = 0;
> > > -                     len = read(err, buf, sizeof(buf) - 1);
> > > -
> > > -                     if (len <= 0) {
> > > -                             err_done = errno != EAGAIN;
> > > -                     } else {
> > > -                             buf[len] = '\0';
> > > -                             if (verbose > 1)
> > > -                                     fprintf(stdout, "%s", buf);
> > > -                             else
> > > +             if (perf_use_color_default) {
> > > +                     int tests_in_progress = running_test;
> > > +
> > > +                     for (int y = running_test; y < child_test_num; y++) {
> > > +                             if (check_if_command_finished(&child_tests[y]->process))
> > > +                                     tests_in_progress++;
> > > +                     }
> > > +                     print_test_result(t, i, subi, TEST_RUNNING, width,
> > > +                                       child_test_num - tests_in_progress);
> > > +             }
> > > +
> > > +             err_done = true;
> > > +             if (err <= 0) {
> > > +                     /* No child stderr to poll, sleep for 10ms for child to complete. */
> > > +                     usleep(10 * 1000);
> > > +             } else {
> > > +                     /* Poll to avoid excessive spinning, timeout set for 100ms. */
> > > +                     poll(pfds, ARRAY_SIZE(pfds), /*timeout=*/100);
> >
> > When I tested this patch, I saw it refreshes too often in parallel mode.
> > Maybe 100ms is too short?  I don't know if it's from usleep (10ms) or
> > here.
> 
> It's usually the poll and I suspect it is the test writing a lot of
> output. I agree it can look a little flickery but it is also
> responsive in terms of not waiting too long before moving to the next
> test. I think it is possible to improve on the code here, the main
> thing I was after was making the output writing self contained and not
> split between start test and finish test, as that won't work well in
> the parallel case.

Is it possible to skip the rewriting if nothing is changed?

Thanks,
Namhyung


^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: [PATCH v2 1/2] tools subcmd: Add non-waitpid check_if_command_finished()
  2024-07-03  4:24     ` Ian Rogers
@ 2024-07-12 20:33       ` Namhyung Kim
  2024-07-12 21:19         ` Ian Rogers
  0 siblings, 1 reply; 12+ messages in thread
From: Namhyung Kim @ 2024-07-12 20:33 UTC (permalink / raw)
  To: Ian Rogers
  Cc: Peter Zijlstra, Ingo Molnar, Arnaldo Carvalho de Melo,
	Mark Rutland, Alexander Shishkin, Jiri Olsa, Adrian Hunter,
	Kan Liang, James Clark, linux-kernel, linux-perf-users

Hi Ian,

On Tue, Jul 02, 2024 at 09:24:50PM -0700, Ian Rogers wrote:
> On Tue, Jul 2, 2024 at 8:24 PM Namhyung Kim <namhyung@kernel.org> wrote:
> >
> > Hi Ian,
> >
> > On Sun, Jun 30, 2024 at 09:42:35PM -0700, Ian Rogers wrote:
> > > Using waitpid can cause stdout/stderr of the child process to be
> > > lost. Use Linux's /prod/<pid>/status file to determine if the process
> > > has reached the zombie state. Use the 'status' file rather than 'stat'
> > > to avoid issues around skipping the process name.
> > >
> > > Signed-off-by: Ian Rogers <irogers@google.com>
> > > ---
> > >  tools/lib/subcmd/run-command.c | 33 +++++++++++++++++++++++++++++++++
> > >  1 file changed, 33 insertions(+)
> > >
> > > diff --git a/tools/lib/subcmd/run-command.c b/tools/lib/subcmd/run-command.c
> > > index 4e3a557a2f37..ec06683e77a0 100644
> > > --- a/tools/lib/subcmd/run-command.c
> > > +++ b/tools/lib/subcmd/run-command.c
> > > @@ -2,6 +2,7 @@
> > >  #include <unistd.h>
> > >  #include <sys/types.h>
> > >  #include <sys/stat.h>
> > > +#include <ctype.h>
> > >  #include <fcntl.h>
> > >  #include <string.h>
> > >  #include <linux/string.h>
> > > @@ -217,8 +218,40 @@ static int wait_or_whine(struct child_process *cmd, bool block)
> > >
> > >  int check_if_command_finished(struct child_process *cmd)
> > >  {
> > > +#ifdef __linux__
> >
> > Is this really necessary?  I don't think we plan to support other OS..
> 
> I don't think it'd be unreasonable to say run "perf report" on
> Windows, or using wasm inside a web browser. Part of the reason for
> doing things this way was to keep the WNOHANG logic although this
> change no longer uses it for __linux__.

I'm not sure we are ready to run it on other platforms.  So I think
it's better simply remove it for now.

Thanks,
Namhyung


> > > +     char filename[FILENAME_MAX + 12];
> > > +     char status_line[256];
> > > +     FILE *status_file;
> > > +
> > > +     /*
> > > +      * Check by reading /proc/<pid>/status as calling waitpid causes
> > > +      * stdout/stderr to be closed and data lost.
> > > +      */
> > > +     sprintf(filename, "/proc/%d/status", cmd->pid);
> > > +     status_file = fopen(filename, "r");
> > > +     if (status_file == NULL) {
> > > +             /* Open failed assume finish_command was called. */
> > > +             return true;
> > > +     }
> > > +     while (fgets(status_line, sizeof(status_line), status_file) != NULL) {
> > > +             char *p;
> > > +
> > > +             if (strncmp(status_line, "State:", 6))
> > > +                     continue;
> > > +
> > > +             fclose(status_file);
> > > +             p = status_line + 6;
> > > +             while (isspace(*p))
> > > +                     p++;
> > > +             return *p == 'Z';
> > > +     }
> > > +     /* Read failed assume finish_command was called. */
> > > +     fclose(status_file);
> > > +     return true;
> > > +#else
> > >       wait_or_whine(cmd, /*block=*/false);
> > >       return cmd->finished;
> > > +#endif
> > >  }
> > >
> > >  int finish_command(struct child_process *cmd)
> > > --
> > > 2.45.2.803.g4e1b14247a-goog
> > >

^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: [PATCH v2 1/2] tools subcmd: Add non-waitpid check_if_command_finished()
  2024-07-12 20:33       ` Namhyung Kim
@ 2024-07-12 21:19         ` Ian Rogers
  2024-07-13 14:59           ` Namhyung Kim
  0 siblings, 1 reply; 12+ messages in thread
From: Ian Rogers @ 2024-07-12 21:19 UTC (permalink / raw)
  To: Namhyung Kim
  Cc: Peter Zijlstra, Ingo Molnar, Arnaldo Carvalho de Melo,
	Mark Rutland, Alexander Shishkin, Jiri Olsa, Adrian Hunter,
	Kan Liang, James Clark, linux-kernel, linux-perf-users

On Fri, Jul 12, 2024 at 1:33 PM Namhyung Kim <namhyung@kernel.org> wrote:
>
> Hi Ian,
>
> On Tue, Jul 02, 2024 at 09:24:50PM -0700, Ian Rogers wrote:
> > On Tue, Jul 2, 2024 at 8:24 PM Namhyung Kim <namhyung@kernel.org> wrote:
> > >
> > > Hi Ian,
> > >
> > > On Sun, Jun 30, 2024 at 09:42:35PM -0700, Ian Rogers wrote:
> > > > Using waitpid can cause stdout/stderr of the child process to be
> > > > lost. Use Linux's /prod/<pid>/status file to determine if the process
> > > > has reached the zombie state. Use the 'status' file rather than 'stat'
> > > > to avoid issues around skipping the process name.
> > > >
> > > > Signed-off-by: Ian Rogers <irogers@google.com>
> > > > ---
> > > >  tools/lib/subcmd/run-command.c | 33 +++++++++++++++++++++++++++++++++
> > > >  1 file changed, 33 insertions(+)
> > > >
> > > > diff --git a/tools/lib/subcmd/run-command.c b/tools/lib/subcmd/run-command.c
> > > > index 4e3a557a2f37..ec06683e77a0 100644
> > > > --- a/tools/lib/subcmd/run-command.c
> > > > +++ b/tools/lib/subcmd/run-command.c
> > > > @@ -2,6 +2,7 @@
> > > >  #include <unistd.h>
> > > >  #include <sys/types.h>
> > > >  #include <sys/stat.h>
> > > > +#include <ctype.h>
> > > >  #include <fcntl.h>
> > > >  #include <string.h>
> > > >  #include <linux/string.h>
> > > > @@ -217,8 +218,40 @@ static int wait_or_whine(struct child_process *cmd, bool block)
> > > >
> > > >  int check_if_command_finished(struct child_process *cmd)
> > > >  {
> > > > +#ifdef __linux__
> > >
> > > Is this really necessary?  I don't think we plan to support other OS..
> >
> > I don't think it'd be unreasonable to say run "perf report" on
> > Windows, or using wasm inside a web browser. Part of the reason for
> > doing things this way was to keep the WNOHANG logic although this
> > change no longer uses it for __linux__.
>
> I'm not sure we are ready to run it on other platforms.  So I think
> it's better simply remove it for now.

So in the office hours there was some discussion with a potential new
contributor whose development platform is OS/X. It's fairly obvious
this code can't work on anything but Linux and using #error feels
annoying. The waitpid code is tested and has a known issue, but I
think it is better than just breaking anyone not on Linux.

Thanks,
Ian

> Thanks,
> Namhyung
>
>
> > > > +     char filename[FILENAME_MAX + 12];
> > > > +     char status_line[256];
> > > > +     FILE *status_file;
> > > > +
> > > > +     /*
> > > > +      * Check by reading /proc/<pid>/status as calling waitpid causes
> > > > +      * stdout/stderr to be closed and data lost.
> > > > +      */
> > > > +     sprintf(filename, "/proc/%d/status", cmd->pid);
> > > > +     status_file = fopen(filename, "r");
> > > > +     if (status_file == NULL) {
> > > > +             /* Open failed assume finish_command was called. */
> > > > +             return true;
> > > > +     }
> > > > +     while (fgets(status_line, sizeof(status_line), status_file) != NULL) {
> > > > +             char *p;
> > > > +
> > > > +             if (strncmp(status_line, "State:", 6))
> > > > +                     continue;
> > > > +
> > > > +             fclose(status_file);
> > > > +             p = status_line + 6;
> > > > +             while (isspace(*p))
> > > > +                     p++;
> > > > +             return *p == 'Z';
> > > > +     }
> > > > +     /* Read failed assume finish_command was called. */
> > > > +     fclose(status_file);
> > > > +     return true;
> > > > +#else
> > > >       wait_or_whine(cmd, /*block=*/false);
> > > >       return cmd->finished;
> > > > +#endif
> > > >  }
> > > >
> > > >  int finish_command(struct child_process *cmd)
> > > > --
> > > > 2.45.2.803.g4e1b14247a-goog
> > > >

^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: [PATCH v2 1/2] tools subcmd: Add non-waitpid check_if_command_finished()
  2024-07-12 21:19         ` Ian Rogers
@ 2024-07-13 14:59           ` Namhyung Kim
  2024-07-14 18:13             ` Ian Rogers
  0 siblings, 1 reply; 12+ messages in thread
From: Namhyung Kim @ 2024-07-13 14:59 UTC (permalink / raw)
  To: Ian Rogers
  Cc: Peter Zijlstra, Ingo Molnar, Arnaldo Carvalho de Melo,
	Mark Rutland, Alexander Shishkin, Jiri Olsa, Adrian Hunter,
	Kan Liang, James Clark, linux-kernel, linux-perf-users

On Fri, Jul 12, 2024 at 02:19:58PM -0700, Ian Rogers wrote:
> On Fri, Jul 12, 2024 at 1:33 PM Namhyung Kim <namhyung@kernel.org> wrote:
> >
> > Hi Ian,
> >
> > On Tue, Jul 02, 2024 at 09:24:50PM -0700, Ian Rogers wrote:
> > > On Tue, Jul 2, 2024 at 8:24 PM Namhyung Kim <namhyung@kernel.org> wrote:
> > > >
> > > > Hi Ian,
> > > >
> > > > On Sun, Jun 30, 2024 at 09:42:35PM -0700, Ian Rogers wrote:
> > > > > Using waitpid can cause stdout/stderr of the child process to be
> > > > > lost. Use Linux's /prod/<pid>/status file to determine if the process
> > > > > has reached the zombie state. Use the 'status' file rather than 'stat'
> > > > > to avoid issues around skipping the process name.
> > > > >
> > > > > Signed-off-by: Ian Rogers <irogers@google.com>
> > > > > ---
> > > > >  tools/lib/subcmd/run-command.c | 33 +++++++++++++++++++++++++++++++++
> > > > >  1 file changed, 33 insertions(+)
> > > > >
> > > > > diff --git a/tools/lib/subcmd/run-command.c b/tools/lib/subcmd/run-command.c
> > > > > index 4e3a557a2f37..ec06683e77a0 100644
> > > > > --- a/tools/lib/subcmd/run-command.c
> > > > > +++ b/tools/lib/subcmd/run-command.c
> > > > > @@ -2,6 +2,7 @@
> > > > >  #include <unistd.h>
> > > > >  #include <sys/types.h>
> > > > >  #include <sys/stat.h>
> > > > > +#include <ctype.h>
> > > > >  #include <fcntl.h>
> > > > >  #include <string.h>
> > > > >  #include <linux/string.h>
> > > > > @@ -217,8 +218,40 @@ static int wait_or_whine(struct child_process *cmd, bool block)
> > > > >
> > > > >  int check_if_command_finished(struct child_process *cmd)
> > > > >  {
> > > > > +#ifdef __linux__
> > > >
> > > > Is this really necessary?  I don't think we plan to support other OS..
> > >
> > > I don't think it'd be unreasonable to say run "perf report" on
> > > Windows, or using wasm inside a web browser. Part of the reason for
> > > doing things this way was to keep the WNOHANG logic although this
> > > change no longer uses it for __linux__.
> >
> > I'm not sure we are ready to run it on other platforms.  So I think
> > it's better simply remove it for now.
> 
> So in the office hours there was some discussion with a potential new
> contributor whose development platform is OS/X. It's fairly obvious
> this code can't work on anything but Linux and using #error feels
> annoying. The waitpid code is tested and has a known issue, but I
> think it is better than just breaking anyone not on Linux.

I feel like it's a potential issue and should be handled by the
potentiall contributor.  Until that happens, we can assume Linux
and keep the code minimal.

Thanks,
Namhyung

> >
> > > > > +     char filename[FILENAME_MAX + 12];
> > > > > +     char status_line[256];
> > > > > +     FILE *status_file;
> > > > > +
> > > > > +     /*
> > > > > +      * Check by reading /proc/<pid>/status as calling waitpid causes
> > > > > +      * stdout/stderr to be closed and data lost.
> > > > > +      */
> > > > > +     sprintf(filename, "/proc/%d/status", cmd->pid);
> > > > > +     status_file = fopen(filename, "r");
> > > > > +     if (status_file == NULL) {
> > > > > +             /* Open failed assume finish_command was called. */
> > > > > +             return true;
> > > > > +     }
> > > > > +     while (fgets(status_line, sizeof(status_line), status_file) != NULL) {
> > > > > +             char *p;
> > > > > +
> > > > > +             if (strncmp(status_line, "State:", 6))
> > > > > +                     continue;
> > > > > +
> > > > > +             fclose(status_file);
> > > > > +             p = status_line + 6;
> > > > > +             while (isspace(*p))
> > > > > +                     p++;
> > > > > +             return *p == 'Z';
> > > > > +     }
> > > > > +     /* Read failed assume finish_command was called. */
> > > > > +     fclose(status_file);
> > > > > +     return true;
> > > > > +#else
> > > > >       wait_or_whine(cmd, /*block=*/false);
> > > > >       return cmd->finished;
> > > > > +#endif
> > > > >  }
> > > > >
> > > > >  int finish_command(struct child_process *cmd)
> > > > > --
> > > > > 2.45.2.803.g4e1b14247a-goog
> > > > >

^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: [PATCH v2 1/2] tools subcmd: Add non-waitpid check_if_command_finished()
  2024-07-13 14:59           ` Namhyung Kim
@ 2024-07-14 18:13             ` Ian Rogers
  0 siblings, 0 replies; 12+ messages in thread
From: Ian Rogers @ 2024-07-14 18:13 UTC (permalink / raw)
  To: Namhyung Kim
  Cc: Peter Zijlstra, Ingo Molnar, Arnaldo Carvalho de Melo,
	Mark Rutland, Alexander Shishkin, Jiri Olsa, Adrian Hunter,
	Kan Liang, James Clark, linux-kernel, linux-perf-users

On Sat, Jul 13, 2024 at 8:00 AM Namhyung Kim <namhyung@kernel.org> wrote:
>
> On Fri, Jul 12, 2024 at 02:19:58PM -0700, Ian Rogers wrote:
> > On Fri, Jul 12, 2024 at 1:33 PM Namhyung Kim <namhyung@kernel.org> wrote:
> > >
> > > Hi Ian,
> > >
> > > On Tue, Jul 02, 2024 at 09:24:50PM -0700, Ian Rogers wrote:
> > > > On Tue, Jul 2, 2024 at 8:24 PM Namhyung Kim <namhyung@kernel.org> wrote:
> > > > >
> > > > > Hi Ian,
> > > > >
> > > > > On Sun, Jun 30, 2024 at 09:42:35PM -0700, Ian Rogers wrote:
> > > > > > Using waitpid can cause stdout/stderr of the child process to be
> > > > > > lost. Use Linux's /prod/<pid>/status file to determine if the process
> > > > > > has reached the zombie state. Use the 'status' file rather than 'stat'
> > > > > > to avoid issues around skipping the process name.
> > > > > >
> > > > > > Signed-off-by: Ian Rogers <irogers@google.com>
> > > > > > ---
> > > > > >  tools/lib/subcmd/run-command.c | 33 +++++++++++++++++++++++++++++++++
> > > > > >  1 file changed, 33 insertions(+)
> > > > > >
> > > > > > diff --git a/tools/lib/subcmd/run-command.c b/tools/lib/subcmd/run-command.c
> > > > > > index 4e3a557a2f37..ec06683e77a0 100644
> > > > > > --- a/tools/lib/subcmd/run-command.c
> > > > > > +++ b/tools/lib/subcmd/run-command.c
> > > > > > @@ -2,6 +2,7 @@
> > > > > >  #include <unistd.h>
> > > > > >  #include <sys/types.h>
> > > > > >  #include <sys/stat.h>
> > > > > > +#include <ctype.h>
> > > > > >  #include <fcntl.h>
> > > > > >  #include <string.h>
> > > > > >  #include <linux/string.h>
> > > > > > @@ -217,8 +218,40 @@ static int wait_or_whine(struct child_process *cmd, bool block)
> > > > > >
> > > > > >  int check_if_command_finished(struct child_process *cmd)
> > > > > >  {
> > > > > > +#ifdef __linux__
> > > > >
> > > > > Is this really necessary?  I don't think we plan to support other OS..
> > > >
> > > > I don't think it'd be unreasonable to say run "perf report" on
> > > > Windows, or using wasm inside a web browser. Part of the reason for
> > > > doing things this way was to keep the WNOHANG logic although this
> > > > change no longer uses it for __linux__.
> > >
> > > I'm not sure we are ready to run it on other platforms.  So I think
> > > it's better simply remove it for now.
> >
> > So in the office hours there was some discussion with a potential new
> > contributor whose development platform is OS/X. It's fairly obvious
> > this code can't work on anything but Linux and using #error feels
> > annoying. The waitpid code is tested and has a known issue, but I
> > think it is better than just breaking anyone not on Linux.
>
> I feel like it's a potential issue and should be handled by the
> potentiall contributor.  Until that happens, we can assume Linux
> and keep the code minimal.

I'm not clear what the issue is. Arnaldo took the WNOHANG waitpid
contribution but I asked him to drop it due to losing stdout/stderr in
parallel mode due to the waitpid closing these file descriptors early
- we're not running in parallel mode by default any more. Reading
procfs to determine zombie state is clearly a Linux only thing, hence
the ifdef. I'm keen to keep to any extent possible the perf tool
running on non-Linux platforms, for example, gathering a data file on
a server or embedded system then wanting to do perf report on a
different machine which may not be running Linux. libsubcmd is used
throughout the tool and we have many subprocesses for pagers, objdump,
addr2line, etc. I don't agree with making such a core library Linux
only and there's no obligation for the maintainers to take these
patches. I disagree with a minimal patch.

Thanks,
Ian

> Thanks,
> Namhyung
>
> > >
> > > > > > +     char filename[FILENAME_MAX + 12];
> > > > > > +     char status_line[256];
> > > > > > +     FILE *status_file;
> > > > > > +
> > > > > > +     /*
> > > > > > +      * Check by reading /proc/<pid>/status as calling waitpid causes
> > > > > > +      * stdout/stderr to be closed and data lost.
> > > > > > +      */
> > > > > > +     sprintf(filename, "/proc/%d/status", cmd->pid);
> > > > > > +     status_file = fopen(filename, "r");
> > > > > > +     if (status_file == NULL) {
> > > > > > +             /* Open failed assume finish_command was called. */
> > > > > > +             return true;
> > > > > > +     }
> > > > > > +     while (fgets(status_line, sizeof(status_line), status_file) != NULL) {
> > > > > > +             char *p;
> > > > > > +
> > > > > > +             if (strncmp(status_line, "State:", 6))
> > > > > > +                     continue;
> > > > > > +
> > > > > > +             fclose(status_file);
> > > > > > +             p = status_line + 6;
> > > > > > +             while (isspace(*p))
> > > > > > +                     p++;
> > > > > > +             return *p == 'Z';
> > > > > > +     }
> > > > > > +     /* Read failed assume finish_command was called. */
> > > > > > +     fclose(status_file);
> > > > > > +     return true;
> > > > > > +#else
> > > > > >       wait_or_whine(cmd, /*block=*/false);
> > > > > >       return cmd->finished;
> > > > > > +#endif
> > > > > >  }
> > > > > >
> > > > > >  int finish_command(struct child_process *cmd)
> > > > > > --
> > > > > > 2.45.2.803.g4e1b14247a-goog
> > > > > >

^ permalink raw reply	[flat|nested] 12+ messages in thread

end of thread, other threads:[~2024-07-14 18:14 UTC | newest]

Thread overview: 12+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2024-07-01  4:42 [PATCH v2 0/2] perf test: Display remaining tests while waiting Ian Rogers
2024-07-01  4:42 ` [PATCH v2 1/2] tools subcmd: Add non-waitpid check_if_command_finished() Ian Rogers
2024-07-03  3:23   ` Namhyung Kim
2024-07-03  4:24     ` Ian Rogers
2024-07-12 20:33       ` Namhyung Kim
2024-07-12 21:19         ` Ian Rogers
2024-07-13 14:59           ` Namhyung Kim
2024-07-14 18:13             ` Ian Rogers
2024-07-01  4:42 ` [PATCH v2 2/2] perf test: Display number of remaining tests Ian Rogers
2024-07-03  3:39   ` Namhyung Kim
2024-07-03  4:30     ` Ian Rogers
2024-07-03 21:23       ` Namhyung Kim

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).