linux-trace-devel.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Steven Rostedt <rostedt@goodmis.org>
To: "Tzvetomir Stoyanov (VMware)" <tz.stoyanov@gmail.com>
Cc: linux-trace-devel@vger.kernel.org
Subject: Re: [PATCH 2/2] trace-cmd: Wait for first time sync before the trace
Date: Mon, 15 Mar 2021 14:32:44 -0400	[thread overview]
Message-ID: <20210315143244.51bb87c0@gandalf.local.home> (raw)
In-Reply-To: <20210315061819.168426-3-tz.stoyanov@gmail.com>

On Mon, 15 Mar 2021 08:18:19 +0200
"Tzvetomir Stoyanov (VMware)" <tz.stoyanov@gmail.com> wrote:

> Added a barrier in time synchronization threads to ensure the first time
> synchronization passed before to start the trace.
> 
> Signed-off-by: Tzvetomir Stoyanov (VMware) <tz.stoyanov@gmail.com>
> ---
>  lib/trace-cmd/trace-timesync.c | 15 ++++++++++++++-
>  1 file changed, 14 insertions(+), 1 deletion(-)
> 
> diff --git a/lib/trace-cmd/trace-timesync.c b/lib/trace-cmd/trace-timesync.c
> index 06853f9d..5995551e 100644
> --- a/lib/trace-cmd/trace-timesync.c
> +++ b/lib/trace-cmd/trace-timesync.c
> @@ -537,6 +537,7 @@ void tracecmd_tsync_free(struct tracecmd_time_sync *tsync)
>  	tsync_context->sync_size = 0;
>  	pthread_mutex_destroy(&tsync->lock);
>  	pthread_cond_destroy(&tsync->cond);
> +	pthread_barrier_destroy(&tsync->first_sync);
>  	free(tsync->clock_str);
>  	free(tsync->proto_name);
>  	free(tsync);
> @@ -648,6 +649,7 @@ static int tsync_with_guest(struct tracecmd_time_sync *tsync)
>  	int ts_array_size = CLOCK_TS_ARRAY;
>  	struct tsync_proto *proto;
>  	struct timespec timeout;
> +	bool first = true;
>  	bool end = false;
>  	int ret;
>  

This function should always release the barrier, and not depend on the
caller to do so on error. That is, have this:

	clock_context_init(tsync, &proto, false);
-	if (!tsync->context)
+	if (!tsync->context) {
+		pthread_barrier_wait(&tsync->first_sync);
		return -1;
+	}



> @@ -666,6 +668,10 @@ static int tsync_with_guest(struct tracecmd_time_sync *tsync)
>  						  TRACECMD_TIME_SYNC_CMD_PROBE,
>  						  0, NULL);
>  		ret = tsync_get_sample(tsync, proto, ts_array_size);
> +		if (first) {
> +			first = false;
> +			pthread_barrier_wait(&tsync->first_sync);
> +		}
>  		if (ret || end)
>  			break;

On error here, you will cause the caller to incorrectly call
pthread_barrier_wait() again and get stuck. That's why I stated above that
it this function must be responsible to release the barrier.

This is why barriers can be dangerous.

>  		if (tsync->loop_interval > 0) {
> @@ -693,12 +699,17 @@ static int tsync_with_guest(struct tracecmd_time_sync *tsync)
>  static void *tsync_host_thread(void *data)
>  {
>  	struct tracecmd_time_sync *tsync = NULL;
> +	int ret;
>  
>  	tsync = (struct tracecmd_time_sync *)data;
> -	tsync_with_guest(tsync);
> +	ret = tsync_with_guest(tsync);
>  	tracecmd_msg_handle_close(tsync->msg_handle);
>  	tsync->msg_handle = NULL;
>  
> +	/* tsync with guest failed, release the barrier */
> +	if (ret)
> +		pthread_barrier_wait(&tsync->first_sync);
> +

As stated above, do not do this here.

-- Steve

>  	pthread_exit(0);
>  }
>  
> @@ -757,6 +768,7 @@ tracecmd_tsync_with_guest(unsigned long long trace_id, int loop_interval,
>  		tsync->clock_str = strdup(clock);
>  	pthread_mutex_init(&tsync->lock, NULL);
>  	pthread_cond_init(&tsync->cond, NULL);
> +	pthread_barrier_init(&tsync->first_sync, NULL, 2);
>  	pthread_attr_init(&attrib);
>  	pthread_attr_setdetachstate(&attrib, PTHREAD_CREATE_JOINABLE);
>  
> @@ -767,6 +779,7 @@ tracecmd_tsync_with_guest(unsigned long long trace_id, int loop_interval,
>  
>  	if (!get_first_cpu(&pin_mask, &mask_size))
>  		pthread_setaffinity_np(tsync->thread, mask_size, pin_mask);
> +	pthread_barrier_wait(&tsync->first_sync);
>  
>  	if (pin_mask)
>  		CPU_FREE(pin_mask);


      reply	other threads:[~2021-03-15 18:33 UTC|newest]

Thread overview: 6+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2021-03-15  6:18 [PATCH 0/2] Refactoring and improvements of time sync logic Tzvetomir Stoyanov (VMware)
2021-03-15  6:18 ` [PATCH 1/2] trace-cmd: Move time sync logic in the trace-cmd library Tzvetomir Stoyanov (VMware)
2021-03-15 17:49   ` Steven Rostedt
2021-03-17  3:40     ` Tzvetomir Stoyanov
2021-03-15  6:18 ` [PATCH 2/2] trace-cmd: Wait for first time sync before the trace Tzvetomir Stoyanov (VMware)
2021-03-15 18:32   ` Steven Rostedt [this message]

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20210315143244.51bb87c0@gandalf.local.home \
    --to=rostedt@goodmis.org \
    --cc=linux-trace-devel@vger.kernel.org \
    --cc=tz.stoyanov@gmail.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).