All of lore.kernel.org
 help / color / mirror / Atom feed
From: Ingo Molnar <mingo@elte.hu>
To: Tom Zanussi <tzanussi@gmail.com>
Cc: LKML <linux-kernel@vger.kernel.org>,
	Steven Rostedt <rostedt@goodmis.org>,
	fweisbec@gmail.com, paulmck@linux.vnet.ibm.com
Subject: Re: [PATCH] tracing/filters: allow on-the-fly filter switching
Date: Mon, 13 Apr 2009 23:52:14 +0200	[thread overview]
Message-ID: <20090413215214.GD8514@elte.hu> (raw)
In-Reply-To: <1239610670.6660.49.camel@tropicana>


* Tom Zanussi <tzanussi@gmail.com> wrote:

> This patch allows event filters to be safely removed or switched 
> on-the-fly while avoiding the use of rcu or the suspension of 
> tracing of previous versions.
> 
> It does it by adding a new filter_pred_none() predicate function 
> which does nothing and by never deallocating either the predicates 
> or any of the filter_pred members used in matching; the predicate 
> lists are allocated and initialized during ftrace_event_calls 
> initialization.
> 
> Whenever a filter is removed or replaced, the filter_pred_* 
> functions currently in use by the affected ftrace_event_call are 
> immediately switched over to to the filter_pred_none() function, 
> while the rest of the filter_pred members are left intact, 
> allowing any currently executing filter_pred_* functions to finish 
> up, using the values they're currently using.
> 
> In the case of filter replacement, the new predicate values are 
> copied into the old predicates after the above step, and the 
> filter_pred_none() functions are replaced by the filter_pred_* 
> functions for the new filter.  In this case, it is possible though 
> very unlikely that a previous filter_pred_* is still running even 
> after the filter_pred_none() switch and the switch to the new 
> filter_pred_*.  In that case, however, because nothing has been 
> deallocated in the filter_pred, the worst that can happen is that 
> the old filter_pred_* function sees the new values and as a result 
> produces either a false positive or a false negative, depending on 
> the values it finds.
> 
> So one downside to this method is that rarely, it can produce a 
> bad match during the filter switch, but it should be possible to 
> live with that, IMHO.

Yeah.

It is really a strong thing to avoid RCU here. Instrumentation 
should be self-sufficient to a large degree, and it does not get any 
more lowlevel than filter expression evaluation engine. Forcing the 
use of rcu_read_lock() there would limit its utility.

> The other downside is that at least in this patch the predicate 
> lists are always pre-allocated, taking up memory from the start.  
> They could probably be allocated on first-use, and de-allocated 
> when tracing is completely stopped - if this patch makes sense, I 
> could create another one to do that later on.

That's not a big issue IMO.

> Oh, and it also places a restriction on the size of __arrays in 
> events, currently set to 128, since they can't be larger than the 
> now embedded str_val arrays in the filter_pred struct.

that's OK too - we really want pre-calculated filter expressions and 
as atomic evaluations as possible. So having the maximum width 
specified is no big deal.

The only exception would be if we ever do PATH_MAX type of field 
value comparisons - and i dont see any reason why not, once tracing 
is extended to the VFS or once the syscall tracer . That would 
increase it to 4096 bytes, making the max kzalloc larger than page 
size - still not outrageous so not a big problem. Just lets keep it 
in mind that 128 is a bit on the low side.

also:

> +     if (!val_str || !strlen(val_str)
> +         || strlen(val_str) >= MAX_FILTER_STR_VAL) {
>               pred->field_name = NULL;
>               return -EINVAL;
>	}

it might be quite cryptic to the user why a complex expression was 
not installed. I think a single-line KERN_INFO syslog entry would be 
most helpful.

	Ingo

  reply	other threads:[~2009-04-13 21:52 UTC|newest]

Thread overview: 12+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2009-04-13  8:17 [PATCH] tracing/filters: allow on-the-fly filter switching Tom Zanussi
2009-04-13 21:52 ` Ingo Molnar [this message]
2009-04-14  0:33 ` [tip:tracing/core] " tip-bot for Tom Zanussi
2009-04-14 20:56 ` [PATCH] " Frederic Weisbecker
2009-04-15  1:12   ` Li Zefan
2009-04-15  1:55     ` Frederic Weisbecker
2009-04-15  4:17     ` Tom Zanussi
2009-04-15  4:32   ` Tom Zanussi
2009-04-15 16:21     ` Frederic Weisbecker
2009-04-16  0:54       ` Li Zefan
2009-04-16  5:34         ` Tom Zanussi
2009-04-16 15:29         ` Frederic Weisbecker

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20090413215214.GD8514@elte.hu \
    --to=mingo@elte.hu \
    --cc=fweisbec@gmail.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=paulmck@linux.vnet.ibm.com \
    --cc=rostedt@goodmis.org \
    --cc=tzanussi@gmail.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.