From: Steven Rostedt <rostedt@goodmis.org>
To: Slavomir Kaslev <kaslevs@vmware.com>
Cc: linux-trace-devel@vger.kernel.org, pauld@redhat.com,
ykaradzhov@vmware.com, jbacik@fb.com, tstoyanov@vmware.com,
slavomir.kaslev@gmail.com
Subject: Re: [PATCH v2 1/2] trace-cmd: Optimize how pid filters are expressed
Date: Mon, 15 Apr 2019 17:59:19 -0400 [thread overview]
Message-ID: <20190415175919.637f28f6@gandalf.local.home> (raw)
In-Reply-To: <20190415164739.17223-2-kaslevs@vmware.com>
On Mon, 15 Apr 2019 19:47:38 +0300
Slavomir Kaslev <kaslevs@vmware.com> wrote:
> Express pid filters as allowed/disallowed filter ranges
>
> (pid>=100&&pid<=103)
>
> instead of specifying them per pid
>
> (pid==100||pid==101||pid==102||pid==103)
>
> This makes the size of the resulting filter smaller (and faster) and avoids
> overflowing the filter size limit of one page which we can hit on bigger
> machines (say >160 CPUs).
Except it breaks if we have a split.
I ran this:
hackbench 10 &
tracecmd/trace-cmd record -e sched_switch cat /sys/kernel/debug/tracing/events/sched/sched_switch/filter
Time: 0.093
(common_pid<6959||common_pid>6969)||(common_pid<6945||common_pid>6957)||(next_pid<6959||next_pid>6969)||(next_pid<6945||next_pid>6957)
This was the output. Showing that we had common_pid from 6959 - 6969
and 6945 - 6957 (a 6958 was missing), and because of this, we now trace
all processes because (common_pid < 6959 || common_pid > 6957) is
always true.
We need an "&&" there somewhere. That should have been:
((common_pid<6959||common_pid>6969)&&(common_pid<6945||common_pid>6957))||((next_pid<6959||next_pid>6969)&&(next_pid<6945||next_pid>6957))
-- Steve
>
> Signed-off-by: Slavomir Kaslev <kaslevs@vmware.com>
> Reported-by: Phil Auld <pauld@redhat.com>
> Suggested-by: Steven Rostedt (VMware) <rostedt@goodmis.org>
> ---
> tracecmd/trace-record.c | 115 +++++++++++++++++++++++++++-------------
> 1 file changed, 78 insertions(+), 37 deletions(-)
>
> diff --git a/tracecmd/trace-record.c b/tracecmd/trace-record.c
> index 76ca92d..eeee5e9 100644
> --- a/tracecmd/trace-record.c
> +++ b/tracecmd/trace-record.c
> @@ -950,10 +950,61 @@ static void update_ftrace_pids(int reset)
> static void update_event_filters(struct buffer_instance *instance);
> static void update_pid_event_filters(struct buffer_instance *instance);
>
> +static void append_filter_pid_range(char **filter, int *curr_len,
> + const char *field,
> + int start_pid, int end_pid, bool exclude)
> +{
> + char *op, *op1, *op2, *op3;
> + int len;
> +
> + op = *filter && **filter ? "||" : "";
> +
> + // Handle thus case explicitly so that we get `pid==3` instead of
> + // `pid>=3&&pid<=3` for singleton ranges
> + if (start_pid == end_pid) {
> +#define FMT "%s(%s%s%d)"
> + len = snprintf(NULL, 0, FMT, op,
> + field, exclude ? "!=" : "==", start_pid);
> + *filter = realloc(*filter, *curr_len + len + 1);
> + if (!*filter)
> + die("realloc");
> +
> + len = snprintf(*filter + *curr_len, len + 1, FMT, op,
> + field, exclude ? "!=" : "==", start_pid);
> + *curr_len += len;
> +
> + return;
> +#undef FMT
> + }
> +
> + if (exclude) {
> + op1 = "<";
> + op2 = "||";
> + op3 = ">";
> + } else {
> + op1 = ">=";
> + op2 = "&&";
> + op3 = "<=";
> + }
> +
> +#define FMT "%s(%s%s%d%s%s%s%d)"
> + len = snprintf(NULL, 0, FMT, op,
> + field, op1, start_pid, op2,
> + field, op3, end_pid);
> + *filter = realloc(*filter, *curr_len + len + 1);
> + if (!*filter)
> + die("realloc");
> +
> + len = snprintf(*filter + *curr_len, len + 1, FMT, op,
> + field, op1, start_pid, op2,
> + field, op3, end_pid);
> + *curr_len += len;
> +}
> +
> /**
> * make_pid_filter - create a filter string to all pids against @field
> * @curr_filter: Append to a previous filter (may realloc). Can be NULL
> - * @field: The fild to compare the pids against
> + * @field: The field to compare the pids against
> *
> * Creates a new string or appends to an existing one if @curr_filter
> * is not NULL. The new string will contain a filter with all pids
> @@ -963,54 +1014,44 @@ static void update_pid_event_filters(struct buffer_instance *instance);
> */
> static char *make_pid_filter(char *curr_filter, const char *field)
> {
> + int curr_len = 0, last_exclude = -1;
> + int start_pid = -1, last_pid = -1;
> + char *filter = NULL, *save;
> struct filter_pids *p;
> - char *filter;
> - char *orit;
> - char *match;
> - char *str;
> - int curr_len = 0;
> - int len;
>
> /* Use the new method if possible */
> if (have_set_event_pid)
> return NULL;
>
> - len = len_filter_pids + (strlen(field) + strlen("(==)||")) * nr_filter_pids;
> -
> - if (curr_filter) {
> - curr_len = strlen(curr_filter);
> - filter = realloc(curr_filter, curr_len + len + strlen("(&&())"));
> - if (!filter)
> - die("realloc");
> - memmove(filter+1, curr_filter, curr_len);
> - filter[0] = '(';
> - strcat(filter, ")&&(");
> - curr_len = strlen(filter);
> - } else
> - filter = malloc(len);
> - if (!filter)
> - die("Failed to allocate pid filter");
> -
> - /* Last '||' that is not used will cover the \0 */
> - str = filter + curr_len;
> + if (!filter_pids)
> + return curr_filter;
>
> for (p = filter_pids; p; p = p->next) {
> - if (p->exclude) {
> - match = "!=";
> - orit = "&&";
> - } else {
> - match = "==";
> - orit = "||";
> + /* PIDs are inserted in `filter_pids` from the front and that's
> + * why we expect them in descending order here.
> + */
> + if (p->pid == last_pid - 1 && p->exclude == last_exclude) {
> + last_pid = p->pid;
> + continue;
> }
> - if (p == filter_pids)
> - orit = "";
>
> - len = sprintf(str, "%s(%s%s%d)", orit, field, match, p->pid);
> - str += len;
> + if (start_pid != -1)
> + append_filter_pid_range(&filter, &curr_len, field,
> + last_pid, start_pid,
> + last_exclude);
> +
> + start_pid = last_pid = p->pid;
> + last_exclude = p->exclude;
> +
> }
> + append_filter_pid_range(&filter, &curr_len, field,
> + last_pid, start_pid, last_exclude);
>
> - if (curr_len)
> - sprintf(str, ")");
> + if (curr_filter) {
> + save = filter;
> + asprintf(&filter, "(%s)&&(%s)", curr_filter, filter);
> + free(save);
> + }
>
> return filter;
> }
next prev parent reply other threads:[~2019-04-15 21:59 UTC|newest]
Thread overview: 5+ messages / expand[flat|nested] mbox.gz Atom feed top
2019-04-15 16:47 [PATCH v2 0/2] Optimize pid filters and add --no-filter option Slavomir Kaslev
2019-04-15 16:47 ` [PATCH v2 1/2] trace-cmd: Optimize how pid filters are expressed Slavomir Kaslev
2019-04-15 21:59 ` Steven Rostedt [this message]
2019-04-15 22:55 ` Slavomir Kaslev
2019-04-15 16:47 ` [PATCH v2 2/2] trace-cmd: Add --no-filter option to not filter recording processes Slavomir Kaslev
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20190415175919.637f28f6@gandalf.local.home \
--to=rostedt@goodmis.org \
--cc=jbacik@fb.com \
--cc=kaslevs@vmware.com \
--cc=linux-trace-devel@vger.kernel.org \
--cc=pauld@redhat.com \
--cc=slavomir.kaslev@gmail.com \
--cc=tstoyanov@vmware.com \
--cc=ykaradzhov@vmware.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).