From: Paulo Miguel Almeida <paulo.miguel.almeida.rodenas@gmail.com>
To: Steven Rostedt <rostedt@goodmis.org>
Cc: linux-trace-devel@vger.kernel.org
Subject: Re: [PATCH] trace-cmd: document expected behaviour of execvp for record command
Date: Fri, 6 Jan 2023 15:09:58 +1300 [thread overview]
Message-ID: <Y7eC9o4JDO7XWjlw@mail.google.com> (raw)
In-Reply-To: <20230105181317.32dcafa7@gandalf.local.home>
On Thu, Jan 05, 2023 at 06:13:17PM -0500, Steven Rostedt wrote:
> On Fri, 6 Jan 2023 11:52:19 +1300
> Paulo Miguel Almeida <paulo.miguel.almeida.rodenas@gmail.com> wrote:
>
> > In tracecmd/trace-record.c:<run_cmd>, trace-cmd record -F <executable>
> > is launched via the libc's execvp() routine.
> >
> > If you run a command which is meant to be found in one of the entries
> > of the PATH env var then you should see <n> invocations of the
> > __x64_sys_execve() syscall where <n> will be equal to the number of
> > attempts that execvp() routine had done until it could find the
> > executable.
> >
> > While the routine works as expected, its behaviour can seem a bit
> > cryptic for untrained eyes and makes one wonder whether there is
> > something wrong in any of the parts involved in the tracing operation.
> > Some time after one digs into trace-cmd's and libc's code base, one
> > realises that everything is working as expected but documenting it could
> > save other people's time :-)
> >
> > Document the expected behaviour ftrace users should see depending on
> > the way -F <executable> is used.
> >
> > Signed-off-by: Paulo Miguel Almeida <paulo.miguel.almeida.rodenas@gmail.com>
> > ---
> > Additional context:
> >
> > - absolute path example:
> >
> > # trace-cmd record -p function_graph \
> > -g __x64_sys_execve -O nofuncgraph-irqs \
> > -n __cond_resched --max-graph-depth 1 \
> > -F /usr/bin/echo "ftrace" > /dev/null
> >
> > # trace-cmd report
> > echo-172994 [000] 185539.798539: funcgraph_entry: ! 803.376 us | __x64_sys_execve();
> >
> > - PATH-dependent path example:
> >
> > # trace-cmd record -p function_graph \
> > -g __x64_sys_execve -O nofuncgraph-irqs \
> > -n __cond_resched --max-graph-depth 1 \
> > -F echo "ftrace" > /dev/null
> >
> > # trace-cmd report
> > echo-172656 [002] 185009.671586: funcgraph_entry: ! 288.732 us | __x64_sys_execve();
> > echo-172656 [002] 185009.671879: funcgraph_entry: ! 158.337 us | __x64_sys_execve();
> > echo-172656 [002] 185009.672042: funcgraph_entry: ! 161.843 us | __x64_sys_execve();
> > echo-172656 [002] 185009.672207: funcgraph_entry: ! 157.656 us | __x64_sys_execve();
> > echo-172656 [002] 185009.672369: funcgraph_entry: ! 156.343 us | __x64_sys_execve();
> > echo-172656 [002] 185009.672529: funcgraph_entry: ! 863.629 us | __x64_sys_execve();
>
> Honestly, the above should be in the change log.
Fair enough. I will send a new patch shortly.
>
> >
> > ---
> > Documentation/trace-cmd/trace-cmd-record.1.txt | 4 ++++
> > 1 file changed, 4 insertions(+)
> >
> > diff --git a/Documentation/trace-cmd/trace-cmd-record.1.txt b/Documentation/trace-cmd/trace-cmd-record.1.txt
> > index b709e48..a000cbe 100644
> > --- a/Documentation/trace-cmd/trace-cmd-record.1.txt
> > +++ b/Documentation/trace-cmd/trace-cmd-record.1.txt
> > @@ -113,6 +113,10 @@ OPTIONS
> > Using *-F* will let you trace only events that are caused by the given
> > command.
> >
> > + Note: if the specified filename is neither absolute or relative then libc
> > + will invoke execve() syscall for every entry in the colon-separated list of
> > + directory pathnames specified in the PATH environment variable.
> > +
>
> Hmm, do you think it may be worth open-coding execvp() and looking for it
> from trace-cmd, and then only enabling tracing when it found the full path?
>
> -- Steve
That's a good point. I thought about it for a while and this is really a pickle.
I'll dump the things I'm thinking about and peharps you can help me
choosing between the approaches :-)
Launching executables via:
libc->execvp: It avoid some sorts of TOCTOU race conditions by trying to
execve executables and leave it up to the kernel to do a 'single'
file-exists validation (best case scenario) instead of validate whether
file exists twice, once in user-space and again in kernel (best case
scenario).
bash: As a counter point (and also because that's most
likely how trace-cmd will be executed). Bash actually does the
open-coding execvp() approach as shown below:
# trace-cmd record -p function_graph \
-g __x64_sys_execve -O nofuncgraph-irqs \
--max-graph-depth 1 \
-F /usr/bin/bash -c "ls" > /dev/null
ls-178525 [010] 197387.772776: funcgraph_entry: ! 775.074 us | __x64_sys_execve(); # for /usr/bin/bash
ls-178525 [010] 197387.775568: funcgraph_entry: ! 993.453 us | __x64_sys_execve(); # for /usr/bin/ls
I guess this will boil down to whether you want to make trace-cmd as
'transparent' as possible (in comparison to running the command without
prepending trace-cmd) OR if you actually care about/want to abide by
the TOCTOU 'guarantees' that libc implements for userspace applications
Let me know your thoughts. I'm flexible with either approach
- Paulo Almeida
>
>
> > *-P* 'pid'::
> > Similar to *-F* but lets you specify a process ID to trace.
> >
>
next prev parent reply other threads:[~2023-01-06 2:10 UTC|newest]
Thread overview: 9+ messages / expand[flat|nested] mbox.gz Atom feed top
2023-01-05 22:52 [PATCH] trace-cmd: document expected behaviour of execvp for record command Paulo Miguel Almeida
2023-01-05 23:13 ` Steven Rostedt
2023-01-06 2:09 ` Paulo Miguel Almeida [this message]
2023-01-06 3:07 ` Steven Rostedt
2023-01-13 22:58 ` [PATCH v2] trace-cmd: open code execvp routine to avoid multiple execve syscalls Paulo Miguel Almeida
2023-01-13 23:05 ` Paulo Miguel Almeida
2023-01-14 4:58 ` Paulo Miguel Almeida
2023-01-14 5:51 ` Steven Rostedt
2023-01-14 14:43 ` Steven Rostedt
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=Y7eC9o4JDO7XWjlw@mail.google.com \
--to=paulo.miguel.almeida.rodenas@gmail.com \
--cc=linux-trace-devel@vger.kernel.org \
--cc=rostedt@goodmis.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.