linux-perf-users.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* Question to perf annotate handling mov ...(%rip) instructions
@ 2017-11-28 10:42 Thomas-Mich Richter
  2017-11-28 14:50 ` Arnaldo Carvalho de Melo
  0 siblings, 1 reply; 5+ messages in thread
From: Thomas-Mich Richter @ 2017-11-28 10:42 UTC (permalink / raw)
  To: Arnaldo Carvalho de Melo, linux-perf-use.

I am confused by perf annotate internals.

Perf annotate examines a perf.data file and shows disassembler output.
However the output differs depending on the output option specified:

‑‑stdio: Output to stdout, also selected implicitly when output piped to another
process or redirected to a file. The function call sequence is 

symbol__tty_annotate() –> symbol__annotate_printf() –> disasm_line__print().

This output style does not annotate the branch instructions nor does it use special 
printing functions in the util/annotate.c, for example mov__scnprintf().

‑‑tui: Default. there are annotations to augment branches, jumps, fct returns 
with arrows for interactive usage. The function call stack starts with 
symbol__tui_annotate(). 
There is also special treatment for the Intel mov instructions of the form:

00000000000060b0 <_init@@Base>: 
.... 
60b4:       48 8b 05 35 cd 22 00    mov 0x22cd35(%rip),%rax   # 232df0 <__gmon_start__>     

Commit 6de783b6f50f7f1db18a3fda0aa34b2e84b5771d ("perf annotate: Resolve symbols 
using objdump comment") added this support.

Special code for Intel platform handles the mov  at address 60b4: 
This is dynamic linkage against the PLT. Function mov__parse() is always called 
to parse the objdump comment following the '#' character. 
However the function mov__scnprintf() to replace the text '0x22cd35(%rip)' by the 
target function name __gmon_start__ is only called in tui mode and not in stdio mode.

Now to the confusion:
Function mov__parse() calls comment__symbol() which contains:

   static int comment__symbol(char *raw, char *comment, u64 *addrp, char **namep)
   {
        char *endptr, *name, *t;

        if (strstr(raw, "(%rip)") == NULL)
                return 0;

This is architecture specific and does not work for non-Intel platforms.

I would like to fix perf annotate for s390x and above move instruction on s390x
is
     
    655a:       c0 10 00 01 9c eb       larl    %r1,39f30 <__gmon_start__>

There is a need to handle PLT resolution in an architecture independent way.

Ideas and suggestions?

-- 
Thomas Richter, Dept 3303, IBM LTC Boeblingen Germany
--
Vorsitzende des Aufsichtsrats: Martina Koederitz 
Geschäftsführung: Dirk Wittkopp
Sitz der Gesellschaft: Böblingen / Registergericht: Amtsgericht Stuttgart, HRB 243294

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: Question to perf annotate handling mov ...(%rip) instructions
  2017-11-28 10:42 Question to perf annotate handling mov ...(%rip) instructions Thomas-Mich Richter
@ 2017-11-28 14:50 ` Arnaldo Carvalho de Melo
  2017-11-29  8:29   ` Jiri Olsa
  2017-11-29  9:14   ` Thomas-Mich Richter
  0 siblings, 2 replies; 5+ messages in thread
From: Arnaldo Carvalho de Melo @ 2017-11-28 14:50 UTC (permalink / raw)
  To: Thomas-Mich Richter; +Cc: Jiri Olsa, Linux Kernel Mailing List, linux-perf-use.

Em Tue, Nov 28, 2017 at 11:42:16AM +0100, Thomas-Mich Richter escreveu:
> I am confused by perf annotate internals.
> 
> Perf annotate examines a perf.data file and shows disassembler output.
> However the output differs depending on the output option specified:

Well, we started with the --stdio code, and then, for the TUI, went on
improving to be able to navigate, etc.

The --stdio code was then left for us to compare outputs and see if some
regression was being added.

I think that the right thing is to have the --stdio use what is in
--tui, modulo the interactive bits.

The --tui has knobs to disable its beautifications, see H in the tui
annotation browser to see the toggle hotkeys.
 
> ‑‑stdio: Output to stdout, also selected implicitly when output piped to another
> process or redirected to a file. The function call sequence is 
> 
> symbol__tty_annotate() –> symbol__annotate_printf() –> disasm_line__print().
> 
> This output style does not annotate the branch instructions nor does it use special 
> printing functions in the util/annotate.c, for example mov__scnprintf().
> 
> ‑‑tui: Default. there are annotations to augment branches, jumps, fct returns 
> with arrows for interactive usage. The function call stack starts with 
> symbol__tui_annotate(). 
> There is also special treatment for the Intel mov instructions of the form:

The cases where intel has special treatment are bugs, should be moved to
arch specific callbacks.
 
> 00000000000060b0 <_init@@Base>: 
> .... 
> 60b4:       48 8b 05 35 cd 22 00    mov 0x22cd35(%rip),%rax   # 232df0 <__gmon_start__>     
> 
> Commit 6de783b6f50f7f1db18a3fda0aa34b2e84b5771d ("perf annotate: Resolve symbols 
> using objdump comment") added this support.
> 
> Special code for Intel platform handles the mov  at address 60b4: 
> This is dynamic linkage against the PLT. Function mov__parse() is always called 
> to parse the objdump comment following the '#' character. 
> However the function mov__scnprintf() to replace the text '0x22cd35(%rip)' by the 
> target function name __gmon_start__ is only called in tui mode and not in stdio mode.
> 
> Now to the confusion:
> Function mov__parse() calls comment__symbol() which contains:
> 
>    static int comment__symbol(char *raw, char *comment, u64 *addrp, char **namep)
>    {
>         char *endptr, *name, *t;
> 
>         if (strstr(raw, "(%rip)") == NULL)
>                 return 0;
> 
> This is architecture specific and does not work for non-Intel platforms.
> 
> I would like to fix perf annotate for s390x and above move instruction on s390x
> is
>      
>     655a:       c0 10 00 01 9c eb       larl    %r1,39f30 <__gmon_start__>
> 
> There is a need to handle PLT resolution in an architecture independent way.
> 
> Ideas and suggestions?

Some historical background there, busy now, but you seem to be on the
right track and IIRC you already sent a patch for this, right? I'll try
to look at it.

Jiri may as well, since he worked a lot recently in this codebase, to
refactor it some more to make it useful for annotating python code, perl
next, other scripted languages should follow too.

- Arnaldo

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: Question to perf annotate handling mov ...(%rip) instructions
  2017-11-28 14:50 ` Arnaldo Carvalho de Melo
@ 2017-11-29  8:29   ` Jiri Olsa
  2017-11-29  9:26     ` Thomas-Mich Richter
  2017-11-29  9:14   ` Thomas-Mich Richter
  1 sibling, 1 reply; 5+ messages in thread
From: Jiri Olsa @ 2017-11-29  8:29 UTC (permalink / raw)
  To: Arnaldo Carvalho de Melo
  Cc: Thomas-Mich Richter, Jiri Olsa, Linux Kernel Mailing List,
	linux-perf-use.

On Tue, Nov 28, 2017 at 11:50:30AM -0300, Arnaldo Carvalho de Melo wrote:

SNIP

> > Function mov__parse() calls comment__symbol() which contains:
> > 
> >    static int comment__symbol(char *raw, char *comment, u64 *addrp, char **namep)
> >    {
> >         char *endptr, *name, *t;
> > 
> >         if (strstr(raw, "(%rip)") == NULL)
> >                 return 0;
> > 
> > This is architecture specific and does not work for non-Intel platforms.
> > 
> > I would like to fix perf annotate for s390x and above move instruction on s390x
> > is
> >      
> >     655a:       c0 10 00 01 9c eb       larl    %r1,39f30 <__gmon_start__>
> > 
> > There is a need to handle PLT resolution in an architecture independent way.
> > 
> > Ideas and suggestions?
> 
> Some historical background there, busy now, but you seem to be on the
> right track and IIRC you already sent a patch for this, right? I'll try
> to look at it.
> 
> Jiri may as well, since he worked a lot recently in this codebase, to
> refactor it some more to make it useful for annotating python code, perl
> next, other scripted languages should follow too.

so we try to parse each line out of objdump using arch specific
ops callbacks, check:

disasm_line__new
  disasm_line__init_ins
    dl->ins.ops->parse(...

the ops stuff are defined in:

static struct arch architectures[] = { 

        {
                .name = "s390",
                .init = s390__annotate_init,
                .objdump =  {
                        .comment_char = '#',
                },
        },

I'd check s390__annotate_init, there's some mechanism
of associating ops for arch and try to fit in ;-)

jirka

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: Question to perf annotate handling mov ...(%rip) instructions
  2017-11-28 14:50 ` Arnaldo Carvalho de Melo
  2017-11-29  8:29   ` Jiri Olsa
@ 2017-11-29  9:14   ` Thomas-Mich Richter
  1 sibling, 0 replies; 5+ messages in thread
From: Thomas-Mich Richter @ 2017-11-29  9:14 UTC (permalink / raw)
  To: Arnaldo Carvalho de Melo
  Cc: Jiri Olsa, Linux Kernel Mailing List, linux-perf-use.

On 11/28/2017 03:50 PM, Arnaldo Carvalho de Melo wrote:

.....

>> .... 
>> 60b4:       48 8b 05 35 cd 22 00    mov 0x22cd35(%rip),%rax   # 232df0 <__gmon_start__>     
>>
>> Commit 6de783b6f50f7f1db18a3fda0aa34b2e84b5771d ("perf annotate: Resolve symbols 
>> using objdump comment") added this support.
>>
>> Special code for Intel platform handles the mov  at address 60b4: 
>> This is dynamic linkage against the PLT. Function mov__parse() is always called 
>> to parse the objdump comment following the '#' character. 
>> However the function mov__scnprintf() to replace the text '0x22cd35(%rip)' by the 
>> target function name __gmon_start__ is only called in tui mode and not in stdio mode.
>>
>> Now to the confusion:
>> Function mov__parse() calls comment__symbol() which contains:
>>
>>    static int comment__symbol(char *raw, char *comment, u64 *addrp, char **namep)
>>    {
>>         char *endptr, *name, *t;
>>
>>         if (strstr(raw, "(%rip)") == NULL)
>>                 return 0;
>>
>> This is architecture specific and does not work for non-Intel platforms.
>>
>> I would like to fix perf annotate for s390x and above move instruction on s390x
>> is
>>      
>>     655a:       c0 10 00 01 9c eb       larl    %r1,39f30 <__gmon_start__>
>>
>> There is a need to handle PLT resolution in an architecture independent way.
>>
>> Ideas and suggestions?
> 
> Some historical background there, busy now, but you seem to be on the
> right track and IIRC you already sent a patch for this, right? I'll try
> to look at it.

The two patches I sent are unrelated to this issue.

> Jiri may as well, since he worked a lot recently in this codebase, to
> refactor it some more to make it useful for annotating python code, perl
> next, other scripted languages should follow too.

-- 
Thomas Richter, Dept 3303, IBM LTC Boeblingen Germany
--
Vorsitzende des Aufsichtsrats: Martina Koederitz 
Geschäftsführung: Dirk Wittkopp
Sitz der Gesellschaft: Böblingen / Registergericht: Amtsgericht Stuttgart, HRB 243294

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: Question to perf annotate handling mov ...(%rip) instructions
  2017-11-29  8:29   ` Jiri Olsa
@ 2017-11-29  9:26     ` Thomas-Mich Richter
  0 siblings, 0 replies; 5+ messages in thread
From: Thomas-Mich Richter @ 2017-11-29  9:26 UTC (permalink / raw)
  To: Jiri Olsa, Arnaldo Carvalho de Melo; +Cc: Hendrik Brueckner, linux-perf-use.

On 11/29/2017 09:29 AM, Jiri Olsa wrote:
> On Tue, Nov 28, 2017 at 11:50:30AM -0300, Arnaldo Carvalho de Melo wrote:
> 
> SNIP
> 
>>> Function mov__parse() calls comment__symbol() which contains:
>>>
>>>    static int comment__symbol(char *raw, char *comment, u64 *addrp, char **namep)
>>>    {
>>>         char *endptr, *name, *t;
>>>
>>>         if (strstr(raw, "(%rip)") == NULL)
>>>                 return 0;
>>>
>>> This is architecture specific and does not work for non-Intel platforms.
>>>
>>> I would like to fix perf annotate for s390x and above move instruction on s390x
>>> is
>>>      
>>>     655a:       c0 10 00 01 9c eb       larl    %r1,39f30 <__gmon_start__>
>>>
>>> There is a need to handle PLT resolution in an architecture independent way.
>>>
>>> Ideas and suggestions?
>>
>> Some historical background there, busy now, but you seem to be on the
>> right track and IIRC you already sent a patch for this, right? I'll try
>> to look at it.
>>
>> Jiri may as well, since he worked a lot recently in this codebase, to
>> refactor it some more to make it useful for annotating python code, perl
>> next, other scripted languages should follow too.
> 
> so we try to parse each line out of objdump using arch specific
> ops callbacks, check:
> 
> disasm_line__new
>   disasm_line__init_ins
>     dl->ins.ops->parse(...
> 
> the ops stuff are defined in:
> 
> static struct arch architectures[] = { 
> 
>         {
>                 .name = "s390",
>                 .init = s390__annotate_init,
>                 .objdump =  {
>                         .comment_char = '#',
>                 },
>         },
> 
> I'd check s390__annotate_init, there's some mechanism
> of associating ops for arch and try to fit in ;-)
> 
> jirka
> 

Ahhhh, I get it now. The code in util/annotate.c (for example mov__parse())
is in a common code file, but these functions are called via architecture 
depended triggers.

I'll go and try something for s390x to handle PLT resolution.

-- 
Thomas Richter, Dept 3303, IBM LTC Boeblingen Germany
--
Vorsitzende des Aufsichtsrats: Martina Koederitz 
Geschäftsführung: Dirk Wittkopp
Sitz der Gesellschaft: Böblingen / Registergericht: Amtsgericht Stuttgart, HRB 243294

^ permalink raw reply	[flat|nested] 5+ messages in thread

end of thread, other threads:[~2017-11-29  9:27 UTC | newest]

Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2017-11-28 10:42 Question to perf annotate handling mov ...(%rip) instructions Thomas-Mich Richter
2017-11-28 14:50 ` Arnaldo Carvalho de Melo
2017-11-29  8:29   ` Jiri Olsa
2017-11-29  9:26     ` Thomas-Mich Richter
2017-11-29  9:14   ` Thomas-Mich Richter

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).