linux-perf-users.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* perf sched timehist --state vs perf script
@ 2024-10-24 16:36 vmolnaro
  2024-10-30  0:34 ` Namhyung Kim
  0 siblings, 1 reply; 4+ messages in thread
From: vmolnaro @ 2024-10-24 16:36 UTC (permalink / raw)
  To: linux-perf-users
  Cc: mpetlan, kan.liang, irogers, acme, ak, alexander.shishkin,
	namhyung

From: Veronika Molnarova <vmolnaro@redhat.com>

Hello!

We encountered an issue with 'perf sched timehist --state' when
we expected for a test case that the process of a workload would 
eventually end in the dead state(marked by 'X') and be switched
out. This can be seen also by using 'perf script':

    # perf sched record -a sleep 1
    [ perf record: Woken up 1 times to write data ]
    [ perf record: Captured and wrote 0.167 MB perf.data (166 samples) ]
    # perf script | grep sleep
           sleep  151541 [000] 193461.410617: sched:sched_stat_runtime: comm=sleep pid=151541 runtime=3132275 [ns]
           sleep  151541 [000] 193461.410624:       sched:sched_switch: sleep:151541 [120] S ==> swapper/0:0 [120]
         swapper       0 [000] 193462.410688:       sched:sched_waking: comm=sleep pid=151541 prio=120 target_cpu=000
         swapper       0 [000] 193462.410698:       sched:sched_switch: swapper/0:0 [120] R ==> sleep:151541 [120]
         :151541  151541 [000] 193462.411100: sched:sched_stat_runtime: comm=sleep pid=151541 runtime=407210 [ns]
         :151541  151541 [000] 193462.411107:       sched:sched_switch: sleep:151541 [120] X ==> swapper/0:0 [120]

But for the 'perf sched timehist --state' the process is shown
in the zombie state when it is being switched out:

    # perf sched timehist --state | grep 151541
    Samples do not have callchains.
      193461.410624 [0000]  sleep[151541]                       0.000      0.054      3.129      S 
      193462.411107 [0000]  :151541[151541]                     0.000      0.000      0.408      Z 

Shouldn't both commands report the same result, or is there a reason
why 'perf script' uses the dead state while 'perf sched' uses dead?
It also seems that the 'perf sched' cannot read the name of the zombie
process, but that can be a different issue.

Thanks,

Veronika


^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: perf sched timehist --state vs perf script
  2024-10-24 16:36 perf sched timehist --state vs perf script vmolnaro
@ 2024-10-30  0:34 ` Namhyung Kim
  2024-10-31 11:32   ` Michael Petlan
  0 siblings, 1 reply; 4+ messages in thread
From: Namhyung Kim @ 2024-10-30  0:34 UTC (permalink / raw)
  To: vmolnaro
  Cc: linux-perf-users, mpetlan, kan.liang, irogers, acme, ak,
	alexander.shishkin

On Thu, Oct 24, 2024 at 06:36:30PM +0200, vmolnaro@redhat.com wrote:
> From: Veronika Molnarova <vmolnaro@redhat.com>
> 
> Hello!
> 
> We encountered an issue with 'perf sched timehist --state' when
> we expected for a test case that the process of a workload would 
> eventually end in the dead state(marked by 'X') and be switched
> out. This can be seen also by using 'perf script':
> 
>     # perf sched record -a sleep 1
>     [ perf record: Woken up 1 times to write data ]
>     [ perf record: Captured and wrote 0.167 MB perf.data (166 samples) ]
>     # perf script | grep sleep
>            sleep  151541 [000] 193461.410617: sched:sched_stat_runtime: comm=sleep pid=151541 runtime=3132275 [ns]
>            sleep  151541 [000] 193461.410624:       sched:sched_switch: sleep:151541 [120] S ==> swapper/0:0 [120]
>          swapper       0 [000] 193462.410688:       sched:sched_waking: comm=sleep pid=151541 prio=120 target_cpu=000
>          swapper       0 [000] 193462.410698:       sched:sched_switch: swapper/0:0 [120] R ==> sleep:151541 [120]
>          :151541  151541 [000] 193462.411100: sched:sched_stat_runtime: comm=sleep pid=151541 runtime=407210 [ns]
>          :151541  151541 [000] 193462.411107:       sched:sched_switch: sleep:151541 [120] X ==> swapper/0:0 [120]
> 
> But for the 'perf sched timehist --state' the process is shown
> in the zombie state when it is being switched out:
> 
>     # perf sched timehist --state | grep 151541
>     Samples do not have callchains.
>       193461.410624 [0000]  sleep[151541]                       0.000      0.054      3.129      S 
>       193462.411107 [0000]  :151541[151541]                     0.000      0.000      0.408      Z 
> 
> Shouldn't both commands report the same result, or is there a reason
> why 'perf script' uses the dead state while 'perf sched' uses dead?

I remember there was a discussion related to this.

https://lore.kernel.org/all/20240122070859.1394479-2-zegao@tencent.com/

And it seems it's fixed both in perf and libtraceevent.  Maybe your
libtraceevent is somewhat outdated and doesn't have the update.

> It also seems that the 'perf sched' cannot read the name of the zombie
> process, but that can be a different issue.

Hmm.. right.  It should update thread->comm using the sched_switch event.

Thanks,
Namhyung


^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: perf sched timehist --state vs perf script
  2024-10-30  0:34 ` Namhyung Kim
@ 2024-10-31 11:32   ` Michael Petlan
  2024-11-02  6:20     ` Namhyung Kim
  0 siblings, 1 reply; 4+ messages in thread
From: Michael Petlan @ 2024-10-31 11:32 UTC (permalink / raw)
  To: Namhyung Kim
  Cc: vmolnaro, linux-perf-users, mpetlan, kan.liang, irogers, acme, ak,
	alexander.shishkin

On Tue, 29 Oct 2024, Namhyung Kim wrote:
> On Thu, Oct 24, 2024 at 06:36:30PM +0200, vmolnaro@redhat.com wrote:
> > From: Veronika Molnarova <vmolnaro@redhat.com>
> > 
> > Hello!
> > 
> > We encountered an issue with 'perf sched timehist --state' when
> > we expected for a test case that the process of a workload would 
> > eventually end in the dead state(marked by 'X') and be switched
> > out. This can be seen also by using 'perf script':
> > 
> >     # perf sched record -a sleep 1
> >     [ perf record: Woken up 1 times to write data ]
> >     [ perf record: Captured and wrote 0.167 MB perf.data (166 samples) ]
> >     # perf script | grep sleep
> >            sleep  151541 [000] 193461.410617: sched:sched_stat_runtime: comm=sleep pid=151541 runtime=3132275 [ns]
> >            sleep  151541 [000] 193461.410624:       sched:sched_switch: sleep:151541 [120] S ==> swapper/0:0 [120]
> >          swapper       0 [000] 193462.410688:       sched:sched_waking: comm=sleep pid=151541 prio=120 target_cpu=000
> >          swapper       0 [000] 193462.410698:       sched:sched_switch: swapper/0:0 [120] R ==> sleep:151541 [120]
> >          :151541  151541 [000] 193462.411100: sched:sched_stat_runtime: comm=sleep pid=151541 runtime=407210 [ns]
> >          :151541  151541 [000] 193462.411107:       sched:sched_switch: sleep:151541 [120] X ==> swapper/0:0 [120]
> > 
> > But for the 'perf sched timehist --state' the process is shown
> > in the zombie state when it is being switched out:
> > 
> >     # perf sched timehist --state | grep 151541
> >     Samples do not have callchains.
> >       193461.410624 [0000]  sleep[151541]                       0.000      0.054      3.129      S 
> >       193462.411107 [0000]  :151541[151541]                     0.000      0.000      0.408      Z 
> > 
> > Shouldn't both commands report the same result, or is there a reason
> > why 'perf script' uses the dead state while 'perf sched' uses dead?

Hello! Thanks for reacting!
> 
> I remember there was a discussion related to this.
> 
> https://lore.kernel.org/all/20240122070859.1394479-2-zegao@tencent.com/
> 
> And it seems it's fixed both in perf and libtraceevent.  Maybe your
> libtraceevent is somewhat outdated and doesn't have the update.
>
Yes! When we've updated libtraceevent to 1.8.3, both show Z now, which is
probably correct.

Then it comes into my mind whether shouldn't perf-sched and perf-script
both take it from libtraceevent to unify the approaches, rather than
perf-sched using its own implementation and perf-script taking it from
libtraceevent... ?

> > It also seems that the 'perf sched' cannot read the name of the zombie
> > process, but that can be a different issue.
> 
> Hmm.. right.  It should update thread->comm using the sched_switch event.

According to the tests, the name seems that it used to be translated correctly
in past, so it'd be nice if it worked again, althoug it is pretty clear which
process it was, according to the PID. Thus, the test can be adjusted to search
for the combination "PID, status Z", instead of "command, status Z"... I.e. to
accept the current situation...

Thanks,

Michael

> 
> Thanks,
> Namhyung
> 
> 


^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: perf sched timehist --state vs perf script
  2024-10-31 11:32   ` Michael Petlan
@ 2024-11-02  6:20     ` Namhyung Kim
  0 siblings, 0 replies; 4+ messages in thread
From: Namhyung Kim @ 2024-11-02  6:20 UTC (permalink / raw)
  To: Michael Petlan
  Cc: vmolnaro, linux-perf-users, kan.liang, irogers, acme, ak,
	alexander.shishkin

On Thu, Oct 31, 2024 at 12:32:31PM +0100, Michael Petlan wrote:
> On Tue, 29 Oct 2024, Namhyung Kim wrote:
> > On Thu, Oct 24, 2024 at 06:36:30PM +0200, vmolnaro@redhat.com wrote:
> > > From: Veronika Molnarova <vmolnaro@redhat.com>
> > > 
> > > Hello!
> > > 
> > > We encountered an issue with 'perf sched timehist --state' when
> > > we expected for a test case that the process of a workload would 
> > > eventually end in the dead state(marked by 'X') and be switched
> > > out. This can be seen also by using 'perf script':
> > > 
> > >     # perf sched record -a sleep 1
> > >     [ perf record: Woken up 1 times to write data ]
> > >     [ perf record: Captured and wrote 0.167 MB perf.data (166 samples) ]
> > >     # perf script | grep sleep
> > >            sleep  151541 [000] 193461.410617: sched:sched_stat_runtime: comm=sleep pid=151541 runtime=3132275 [ns]
> > >            sleep  151541 [000] 193461.410624:       sched:sched_switch: sleep:151541 [120] S ==> swapper/0:0 [120]
> > >          swapper       0 [000] 193462.410688:       sched:sched_waking: comm=sleep pid=151541 prio=120 target_cpu=000
> > >          swapper       0 [000] 193462.410698:       sched:sched_switch: swapper/0:0 [120] R ==> sleep:151541 [120]
> > >          :151541  151541 [000] 193462.411100: sched:sched_stat_runtime: comm=sleep pid=151541 runtime=407210 [ns]
> > >          :151541  151541 [000] 193462.411107:       sched:sched_switch: sleep:151541 [120] X ==> swapper/0:0 [120]
> > > 
> > > But for the 'perf sched timehist --state' the process is shown
> > > in the zombie state when it is being switched out:
> > > 
> > >     # perf sched timehist --state | grep 151541
> > >     Samples do not have callchains.
> > >       193461.410624 [0000]  sleep[151541]                       0.000      0.054      3.129      S 
> > >       193462.411107 [0000]  :151541[151541]                     0.000      0.000      0.408      Z 
> > > 
> > > Shouldn't both commands report the same result, or is there a reason
> > > why 'perf script' uses the dead state while 'perf sched' uses dead?
> 
> Hello! Thanks for reacting!
> > 
> > I remember there was a discussion related to this.
> > 
> > https://lore.kernel.org/all/20240122070859.1394479-2-zegao@tencent.com/
> > 
> > And it seems it's fixed both in perf and libtraceevent.  Maybe your
> > libtraceevent is somewhat outdated and doesn't have the update.
> >
> Yes! When we've updated libtraceevent to 1.8.3, both show Z now, which is
> probably correct.

Cool.

> 
> Then it comes into my mind whether shouldn't perf-sched and perf-script
> both take it from libtraceevent to unify the approaches, rather than
> perf-sched using its own implementation and perf-script taking it from
> libtraceevent... ?

Yeah, I think it's better to use libtraceevent for it.

> 
> > > It also seems that the 'perf sched' cannot read the name of the zombie
> > > process, but that can be a different issue.
> > 
> > Hmm.. right.  It should update thread->comm using the sched_switch event.
> 
> According to the tests, the name seems that it used to be translated correctly
> in past, so it'd be nice if it worked again, althoug it is pretty clear which
> process it was, according to the PID. Thus, the test can be adjusted to search
> for the combination "PID, status Z", instead of "command, status Z"... I.e. to
> accept the current situation...

Sounds good.  Please send the patch.

Thanks,
Namhyung


^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2024-11-02  6:20 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2024-10-24 16:36 perf sched timehist --state vs perf script vmolnaro
2024-10-30  0:34 ` Namhyung Kim
2024-10-31 11:32   ` Michael Petlan
2024-11-02  6:20     ` Namhyung Kim

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).