* kfree causing high latency on 3.4.4-rt13
@ 2013-01-29 20:59 Austin Hendrix
2013-02-04 11:49 ` Thomas Gleixner
0 siblings, 1 reply; 6+ messages in thread
From: Austin Hendrix @ 2013-01-29 20:59 UTC (permalink / raw)
To: linux-rt-users
Hi all,
I'm running 3.4.4-rt13 on my systems, and while the realtime
performance is great, I occasionally see non-realtime processes block
for several seconds. Running latencytop, it looks like the kfree
kernel process is the worst offender. Does anyone have advice on how I
can troubleshoot this further?
I've poked around the intercatz, but either my google-fu is weak, or
there aren't many other people having this problem.
Thanks,
Austin Hendrix
Robot Systems Administrator
Willow Garage
^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: kfree causing high latency on 3.4.4-rt13
2013-01-29 20:59 kfree causing high latency on 3.4.4-rt13 Austin Hendrix
@ 2013-02-04 11:49 ` Thomas Gleixner
2013-02-04 21:54 ` Austin Hendrix
0 siblings, 1 reply; 6+ messages in thread
From: Thomas Gleixner @ 2013-02-04 11:49 UTC (permalink / raw)
To: Austin Hendrix; +Cc: linux-rt-users
On Tue, 29 Jan 2013, Austin Hendrix wrote:
> I'm running 3.4.4-rt13 on my systems, and while the realtime
> performance is great, I occasionally see non-realtime processes block
> for several seconds. Running latencytop, it looks like the kfree
> kernel process is the worst offender. Does anyone have advice on how I
There is no kfree process. kfree() is a function to release memory
allocated by kmalloc.
Can you provide the latencytop output please ?
Thanks,
tglx
^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: kfree causing high latency on 3.4.4-rt13
2013-02-04 11:49 ` Thomas Gleixner
@ 2013-02-04 21:54 ` Austin Hendrix
2013-02-04 23:40 ` Sven-Thorsten Dietrich
0 siblings, 1 reply; 6+ messages in thread
From: Austin Hendrix @ 2013-02-04 21:54 UTC (permalink / raw)
To: Thomas Gleixner; +Cc: linux-rt-users
An example of what I'm seeing from latencytop is:
[kfree] 7937.6 msec 6.4 %
I poked around the kernel source a bit and was pretty stumped by this,
so I'm glad it's not just me.
Thanks,
-Austin
On Mon, Feb 4, 2013 at 3:49 AM, Thomas Gleixner <tglx@linutronix.de> wrote:
> On Tue, 29 Jan 2013, Austin Hendrix wrote:
>> I'm running 3.4.4-rt13 on my systems, and while the realtime
>> performance is great, I occasionally see non-realtime processes block
>> for several seconds. Running latencytop, it looks like the kfree
>> kernel process is the worst offender. Does anyone have advice on how I
>
> There is no kfree process. kfree() is a function to release memory
> allocated by kmalloc.
>
> Can you provide the latencytop output please ?
>
> Thanks,
>
> tglx
^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: kfree causing high latency on 3.4.4-rt13
2013-02-04 21:54 ` Austin Hendrix
@ 2013-02-04 23:40 ` Sven-Thorsten Dietrich
2013-02-05 1:42 ` Austin Hendrix
0 siblings, 1 reply; 6+ messages in thread
From: Sven-Thorsten Dietrich @ 2013-02-04 23:40 UTC (permalink / raw)
To: Austin Hendrix; +Cc: Thomas Gleixner, linux-rt-users
On Mon, 2013-02-04 at 13:54 -0800, Austin Hendrix wrote:
> An example of what I'm seeing from latencytop is:
> [kfree] 7937.6 msec 6.4 %
you are overwhelming folks with data now... but just a few more
questions:
- e.g. what does the RT program you are running do?
- Code snippet that reproduces this issue?
- list of other processes on system.
- system specs; e.g. cpu speed, I am guessing around 8 - 12 Kilo Hz?
Thanks
Sven
>
> I poked around the kernel source a bit and was pretty stumped by this,
> so I'm glad it's not just me.
>
> Thanks,
> -Austin
>
> On Mon, Feb 4, 2013 at 3:49 AM, Thomas Gleixner <tglx@linutronix.de> wrote:
> > On Tue, 29 Jan 2013, Austin Hendrix wrote:
> >> I'm running 3.4.4-rt13 on my systems, and while the realtime
> >> performance is great, I occasionally see non-realtime processes block
> >> for several seconds. Running latencytop, it looks like the kfree
> >> kernel process is the worst offender. Does anyone have advice on how I
> >
> > There is no kfree process. kfree() is a function to release memory
> > allocated by kmalloc.
> >
> > Can you provide the latencytop output please ?
> >
> > Thanks,
> >
> > tglx
> --
> To unsubscribe from this list: send the line "unsubscribe linux-rt-users" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at http://vger.kernel.org/majordomo-info.html
^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: kfree causing high latency on 3.4.4-rt13
2013-02-04 23:40 ` Sven-Thorsten Dietrich
@ 2013-02-05 1:42 ` Austin Hendrix
2013-02-05 10:12 ` Thomas Gleixner
0 siblings, 1 reply; 6+ messages in thread
From: Austin Hendrix @ 2013-02-05 1:42 UTC (permalink / raw)
To: Sven-Thorsten Dietrich; +Cc: Thomas Gleixner, linux-rt-users
On Mon, Feb 4, 2013 at 3:40 PM, Sven-Thorsten Dietrich
<sven@thebigcorporation.com> wrote:
> On Mon, 2013-02-04 at 13:54 -0800, Austin Hendrix wrote:
>> An example of what I'm seeing from latencytop is:
>> [kfree] 7937.6 msec 6.4 %
>
> you are overwhelming folks with data now... but just a few more
> questions:
>
> - e.g. what does the RT program you are running do?
> - Code snippet that reproduces this issue?
> - list of other processes on system.
> - system specs; e.g. cpu speed, I am guessing around 8 - 12 Kilo Hz?
>
> Thanks
>
> Sven
>
>
>>
>> I poked around the kernel source a bit and was pretty stumped by this,
>> so I'm glad it's not just me.
>>
>> Thanks,
>> -Austin
>>
>> On Mon, Feb 4, 2013 at 3:49 AM, Thomas Gleixner <tglx@linutronix.de> wrote:
>> > On Tue, 29 Jan 2013, Austin Hendrix wrote:
>> >> I'm running 3.4.4-rt13 on my systems, and while the realtime
>> >> performance is great, I occasionally see non-realtime processes block
>> >> for several seconds. Running latencytop, it looks like the kfree
>> >> kernel process is the worst offender. Does anyone have advice on how I
>> >
>> > There is no kfree process. kfree() is a function to release memory
>> > allocated by kmalloc.
>> >
>> > Can you provide the latencytop output please ?
>> >
>> > Thanks,
>> >
>> > tglx
>> --
>> To unsubscribe from this list: send the line "unsubscribe linux-rt-users" in
>> the body of a message to majordomo@vger.kernel.org
>> More majordomo info at http://vger.kernel.org/majordomo-info.html
>
>
>
I wish I had more data too; that's one of the frustrating things about
the troubleshooting process. Most of my attempts to capture more data
or narrow the problem have instead caused the system to stop
exhibiting problems.
The system in question is a dual Xeon L5520 (quad-core, 2.27GHz), with
24GB of RAM. It's inside our robots:
https://willowgarage.com/pages/pr2/overview
The system load is around 2-3 under normal use.
My realtime process is an EtherCAT master, using about 30-40% of one
core. We use it for 1kHz motor control, so the realtime deadline is
pretty lax; in the 100's of us range.
The rest of the system load comes from a number of non-realtime
processes that are doing a significant amount of network I/O, along
with an NFS server.
Believe it or not, this system actually works quite well on the
3.0.6-rt17 kernel. I'm upgrading it to a newer kernel since I'm also
upgrading the base OS from Ubuntu Lucid to Precise.
I gave the 3.4.28-rt40 stable release a try today, and so far I
haven't seen the problems that I was seeing with 3.4.4-rt13. I'd still
like to know more about how the debug the problems I'm seeing on
3.4.4-rt13 so that I can do a better job of debugging if problems like
this come up in the future.
Thanks,
-Austin
^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: kfree causing high latency on 3.4.4-rt13
2013-02-05 1:42 ` Austin Hendrix
@ 2013-02-05 10:12 ` Thomas Gleixner
0 siblings, 0 replies; 6+ messages in thread
From: Thomas Gleixner @ 2013-02-05 10:12 UTC (permalink / raw)
To: Austin Hendrix; +Cc: Sven-Thorsten Dietrich, linux-rt-users
On Mon, 4 Feb 2013, Austin Hendrix wrote:
> I gave the 3.4.28-rt40 stable release a try today, and so far I
> haven't seen the problems that I was seeing with 3.4.4-rt13. I'd still
> like to know more about how the debug the problems I'm seeing on
> 3.4.4-rt13 so that I can do a better job of debugging if problems like
> this come up in the future.
One possibility would be to use the kernel tracer and modify
latencytop in the kernel to issue a tracing_off() call when a certain
latency treshold is reached. That way you freeze the kernel trace on
the spot where the latency is detected and you can analyze what
happened right before that point in the system.
Thanks,
tglx
^ permalink raw reply [flat|nested] 6+ messages in thread
end of thread, other threads:[~2013-02-05 10:12 UTC | newest]
Thread overview: 6+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2013-01-29 20:59 kfree causing high latency on 3.4.4-rt13 Austin Hendrix
2013-02-04 11:49 ` Thomas Gleixner
2013-02-04 21:54 ` Austin Hendrix
2013-02-04 23:40 ` Sven-Thorsten Dietrich
2013-02-05 1:42 ` Austin Hendrix
2013-02-05 10:12 ` Thomas Gleixner
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox