From: Wolfgang Grandegger <wolfgang.grandegger@domain.hid>
To: Fillod Stephane <stephane.fillod@domain.hid>
Cc: rtai@domain.hid, adeos-main@gna.org
Subject: [Adeos-main] Re: Interrupt Latency Question
Date: Sun, 17 Apr 2005 11:32:16 +0200 [thread overview]
Message-ID: <42622D20.3070609@domain.hid> (raw)
In-Reply-To: <1CFEB358338412458B21FAA0D78FE86D4F0D3F@rennsmail02.eu.thmulti.com>
Hello,
I just run flushy on a low-end PowerPC system:
XPC855xxZPnnD4 at 80 MHz: 4 kB I-Cache 4 kB D-Cache FEC present
Unfortunately the results are not that clear than expected. I see the
latency going up a bit but other activities do increase it as well
(telnet, ping -f). Furthermore, from run to run the latency results are
different. Well, I think it's a complex and arch-dependent interplay of
various parameters, e.g. on the system above, the caches are quite small
and therefore the influence of cache refills is low. When I have more
time I might repeat the tests on other PowerPC archs as well.
There are a few things you can do to reduce the influence of TLB misses,
e.g. pinning TLB entries and there are corresponding kernel option on
some PowerPC archs. With a small patch you can then also load kernel
modules into kmalloc instead of vmalloc space to profit from the
pinning. Unfortunately the latency improvement depends on your
application and PowerPC arch and requires tedious tuning, which is not
appropriate in general.
Apart from that, you can do little to reduce the latency degradation due
to cache refills and TLB misses (at least not in a portable way). Linux
simply requires it.
Wolfgang.
On 04/14/2005 04:55 PM Fillod Stephane wrote:
> Wolfgang Grandegger wrote:
>>It's also my experience, that the large latencies are
>>due to TLB misses and cache refills, especially the
>>latter one. What helps is L2 cache or fast memory.
>>For example, on an MPC 5200 I get significately better
>>latencies with DDR-RAM than with SDRAM (which is ca.
>>20% slower).
>
> I keep on hearing people are having feeling that their latency
> can be caused by TLB misses/cache refills, but never seen proof.
> Is there some literature about that subject? Nobody in the RTAI
> community had curiosity to explain and fix this interesting problem?
>
> If not, what about showing (or not) that the large latencies are due
> to TLB misses/cache refills with a tool like Flushy?
>
> Using Flushy would be like using low-end hardware. It's far easier to
> make
> performance improvements on low-end hardware than high-end. It works as
> a
> magnifying glass. It reminds me a comment on Gnome mailing list, where
> an
> end-user wished that developers had high-end compile machine, but slow
> hardware to test with.
>
>>>Have a look at http://rtai.dk/cgi-bin/gratiswiki.pl?Latency_Killer
>>>To get real bad cases, try the Flushy module.
>>>You can try also to disable caches for better predictability, but it
> really
>>>hurts :*)
>>
>>I will try it on an embedded PowerPC platform a.s.a.p.
>
> After thought, there would be a better design for Flushy. Instead of
> an infinite loop in a separate module(process), we should instead call
> the TLB flush/cache invalidate right before entering the RT world
> from ADEOS. Therefore, we should get "predictable" worst case latencies
> wrt
> TLB/cache conditions.
>
> Where is the best place in ADEOS to do that?
> The earlier, the better. Tapping at the exception level would be the
> best, right before saving registers, but we need couple registers to
> call the
> TLB/cache flush.
> Any idea?
>
> I've Cc:'d the adeos-main list to reach some more gurus.
>
>>>Note: if it turns out this latency is due to cache misses, then
> solutions
>>>exist.
>>
>>Can you be more precise here.
>
> With reproducible latencies, we can then use OProfile (where available)
> to
> spot slow areas. We have to sort out whether TLB misses, I-cache misses
> or
> D-cache misses is the bigger culprit. Make your guess :-)
> Modern processors have cache control instructions, like prefetch for
> read,
> zero cache line, writeback flush, etc. With nice cpp macros, we can use
> them (where available) ahead of time in the previously spotted places,
> to render the memory access latency predictable.
>
> Do you think that will do it? Anybody has experience to share?
>
>
> Thanks
next prev parent reply other threads:[~2005-04-17 9:32 UTC|newest]
Thread overview: 10+ messages / expand[flat|nested] mbox.gz Atom feed top
2005-04-14 14:55 [Adeos-main] RE: Interrupt Latency Question Fillod Stephane
2005-04-14 15:47 ` Philippe Gerum
2005-04-14 20:20 ` Michael Neuhauser
2005-04-14 17:15 ` [Adeos-main] " Paolo Mantegazza
2005-04-19 18:35 ` Max Krasnyansky
2005-04-20 8:06 ` Paolo Mantegazza
2005-04-20 18:10 ` Max Krasnyansky
2005-04-17 9:32 ` Wolfgang Grandegger [this message]
2005-05-03 7:12 ` Der Herr Hofrat
2005-05-03 16:11 ` Wolfgang Grandegger
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=42622D20.3070609@domain.hid \
--to=wolfgang.grandegger@domain.hid \
--cc=adeos-main@gna.org \
--cc=rtai@domain.hid \
--cc=stephane.fillod@domain.hid \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.