From: Jan Kiszka <jan.kiszka@domain.hid>
To: "\"Schlägl \\\"Manfred jun.\\\"\"" <manfred.schlaegl@domain.hid>
Cc: xenomai@xenomai.org
Subject: Re: [Xenomai-help] Adeos/Xenomai Arm Port
Date: Mon, 16 Oct 2006 18:24:17 +0200 [thread overview]
Message-ID: <4533B231.40001@domain.hid> (raw)
In-Reply-To: <1161013724.5503.65.camel@domain.hid>
[-- Attachment #1: Type: text/plain, Size: 7943 bytes --]
Schlägl Manfred jun. wrote:
> Hi!
>
> I've tried to implement Adeos 1.5 and Xenomai 2.2.3 on a
> Netsilicon-Platform (ns9750-evalboard with Arm926EJ).
>
> The Timer is implemented (with a manual __ipipe_trigger_irq if
> reload<3). The Interrupt-Demuxer is deactivated.
>
> The System boots with Adeos & Xenomai. Time is running normally. Things
Cool. Contributing patches is always welcome. ;)
> look generally very good, but there are some Problems:
>
>
> 1. Latencies are very high.
>
> dd if=/dev/zero of=/dev/null &
> ./latency -p 10000 -T 20
>
> RTS| 92.592| 95.486| 125.506| 0|
> 00:00:20/00:00:20
>
> I know, it's a common problem with Xenomai/ARM. Is it true, that it has
> something to do with a slow processor-cache-flush? Does anybody know
> more about that?
>
AFAIK, this is a generic ARM problem. You have to flush caches on MMU
context switch, i.e. also when waking up this real-time user-space
benchmark task. That's due to the fact that ARM is caching virtual
address, so that the cache content become invalid on context switch.
>
> 2. I've to use very high Periods on Latency-Tests.
>
> With load i've to use > 500 us
> Without load i've to use > 1000 us (!?!)
>
> If I run ./latency -p 1000 without any load I get latencies between 40
> and 70 us, but the Process is killed after a few seconds.
> - sometimes simply the message "Killed" is printed.
> In this case the system is running, but if i try to start another
> latency-Process the system crashes (most of time: "Unable to handle
> kernel paging request at virtual address xxxxxxxx".)
> - sometimes the system simply stands still (linux-timerisr /
> linux-timer_tick is'nt called anymore)
> jtag-debugger gdb-output:
>
> Program received signal SIGSTOP, Stopped (signal).
> 0x0000ba40 in ?? ()
> (gdb) stepi
> 0x0000ba40 in ?? ()
> Infinite loop detected
>
> The same happens if I run ./latency -p 500 with load.
>
Sounds like some gremlin is still sleeping in the code. You will have to
trace down the oddities and understand what is leading to the oops or
lock-up.
>
>
> 3. in-kernel periodic task & in-kernel timer benchmark
>
> -sh-3.00# ./latency -t 2 -T 10
> == Sampling period: 100 us
> == Test mode: in-kernel timer handler
> == All results in microseconds
> latency: failed to start in-kernel timer benchmark, code -25
> ---|------------|------------|------------|--------|-------------------------
> RTS|-1093001.752| 0.001| 93.252| 93340|
> 00:00:10/00:00:10
>
> -sh-3.00# ./latency -t 1 -T 10
> == Sampling period: 100 us
> == Test mode: in-kernel periodic task
> == All results in microseconds
> latency: failed to start in-kernel timer benchmark, code -25
> ---|------------|------------|------------|--------|-------------------------
> RTS|-1097339.416| 0.001| 93.252| 93340|
> 00:00:10/00:00:10
>
> What does "latency: failed to start in-kernel timer benchmark, code -25"
> mean? Are the results valid?
Nope, they aren't. Did you compile all the testing devices into the
kernel or did you load them all at once? I guess the latency test picks
the wrong device here. Try calling it with "-D1" as well. You can see
what devices are loaded under /proc/xenomai/rtdm/.
>
>
> 4. switchtest & switchbench
>
> -sh-3.00# ./switchtest
> == Testing FPU check routines...
> == FPU check routines: unimplemented, skipping FPU switches tests.
> == Threads: sleeper-0 rtk-1 rtk-2 rtup-3 rtup-4 rtus-5 rtus-6 rtuo-7
> rtuo-8
> RTT| 00:00:01
> RTH|ctx switches|-------total
> RTD| 450| 450
> RTD| 450| 900
> RTD| 459| 1359
> RTD| 450| 1809
> RTD| 459| 2268
> RTD| 459| 2727
> RTD| 450| 3177
> switchtest seems to run normally, but what does the output mean?
As long as you see increasing numbers on the right and no oopses,
everything should be ok. (These numbers indicate the ammount of
successful context switches per cycle.)
>
>
> The problems with switchbench are similar to the Problems discussed in
> #2.
> -sh-3.00# ./switchbench -n 100 -p 10000
> == Sampling period: 10000 us
> == Do not interrupt this program
> RTH| lat min| lat avg| lat max| lost
> RTD| 86443| 90060| 101634| 0
> -sh-3.00# ./switchbench -n 100 -p 1000
> == Sampling period: 1000 us
> == Do not interrupt this program
> RTH| lat min| lat avg| lat max| lost
> RTD| 77039| 81741| 98741| 0
> -sh-3.00# ./switchbench -n 100 -p 100
> == Sampling period: 100 us
> == Do not interrupt this program
> [ 122.670000] Unable to handle kernel paging request at virtual address
> e2822059
> [ 122.670000] pgd = c3b60000
> [ 122.670000] [e2822059] *pgd=00000000
> [ 122.670000] Internal error: Oops: 805 [#1]
> [ 122.670000] Modules linked in:
> [ 122.670000] CPU: 0
> [ 122.670000] pc : [<c0150930>] lr : [<c015080c>] Not tainted
> [ 122.670000] sp : c39d0014 ip : c3b54044 fp : c39d0040
> [ 122.670000] r10: c02d43e4 r9 : 00800000 r8 : c0322b2c
> [ 122.670000] r7 : c032212c r6 : 00000000 r5 : c01194e0 r4 :
> 00000000
> [ 122.670000] r3 : 00000000 r2 : e282200c r1 : c3b54000 r0 :
> c3b54000
> [ 122.670000] Flags: NzCv IRQs off FIQs on Mode SVC_32 Segment user
> [ 122.670000] Control: 5317F Table: 03B60000 DAC: 00000015
> [ 122.670000] Process worker (pid: 131, stack limit = 0xc39ce194)
> [ 122.670000] Stack: (0xc39d0014 to 0xc39d0000)
> [ 122.670000] Backtrace:
> [ 122.670000] Function entered at [<c01502e8>] from [<c014d25c>]
> [ 122.670000] Function entered at [<c014d1c4>] from [<c014d2a4>]
> [ 122.670000] r7 = 00000000 r6 = C02D2680 r5 = C02D2880 r4 =
> C02D2688
> [ 122.670000] Function entered at [<c014d288>] from [<c014a350>]
> [ 122.670000] Function entered at [<c014a1e8>] from [<c014a7a4>]
> [ 122.670000] Function entered at [<c014a700>] from [<c0126000>]
> [ 122.670000] r7 = C02D2680 r6 = C02D2880 r5 = 00000010 r4 =
> C02D2680
> [ 122.670000] Function entered at [<c0125e64>] from [<c01260a8>]
> [ 122.670000] Function entered at [<c0126010>] from [<c011f824>]
> [ 122.670000] Code: e3c3303f e593300c e5932004 e3a03001 (e5c2304d)
>
Compiled without debug symbols? Might be helpful to turn them in in
kernel config.
>
>
> 5. cyclictest
>
> Here some test-cases:
>
> -sh-3.00# ./cyclictest -l 100
> 0.03 0.01 0.00 1/21 53
>
> T: 0 ( 53) P: 0 I: 1000 C: 100 Min:-1087273 Act:-1087273 Max:
> -989708
> -sh-3.00# ./cyclictest -l 1000
> 0.02 0.01 0.00 2/22 56
>
> T: 0 ( 56) P: 0 I: 1000 C: 1000 Min:-1973899 Act:-1973899 Max:
> -989170
> -sh-3.00# ./cyclictest -l 10000
> 0.01 0.01 0.00 1/21 59
>
> T: 0 ( 59) P: 0 I: 1000 C: 10000 Min:-10848662 Act:-10848662 Max:
> -992290
> -sh-3.00# ./cyclictest -l 100000
> 0.01 0.00 0.00 2/22 62
>
> T: 0 ( 62) P: 0 I: 1000 C: 100000 Min:-99563368 Act:-99563368 Max:
> -989732
> -sh-3.00# ./cyclictest -l 1000000
> 0.98 0.22 0.07 3/22 65
>
> T: 0 ( 65) P: 0 I: 1000 C: 1000000 Min:-986731091 Act:-986731091
> Max: -989084
>
> cyclic-Test runs, but while it is running, timer_tick is never called.
>
This is totally broken. Do you run a manually compiled version of that
test? Check if it links against pthread_rt (the POSIX skin library).
Otherwise you benchmark vanilla Linux...
>
>
>
> I think that all above problems depends on the same root. I tried to
> find that root, but currently without success.
>
Again: Pick some easy-to-reproduce oops and try to understand its
history. Also, posting your patches may trigger further review and input.
Jan
[-- Attachment #2: OpenPGP digital signature --]
[-- Type: application/pgp-signature, Size: 250 bytes --]
next prev parent reply other threads:[~2006-10-16 16:24 UTC|newest]
Thread overview: 27+ messages / expand[flat|nested] mbox.gz Atom feed top
2006-10-16 15:48 [Xenomai-help] Adeos/Xenomai Arm Port Schlägl Manfred jun.
2006-10-16 16:24 ` Jan Kiszka [this message]
2006-10-16 18:20 ` Gilles Chanteperdrix
2006-10-16 21:30 ` Philippe Gerum
2006-10-17 8:15 ` Gilles Chanteperdrix
2006-10-17 8:38 ` Gilles Chanteperdrix
2006-10-17 9:54 ` Philippe Gerum
2006-10-17 10:04 ` Philippe Gerum
2006-10-17 12:05 ` Gilles Chanteperdrix
-- strict thread matches above, loose matches on Subject: below --
2006-10-17 9:23 Schlägl Manfred jun.
2006-10-17 9:37 ` Jan Kiszka
2006-10-17 15:39 Schlägl Manfred jun.
2006-10-18 7:52 ` Schlägl Manfred jun.
2006-10-18 10:35 ` Jan Kiszka
2006-10-18 12:24 ` Schlägl Manfred jun.
2006-10-18 12:33 ` Jan Kiszka
2006-10-18 12:48 ` Gilles Chanteperdrix
2006-10-21 14:56 ` Schlägl Manfred jun.
2006-10-21 16:44 ` Jan Kiszka
2006-10-25 8:03 ` Schlägl Manfred jun.
2006-10-18 10:25 ` Philippe Gerum
2006-10-18 12:05 ` Gilles Chanteperdrix
2006-10-18 15:16 Schlägl Manfred jun.
2006-10-18 15:27 ` Jan Kiszka
2006-10-18 15:56 ` Schlägl Manfred jun.
2006-10-19 5:52 ` Jan Kiszka
2006-10-21 12:11 ` Schlägl Manfred jun.
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=4533B231.40001@domain.hid \
--to=jan.kiszka@domain.hid \
--cc=manfred.schlaegl@domain.hid \
--cc=xenomai@xenomai.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.