All of lore.kernel.org
 help / color / mirror / Atom feed
From: Sebastian Smolorz <smolorz@domain.hid>
To: Jan Kiszka <jan.kiszka@domain.hid>
Cc: xenomai-core <xenomai@xenomai.org>,
	"Cornelius Köpp" <Cornelius.Koepp@domain.hid>
Subject: Re: [Xenomai-core] latencys drifting into negative (Xenomai	2.4.2/2.4.3)
Date: Wed, 02 Apr 2008 17:58:12 +0200	[thread overview]
Message-ID: <47F3AD14.4090306@domain.hid> (raw)
In-Reply-To: <47F37BF8.6000401@domain.hid>

Jan Kiszka wrote:
> Sebastian Smolorz wrote:
>> Jan Kiszka wrote:
>>> Cornelius Köpp wrote:
>>>> Hello,
>>>> I run the latency test from testsuite on several hard and software
>>>> configurations. Running on Xenomai 2.4.2, Linux 2.6.24 the results
>>>> shows a "strange" behavior: In Kernel mode (-t1) the latencys
>>>> constantly linear decrease. See attached plot
>>>> 'drifting_latencys_in_kernelmode.png' of latency test running 48h on
>>>> Pentium3 700. This effect could be reproduced, even on other hardware
>>>> (Pentium-M 1400).
>>> As our P3 boards did not support APIC-based timing (IIRC), your kernel
>>> has correctly disabled the related kernel support. But the Pentium M
>>> should be fine. So could you check if we are seeing some TSC clocks
>>> vs. PIT timer rounding issue by enabling the local APIC on the Pentium M?
>> There is no difference in enabling the local APIC on the Pentium M WRT
>> this bug.
>>
>>>> The usermode (-t0) did not show a drifting, but is influenced by a
>>>> test ran in kernelmode before.
>>> What do you mean with "is influenced"?
>> Cornelius saw the following behaviour: If the latency test was run in
>> user space first, no drift appeared over time. If latency was run in
>> kernel space (with the reported ngeative drift) a following latency test
>> in user space showed also negative values but with no additional drift
>> over time.

Correction: The initial negative drift when starting user mode latency 
does not depend on a former run of latency in kernel mode but on the 
time passed between system start and the starting point of latency -t0. 
Or, as explained below, it depends on the value of the TSC.

>>
>>>> I talked with Sebastian Smolorz about this and he builds his own
>>>> independent kernel-config to check. He got the same drifting-effect
>>>> with Xenomai 2.4.2 and Xenomai 2.4.3 running latency over several
>>>> hours. His kernel-config ist attached as
>>>> 'config-2.6.24-xenomai-2.4.3__ssm'.
>>>>
>>>> Our kernel-configs are both based on a config used with Xenomai 2.3.4
>>>> and Linux 2.6.20.15 without any drifting effects.
>>> 2.3.x did not incorporate the new TSC-to-ns conversion. Maybe it is
>>> not a PIC vs. APIC thing, but rather a rounding problem of larger TSC
>>> values (that naturally show up when the system runs for a longer time).
>> This hint seems to point into the right direction. I tried out a
>> modified pod_32.h (xnarch_tsc_to_ns() commented out) so that the old
>> implementation in include/asm-generic/bits/pod.h was used. The drifting
>> bug disappeared. So there seems so be a buggy x86-specific
>> implementation of this routine.
> 
> Hmm, maybe even a conceptional issue: the multiply-shift-based
> xnarch_tsc_to_ns is not as precise as the still multiply-divide-based
> xnarch_ns_to_tsc. So when converting from tsc over ns back to tsc, we
> may loose some bits, maybe too many bits...
> 
> It looks like this bites us in the kernel latency tests (-t2 should
> suffer as well). Those recalculate their timeouts each round based on
> absolute nanoseconds. In contrast, the periodic user mode task of -t0
> uses a periodic timer that is forwarded via a tsc-based interval.
> 
> You (or Cornelius) could try to analyse the calculation path of the
> involved timeouts, specifically to understand why the scheduled timeout
> of the underlying task timer (which is tsc-based) tend to diverge from
> the calculated one (ns-based).

So here comes the explanation. The error is inside the function 
rthal_llmulshft(). It returns wrong values which are too small - the 
higher the given TSC value the bigger the error. The function 
rtdm_clock_read_monotonic() calls rthal_llmulshft(). As 
rtdm_clock_read_monotonic() is called every time the latency kernel 
thread runs [1] the values reported by latency become smaller over time.

In contrast, the latency task in user space only uses the conversion 
from TSC to ns only once when calling rt_timer_inquire [2]. 
timer_info.date is too small, timer_info.tsc is right. So all calculated 
  deltas in [3] are shifted to a smaller value. This value is constant 
during the runtime of lateny in user space because no more conversion 
from TSC to ns occurs.


[1] 
http://www.rts.uni-hannover.de/xenomai/lxr/source/ksrc/drivers/testing/timerbench.c#166
[2] 
http://www.rts.uni-hannover.de/xenomai/lxr/source/src/testsuite/latency/latency.c#076
[3] 
http://www.rts.uni-hannover.de/xenomai/lxr/source/src/testsuite/latency/latency.c#111


-- 
Sebastian


  parent reply	other threads:[~2008-04-02 15:58 UTC|newest]

Thread overview: 28+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2008-04-01 23:26 [Xenomai-core] latencys drifting into negative (Xenomai 2.4.2/2.4.3) Cornelius Köpp
2008-04-02  3:01 ` Tomas Kalibera
2008-04-02  9:04 ` Jan Kiszka
2008-04-02 12:00   ` Sebastian Smolorz
2008-04-02 12:28     ` Jan Kiszka
2008-04-02 12:46       ` Gilles Chanteperdrix
2008-04-02 13:00         ` Sebastian Smolorz
2008-04-02 15:28           ` Sebastian Smolorz
2008-04-02 15:58       ` Sebastian Smolorz [this message]
2008-04-02 16:05         ` Gilles Chanteperdrix
2008-04-02 16:24           ` Sebastian Smolorz
2008-04-03 12:17             ` Jan Kiszka
2008-04-03 12:27               ` Gilles Chanteperdrix
2008-04-03 12:50                 ` Jan Kiszka
2008-04-03 12:52                   ` Gilles Chanteperdrix
2008-04-03 13:15               ` Sebastian Smolorz
2008-04-03 21:52                 ` Jan Kiszka
2008-04-04  8:23                   ` Sebastian Smolorz
2008-04-04 10:45                     ` Jan Kiszka
2008-04-04 13:18                       ` Gilles Chanteperdrix
2008-04-04 13:25                         ` Jan Kiszka
2008-04-04 13:32                           ` Jan Kiszka
2008-04-04 13:32                           ` Gilles Chanteperdrix
2008-04-04 13:57                             ` Jan Kiszka
2008-04-04 14:09                               ` Gilles Chanteperdrix
2008-04-04 14:33                                 ` Jan Kiszka
2008-04-04 15:48                                   ` Gilles Chanteperdrix
2008-04-04 15:52                                     ` Philippe Gerum

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=47F3AD14.4090306@domain.hid \
    --to=smolorz@domain.hid \
    --cc=Cornelius.Koepp@domain.hid \
    --cc=jan.kiszka@domain.hid \
    --cc=xenomai@xenomai.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.