From: John <linux.kernel@free.fr>
To: Eric Dumazet <dada1@cosmosbay.com>
Cc: linux-net@vger.kernel.org, netdev@vger.kernel.org, linux.kernel@free.fr
Subject: Re: CLOCK_MONOTONIC datagram timestamps by the kernel
Date: Wed, 28 Feb 2007 17:07:10 +0100 [thread overview]
Message-ID: <45E5A8AE.3030606@free.fr> (raw)
In-Reply-To: <200702281555.10309.dada1@cosmosbay.com>
Eric Dumazet wrote:
> On Wednesday 28 February 2007 15:23, John wrote:
>> Eric Dumazet wrote:
>>>> John wrote:
>>>>> I know it's possible to have Linux timestamp incoming datagrams as soon
>>>>> as they are received, then for one to retrieve this timestamp later
>>>>> with an ioctl command or a recvmsg call.
>>>> Has it ever been proposed to modify struct skb_timeval to hold
>>>> nanosecond stamps instead of just microsecond stamps? Then make the
>>>> improved precision somehow available to user space.
>>> Most modern NICS are able to delay packet delivery, in order to reduce
>>> number of interrupts and benefit from better cache hits.
>>
>> You are referring to NAPI interrupt mitigation, right?
>
> Nope; I am referring to hardware features. NAPI is software.
>
> See ethtool -c eth0
>
> # ethtool -c eth0
> Coalesce parameters for eth0:
> Adaptive RX: off TX: off
> stats-block-usecs: 1000000
> sample-interval: 0
> pkt-rate-low: 0
> pkt-rate-high: 0
>
> rx-usecs: 300
> rx-frames: 60
> rx-usecs-irq: 300
> rx-frames-irq: 60
>
> tx-usecs: 200
> tx-frames: 53
> tx-usecs-irq: 200
> tx-frames-irq: 53
>
> You can see on this setup, rx interrupts can be delayed up to 300 us (up to 60
> packets might be delayed)
One can disable interrupt mitigation. Your argument that it introduces
latency therefore becomes irrelevant.
>> POSIX is moving to nanoseconds interfaces.
>> http://www.opengroup.org/onlinepubs/009695399/functions/clock_settime.html
You snipped too much. I also wrote:
struct timeval and struct timespec take as much space (64 bits).
If the hardware can indeed manage sub-microsecond accuracy, a struct
timeval forces the kernel to discard valuable information.
> The fact that you are able to give nanosecond timestamps inside kernel is not
> sufficient. It is necessary of course, but not sufficient. This precision is
> OK to time locally generated events. The moment you ask a 'nanosecond'
> timestamp, it's usually long before/after the real event.
>
> If you rely on nanosecond precision on network packets, then something is
> wrong with your algo. Even rt patches wont make sure your cpu caches are
> pre-filled, or that the routers/links between your machines are not busy.
> A cache miss cost 40 ns for example. A typical interrupt handler or rx
> processing can trigger 100 cache misses, or not at all if cache is hot.
Consider an idle Linux 2.6.20-rt8 system, equipped with a single PCI-E
gigabit Ethernet NIC, running on a modern CPU (e.g. Core 2 Duo E6700).
All this system does is time stamp 1000 packets per second.
Are you claiming that this platform *cannot* handle most packets within
less than 1 microsecond of their arrival?
If there are platforms that can achieve sub-microsecond precision, and
if it is not more expensive to support nanosecond resolution (I said
resolution not precision), then it makes sense to support nanosecond
resolution in Linux. Right?
> You said that rt gives highest priority to interrupt handlers :
> If you have several nics, what will happen if you receive packets on both
> nics, or if the NIC interrupt happens in the same time than timer interrupt ?
> One timestamp will be wrong for sure.
Again, this is irrelevant. We are discussing whether it would make sense
to support sub-microsecond resolution. If there is one platform that can
achieve sub-microsecond precision, there is a need for sub-microsecond
resolution. As long as we are changing the resolution, we might as well
use something standard like struct timespec.
> For sure we could timestamp packets with nanosecond resolution, and eventually
> with MONOTONIC value too, but it will give you (and others) false confidence
> on the real precision. us timestamps are already wrong...
IMHO, this is not true for all platforms.
Regards.
next prev parent reply other threads:[~2007-02-28 16:07 UTC|newest]
Thread overview: 34+ messages / expand[flat|nested] mbox.gz Atom feed top
2007-02-28 10:18 CLOCK_MONOTONIC datagram timestamps by the kernel John
2007-02-28 13:37 ` John
2007-02-28 13:55 ` Eric Dumazet
2007-02-28 14:23 ` John
2007-02-28 14:55 ` Eric Dumazet
2007-02-28 16:07 ` John [this message]
2007-03-01 10:03 ` Evgeniy Polyakov
2007-03-01 11:30 ` Eric Dumazet
2007-03-01 15:54 ` Stephen Hemminger
2007-03-01 16:13 ` Eric Dumazet
2007-03-02 14:38 ` [PATCH] NET : convert network timestamps to ktime_t Eric Dumazet
2007-03-02 16:27 ` Stephen Hemminger
2007-03-02 21:02 ` Stephen Hemminger
2007-03-02 22:46 ` Eric Dumazet
2007-03-05 0:19 ` David Miller
2007-03-05 6:56 ` Eric Dumazet
2007-03-05 7:40 ` Eric Dumazet
2007-03-05 8:00 ` David Miller
2007-03-05 8:21 ` Eric Dumazet
2007-03-05 8:49 ` David Miller
2007-03-08 14:17 ` [PATCH] NET : Introduce SIOCGSTAMPNS ioctl to get timestamps with nanosec resolution Eric Dumazet
2007-03-08 16:28 ` Patrick McHardy
2007-03-08 16:42 ` Eric Dumazet
2007-03-08 16:45 ` Patrick McHardy
2007-03-09 4:39 ` David Miller
2007-03-09 18:39 ` [PATCH] NET : Adding SO_TIMESTAMPNS / SCM_TIMESTAMPNS support Eric Dumazet
2007-03-09 22:17 ` David Miller
2007-03-01 18:53 ` CLOCK_MONOTONIC datagram timestamps by the kernel Stephen Hemminger
2007-03-01 23:14 ` Eric Dumazet
2007-03-01 23:34 ` Stephen Hemminger
2007-03-02 0:56 ` Eric Dumazet
2007-03-02 9:26 ` John
2007-03-02 10:11 ` Eric Dumazet
2007-02-28 18:22 ` Stephen Hemminger
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=45E5A8AE.3030606@free.fr \
--to=linux.kernel@free.fr \
--cc=dada1@cosmosbay.com \
--cc=linux-net@vger.kernel.org \
--cc=netdev@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).