All of lore.kernel.org
 help / color / mirror / Atom feed
From: john stultz <johnstul@us.ibm.com>
To: "J. Bruce Fields" <bfields@fieldses.org>
Cc: "Patrick J. LoPresti" <lopresti@gmail.com>,
	Alan Cox <alan@lxorguk.ukuu.org.uk>,
	Andi Kleen <andi@firstfloor.org>,
	linux-fsdevel@vger.kernel.org, linux-nfs@vger.kernel.org,
	linux-kernel <linux-kernel@vger.kernel.org>
Subject: Re: Proposal: Use hi-res clock for file timestamps
Date: Wed, 18 Aug 2010 20:17:14 -0700	[thread overview]
Message-ID: <1282187834.3575.30.camel@localhost.localdomain> (raw)
In-Reply-To: <20100819023106.GB30151@fieldses.org>

On Wed, 2010-08-18 at 22:31 -0400, J. Bruce Fields wrote:
> On Wed, Aug 18, 2010 at 06:41:02PM -0700, john stultz wrote:
> > On Wed, Aug 18, 2010 at 11:12 AM, J. Bruce Fields <bfields@fieldses.org> wrote:
> > > I'm completely ignorant about higher-resolution time sources.  Any
> > > recommended reading?  What resolution do they actually provide, what's
> > > the expense of reading them, how reliable are they, and how do the
> > > answers to those questions vary across different hardware and kernel
> > > versions?  A quick look at drivers/clocksource/ doesn't suggest
> > > simple answers.
> > 
> > Yea, there aren't simple answers. Clocksource hardware varies
> > drastically in resolution and access time across systems and
> > architectures. Further, clocksources may change while the system is
> > up, so we don't really expose the hardware resolution.
> > 
> > On x86, access latency varies from ~50ns (TSC) to ~1.3us (ACPI PM).
> > (And that is ignoring the PIT, which can be 18us per call - luckily
> > almost no hardware uses that). The resolution similarly scales from
> > sub-ns (TSC @ > 1ghz cpus) to ~279ns (ACPI PM). Of course, across
> > architectures you will see even more variance.
> 
> The race in question occurs when you manage to check mtime between two
> file data updates, with all three operations occurring within a clock
> tick.
> 
> No idea if that's feasible in hundreds of nanoseconds.

I think this is what Andi meant that you'll always race with time and
that version counters are the only real solution here.

> I'm also not sure how to judge the access latency.  Certainly a
> microsecond is a lot compared to just reading a cached mtime value.
> 
> Will we ever see them go backwards?  (So if I know I wrote to file B
> after writing to file A, is there ever a case where I could end up with
> an earlier mtime on B than A?)

You should not. However, there have been bugs in the past, and there
will probably be a few more in the future.

There are also theoretical issues with SMP systems where the TSCs are
not perfectly synced, but the window for those races should be small
(ie: smaller then can be detected - otherwise we'll throw out the TSC).


thanks
-john



WARNING: multiple messages have this Message-ID (diff)
From: john stultz <johnstul-r/Jw6+rmf7HQT0dZR+AlfA@public.gmane.org>
To: "J. Bruce Fields" <bfields-uC3wQj2KruNg9hUCZPvPmw@public.gmane.org>
Cc: "Patrick J. LoPresti"
	<lopresti-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>,
	Alan Cox <alan-qBU/x9rampVanCEyBjwyrvXRex20P6io@public.gmane.org>,
	Andi Kleen <andi-Vw/NltI1exuRpAAqCnN02g@public.gmane.org>,
	linux-fsdevel-u79uwXL29TY76Z2rM5mHXA@public.gmane.org,
	linux-nfs-u79uwXL29TY76Z2rM5mHXA@public.gmane.org,
	linux-kernel
	<linux-kernel-u79uwXL29TY76Z2rM5mHXA@public.gmane.org>
Subject: Re: Proposal: Use hi-res clock for file timestamps
Date: Wed, 18 Aug 2010 20:17:14 -0700	[thread overview]
Message-ID: <1282187834.3575.30.camel@localhost.localdomain> (raw)
In-Reply-To: <20100819023106.GB30151-uC3wQj2KruNg9hUCZPvPmw@public.gmane.org>

On Wed, 2010-08-18 at 22:31 -0400, J. Bruce Fields wrote:
> On Wed, Aug 18, 2010 at 06:41:02PM -0700, john stultz wrote:
> > On Wed, Aug 18, 2010 at 11:12 AM, J. Bruce Fields <bfields-uC3wQj2KruNg9hUCZPvPmw@public.gmane.org> wrote:
> > > I'm completely ignorant about higher-resolution time sources.  Any
> > > recommended reading?  What resolution do they actually provide, what's
> > > the expense of reading them, how reliable are they, and how do the
> > > answers to those questions vary across different hardware and kernel
> > > versions?  A quick look at drivers/clocksource/ doesn't suggest
> > > simple answers.
> > 
> > Yea, there aren't simple answers. Clocksource hardware varies
> > drastically in resolution and access time across systems and
> > architectures. Further, clocksources may change while the system is
> > up, so we don't really expose the hardware resolution.
> > 
> > On x86, access latency varies from ~50ns (TSC) to ~1.3us (ACPI PM).
> > (And that is ignoring the PIT, which can be 18us per call - luckily
> > almost no hardware uses that). The resolution similarly scales from
> > sub-ns (TSC @ > 1ghz cpus) to ~279ns (ACPI PM). Of course, across
> > architectures you will see even more variance.
> 
> The race in question occurs when you manage to check mtime between two
> file data updates, with all three operations occurring within a clock
> tick.
> 
> No idea if that's feasible in hundreds of nanoseconds.

I think this is what Andi meant that you'll always race with time and
that version counters are the only real solution here.

> I'm also not sure how to judge the access latency.  Certainly a
> microsecond is a lot compared to just reading a cached mtime value.
> 
> Will we ever see them go backwards?  (So if I know I wrote to file B
> after writing to file A, is there ever a case where I could end up with
> an earlier mtime on B than A?)

You should not. However, there have been bugs in the past, and there
will probably be a few more in the future.

There are also theoretical issues with SMP systems where the TSCs are
not perfectly synced, but the window for those races should be small
(ie: smaller then can be detected - otherwise we'll throw out the TSC).


thanks
-john


--
To unsubscribe from this list: send the line "unsubscribe linux-nfs" in
the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

  reply	other threads:[~2010-08-19  3:17 UTC|newest]

Thread overview: 76+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2010-08-13 18:25 Proposal: Use hi-res clock for file timestamps Patrick J. LoPresti
2010-08-13 18:45 ` john stultz
2010-08-13 18:45   ` john stultz
2010-08-13 18:57   ` Patrick J. LoPresti
2010-08-13 19:09     ` john stultz
2010-08-13 19:09       ` john stultz
2010-08-13 20:53       ` Patrick J. LoPresti
2010-08-13 20:53         ` Patrick J. LoPresti
2010-08-14 16:45         ` Patrick J. LoPresti
2010-08-15  1:50         ` Bret Towe
2010-08-13 19:57     ` Jim Rees
2010-08-13 20:26       ` john stultz
2010-08-13 20:52         ` Jim Rees
2010-08-17 14:54 ` Andi Kleen
2010-08-17 14:54   ` Andi Kleen
2010-08-17 17:41   ` J. Bruce Fields
2010-08-17 17:41     ` J. Bruce Fields
2010-08-17 18:29     ` Andi Kleen
2010-08-17 18:29       ` Andi Kleen
2010-08-17 18:50       ` Patrick J. LoPresti
2010-08-17 18:50         ` Patrick J. LoPresti
2010-08-17 19:04       ` J. Bruce Fields
2010-08-17 19:18         ` Patrick J. LoPresti
2010-08-17 19:18           ` Patrick J. LoPresti
2010-08-17 19:39           ` Alan Cox
2010-08-17 19:39             ` Alan Cox
2010-08-17 19:29             ` J. Bruce Fields
2010-08-17 19:29               ` J. Bruce Fields
2010-08-17 19:52               ` Alan Cox
2010-08-17 19:52                 ` Alan Cox
2010-08-18  5:53               ` Neil Brown
2010-08-18  5:53                 ` Neil Brown
2010-08-18 14:46                 ` Patrick J. LoPresti
2010-08-18 14:46                   ` Patrick J. LoPresti
2010-08-18 17:32                 ` J. Bruce Fields
2010-08-18 17:32                   ` J. Bruce Fields
2010-08-18 18:15                   ` Chuck Lever
2010-08-18 18:15                     ` Chuck Lever
2010-08-18 23:41                     ` Neil Brown
2010-08-18 23:41                       ` Neil Brown
2010-08-19  0:52                       ` Neil Brown
2010-08-19  0:52                         ` Neil Brown
2010-08-19  2:08                         ` J. Bruce Fields
2010-08-19  2:08                           ` J. Bruce Fields
2010-08-19  2:44                           ` Neil Brown
2010-08-19  2:44                             ` Neil Brown
2010-08-19 22:46                             ` J. Bruce Fields
2010-08-19 22:46                               ` J. Bruce Fields
2010-08-18 23:47                   ` Neil Brown
2010-08-18 23:47                     ` Neil Brown
2010-08-18 17:50                 ` Andi Kleen
2010-08-18 17:50                   ` Andi Kleen
2010-08-18 18:54                   ` J. Bruce Fields
2010-08-18 19:25                     ` Andi Kleen
2010-08-18 19:30                       ` J. Bruce Fields
2010-08-17 19:34             ` Patrick J. LoPresti
2010-08-17 19:34               ` Patrick J. LoPresti
2010-08-17 19:54               ` Alan Cox
2010-08-17 19:43                 ` Patrick J. LoPresti
2010-08-17 19:43                   ` Patrick J. LoPresti
2010-08-17 19:45                   ` J. Bruce Fields
2010-08-17 19:45                     ` J. Bruce Fields
2010-08-18 18:12               ` J. Bruce Fields
2010-08-18 18:12                 ` J. Bruce Fields
2010-08-19  1:41                 ` john stultz
2010-08-19  1:41                   ` john stultz
2010-08-19  2:31                   ` J. Bruce Fields
2010-08-19  2:31                     ` J. Bruce Fields
2010-08-19  3:17                     ` john stultz [this message]
2010-08-19  3:17                       ` john stultz
2010-08-19 22:53                       ` J. Bruce Fields
2010-08-18 18:20       ` David Woodhouse
2010-08-18 18:20         ` David Woodhouse
2010-08-18 18:32         ` Patrick J. LoPresti
2010-08-18 18:32           ` Patrick J. LoPresti
2010-08-18 18:53         ` Andi Kleen

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=1282187834.3575.30.camel@localhost.localdomain \
    --to=johnstul@us.ibm.com \
    --cc=alan@lxorguk.ukuu.org.uk \
    --cc=andi@firstfloor.org \
    --cc=bfields@fieldses.org \
    --cc=linux-fsdevel@vger.kernel.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linux-nfs@vger.kernel.org \
    --cc=lopresti@gmail.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.