From: Petr Baudis <pasky@suse.cz>
To: Jay Soffian <jaysoffian@gmail.com>
Cc: Junio C Hamano <junkio@cox.net>, git@vger.kernel.org
Subject: Re: [PATCH] gitweb: Support caching projects list
Date: Fri, 14 Mar 2008 01:22:05 +0100 [thread overview]
Message-ID: <20080314002205.GL10335@machine.or.cz> (raw)
In-Reply-To: <76718490803131707g34fd40d4q21c69391c2597bc@mail.gmail.com>
On Thu, Mar 13, 2008 at 08:07:09PM -0400, Jay Soffian wrote:
> On Thu, Mar 13, 2008 at 7:14 PM, Petr Baudis <pasky@suse.cz> wrote:
> > diff --git a/gitweb/gitweb.css b/gitweb/gitweb.css
> > index 8e2bf3d..673077a 100644
> > --- a/gitweb/gitweb.css
> > +++ b/gitweb/gitweb.css
> > @@ -85,6 +85,12 @@ div.title, a.title {
> > color: #000000;
> > }
> >
> > +div.stale_info {
> > + display: block;
> > + text-align: right;
> > + font-style: italic;
> > +}
> > +
> > div.readme {
> > padding: 8px;
> > }
>
> What does this have to do with it?
The box shows that cached information is being shown.
> > diff --git a/gitweb/gitweb.perl b/gitweb/gitweb.perl
> > index bcb6193..0eee195 100755
> > --- a/gitweb/gitweb.perl
> > +++ b/gitweb/gitweb.perl
> > @@ -122,6 +122,15 @@ our $fallback_encoding = 'latin1';
>
> ...
>
> > + if ($cache_lifetime and -f $cache_file) {
> > + # Postpone timeout by two minutes so that we get
> > + # enough time to do our job.
> > + my $time = time() - $cache_lifetime + 120;
> > + utime $time, $time, $cache_file;
> > + }
>
> Race condition. I don't see any locking. Nothing keeps multiple instances from
> regenerating the cache concurrently...
>
> > + @projects = git_get_projects_details($projlist, $check_forks);
> > + if ($cache_lifetime and open (my $fd, '>'.$cache_file)) {
>
> ...and then clobbering each other here. You have two choices:
>
> 1) Use a lock file for the critical section.
>
> 2) Assume the race condition is rare enough, but you still need to account for
> it. In that case, you want to write to a temporary file and then rename to the
> cache file name. The rename is atomic, so though N instances of gitweb may
> regenerate the cache (at some CPU/IO overhead), at least the cache file won't
> get corrupted.
You are of course right - I wanted to do the rename, but forgot to write
it in the actual code. :-)
There is a more conceptual problem though - in case of such big sites,
it really makes more sense to explicitly regenerate the cache
periodically instead of making random clients to have to wait it out.
We could add a 'force_update' parameter to accept from localhost only
that will always regenerate the cache, but that feels rather kludgy -
can anyone think of a more elegant solution? (I don't think taking the
@projects generating code out of gitweb and then having to worry during
gitweb upgrades is any better.)
> Out of curiosity, repo.or.cz isn't running this as a CGI is it? If so, wouldn't
> running it as a FastCGI or modperl be a vast improvement?
Unlikely. Currently the machine is mostly IO-bound and only small
portion of CPU usage comes from gitweb itself.
--
Petr "Pasky" Baudis
Whatever you can do, or dream you can, begin it.
Boldness has genius, power, and magic in it. -- J. W. von Goethe
next prev parent reply other threads:[~2008-03-14 0:31 UTC|newest]
Thread overview: 30+ messages / expand[flat|nested] mbox.gz Atom feed top
2008-03-13 23:14 [PATCH] gitweb: Support caching projects list Petr Baudis
2008-03-14 0:07 ` Jay Soffian
2008-03-14 0:22 ` Petr Baudis [this message]
2008-03-14 0:27 ` Jay Soffian
2008-03-14 0:30 ` J.H.
2008-03-14 12:17 ` Jakub Narebski
2008-03-14 0:36 ` J.H.
2008-03-17 17:49 ` repo.or.cz renovation Petr Baudis
2008-03-17 18:11 ` Petr Baudis
2008-03-17 18:44 ` J.H.
2008-03-17 20:41 ` Jakub Narebski
2008-03-17 21:09 ` Jakub Narebski
2008-03-14 15:29 ` [PATCH] gitweb: Support caching projects list Jakub Narebski
2008-03-14 21:11 ` Jay Soffian
2008-03-14 0:19 ` Junio C Hamano
2008-03-14 8:35 ` Frank Lichtenheld
2008-03-14 12:14 ` Jakub Narebski
2008-03-17 17:40 ` Petr Baudis
2008-03-15 21:44 ` Jakub Narebski
2008-03-16 0:56 ` Miklos Vajna
2008-03-16 11:41 ` Frank Lichtenheld
2008-03-16 16:52 ` J.H.
2008-03-16 18:37 ` Jakub Narebski
2008-03-16 22:37 ` J.H.
2008-03-16 23:39 ` Jakub Narebski
2008-03-17 18:10 ` repo.or.cz renovated Petr Baudis
2008-03-17 19:09 ` Junio C Hamano
2008-03-17 19:25 ` Petr Baudis
2008-03-17 19:34 ` Theodore Tso
2008-03-17 19:54 ` Petr Baudis
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20080314002205.GL10335@machine.or.cz \
--to=pasky@suse.cz \
--cc=git@vger.kernel.org \
--cc=jaysoffian@gmail.com \
--cc=junkio@cox.net \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).