From: "Kenneth Ölwing" <kenneth@olwing.se>
To: Git List <git@vger.kernel.org>
Cc: Thomas Rast <trast@inf.ethz.ch>
Subject: Re: Collective wisdom about repos on NFS accessed by concurrent clients (== corruption!?)
Date: Fri, 05 Apr 2013 16:45:33 +0200 [thread overview]
Message-ID: <515EE38D.2000502@olwing.se> (raw)
In-Reply-To: <87li8xrt5f.fsf@linux-k42r.v.cablecom.net>
On 2013-04-05 15:42, Thomas Rast wrote:
> Can you run the same tests under strace or similar, and gather the
> relevant outputs? Otherwise it's probably very hard to say what is
> going wrong. In particular we've had some reports on lustre that
> boiled down to "impossible" returns from libc functions, not git
> issues. It's hard to say without some evidence.
Thomas, thanks for your reply.
I'm assuming I should strace the git commands as they're issued? I'm
already collecting regular stdout/err output in a log as I go. Is there
any debugging things I can turn on to make the calls issue internal
tracing of some sort?
The main issue I see is that I suspect it will generate so much data
that it'll overflow my disk ;-). Consider that my hammer consists of a
Perl script that forks a number of tasks (e.g. 15) that each loops doing
clone/commit/push/pull, with retrying on a few levels as errors occur
(usually expected ones due to the concurrency, i.e. someone else pushed
so a pull is necessary first, but occasionally the central repo is
broken enough that it can't be cloned from, or at least not checked out
master from...sometimes with printed errors that still give me a zero
exit code...). That is then also run on several machines to the same
repo to hopefully cause a breakage by sheer pounding...it's going to
generate huge collections of strace output I expect...
I have some variations of this (e.g. all tasks are working on different
branches, improving concurrency in some respects, but effects there have
been that at the end I was missing a branch or so...). The likelihood of
problems seems to increase when I actually use ssh in my ultimate setup,
so a loadbalancer roundrobins each call to any of several hosts. In that
case I must admit I don't know how to get in on the action since I guess
I would need to strace the git-upload/receive-pack processes on the
server side...?
Lastly, I don't know how much this will impact timings etc, or load. To
get a broken result I have sometimes needed to run for many hours,
others fairly quickly.
Well...I will try, it'll probably be a blast :-)
BTW, this is mostly done on Centos 6.3 and 6.4, locally built git-1.8.2.
ken1
next prev parent reply other threads:[~2013-04-06 16:49 UTC|newest]
Thread overview: 7+ messages / expand[flat|nested] mbox.gz Atom feed top
2013-03-28 10:22 Collective wisdom about repos on NFS accessed by concurrent clients (== corruption!?) Kenneth Ölwing
2013-04-05 12:35 ` Kenneth Ölwing
2013-04-05 13:42 ` Thomas Rast
2013-04-05 14:45 ` Kenneth Ölwing [this message]
2013-04-06 8:11 ` Thomas Rast
2013-04-06 11:49 ` Jason Pyeron
2013-04-07 18:56 ` Kenneth Ölwing
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=515EE38D.2000502@olwing.se \
--to=kenneth@olwing.se \
--cc=git@vger.kernel.org \
--cc=trast@inf.ethz.ch \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).