All of lore.kernel.org
 help / color / mirror / Atom feed
From: Pete Zaitcev <zaitcev@redhat.com>
To: Jeff Garzik <jeff@garzik.org>
Cc: Project Hail <hail-devel@vger.kernel.org>
Subject: Re: tabled RPM build fails before it succeeds
Date: Wed, 12 May 2010 18:49:03 -0600	[thread overview]
Message-ID: <20100512184903.34de24ff@redhat.com> (raw)
In-Reply-To: <4BC89B88.7050304@garzik.org>

On Fri, 16 Apr 2010 13:16:56 -0400
Jeff Garzik <jeff@garzik.org> wrote:

> Build #1 (fails on x86_64):
> http://koji.fedoraproject.org/koji/taskinfo?taskID=2119825

I think current tabled is much better; it should not stumble with
the "100s" thing as much. Unfortunately, it's not completely reliable
still. I see this (although very infrequently):

PASS: prep-db
chunkd[19052]: Waiting for CLD PortFile cld.port
cld[19051]: databases up
cld[19051]: Listening on port 56141
cld[19051]: initialized: nodebug
chunkd[19052]: Using CLD port 56141
tabled[19055]: Listening on port 44610
tabled[19055]: New CLD session created, sid 4C7619861D42473D
tabled[19055]: /chunk-default: open failed, retrying
chunkd[19053]: Listening on auto port 48660
PASS: start-daemon
PASS: pid-exists
PASS: daemon-running
tabled[19055]: /chunk-default: open failed, retrying
tabled[19055]: /chunk-default: open failed, retrying
tabled[19055]: /chunk-default: open failed, retrying
tabled[19055]: /chunk-default: open failed, retrying
 <------------ at this point tabled exits
cld[19051]: session timeout, addr ::1 sid 4C7619861D42473D
chunkd[19053]: New CLD session created, sid 4C7619861D42473D
chunkd[19053]: initialized
 <------------ great, too late
^Cmake[2]: *** [check-TESTS] Interrupt

So, tabled retries, but gives up too early. Of course the knee-jerk
reaction would be to change the max retries from 5 to 10... The
problem is I have a vague suspicion that something is fishy.
The root of the 100s problem was that CLD gets delayed just
a tiny bit, enough for clients to start and fail the first
round of sessions. That's fine, we deal with it now. But in
the above log CLD seems to be available enough for tabled to
initiate at least, so why does Chunk have to retry?

-- Pete

  parent reply	other threads:[~2010-05-13  0:49 UTC|newest]

Thread overview: 4+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2010-04-16 17:16 tabled RPM build fails before it succeeds Jeff Garzik
2010-04-16 20:19 ` Pete Zaitcev
2010-05-13  0:49 ` Pete Zaitcev [this message]
2010-05-28 20:14 ` Pete Zaitcev

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20100512184903.34de24ff@redhat.com \
    --to=zaitcev@redhat.com \
    --cc=hail-devel@vger.kernel.org \
    --cc=jeff@garzik.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.