All of lore.kernel.org
 help / color / mirror / Atom feed
From: Jeff Garzik <jeff@garzik.org>
To: Pete Zaitcev <zaitcev@redhat.com>
Cc: Rick Peralta <fbp@tiac.net>,
	Project Hail <hail-devel@vger.kernel.org>,
	zaitcev@redat.com
Subject: Re: HAIL volunteer Rick Peralta
Date: Wed, 29 Jul 2009 13:17:34 -0400	[thread overview]
Message-ID: <4A70842E.8020908@garzik.org> (raw)
In-Reply-To: <20090729105202.7d0410de.zaitcev@redhat.com>

Pete Zaitcev wrote:
> On Wed, 29 Jul 2009 10:31:56 -0400, Jeff Garzik <jeff@garzik.org> wrote:
> 
>>> Is there someone taking the point for the chunkd development?
>> That's me, for the moment.  :)
> 
> I have some short list todo for Chunk, after which I don't have
> any particular plans:
>  * Exit if CLD registration fails (maybe!).

Hopefully all this is wrapped up into libcldc, such that, an application 
needs to only worry about major, abstracted events after calling 
new-session:

* no master, after defined "hunt" procedure.

	This includes both init and master failure (as distinguished
	from fail-over).

	The application will need to be in the "no CLD session"
	state in both cases.

	And indeed, exit() might be the best way to do that.

* master fail-over

	Flush our [currently non-existent] CLD cache.

etc.


>  * Put ourhost into the CLD record, and the port.
>  * Use base directory instead of Cell.
>  * Switch to asprintf for CLD filenames, Geo.

agreed


> So far we managed hacking on same codebase with relative ease.
> Just make sure to post patches early.
> 
>> You should read the GoogleFS paper referenced on the chunkd wiki page: 
>> http://labs.google.com/papers/gfs-sosp2003.pdf  It describes the purpose 
>> and use of a chunk server, in the context of distributed cloud storage.
> 
> I think we're at a point where we have our own base of knowledge
> and evolved an overall architecture to the point we don't have to
> ape every little detail of Google architecture.

Well, until the wiki has a description of the basic idea of a chunk 
server, the Google paper will have to do.

The point is not that we are aping Google, but more to describe the 
general concept to someone who does not know what a chunk server is, and 
how a chunk server fits into the "grand design."


> In particular I'm
> going to fight hard any talk of Chunk doing its own replication,
> for now at least.

WRT chunkd and replication, yes, that's fine for version 1.0.

But consider which is more likely to have bandwidth to spare:

	a) client -> service
		or
	b) service -> service

Of the two, I'd say "a" is a bit more likely to be remote (WAN) and have 
a slow-upload situation like my home cable modem (1 mbps down, 50 kbps 
up), and "b" is more likely to be LAN.

Or to take converse logic -- is it likely that service->service 
replication is SLOWER than client->service replication?

Every way I look at it, client->{service,service,service} replication 
seems both easy... and potentially slower than alternatives :)

	Jeff



  parent reply	other threads:[~2009-07-29 17:17 UTC|newest]

Thread overview: 10+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
     [not found] <29025029.1248785350151.JavaMail.root@mswamui-andean.atl.sa.earthlink.net>
     [not found] ` <4A6F5A3E.1070907@garzik.org>
     [not found]   ` <4A704376.6000303@tiac.net>
2009-07-29 14:31     ` HAIL volunteer Rick Peralta Jeff Garzik
2009-07-29 16:52       ` Pete Zaitcev
2009-07-29 16:58         ` Fabian Deutsch
2009-07-29 17:17         ` Jeff Garzik [this message]
2009-07-29 17:59           ` Jeff Garzik
2009-07-29 18:19             ` Pete Zaitcev
2009-07-29 18:30               ` Jeff Garzik
2009-07-29 18:35             ` Fabian Deutsch
2009-07-29 19:14               ` Jeff Garzik
2009-07-31 16:37 Rick Peralta

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=4A70842E.8020908@garzik.org \
    --to=jeff@garzik.org \
    --cc=fbp@tiac.net \
    --cc=hail-devel@vger.kernel.org \
    --cc=zaitcev@redat.com \
    --cc=zaitcev@redhat.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.