All of lore.kernel.org
 help / color / mirror / Atom feed
From: Pete Zaitcev <zaitcev@redhat.com>
To: Jeff Garzik <jeff@garzik.org>
Cc: hail-devel@vger.kernel.org, zaitcev@redhat.com
Subject: Re: chunkd design genesis, storage tech, and support for multiple key/value tables
Date: Tue, 10 Nov 2009 18:48:56 -0700	[thread overview]
Message-ID: <20091110184856.5550a4ce@redhat.com> (raw)
In-Reply-To: <4AF9D123.4090908@garzik.org>

On Tue, 10 Nov 2009 15:46:27 -0500, Jeff Garzik <jeff@garzik.org> wrote:

> Now the world has figured out giving a storage device the flexibility to 
> manage data on a per-object granular basis simplifies applications, and 
> gives underlying storage more ability to optimize.

This is a sleught of hand by OSD vendors, interested in selling for
more dollars per gigabyte.

> Thus, moving to generic key/value storage actually simplified 
> applications, by eliminating that mapping.

You're so sure about this, I wonder where it comes from.

The fact in case of tabled is, it must maintain a database of keys
of its own, primarily because (a) it cannot afford round-trips into
Chunk for every operation, and (b) to locate the chunks. Both of
these databases may be in RAM, but it does not make them non-existing.

> However, one glaring difference from SCSI OSD was chunkd's lack of 
> administrative partitions.  SCSI OSDs provide "partitions" within each 
> logical unit (LUN), each of contains a set of objects within a single 
> object id namespace.  Therefore, if you consider SCSI OSD object id as 
> the key, then SCSI OSD definitely has multiple key/value tables.

This is a completely bogus analogy. OSD vendors want to push their
wares into PC space, where one unit is all a computer has. But in
the cloud we have thousands of Chunk nodes per each application.
That is your partitioning right there: it's called <Cell></Cell>.

Look, I would not mind if all this partition stuff was free, but
it's not. You decided to embed a partition into a session, so
 - There's a round trip that you excuse by telling applications
   to keep long-living connections, thanks a lot
 - requests to different partitions cannot be pipelined (well,
   not easily).

-- Pete

      reply	other threads:[~2009-11-11  1:48 UTC|newest]

Thread overview: 6+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2009-11-10 11:24 [PATCH] chunkd: add support for multiple key/value tables Jeff Garzik
2009-11-10 16:33 ` Pete Zaitcev
2009-11-10 19:45   ` Jeff Garzik
2009-11-10 19:50     ` Jeff Garzik
2009-11-10 20:46       ` chunkd design genesis, storage tech, and " Jeff Garzik
2009-11-11  1:48         ` Pete Zaitcev [this message]

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20091110184856.5550a4ce@redhat.com \
    --to=zaitcev@redhat.com \
    --cc=hail-devel@vger.kernel.org \
    --cc=jeff@garzik.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.