CEPH filesystem development
 help / color / mirror / Atom feed
From: John Spray <john.spray@redhat.com>
To: Sage Weil <sweil@redhat.com>, sjust@redhat.com
Cc: ceph-devel@vger.kernel.org
Subject: Re: sharded collection list
Date: Thu, 04 Jun 2015 14:30:11 +0100	[thread overview]
Message-ID: <557052E3.6060701@redhat.com> (raw)
In-Reply-To: <alpine.DEB.2.00.1506021533010.30975@cobra.newdream.net>


Talked about this elsewhere but for the benefit of the list:
  * The API suggested here looks nicer to me too
  * This depends on the new PGLS ordering OSD side, so that has to land 
before this
  * In the meantime I've rebased the #9964 (rados import/export) branch 
to not depend on sharded pgls

Cheers,
John

On 02/06/2015 23:54, Sage Weil wrote:
> Hey John-
>
> So the shared pgls stuff has collided a bit with the looming hobject
> sorting changes.  Sam and I just talked about it a bit and came up
> with what librados API would be most appealing:
>
>   - the listing API would have start/end markers
>
>   - it would be driven by a new opaque type rados_list_cursor_t, which is
> just data, no state, and internally is just an hobject_t.
>
>   - it would be totally stateless.. kill the [N]ListContext stuff in
> Objecter (and reimplement a simple wrapper in librados.cc or even .h).
> Note that the important bits of state there now are
>
>   epoch (needed for detecting split; this will go away with a better cursor)
>   result buffer (we can drop this)
>   nspace (part of the ioctx, it just tags each request)
>   cookie (this basically becomes the cursor .. it's just an hobject_t typedef)
>
>   - the list could take a start cursor, optional end cursor, and output the
> next cursor to continue from.
>
>   - we'd lose the buffering that ListContext currently does, which means
> that the request that goes over the wire will return the same number
> of entries that the C caller asks for.  The C++ interface is an iterator
> so it'll have to do its own buffering, but that should be pretty
> trivial...
>
>   - we should kill these calls, which were never used:
>
>   CEPH_RADOS_API uint32_t rados_nobjects_list_get_pg_hash_position(rados_list_ctx_t ctx);
>
>   CEPH_RADOS_API uint32_t rados_nobjects_list_seek(rados_list_ctx_t ctx,
>                                                    uint32_t pos);
>
>   - we'd add a new call that is something like
>
>   int rados_construct_iterator(ioctx, int n, int m, cursor *out);
>
> so that you can get a position partway through the pg.
>
> What do you think?  Unfortunately it is quite a departure from what you
> implemented already but I think it'll be a net simplification *and*
> let you do all the things we want, like
>
>   - get a set of ranges to list form
>   - change our mind partway through to break things into smaller shards
> without losing previous work
>   - start listing from a random position in the pool
>
> You could even list a single hash value by constructing a cursor with
> n=hash and n=hash=1 and m=2^32.
>
> What do you think?
> sage
>
> --
> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html


      reply	other threads:[~2015-06-04 13:30 UTC|newest]

Thread overview: 2+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2015-06-02 22:54 sharded collection list Sage Weil
2015-06-04 13:30 ` John Spray [this message]

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=557052E3.6060701@redhat.com \
    --to=john.spray@redhat.com \
    --cc=ceph-devel@vger.kernel.org \
    --cc=sjust@redhat.com \
    --cc=sweil@redhat.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox