From: Daniel Phillips <phillips@phunq.net>
To: FUJITA Tomonori <fujita.tomonori@lab.ntt.co.jp>
Cc: zumastor@googlegroups.com, dm-devel@redhat.com, agk@redhat.com
Subject: Re: [PATCH RFC] dm snapshot: shared exception store
Date: Fri, 15 Aug 2008 01:43:55 -0700 [thread overview]
Message-ID: <200808150143.55776.phillips@phunq.net> (raw)
In-Reply-To: <20080815171801N.fujita.tomonori@lab.ntt.co.jp>
On Friday 15 August 2008 01:17, FUJITA Tomonori wrote:
> On Wed, 13 Aug 2008 17:14:08 -0700
> Daniel Phillips <phillips@phunq.net> wrote:
>
> > requests. I could take care of designing and implementing a kernel
> > interface between your port and the rest of ddsnapd that does such
> > things as respond to control messages and generate block delta
> > lists.
>
> As I said at the first submission, I plan to add such features to the
> new dm-snapshot code. Then we can have simple user-space code that
> focus on the replication.
Well, I suppose when you get it working we can always port it back to
ddsnap :-)
Ddsnap already has quite simple userspace code to do the replication,
or it would be simple if it were cleaned up a little. There is
nothing complex about this. But the kernel will have to generate the
block difference list because it needs access to the snapshot store
btree to do this.
> A daemon program requests delta from the
> kernel, and sends it to another daemon program on the remote
> server. The daemon on the remote server asks the kernel to apply
> delta.
The downstream server just writes the delta to the origin, there is no
need to ask the kernel to do this.
> The advantage of this approach, the above replication program can work
> with any snapshot implementation, which could live in dm or file
> systems like btrfs. File systems could implement the snapshot features
> more efficiently than dm.
When you replicate a volume you can just send a list of changed blocks
as ddsnap does. This is not the case with a filesystem delta, which
has to send the changed blocks of each filesystem object logically,
along with relevant metadata such as changed permissions, ownership,
file sizes etc.
> My question related with this issue is, any chance to modify
> Zumastor's ddsnapd in a such way. Well, I guess, it would be better to
> ask on Zumastor mailing list.
CC added. Yes, it is planned to modify ddsnap to implement a redirect
on write strategy where you essentially use a snapshot as the origin.
This will be a lot more practical after we have snapshots of snapshots
using the versioned pointer code. Versioned pointers by itself will
take a few months to go in and be stable. Things do not move awfully
fast with this storage work, I think that is some kind of tradition.
There is a lot that can still be done to improve efficiency even before
going to redirect on write. Probably another doubling of throughput is
possible by straightforward techniques such as batching up transfers
better and more improvements to the journalling code, or replacement
of the journal by a logging technique.
Regards,
Daniel
next prev parent reply other threads:[~2008-08-15 8:43 UTC|newest]
Thread overview: 20+ messages / expand[flat|nested] mbox.gz Atom feed top
2008-08-04 8:22 [PATCH RFC] dm snapshot: shared exception store FUJITA Tomonori
2008-08-06 19:14 ` Mikulas Patocka
2008-08-09 5:01 ` FUJITA Tomonori
2008-08-11 22:12 ` Mikulas Patocka
2008-08-11 23:34 ` FUJITA Tomonori
2008-08-12 0:15 ` Steve VanDeBogart
2008-08-12 12:30 ` Daniel Phillips
2008-08-14 0:14 ` Daniel Phillips
2008-08-15 8:17 ` FUJITA Tomonori
2008-08-15 8:43 ` Daniel Phillips [this message]
2008-08-15 9:25 ` FUJITA Tomonori
2008-08-16 20:14 ` Daniel Phillips
2008-08-15 13:53 ` Ryusuke Konishi
2008-08-12 12:56 ` Daniel Phillips
2008-08-12 13:14 ` FUJITA Tomonori
2008-08-12 19:00 ` Daniel Phillips
2008-08-12 23:24 ` FUJITA Tomonori
2008-08-12 23:29 ` FUJITA Tomonori
2008-08-13 0:28 ` Daniel Phillips
2008-08-12 19:07 ` Daniel Phillips
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=200808150143.55776.phillips@phunq.net \
--to=phillips@phunq.net \
--cc=agk@redhat.com \
--cc=dm-devel@redhat.com \
--cc=fujita.tomonori@lab.ntt.co.jp \
--cc=zumastor@googlegroups.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.