From: Daniel Lezcano <dlezcano-NmTC/0ZBporQT0dZR+AlfA@public.gmane.org>
To: "Eric W. Biederman" <ebiederm-aS9lmoZGLiVWk0Htik3J/w@public.gmane.org>
Cc: Linux Containers <containers-qjLDD68F18O7TbgM5vRIOg@public.gmane.org>
Subject: Re: Checkpoint/Restart mini-summit
Date: Thu, 17 Jul 2008 18:15:39 +0200 [thread overview]
Message-ID: <487F702B.4070706@fr.ibm.com> (raw)
In-Reply-To: <m1prpfc7hj.fsf-B27657KtZYmhTnVgQlOflh2eb7JE58TQ@public.gmane.org>
Eric W. Biederman wrote:
> Daniel Lezcano <dlezcano-NmTC/0ZBporQT0dZR+AlfA@public.gmane.org> writes:
>
>> Hi all,
>>
>> Here is a proposition a more detailed agenda for the checkpoint/restart
>> mini-summit. If everybody is ok with it, I will update the wiki.
>>
>> Comments are welcome :)
>
> A reading list is useful, even to help get some ideas circulating
> before we get there.
>
> Ultimately the technical details will need to be resolve by
> people discussing things and sending patches back and forth
> on the mailing lists.
>
> I don't think a detailed agenda is going to get us anywhere.
> Especially not one focused on the implementation details.
>
> I think we need to start by seeing what we can agree on. Certainly we
> agree that checkpoint/restart needs to be part of the picture. What
> are the problems that the linux community can solve with
> checkpoint/restart.
>
> Then we need to talk about what kind of implementation we want to
> merge into mainline. How do we sell it, and how do we implement
> it without affecting long term maintainability.
>
> I think the granularity of our operations, and what state we
> save is important. I don't think how we save it is important
> unless it affects one of our requirements.
>
> As for the posix draft and the historical Cray & SGI implementations.
> They were on the wrong track. The did not have namespace support
> so they could not in general restore their checkpoints.
>
> There are also a lot of things you have failed to touch on, that
> I'm not going to go into now.
>
> With any luck the mini-summit before OLS will be the start of a
> conversation that will go on all week, and continue on the mailing
> lists.
>
> The real question is how do we coordinate our efforts to build a good
> linux checkpoint/restart implementation.
>
>> * Documentation
>> * Zap : www.ncl.cs.columbia.edu/publications/usenix2007_fordist.pdf
>> * Metacluster : lxc.sourceforge.net/doc/ols2006/lxc-ols2006.pdf
>> * OpenVZ : http://wiki.openvz.org/Checkpointing_and_live_migration
>> * Checkpoint/Restart technology :
>> http://en.wikipedia.org/wiki/Application_checkpointing
>> * Virtual Servers and Checkpoint/Restart in Mainstream Linux : Sigops
>> document
>
> There is also the classic emacs undump.
> The very simple vmadump from bproc.
Thanks Eric for all your comments. I agree the agenda is a little big, I
will reduce it and I will add the points you raised. I have other points
from by Oren I will add too, perhaps that will cover more aspect of the
discussion.
-- Daniel
next prev parent reply other threads:[~2008-07-17 16:15 UTC|newest]
Thread overview: 12+ messages / expand[flat|nested] mbox.gz Atom feed top
2008-07-15 10:49 Checkpoint/Restart mini-summit Daniel Lezcano
[not found] ` <487C80C9.2040105-NmTC/0ZBporQT0dZR+AlfA@public.gmane.org>
2008-07-15 18:44 ` Eric W. Biederman
[not found] ` <m1prpfc7hj.fsf-B27657KtZYmhTnVgQlOflh2eb7JE58TQ@public.gmane.org>
2008-07-16 15:15 ` Serge E. Hallyn
[not found] ` <20080716151530.GA26496-r/Jw6+rmf7HQT0dZR+AlfA@public.gmane.org>
2008-07-16 19:04 ` Eric W. Biederman
[not found] ` <m18ww14pm7.fsf-B27657KtZYmhTnVgQlOflh2eb7JE58TQ@public.gmane.org>
2008-07-16 19:35 ` Serge E. Hallyn
[not found] ` <20080716193538.GA28393-r/Jw6+rmf7HQT0dZR+AlfA@public.gmane.org>
2008-07-16 21:27 ` Eric W. Biederman
[not found] ` <m1sku9zfhc.fsf-B27657KtZYmhTnVgQlOflh2eb7JE58TQ@public.gmane.org>
2008-07-17 2:27 ` Serge E. Hallyn
[not found] ` <20080717022729.GC21726-r/Jw6+rmf7HQT0dZR+AlfA@public.gmane.org>
2008-07-17 3:02 ` C. Craig Ross
2008-07-17 16:15 ` Daniel Lezcano [this message]
2008-07-15 18:54 ` Eric W. Biederman
[not found] ` <m1lk03c71k.fsf-B27657KtZYmhTnVgQlOflh2eb7JE58TQ@public.gmane.org>
2008-07-16 15:17 ` Serge E. Hallyn
[not found] ` <20080716151707.GB26496-r/Jw6+rmf7HQT0dZR+AlfA@public.gmane.org>
2008-07-16 15:26 ` Daniel Lezcano
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=487F702B.4070706@fr.ibm.com \
--to=dlezcano-nmtc/0zbporqt0dzr+alfa@public.gmane.org \
--cc=containers-qjLDD68F18O7TbgM5vRIOg@public.gmane.org \
--cc=ebiederm-aS9lmoZGLiVWk0Htik3J/w@public.gmane.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.