From: Daniel Lezcano <dlezcano-NmTC/0ZBporQT0dZR+AlfA@public.gmane.org>
To: "Eric W. Biederman" <ebiederm-aS9lmoZGLiVWk0Htik3J/w@public.gmane.org>
Cc: Linux Containers <containers-qjLDD68F18O7TbgM5vRIOg@public.gmane.org>
Subject: Re: Checkpoint/Restart mini-summit
Date: Thu, 17 Jul 2008 18:15:39 +0200 [thread overview]
Message-ID: <487F702B.4070706@fr.ibm.com> (raw)
In-Reply-To: <m1prpfc7hj.fsf-B27657KtZYmhTnVgQlOflh2eb7JE58TQ@public.gmane.org>
Eric W. Biederman wrote:
> Daniel Lezcano <dlezcano-NmTC/0ZBporQT0dZR+AlfA@public.gmane.org> writes:
>
>> Hi all,
>>
>> Here is a proposition a more detailed agenda for the checkpoint/restart
>> mini-summit. If everybody is ok with it, I will update the wiki.
>>
>> Comments are welcome :)
>
> A reading list is useful, even to help get some ideas circulating
> before we get there.
>
> Ultimately the technical details will need to be resolve by
> people discussing things and sending patches back and forth
> on the mailing lists.
>
> I don't think a detailed agenda is going to get us anywhere.
> Especially not one focused on the implementation details.
>
> I think we need to start by seeing what we can agree on. Certainly we
> agree that checkpoint/restart needs to be part of the picture. What
> are the problems that the linux community can solve with
> checkpoint/restart.
>
> Then we need to talk about what kind of implementation we want to
> merge into mainline. How do we sell it, and how do we implement
> it without affecting long term maintainability.
>
> I think the granularity of our operations, and what state we
> save is important. I don't think how we save it is important
> unless it affects one of our requirements.
>
> As for the posix draft and the historical Cray & SGI implementations.
> They were on the wrong track. The did not have namespace support
> so they could not in general restore their checkpoints.
>
> There are also a lot of things you have failed to touch on, that
> I'm not going to go into now.
>
> With any luck the mini-summit before OLS will be the start of a
> conversation that will go on all week, and continue on the mailing
> lists.
>
> The real question is how do we coordinate our efforts to build a good
> linux checkpoint/restart implementation.
>
>> * Documentation
>> * Zap : www.ncl.cs.columbia.edu/publications/usenix2007_fordist.pdf
>> * Metacluster : lxc.sourceforge.net/doc/ols2006/lxc-ols2006.pdf
>> * OpenVZ : http://wiki.openvz.org/Checkpointing_and_live_migration
>> * Checkpoint/Restart technology :
>> http://en.wikipedia.org/wiki/Application_checkpointing
>> * Virtual Servers and Checkpoint/Restart in Mainstream Linux : Sigops
>> document
>
> There is also the classic emacs undump.
> The very simple vmadump from bproc.
Thanks Eric for all your comments. I agree the agenda is a little big, I
will reduce it and I will add the points you raised. I have other points
from by Oren I will add too, perhaps that will cover more aspect of the
discussion.
-- Daniel
next prev parent reply other threads:[~2008-07-17 16:15 UTC|newest]
Thread overview: 12+ messages / expand[flat|nested] mbox.gz Atom feed top
2008-07-15 10:49 Checkpoint/Restart mini-summit Daniel Lezcano
[not found] ` <487C80C9.2040105-NmTC/0ZBporQT0dZR+AlfA@public.gmane.org>
2008-07-15 18:44 ` Eric W. Biederman
[not found] ` <m1prpfc7hj.fsf-B27657KtZYmhTnVgQlOflh2eb7JE58TQ@public.gmane.org>
2008-07-16 15:15 ` Serge E. Hallyn
[not found] ` <20080716151530.GA26496-r/Jw6+rmf7HQT0dZR+AlfA@public.gmane.org>
2008-07-16 19:04 ` Eric W. Biederman
[not found] ` <m18ww14pm7.fsf-B27657KtZYmhTnVgQlOflh2eb7JE58TQ@public.gmane.org>
2008-07-16 19:35 ` Serge E. Hallyn
[not found] ` <20080716193538.GA28393-r/Jw6+rmf7HQT0dZR+AlfA@public.gmane.org>
2008-07-16 21:27 ` Eric W. Biederman
[not found] ` <m1sku9zfhc.fsf-B27657KtZYmhTnVgQlOflh2eb7JE58TQ@public.gmane.org>
2008-07-17 2:27 ` Serge E. Hallyn
[not found] ` <20080717022729.GC21726-r/Jw6+rmf7HQT0dZR+AlfA@public.gmane.org>
2008-07-17 3:02 ` C. Craig Ross
2008-07-17 16:15 ` Daniel Lezcano [this message]
2008-07-15 18:54 ` Eric W. Biederman
[not found] ` <m1lk03c71k.fsf-B27657KtZYmhTnVgQlOflh2eb7JE58TQ@public.gmane.org>
2008-07-16 15:17 ` Serge E. Hallyn
[not found] ` <20080716151707.GB26496-r/Jw6+rmf7HQT0dZR+AlfA@public.gmane.org>
2008-07-16 15:26 ` Daniel Lezcano
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=487F702B.4070706@fr.ibm.com \
--to=dlezcano-nmtc/0zbporqt0dzr+alfa@public.gmane.org \
--cc=containers-qjLDD68F18O7TbgM5vRIOg@public.gmane.org \
--cc=ebiederm-aS9lmoZGLiVWk0Htik3J/w@public.gmane.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox