All of lore.kernel.org
 help / color / mirror / Atom feed
* C/R minisummit notes
@ 2008-07-23 11:30 Daniel Lezcano
       [not found] ` <4887163F.5090801-NmTC/0ZBporQT0dZR+AlfA@public.gmane.org>
  0 siblings, 1 reply; 20+ messages in thread
From: Daniel Lezcano @ 2008-07-23 11:30 UTC (permalink / raw)
  To: Linux Containers


  * What are the problems that the linux community can solve with the 
checkpoint/restart ?

	Eric Biederman reminds at the previous OLS nobody complained about the 
checkpoint/restart

	Pavel Emylianov : The startup of Oracle takes some minutes, if we 
checkpoint just after the startup, Oracle can be restarted from this 
point later and provide fast startup

	Oren Laaden : Time travel, we can do monotonic snapshot and go back on 
one of this snaphost.

	Eric Biedreman : Priority running, checkpoint/kill an application and 
run another application with a bigger priority

	Denis Lunev : Task migration, move application on one host to another host

	Daniel Lezcano : SSI (task migration)

  * Preparing the kernel internals

	OL : Can we implement a kernel module and move CR functionality into 
the kernel itself later ?

	EB : Better to add a little CR functionnality into the kernel itself 
and add more after.

	DLu : Problem with kernel version

	OL : Compatibility with intermediate kernel version should be possible 
with userspace conversion tools

	DLu : Non sequential file for checkpoint statefile is a challenge

	OL : yes, but possible and useful for compression/encryption

	We showed that there are five steps to realize a checkpoint:

	1 - Pre-dump
	2 - Freeze
	3 - Dump
	4 - Resume/kill
	5 - Post-dump

	At this point we state we want create a proof of concept and 
checkpoint/restart the simplest application.

	We will add iteratively more and more kernel resources.

	Process hierarchy created from kernel or userspace ?

	OL : Seems better to send a chunk of data to kernel and that restores 
the processes hierarchy
	PE : Agreed
	OL : We should be able to checkpoint from inside the container, keep 
that in mind for later.
	
	=> we need a syscall or a ioctl

	The first items to address before implementing the Checkpoint are:
	1 - Make a container object (the context)
	2 - Freeze the container (extend cgroup freezer ?)
	3 - syscall | ioctl

	First step:
		* simplest application : A single process, without any file, no 
checkpoint of text file (same file system for restart), no signals, no 
syscall in the application, no ipc/no msgq, no network

	Second step:
		* multiple processes + zombie state

	Third step:
		* files, pipe, signals, socketpair ?

	This proof of concept must came with a documentation describing what is 
supported, what is not supported and what we plan to do.

^ permalink raw reply	[flat|nested] 20+ messages in thread

end of thread, other threads:[~2008-07-26  7:32 UTC | newest]

Thread overview: 20+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2008-07-23 11:30 C/R minisummit notes Daniel Lezcano
     [not found] ` <4887163F.5090801-NmTC/0ZBporQT0dZR+AlfA@public.gmane.org>
2008-07-23 14:20   ` Eric W. Biederman
2008-07-23 18:55   ` Oren Laadan
     [not found]     ` <48877EA7.1050206-eQaUEPhvms7ENvBUuze7eA@public.gmane.org>
2008-07-23 20:18       ` Serge E. Hallyn
2008-07-23 20:23       ` [Devel] " Denis V. Lunev
2008-07-23 20:24       ` Daniel Lezcano
2008-07-23 21:18   ` Serge E. Hallyn
     [not found]     ` <20080723211818.GA10295-r/Jw6+rmf7HQT0dZR+AlfA@public.gmane.org>
2008-07-23 21:38       ` Oren Laadan
     [not found]         ` <4887A4CC.5070009-eQaUEPhvms7ENvBUuze7eA@public.gmane.org>
2008-07-24  1:41           ` sukadev-r/Jw6+rmf7HQT0dZR+AlfA
     [not found]             ` <20080724014122.GA23105-r/Jw6+rmf7HQT0dZR+AlfA@public.gmane.org>
2008-07-24  3:26               ` Serge E. Hallyn
     [not found]                 ` <20080724032616.GB9839-r/Jw6+rmf7HQT0dZR+AlfA@public.gmane.org>
2008-07-24  9:58                   ` Eric W. Biederman
2008-07-24  9:55   ` C/R minisummit notes (namespace naming) Eric W. Biederman
     [not found]     ` <m1zlo7a9nq.fsf-B27657KtZYmhTnVgQlOflh2eb7JE58TQ@public.gmane.org>
2008-07-25 19:13       ` Serge E. Hallyn
     [not found]         ` <20080725191356.GE28136-r/Jw6+rmf7HQT0dZR+AlfA@public.gmane.org>
2008-07-25 19:26           ` Daniel Lezcano
     [not found]             ` <488A28E4.6080902-NmTC/0ZBporQT0dZR+AlfA@public.gmane.org>
2008-07-25 19:34               ` Serge E. Hallyn
     [not found]                 ` <20080725193458.GA12356-r/Jw6+rmf7HQT0dZR+AlfA@public.gmane.org>
2008-07-25 19:52                   ` Oren Laadan
2008-07-25 20:09                   ` Daniel Lezcano
     [not found]                     ` <488A32FC.7020803-NmTC/0ZBporQT0dZR+AlfA@public.gmane.org>
2008-07-26  7:32                       ` Eric W. Biederman
2008-07-24 20:28   ` C/R minisummit notes Oren Laadan
     [not found]     ` <4888E5D3.807-eQaUEPhvms7ENvBUuze7eA@public.gmane.org>
2008-07-25  2:14       ` Daniel Lezcano

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.