From mboxrd@z Thu Jan 1 00:00:00 1970 From: Daniel Lezcano Subject: Re: [RFC v7][PATCH 0/9] Kernel based checkpoint/restart Date: Wed, 22 Oct 2008 13:51:28 +0200 Message-ID: <48FF13C0.6000805@fr.ibm.com> References: <1224481237-4892-1-git-send-email-orenl@cs.columbia.edu> <20081021122135.4bce362c.akpm@linux-foundation.org> <1224621667.1848.228.camel@nimitz> <20081022092024.GC12453@elte.hu> Mime-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Return-path: In-Reply-To: <20081022092024.GC12453-X9Un+BFzKDI@public.gmane.org> Sender: linux-api-owner-u79uwXL29TY76Z2rM5mHXA@public.gmane.org To: Ingo Molnar Cc: Dave Hansen , containers-cunTk1MwBs9QetFLy7KEm3xJsTq8ys+cHZ5vskTnxNA@public.gmane.org, hpa-YMNOUZJC4hwAvxtiuMwx3w@public.gmane.org, linux-kernel-u79uwXL29TY76Z2rM5mHXA@public.gmane.org, linux-mm-Bw31MaZKKs3YtjvyW6yDsg@public.gmane.org, viro-RmSDqhL/yNMiFSDQTTA3OLVCufUGDwFn@public.gmane.org, linux-api-u79uwXL29TY76Z2rM5mHXA@public.gmane.org, Andrew Morton , torvalds-de/tnXTf+JLsfHDXvbKv3WD2FQJk+8+b@public.gmane.org, tglx-hfZtesqFncYOwBW4kG4KsQ@public.gmane.org List-Id: linux-api@vger.kernel.org Ingo Molnar wrote: > * Dave Hansen wrote: > >> On Tue, 2008-10-21 at 12:21 -0700, Andrew Morton wrote: >>> On Mon, 20 Oct 2008 01:40:28 -0400 >>> Oren Laadan wrote: >>>> These patches implement basic checkpoint-restart [CR]. This version >>>> (v7) supports basic tasks with simple private memory, and open files >>>> (regular files and directories only). >>> - how useful is this code as it stands in real-world usage? >> Right now, an application must be specifically written to use these >> mew system calls. It must be a single process and not share any >> resources with other processes. The only file descriptors that may be >> open are simple files and may not include sockets or pipes. >> >> What this means in practice is that it is useful for a simple app >> doing computational work. > > say a chemistry application doing calculations. Or a raytracer with a > large job. Both can take many hours (days!) even on very fast machine > and the restrictions on rebootability can hurt in such cases. > > You should reach a minimal level of initial practical utility: say some > helper tool that allows testers to checkpoint and restore a real PovRay > session - without any modification to a stock distro PovRay. There are the liblxc userspace tools doing that. http://sourceforge.net/projects/lxc/ There are the lxc-checkpoint and lxc-restart commands to test the Oren's patches with the external checkpoint Cedric did. These commands are experimental and under development so a hack may be necessary for checkpoint/restart. I didn't tried with Oren's external checkpoint yet, but I think the commands should work. Actually these commands relies on the freezer, so the checkpoint command does freeze, checkpoint, unfreeze. (and kill if specified). lxc-create -n foo lxc-start -n foo mypovray lxc-checkpoint -s -n foo > myckptfile lxc-restart -n foo < myckptfile Thanks -- Daniel -- To unsubscribe from this list: send the line "unsubscribe linux-api" in the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org More majordomo info at http://vger.kernel.org/majordomo-info.html