From mboxrd@z Thu Jan 1 00:00:00 1970 From: "Serge E. Hallyn" Subject: Re: [RFC v17][PATCH 00/60] Kernel based checkpoint/restart Date: Fri, 24 Jul 2009 14:09:53 -0500 Message-ID: <20090724190953.GA22641@us.ibm.com> References: <1248256822-23416-1-git-send-email-orenl@librato.com> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Return-path: Content-Disposition: inline In-Reply-To: <1248256822-23416-1-git-send-email-orenl-RdfvBDnrOixBDgjK7y7TUQ@public.gmane.org> Sender: linux-api-owner-u79uwXL29TY76Z2rM5mHXA@public.gmane.org To: Oren Laadan Cc: Andrew Morton , Linus Torvalds , containers-cunTk1MwBs9QetFLy7KEm3xJsTq8ys+cHZ5vskTnxNA@public.gmane.org, linux-kernel-u79uwXL29TY76Z2rM5mHXA@public.gmane.org, linux-mm-Bw31MaZKKs3YtjvyW6yDsg@public.gmane.org, linux-api-u79uwXL29TY76Z2rM5mHXA@public.gmane.org, Dave Hansen , Ingo Molnar , "H. Peter Anvin" , Alexander Viro , Pavel Emelyanov , Alexey Dobriyan List-Id: linux-api@vger.kernel.org Quoting Oren Laadan (orenl-RdfvBDnrOixBDgjK7y7TUQ@public.gmane.org): > Application checkpoint/restart (c/r) is the ability to save the state > of a running application so that it can later resume its execution > from the time at which it was checkpointed, on the same or a different > machine. > > This version introduces 'clone_with_pids()' syscall to preset pid(s) > for a child process. It is used by restart(2) to recreate process > hierarchy with the same pids as at checkpoint time. > > It also adds a freezer state CHECKPOINTING to safeguard processes > during a checkpoint. Other important changes include support for > threads and zombies, credentials, signal handling, and improved > restart logic. See below for a more detailed changelog. > > Compiled and tested against v2.6.31-rc3. With the s390 patch I recently sent on top of this set, all of my c/r tests pass, and ltp behaves the same as on plain v2.6.31-rc3 (up to and including hanging on mallocstress). -serge -- To unsubscribe from this list: send the line "unsubscribe linux-api" in the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org More majordomo info at http://vger.kernel.org/majordomo-info.html From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1753988AbZGXTKD (ORCPT ); Fri, 24 Jul 2009 15:10:03 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1753833AbZGXTKB (ORCPT ); Fri, 24 Jul 2009 15:10:01 -0400 Received: from e39.co.us.ibm.com ([32.97.110.160]:60645 "EHLO e39.co.us.ibm.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1753617AbZGXTKA (ORCPT ); Fri, 24 Jul 2009 15:10:00 -0400 Date: Fri, 24 Jul 2009 14:09:53 -0500 From: "Serge E. Hallyn" To: Oren Laadan Cc: Andrew Morton , Linus Torvalds , containers@lists.linux-foundation.org, linux-kernel@vger.kernel.org, linux-mm@kvack.org, linux-api@vger.kernel.org, Dave Hansen , Ingo Molnar , "H. Peter Anvin" , Alexander Viro , Pavel Emelyanov , Alexey Dobriyan Subject: Re: [RFC v17][PATCH 00/60] Kernel based checkpoint/restart Message-ID: <20090724190953.GA22641@us.ibm.com> References: <1248256822-23416-1-git-send-email-orenl@librato.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <1248256822-23416-1-git-send-email-orenl@librato.com> User-Agent: Mutt/1.5.18 (2008-05-17) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Quoting Oren Laadan (orenl@librato.com): > Application checkpoint/restart (c/r) is the ability to save the state > of a running application so that it can later resume its execution > from the time at which it was checkpointed, on the same or a different > machine. > > This version introduces 'clone_with_pids()' syscall to preset pid(s) > for a child process. It is used by restart(2) to recreate process > hierarchy with the same pids as at checkpoint time. > > It also adds a freezer state CHECKPOINTING to safeguard processes > during a checkpoint. Other important changes include support for > threads and zombies, credentials, signal handling, and improved > restart logic. See below for a more detailed changelog. > > Compiled and tested against v2.6.31-rc3. With the s390 patch I recently sent on top of this set, all of my c/r tests pass, and ltp behaves the same as on plain v2.6.31-rc3 (up to and including hanging on mallocstress). -serge From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mail202.messagelabs.com (mail202.messagelabs.com [216.82.254.227]) by kanga.kvack.org (Postfix) with ESMTP id 828D36B0088 for ; Fri, 24 Jul 2009 15:09:59 -0400 (EDT) Received: from d03relay04.boulder.ibm.com (d03relay04.boulder.ibm.com [9.17.195.106]) by e37.co.us.ibm.com (8.14.3/8.13.1) with ESMTP id n6OJ9B4V006232 for ; Fri, 24 Jul 2009 13:09:11 -0600 Received: from d03av02.boulder.ibm.com (d03av02.boulder.ibm.com [9.17.195.168]) by d03relay04.boulder.ibm.com (8.13.8/8.13.8/NCO v9.2) with ESMTP id n6OJ9wdC189982 for ; Fri, 24 Jul 2009 13:09:59 -0600 Received: from d03av02.boulder.ibm.com (loopback [127.0.0.1]) by d03av02.boulder.ibm.com (8.12.11.20060308/8.13.3) with ESMTP id n6OJ9wER017927 for ; Fri, 24 Jul 2009 13:09:58 -0600 Date: Fri, 24 Jul 2009 14:09:53 -0500 From: "Serge E. Hallyn" Subject: Re: [RFC v17][PATCH 00/60] Kernel based checkpoint/restart Message-ID: <20090724190953.GA22641@us.ibm.com> References: <1248256822-23416-1-git-send-email-orenl@librato.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <1248256822-23416-1-git-send-email-orenl@librato.com> Sender: owner-linux-mm@kvack.org To: Oren Laadan Cc: Andrew Morton , Linus Torvalds , containers@lists.linux-foundation.org, linux-kernel@vger.kernel.org, linux-mm@kvack.org, linux-api@vger.kernel.org, Dave Hansen , Ingo Molnar , "H. Peter Anvin" , Alexander Viro , Pavel Emelyanov , Alexey Dobriyan List-ID: Quoting Oren Laadan (orenl@librato.com): > Application checkpoint/restart (c/r) is the ability to save the state > of a running application so that it can later resume its execution > from the time at which it was checkpointed, on the same or a different > machine. > > This version introduces 'clone_with_pids()' syscall to preset pid(s) > for a child process. It is used by restart(2) to recreate process > hierarchy with the same pids as at checkpoint time. > > It also adds a freezer state CHECKPOINTING to safeguard processes > during a checkpoint. Other important changes include support for > threads and zombies, credentials, signal handling, and improved > restart logic. See below for a more detailed changelog. > > Compiled and tested against v2.6.31-rc3. With the s390 patch I recently sent on top of this set, all of my c/r tests pass, and ltp behaves the same as on plain v2.6.31-rc3 (up to and including hanging on mallocstress). -serge -- To unsubscribe, send a message with 'unsubscribe linux-mm' in the body to majordomo@kvack.org. For more info on Linux MM, see: http://www.linux-mm.org/ . Don't email: email@kvack.org