From: Cedric Le Goater <legoater@free.fr>
To: Nathan Lynch <ntl@pobox.com>
Cc: containers@lists.osdl.org, Oren Laadan <orenl@cs.columbia.edu>,
linuxppc-dev@ozlabs.org
Subject: Re: [PATCH 1/3] powerpc: bare minimum checkpoint/restart implementation
Date: Tue, 17 Mar 2009 07:55:37 +0100 [thread overview]
Message-ID: <49BF4969.3080308@free.fr> (raw)
In-Reply-To: <20090316133745.4f636979@thinkcentre.lan>
> Again, how would 'cr' obtain exit status for these tasks, and how would
> it distinguish failure from normal operation?
Here's our solution to this issue.
mcr maintains in its kernel container object an exitcode attribute for
the mcr-restart process. This process is detached from the fork tree of
the restarted application.
when the restart is finished, an mcr-wait command can be called to reap
this exitcode. This make it possible to distinguish an exit of the
application process from an exit of the mcr-restart process.
This is a must-have for batch managers in an HPC environment.
Cheers,
C.
next prev parent reply other threads:[~2009-03-17 6:55 UTC|newest]
Thread overview: 42+ messages / expand[flat|nested] mbox.gz Atom feed top
[not found] <1233182478-27113-1-git-send-email-ntl@pobox.com>
[not found] ` <1233182478-27113-1-git-send-email-ntl-e+AXbWqSrlAAvxtiuMwx3w@public.gmane.org>
2009-01-28 22:41 ` [PATCH 1/3] powerpc: bare minimum checkpoint/restart implementation Nathan Lynch
2009-01-30 3:55 ` Serge E. Hallyn
[not found] ` <1233182478-27113-2-git-send-email-ntl-e+AXbWqSrlAAvxtiuMwx3w@public.gmane.org>
2009-01-29 6:41 ` Oren Laadan
2009-01-29 21:40 ` Nathan Lynch
[not found] ` <49814FA2.9060108-eQaUEPhvms7ENvBUuze7eA@public.gmane.org>
2009-01-29 21:40 ` Nathan Lynch
2009-01-30 4:01 ` Serge E. Hallyn
[not found] ` <20090129214035.GB6913@localdomain>
2009-01-30 0:11 ` Oren Laadan
2009-01-30 0:11 ` Oren Laadan
[not found] ` <49824599.5030503@cs.columbia.edu>
[not found] ` <49824599.5030503-eQaUEPhvms7ENvBUuze7eA@public.gmane.org>
2009-01-30 20:25 ` Nathan Lynch
2009-01-30 20:25 ` Nathan Lynch
2009-02-17 7:03 ` Nathan Lynch
2009-02-17 7:03 ` Nathan Lynch
[not found] ` <20090217010355.58afd5cf@thinkcentre.lan>
2009-02-24 19:58 ` Serge E. Hallyn
2009-02-24 21:11 ` Nathan Lynch
[not found] ` <20090224151152.29e98b5f-4v5LP+xe+1byhTdZtsIeww@public.gmane.org>
2009-03-13 3:36 ` Oren Laadan
[not found] ` <20090217010355.58afd5cf-4v5LP+xe+1byhTdZtsIeww@public.gmane.org>
2009-02-17 20:02 ` [PATCH 1/3 v2] powerpc: heckpoint/restart implementation Nathan Lynch
2009-03-13 3:31 ` [PATCH 1/3] powerpc: bare minimum checkpoint/restart implementation Oren Laadan
[not found] ` <49B9D37A.1070503-eQaUEPhvms7ENvBUuze7eA@public.gmane.org>
2009-03-13 15:42 ` Cedric Le Goater
2009-03-16 18:37 ` Nathan Lynch
2009-03-17 6:55 ` Cedric Le Goater [this message]
2009-03-18 9:15 ` Oren Laadan
2009-01-30 4:01 ` Serge E. Hallyn
2009-01-30 3:55 ` Serge E. Hallyn
2009-02-04 3:39 ` Benjamin Herrenschmidt
2009-02-04 3:39 ` Benjamin Herrenschmidt
[not found] ` <1233718789.16867.156.camel@pasglop>
2009-02-04 15:54 ` Serge E. Hallyn
[not found] ` <20090204155406.GA2039-r/Jw6+rmf7HQT0dZR+AlfA@public.gmane.org>
2009-02-04 20:58 ` Benjamin Herrenschmidt
2009-02-04 20:58 ` Benjamin Herrenschmidt
[not found] ` <1233781099.4612.1.camel@pasglop>
2009-02-04 23:44 ` Oren Laadan
2009-02-04 23:44 ` Oren Laadan
[not found] ` <498A284E.4050501@cs.columbia.edu>
[not found] ` <498A284E.4050501-eQaUEPhvms7ENvBUuze7eA@public.gmane.org>
2009-02-05 0:16 ` Benjamin Herrenschmidt
2009-02-05 0:16 ` Benjamin Herrenschmidt
[not found] ` <1233793012.4612.32.camel@pasglop>
2009-02-05 3:30 ` Oren Laadan
2009-02-05 3:30 ` Oren Laadan
2009-02-05 16:09 ` Serge E. Hallyn
2009-02-05 16:09 ` Serge E. Hallyn
[not found] ` <20090205160946.GF27410@us.ibm.com>
2009-02-05 21:01 ` Benjamin Herrenschmidt
[not found] ` <20090205160946.GF27410-r/Jw6+rmf7HQT0dZR+AlfA@public.gmane.org>
2009-02-05 21:01 ` Benjamin Herrenschmidt
2009-01-28 22:41 ` [PATCH 2/3] powerpc: wire up checkpoint and restart syscalls Nathan Lynch
2009-01-28 22:41 ` [PATCH 3/3] allow checkpoint/restart on powerpc Nathan Lynch
2009-01-28 22:41 ` Nathan Lynch
[not found] ` <1233182478-27113-4-git-send-email-ntl@pobox.com>
[not found] ` <1233182478-27113-4-git-send-email-ntl-e+AXbWqSrlAAvxtiuMwx3w@public.gmane.org>
2009-01-30 4:10 ` Serge E. Hallyn
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=49BF4969.3080308@free.fr \
--to=legoater@free.fr \
--cc=containers@lists.osdl.org \
--cc=linuxppc-dev@ozlabs.org \
--cc=ntl@pobox.com \
--cc=orenl@cs.columbia.edu \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox