All of lore.kernel.org
 help / color / mirror / Atom feed
From: "Serge E. Hallyn" <serue-r/Jw6+rmf7HQT0dZR+AlfA@public.gmane.org>
To: Cedric Le Goater <legoater-GANU6spQydw@public.gmane.org>
Cc: Containers <containers-qjLDD68F18O7TbgM5vRIOg@public.gmane.org>,
	Sukadev Bhattiprolu
	<sukadev-23VcF4HTsmIX0ybBhKVfKdBPR1lH4CV8@public.gmane.org>,
	"David C. Hansen"
	<haveblue-r/Jw6+rmf7HQT0dZR+AlfA@public.gmane.org>,
	"Eric W. Biederman"
	<ebiederm-aS9lmoZGLiVWk0Htik3J/w@public.gmane.org>,
	Dave Hansen
	<dave-23VcF4HTsmIX0ybBhKVfKdBPR1lH4CV8@public.gmane.org>
Subject: Re: [PATCH 0/6] /proc/pid/checkpointable
Date: Thu, 26 Mar 2009 08:29:12 -0500	[thread overview]
Message-ID: <20090326132912.GC13639@us.ibm.com> (raw)
In-Reply-To: <49CB504A.2080400-GANU6spQydw@public.gmane.org>

Quoting Cedric Le Goater (legoater-GANU6spQydw@public.gmane.org):
> Serge E. Hallyn wrote:
> > Quoting Eric W. Biederman (ebiederm-aS9lmoZGLiVWk0Htik3J/w@public.gmane.org):
> >> Dave Hansen <dave-23VcF4HTsmIX0ybBhKVfKdBPR1lH4CV8@public.gmane.org> writes:
> >>
> >>> On Wed, 2009-03-18 at 13:03 -0700, Mike Waychison wrote:
> >>>> Polluting the dmesg buffer with messages from common failures (consider 
> >>>> a multi-user cluster where checkpoints may or may not succeed) isn't 
> >>>> very useful.
> >>> Yeah, I've already gotten an earful from Serge and Dan S. about this. :)
> >>>
> >>> Serge suggested that, perhaps, the audit framework could be used.  We
> >>> might also use an ftrace buffer if we want to keep a whole ton of
> >>> messages around, too.
> >>>
> >>> dmesg is definitely not workable long-term at all.
> >> How about having place holder objects in the generated checkpoint.
> >> Then instead of having a failure you have a non-restoreable checkpoint.
> >> But you know which fd, or which mmaped region, or which other thing
> >> is causing the problem and if you want more information you can
> >> look at that resource.
> >>
> >> That gives user space the freedom and scrub out the non-checkpointable
> >> bits and replace them with something like /dev/null so that we can
> >> continue on and restore the checkpoint anyway, if we think our
> >> app can cope with some things going away.
> >>
> >> Eric
> > 
> > I like this idea.
> 
> yes. This is something required to replace stdios for example, when 
> you execute an application under ssh, checkpoint and then restart on 
> an other host. This a topical scenario for a batch manager in an HPC 
> environment. 
> 
> identified resources of the container are tracked to be ignored by 
> checkpoint and to be replaced by similar ones at restart.

So in that case how are the resources identified?  Does the user
specify them at checkpoint?  Do you look for specific strings
(/dev/pts/*) at restart?

-serge

  parent reply	other threads:[~2009-03-26 13:29 UTC|newest]

Thread overview: 28+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2009-03-17  6:27 [PATCH 0/6] /proc/pid/checkpointable Sukadev Bhattiprolu
     [not found] ` <20090317062754.GA2377-r/Jw6+rmf7HQT0dZR+AlfA@public.gmane.org>
2009-03-17  6:38   ` Sukadev Bhattiprolu
2009-03-17  6:38   ` Sukadev Bhattiprolu
2009-03-17  6:39   ` Sukadev Bhattiprolu
2009-03-17  6:39   ` Sukadev Bhattiprolu
2009-03-17  6:39   ` Sukadev Bhattiprolu
     [not found]     ` <20090317063940.GF2377-r/Jw6+rmf7HQT0dZR+AlfA@public.gmane.org>
2009-03-18  8:55       ` Oren Laadan
     [not found]         ` <49C0B6FF.5030104-eQaUEPhvms7ENvBUuze7eA@public.gmane.org>
2009-03-18 13:59           ` Serge E. Hallyn
     [not found]             ` <20090318135953.GE22636-r/Jw6+rmf7HQT0dZR+AlfA@public.gmane.org>
2009-03-18 16:16               ` Oren Laadan
     [not found]                 ` <49C11E61.4010505-eQaUEPhvms7ENvBUuze7eA@public.gmane.org>
2009-03-18 16:24                   ` Dave Hansen
2009-03-18 17:48                     ` Oren Laadan
     [not found]                       ` <49C133F9.2020505-eQaUEPhvms7ENvBUuze7eA@public.gmane.org>
2009-03-18 18:06                         ` Dave Hansen
2009-03-18 16:23               ` Oren Laadan
     [not found]                 ` <49C1201A.3050604-eQaUEPhvms7ENvBUuze7eA@public.gmane.org>
2009-03-18 17:18                   ` Serge E. Hallyn
     [not found]                     ` <20090318171840.GA29523-r/Jw6+rmf7HQT0dZR+AlfA@public.gmane.org>
2009-03-18 17:50                       ` Oren Laadan
     [not found]                         ` <49C1347F.3000601-eQaUEPhvms7ENvBUuze7eA@public.gmane.org>
2009-03-18 20:03                           ` Mike Waychison
     [not found]                             ` <49C153AF.7070504-hpIqsD4AKlfQT0dZR+AlfA@public.gmane.org>
2009-03-18 20:13                               ` Dave Hansen
2009-03-25 12:25                                 ` Eric W. Biederman
     [not found]                                   ` <m17i2dx00b.fsf-+imSwln9KH6u2/kzUuoCbdi2O/JbrIOy@public.gmane.org>
2009-03-25 17:29                                     ` Serge E. Hallyn
     [not found]                                       ` <20090325172938.GA18957-r/Jw6+rmf7HQT0dZR+AlfA@public.gmane.org>
2009-03-26  9:52                                         ` Cedric Le Goater
     [not found]                                           ` <49CB504A.2080400-GANU6spQydw@public.gmane.org>
2009-03-26 13:29                                             ` Serge E. Hallyn [this message]
2009-03-18 14:42           ` Dave Hansen
2009-03-17  6:39   ` Sukadev Bhattiprolu
     [not found]     ` <20090317063958.GG2377-r/Jw6+rmf7HQT0dZR+AlfA@public.gmane.org>
2009-03-18  8:56       ` Oren Laadan
     [not found]         ` <49C0B750.4050109-eQaUEPhvms7ENvBUuze7eA@public.gmane.org>
2009-03-18 13:53           ` Serge E. Hallyn
2009-03-17  6:55   ` Sukadev Bhattiprolu
  -- strict thread matches above, loose matches on Subject: below --
2009-03-17 17:43 Sukadev Bhattiprolu
     [not found] ` <20090317174359.GA10796-r/Jw6+rmf7HQT0dZR+AlfA@public.gmane.org>
2009-03-18  9:28   ` Oren Laadan

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20090326132912.GC13639@us.ibm.com \
    --to=serue-r/jw6+rmf7hqt0dzr+alfa@public.gmane.org \
    --cc=containers-qjLDD68F18O7TbgM5vRIOg@public.gmane.org \
    --cc=dave-23VcF4HTsmIX0ybBhKVfKdBPR1lH4CV8@public.gmane.org \
    --cc=ebiederm-aS9lmoZGLiVWk0Htik3J/w@public.gmane.org \
    --cc=haveblue-r/Jw6+rmf7HQT0dZR+AlfA@public.gmane.org \
    --cc=legoater-GANU6spQydw@public.gmane.org \
    --cc=sukadev-23VcF4HTsmIX0ybBhKVfKdBPR1lH4CV8@public.gmane.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.