Open MPI logo

Open MPI Development Mailing List Archives

  |   Home   |   Support   |   FAQ   |   all Development mailing list

Subject: Re: [OMPI devel] CRS/CRIU: add code to actually checkpoint a process
From: Josh Hursey (jjhursey_at_[hidden])
Date: 2014-02-17 16:45:49


It look fine except that the restart state is not flagged. When a process
is restarted does it resume execution inside the criu_dump() function? If
so, is there a way to tell from its return code (or some other mechanism)
that it is being restarted versus continuing after checkpointing?

On Mon, Feb 17, 2014 at 2:00 PM, Ralph Castain <rhc_at_[hidden]> wrote:

> Great - looks fine to me!!
>
>
> On Feb 17, 2014, at 11:39 AM, Adrian Reber <adrian_at_[hidden]> wrote:
>
> > I have prepared a patch I would like to commit which adds to code to
> > actually checkpoint a process. Thanks for the pointers about the string
> > variables I tried to do implement it correctly.
> >
> > CRIU currently has problems with the new OOB usock but I will contact
> > the CRIU developers about this error. Using tcp, checkpointing works.
> >
> > CRIU also has problems with --np > 1, but I am sure this can also be
> > resolved.
> >
> > The patch is at:
> >
> >
> https://lisas.de/git/?p=open-mpi.git;a=commitdiff;h=89c9c27c87598706e8f798f84fe9520ee5884492
> >
> > Adrian
> > _______________________________________________
> > devel mailing list
> > devel_at_[hidden]
> > http://www.open-mpi.org/mailman/listinfo.cgi/devel
>
> _______________________________________________
> devel mailing list
> devel_at_[hidden]
> http://www.open-mpi.org/mailman/listinfo.cgi/devel
>

-- 
Joshua Hursey
Assistant Professor of Computer Science
University of Wisconsin-La Crosse
http://cs.uwlax.edu/~jjhursey