Open MPI logo

Open MPI User's Mailing List Archives

  |   Home   |   Support   |   FAQ   |   all Open MPI User's mailing list

Subject: [OMPI users] check point restart
From: Erik Nelson (nelsonerikd_at_[hidden])
Date: 2013-07-19 15:34:23

I run mpi on an NSF computer. One of the conditions of use is that jobs are
limited to 24 hr
duration to provide democratic allotment to its users.

A long program can require many restarts, so it becomes necessary to store
the state of the
program in memory, print it, recompile, and and read the state to start

I seem to remember a simpler approach (check point restart?) in which the
state of the .exe
code is saved and then simply restarted from its current position.

Is there something like this for restarting an mpi program?

Thanks, Erik

Erik Nelson
Howard Hughes Medical Institute
6001 Forest Park Blvd., Room ND10.124
Dallas, Texas 75235-9050
p : 214 645 5981
f : 214 645 5948