Open MPI logo

Open MPI User's Mailing List Archives

  |   Home   |   Support   |   FAQ   |   all Open MPI User's mailing list

Subject: Re: [OMPI users] Open MPI Checkpoint Restart
From: Neel Sunil Desai (Neel.Desai_at_[hidden])
Date: 2013-05-31 19:54:32


Hi Ralph,

Thanks for the help. The path and ld_path were not set to the correct
location. I was able to execute the ompi-checkpoint command. But, I got the
following error.

[ndesai_at_vcainternmpi01 ~]$ ompi-checkpoint 1803
--------------------------------------------------------------------------
Error: Unable to find the requested, active MPIRUN process on this machine.
       This could be due to one of the following:
        - The jobid specified by the '--hnp-jobid' option is not
          correct.
        - The PID specified (1803) is not that of an active MPIRUN.
        - The application with this PID is not checkpointable
        - The application with this PID is not an Open MPI application.
        - The session directory location could not be parsed.
       ompi-checkpoint attempted to use the session directory:
         /tmp/openmpi-sessions-ndesai_at_vcainternmpi01_0
Thanks,
Neel.

On Fri, May 31, 2013 at 4:34 PM, Ralph Castain <rhc_at_[hidden]> wrote:

> Check that your path and ld_library_path are set to point to the directory
> where you installed the version you built (the --prefix=<> you provided).
>
> On May 31, 2013, at 4:31 PM, Neel Sunil Desai <Neel.Desai_at_[hidden]>
> wrote:
>
> Hi Ralph,
>
> I did install open mpi with the --with-ft=cr option.
>
> Thanks,
> Neel.
>
> On Fri, May 31, 2013 at 4:25 PM, Ralph Castain <rhc_at_[hidden]> wrote:
>
>> Okay, it should work it that version. It sounds like you didn't configure
>> OMPI with the --with-ft=cr option - yes? Take a look at "./configure -h"
>> for the ft-related options and ensure you build what you need. C/R support
>> is not built by default.
>>
>>
>> On May 31, 2013, at 3:59 PM, Neel Sunil Desai <Neel.Desai_at_[hidden]>
>> wrote:
>>
>> Open MPI 1.5.4
>>
>> On Fri, May 31, 2013 at 3:31 PM, Ralph Castain <rhc_at_[hidden]> wrote:
>>
>>> What OMPI version?
>>>
>>> On May 31, 2013, at 3:17 PM, Neel Sunil Desai <Neel.Desai_at_[hidden]>
>>> wrote:
>>>
>>> > Hi,
>>> >
>>> > I forgot to add. I watched the video of Joshua Hursey and when I type
>>> ompi_info | grep FT, I get FT Checkpoint Support: no ( checkpoint thread :
>>> no). I do not get anything when I type ompi_info | grep crs.
>>> >
>>> > Thanks,
>>> > Neel.
>>> > _______________________________________________
>>> > users mailing list
>>> > users_at_[hidden]
>>> > http://www.open-mpi.org/mailman/listinfo.cgi/users
>>>
>>>
>>
>>
>
>