Open MPI logo

Open MPI User's Mailing List Archives

  |   Home   |   Support   |   FAQ   |   all Open MPI User's mailing list

Subject: Re: [OMPI users] Open MPI Checkpoint Restart
From: Ralph Castain (rhc_at_[hidden])
Date: 2013-05-31 20:24:42


Did you check the items on the list given in the error? I'm no expert on ompi-checkpoint, but the error means that one of those conditions isn't being met.

On May 31, 2013, at 4:54 PM, Neel Sunil Desai <Neel.Desai_at_[hidden]> wrote:

> Hi Ralph,
>
> Thanks for the help. The path and ld_path were not set to the correct location. I was able to execute the ompi-checkpoint command. But, I got the following error.
>
> [ndesai_at_vcainternmpi01 ~]$ ompi-checkpoint 1803
> --------------------------------------------------------------------------
> Error: Unable to find the requested, active MPIRUN process on this machine.
> This could be due to one of the following:
> - The jobid specified by the '--hnp-jobid' option is not
> correct.
> - The PID specified (1803) is not that of an active MPIRUN.
> - The application with this PID is not checkpointable
> - The application with this PID is not an Open MPI application.
> - The session directory location could not be parsed.
> ompi-checkpoint attempted to use the session directory:
> /tmp/openmpi-sessions-ndesai_at_vcainternmpi01_0
> Thanks,
> Neel.
>
> On Fri, May 31, 2013 at 4:34 PM, Ralph Castain <rhc_at_[hidden]> wrote:
> Check that your path and ld_library_path are set to point to the directory where you installed the version you built (the --prefix=<> you provided).
>
> On May 31, 2013, at 4:31 PM, Neel Sunil Desai <Neel.Desai_at_[hidden]> wrote:
>
>> Hi Ralph,
>>
>> I did install open mpi with the --with-ft=cr option.
>>
>> Thanks,
>> Neel.
>>
>> On Fri, May 31, 2013 at 4:25 PM, Ralph Castain <rhc_at_[hidden]> wrote:
>> Okay, it should work it that version. It sounds like you didn't configure OMPI with the --with-ft=cr option - yes? Take a look at "./configure -h" for the ft-related options and ensure you build what you need. C/R support is not built by default.
>>
>>
>> On May 31, 2013, at 3:59 PM, Neel Sunil Desai <Neel.Desai_at_[hidden]> wrote:
>>
>>> Open MPI 1.5.4
>>>
>>> On Fri, May 31, 2013 at 3:31 PM, Ralph Castain <rhc_at_[hidden]> wrote:
>>> What OMPI version?
>>>
>>> On May 31, 2013, at 3:17 PM, Neel Sunil Desai <Neel.Desai_at_[hidden]> wrote:
>>>
>>> > Hi,
>>> >
>>> > I forgot to add. I watched the video of Joshua Hursey and when I type ompi_info | grep FT, I get FT Checkpoint Support: no ( checkpoint thread : no). I do not get anything when I type ompi_info | grep crs.
>>> >
>>> > Thanks,
>>> > Neel.
>>> > _______________________________________________
>>> > users mailing list
>>> > users_at_[hidden]
>>> > http://www.open-mpi.org/mailman/listinfo.cgi/users
>>>
>>>
>>
>>
>
>