I'm Hiep, I'm trying to use checkpoint/restart feature in Open MPI. I had
read information about this feature in
and Open-MPI-FT-CR-Draft-v1.pdf. I had built Open MPI from "trunk" which
gotten by Subversion.
But I don't know how to enable checkpoint/restart fault tolerance in Open
So that, I get this error when I try this command: ompi-checkpoint.
bash: ompi-checkpoint: command not found
I want to ask you how to build and use checkpoint/restart feature in Open
Please tell me in details, I'm a new user.