Open MPI logo

Open MPI User's Mailing List Archives

  |   Home   |   Support   |   FAQ   |   all Open MPI User's mailing list

Subject: [OMPI users] checkpointing
From: Ifeanyi (ifeanyeg2012_at_[hidden])
Date: 2012-06-14 08:12:48


Hi

Please help.

I have installed openmpi-1.6, I have also tested the installation with
different mpi applications and my application executed successfully.

Whenever I ran NPB-3.3 LU without checkpointing, NPB-3.3 completes
successfully.
however whenever I checkpointing the application, it aborts without
checkpointing with the following error

"mpirun noticed that process rank 1 with PID 1048 on node node1 exited on
signal 10 (User defined signal 1).
--------------------------------------------------------------------------
2 total processes killed (some possibly by mpirun during cleanup)"

However, when I ran HPL and checkpoint - checkpointing was successfully
completed as well as the application.
I have tried to resolved this without success.

Please I need assistance - I am new user of open mpi.

Regards,
Ifeanyi