When you receive that callback the MPI has ben put in a quiescent state. As such it does not allow MPI communication until the checkpoint is completely finished. So you cannot call barrier in the checkpoint callback. Since Open MPI did doing a coordinated checkpoint, you can assume that all processes are calling the same callback at about the same time (the coordination algorithm synchronizes them for you)
Dear Group,I wanted to do a synchronization check with 'MPI_Barrier(MPI_COMM_WORLD)' in 'opal_crs_self_user_checkpoint(char **restart_cmd)' call. Although every process is present in this call, it fails to synchronize. Is there any reason why cant we use barrier?Thanks in advance.Kind regards,Faisal
_______________________________________________
users mailing list
users@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/users