Open MPI logo

Open MPI User's Mailing List Archives

  |   Home   |   Support   |   FAQ   |   all Open MPI User's mailing list

Subject: Re: [OMPI users] job abort on MPI task exit
From: Ralph Castain (rhc_at_[hidden])
Date: 2008-10-27 09:15:17


This was added to the 1.3 version - it was not back-ported to the
1.2.x series.

Ralph

On Oct 27, 2008, at 5:46 AM, David Singleton wrote:

>
> Apologies if this has been covered in a previous thread - I
> went back through a lot of posts without seeing anything
> similar.
>
> In an attempt to protect some users from themselves, I was hoping
> that OpenMPI could be configured so that an MPI task calling
> exit before calling MPI_Finalize() would cause job cleanup, i.e.
> behave effectively as if MPI_Abort() was called. The reason is
> that many users dont realise they need to use MPI_Abort()
> instead of Fortran stop or C exit. If exit is called, all
> other processes get stuck in the next blocking call and, for a
> large walltime limit batch job, that can be a real waste of
> resources.
>
> I think LAM terminated the job if a task exited with non-zero
> exit status or due to a signal. OpenMPI appears to cleanup
> only in the case a signalled task. Ideally, any exit before
> MPI_Finalize() should be terminal. Why is this not the case?
>
> Thanks,
> David
> _______________________________________________
> users mailing list
> users_at_[hidden]
> http://www.open-mpi.org/mailman/listinfo.cgi/users