Open MPI logo

Open MPI User's Mailing List Archives

  |   Home   |   Support   |   FAQ   |   all Open MPI User's mailing list

Subject: Re: [OMPI users] Question on MPMD runs
From: Ralph Castain (rhc_at_[hidden])
Date: 2013-05-30 10:18:03

There is such an option in the 1.7 series and on the trunk, but I don't see it in v1.6.

-mca orte_abort_non_zero_exit 0

On May 30, 2013, at 3:40 AM, Victor Vysotskiy <Victor.Vysotskiy_at_[hidden]> wrote:

> Dear OpenMPI Developers and Users,
> I have general question on signal trapping/handling within mpiexec/mpirun. Let me assume that I have 2 cores and I start two different (independent) prog1 and prog2 programs in parallel via the mpirun/mpiexec strartup command:
> mpiexec -n 1 prog1 : -n 1 prog2
> What happens if one of the programs just is abnormally crashed/terminated while the second one is still running normally? Is it correct observation that in such case the OpenMPI immediately starts a cleanup process and automatically terminates all spawned/running jobs? If it is like that, is there any way to force mpiexec/mpirun to don't cleanup all processes on error and wait until all spawned processes either successfully complete or abnormally terminate their execution?
> Thank you in advance!
> Victor.
> _______________________________________________
> users mailing list
> users_at_[hidden]