Open MPI logo

Open MPI User's Mailing List Archives

  |   Home   |   Support   |   FAQ   |   all Open MPI User's mailing list

Subject: Re: [OMPI users] MPI_Abort under slurm
From: Ralph Castain (rhc_at_[hidden])
Date: 2013-02-25 15:29:40


On Feb 25, 2013, at 10:38 AM, Bokassa <bokassa_at_[hidden]> wrote:

> Hi,
> I noticed that MPI_Abort() does not abort the tasks if the mpi program is started using srun.
> I call MPI_Abort() from rank 0, this process exit, but the other ranks keep running or waiting for IO
> on the other nodes. The only way to kill the job is to use scancel.
> However if I use mpirun under a slurm allocation then MPI_Abort() works as expected aborting
> all tasks.
>
> Is this a known issue?

What version of OMPI are you using? Slurm should detect the process failure and kill the job, unless it was configured not to do so.

>
> Thanks, David
>
> _______________________________________________
> users mailing list
> users_at_[hidden]
> http://www.open-mpi.org/mailman/listinfo.cgi/users