Open MPI logo

Open MPI User's Mailing List Archives

  |   Home   |   Support   |   FAQ   |  

This web mail archive is frozen.

This page is part of a frozen web archive of this mailing list.

You can still navigate around this archive, but know that no new mails have been added to it since July of 2016.

Click here to be taken to the new web archives of this list; it includes all the mails that are in this frozen archive plus all new mails that have been sent to the list since it was migrated to the new archives.

Subject: Re: [OMPI users] trouble using openmpi under slurm
From: Ralph Castain (rhc_at_[hidden])
Date: 2010-07-07 13:26:55


I'm afraid the bottom line is that OMPI simply doesn't support core-level allocations. I tried it on a slurm machine available to me, using our devel trunk as well as 1.4, with the same results.

Not sure why you are trying to run that way, but I'm afraid you can't do it with OMPI.

On Jul 6, 2010, at 3:20 PM, David Roundy wrote:

> On Tue, Jul 6, 2010 at 12:31 PM, Ralph Castain <rhc_at_[hidden]> wrote:
>> Thanks - that helps.
>>
>> As you note, the issue is that OMPI doesn't support the core-level allocation options of slurm - never has, probably never will. What I found interesting, though, was that your envars don't anywhere indicate that this is what you requested. I don't see anything there that would case the daemon to crash.
>>
>> So I'm left to guess that this is an issue where slurm doesn't like something OMPI does because it violates that core-level option. Can you add --display-devel-map to your mpirun command? It would be interesting to see where it thinks the daemon should go.
>>
>> Just to check - the envars you sent in your other note came from the sbatch -c 2 run, yes?
>
> Yes indeed.
>
> Just for good measure, I'm attaching my current test script submit.sh
> and its complete output, also run with sbatch -c 2. Oddly enough
> adding --display-devel-map doesn't cause mpirun to generate any output
> before crashing. Does this give you any sort of a hint? :( Any
> other suggestions for tracking the source of this down? I'd really
> hoped you'd tell me that one of the env vars told you that my slurm
> config was messed up, since that would seem pretty easy to fix, once I
> knew how it was messed up...
>
> David
> <submit.sh><slurm-2833.out>_______________________________________________
> users mailing list
> users_at_[hidden]
> http://www.open-mpi.org/mailman/listinfo.cgi/users