Open MPI logo

Open MPI Development Mailing List Archives

  |   Home   |   Support   |   FAQ   |   all Development mailing list

Subject: Re: [OMPI devel] Nodes already filled when spawning
From: Ralph Castain (rhc_at_[hidden])
Date: 2011-12-15 13:33:08


mpirun --oversubscribe or OMPI_MCA_rmaps_base_oversubscribe=1

On Dec 15, 2011, at 11:27 AM, TERRY DONTJE wrote:

> There's an oversubscribe option I can set in my case, right?
>
> Thanks,
>
> --td
>
> On 12/15/2011 1:22 PM, Ralph Castain wrote:
>>
>> This is fixed, to a degree, with r25659. However, note that there is one big change that occurred back when we first committed the mapping change.
>>
>> As I noted at that time, we changed the default for RM-given allocations to be no-oversubscribe. So your MTTs may well fail if they weren't updated as all those tests oversubscribe the nodes, and are running in RM environments.
>>
>>
>> On Dec 15, 2011, at 8:37 AM, TERRY DONTJE wrote:
>>
>>> Last night MTT test results for 1.7a1r25652 from IU and Oracle is showing failures during some of the spawn tests see http://www.open-mpi.org/mtt/index.php?do_redir=2036.
>>>
>>> Essentially, the test are failing with the message:
>>> All nodes which are allocated for this job are already filled.
>>>
>>> I wonder if this is related to some of the hostfile changes done lately. Anyways, I am
>>> working on narrowing down the revision that introduced that but if someone figures this out
>>> before me that would be great.
>>>
>>> <Mail Attachment.gif>
>>> Terry D. Dontje | Principal Software Engineer
>>> Developer Tools Engineering | +1.781.442.2631
>>> Oracle - Performance Technologies
>>> 95 Network Drive, Burlington, MA 01803
>>> Email terry.dontje_at_[hidden]
>>>
>>>
>>>
>>> _______________________________________________
>>> devel mailing list
>>> devel_at_[hidden]
>>> http://www.open-mpi.org/mailman/listinfo.cgi/devel
>>
>>
>> _______________________________________________
>> devel mailing list
>> devel_at_[hidden]
>> http://www.open-mpi.org/mailman/listinfo.cgi/devel
>
> --
> <Mail Attachment.gif>
> Terry D. Dontje | Principal Software Engineer
> Developer Tools Engineering | +1.781.442.2631
> Oracle - Performance Technologies
> 95 Network Drive, Burlington, MA 01803
> Email terry.dontje_at_[hidden]
>
>
>
> _______________________________________________
> devel mailing list
> devel_at_[hidden]
> http://www.open-mpi.org/mailman/listinfo.cgi/devel