Open MPI logo

Open MPI User's Mailing List Archives

  |   Home   |   Support   |   FAQ   |   all Open MPI User's mailing list

Subject: Re: [OMPI users] Error when using OpenMPI with SGE multiple hosts
From: Reuti (reuti_at_[hidden])
Date: 2010-11-15 16:33:33


Correction:

Am 15.11.2010 um 20:23 schrieb Terry Dontje:

> On 11/15/2010 02:11 PM, Reuti wrote:
>> Just to give my understanding of the problem:
>>
>> Am 15.11.2010 um 19:57 schrieb Terry Dontje:
>>
>>
>>> On 11/15/2010 11:08 AM, Chris Jewell wrote:
>>>
>>>>> Sorry, I am still trying to grok all your email as what the problem you
>>>>> are trying to solve. So is the issue is trying to have two jobs having
>>>>> processes on the same node be able to bind there processes on different
>>>>> resources. Like core 1 for the first job and core 2 and 3 for the 2nd job?
>>>>>
>>>>> --td
>>>>>
>>>>>
>>>> That's exactly it. Each MPI process needs to be bound to 1 processor in a way that reflects GE's slot allocation scheme.
>>>>
>>>>
>>>>
>>> I actually don't think that I got it. So you give two cases:
>>>
>>> Case 1:
>>> $ qsub -pe mpi 8 -binding pe linear:1 myScript.com
>>>
>>> and my pe_hostfile looks like:
>>>
>>> exec6.cluster.stats.local 2
>>>
>>> batch.q_at_exec6.cluster.stats.local
>>>
>>> 0,1
>>>
>> Shouldn't here two cores be reserved for exec6 as it got two slots?
>>
>>
>>
> That's what I was wondering.

You can't get 2 slots on a machine, as it's limited by the core count to one here, so such a slot allocation shouldn't occur at all.

==

If you want exactly N cores per machine, then also the allocation_rule should be set to N.

-- Reuti