Open MPI logo

Open MPI User's Mailing List Archives

  |   Home   |   Support   |   FAQ   |   all Open MPI User's mailing list

Subject: Re: [OMPI users] Strange problem
From: Gabriele Fatigati (g.fatigati_at_[hidden])
Date: 2009-03-09 03:59:54


Hai Ralph,
i've tried --nolocal flag, but doesn't works .. :(
The error is the same.

2009/2/20 Ralph Castain <rhc_at_[hidden]>:
> Hi Gabriele
>
> Could be we have a problem in our LSF support - none of us have a way of
> testing it, so this is somewhat of a blind programming case for us.
>
> From the message, it looks like there is some misunderstanding about how
> many slots were allocated vs how many were mapped to a specific host. I
> don't see your cmd line here - could you pass it along too?
>
> My initial guess is that mpirun is running on node0023, and that we then
> mapped procs local to mpirun such that we exceeded LSF's slot allocation on
> that node. We don't account for mpirun taking a process slot in our mapping,
> and LSF does - hence the error. I think...
>
> You could test this by adding --nolocal to your cmd line. This will force
> mpirun to map all procs on other nodes. If my analysis is correct, the job
> should run.
>
> Ralph
>
> On Feb 20, 2009, at 6:46 AM, Gabriele Fatigati wrote:
>
>> Dear OpenMPi developers,
>> i'm running my MPI code compiled with OpenMPI 1.3 over Infiniband and
>> LSF scheduler. But i got the error attached. I suppose that spawning
>> process doesn't works well. The same program under OpenMPI 1.2.5 works
>> well. Could you help me?
>>
>> Thanks in advance.
>>
>> --
>> Ing. Gabriele Fatigati
>>
>> Parallel programmer
>>
>> CINECA Systems & Tecnologies Department
>>
>> Supercomputing Group
>>
>> Via Magnanelli 6/3, Casalecchio di Reno (BO) Italy
>>
>> www.cineca.it                    Tel:   +39 051 6171722
>>
>> g.fatigati [AT] cineca.it
>> <job.196571.err>_______________________________________________
>> users mailing list
>> users_at_[hidden]
>> http://www.open-mpi.org/mailman/listinfo.cgi/users
>
> _______________________________________________
> users mailing list
> users_at_[hidden]
> http://www.open-mpi.org/mailman/listinfo.cgi/users
>
>

-- 
Ing. Gabriele Fatigati
Parallel programmer
CINECA Systems & Tecnologies Department
Supercomputing Group
Via Magnanelli 6/3, Casalecchio di Reno (BO) Italy
www.cineca.it                    Tel:   +39 051 6171722
g.fatigati [AT] cineca.it