Open MPI logo

Open MPI User's Mailing List Archives

  |   Home   |   Support   |   FAQ   |   all Open MPI User's mailing list

Subject: Re: [OMPI users] connect() fails - inhomogeneous cluster
From: Reuti (reuti_at_[hidden])
Date: 2014-06-17 09:44:41


Am 17.06.2014 um 14:53 schrieb borno_borno_at_[hidden]:

> I should have written that...
>
> mpirun -np n --hostfile host.cfg
>
> mpi_at_Ries slots=n_1 max_slots=n_1
> mpi_at_Euler slots=n_2 max_slots=n_2

Although it's defined to use characters in a case insensitive manner in hostnames, my experience is that not all calls are mapping it in a proper way. To avoid any confusion because of this, it's best to have them all in lowercase. I don't know whether this is related to your observation.

-- Reuti

> It is arranged that the sum over the n_i is equal to n.
>
> Kurt
> Gesendet: Dienstag, 17. Juni 2014 um 14:25 Uhr
> Von: Reuti <reuti_at_[hidden]>
> An: "Open MPI Users" <users_at_[hidden]>
> Betreff: Re: [OMPI users] connect() fails - inhomogeneous cluster
> Hi,
>
> Am 17.06.2014 um 13:00 schrieb Borno Knuttelski:
>
> > this is the first time I contact this list. I'm using OpenMPI 1.6.5 on an inhomogeneous cluster with 2 machines. Short: With few processes everything works fine but with some more my application crashes. (Yes, I can guarantee that in every scenario I start processes on both machines). I posted the problem already with all details on stackoverflow (http://stackoverflow.com/questions/24164825/mpi-connect-fails-inhomogeneous-cluster). Please have a look at it. What exactly is the problem and how can I fix it?
>
> How do you start the program - just with `mpiexec` and a proper hostfile and number of slots?
>
> -- Reuti
>
>
> > Every help and guess is appreciated and will be tested...
> > Thanks in advance,
> >
> > Kurt
> > _______________________________________________
> > users mailing list
> > users_at_[hidden]
> > Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users
> > Link to this post: http://www.open-mpi.org/community/lists/users/2014/06/24662.php
>
> _______________________________________________
> users mailing list
> users_at_[hidden]
> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users
> Link to this post: http://www.open-mpi.org/community/lists/users/2014/06/24663.php
> _______________________________________________
> users mailing list
> users_at_[hidden]
> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users
> Link to this post: http://www.open-mpi.org/community/lists/users/2014/06/24664.php