Open MPI logo

Open MPI User's Mailing List Archives

  |   Home   |   Support   |   FAQ   |   all Open MPI User's mailing list

Subject: Re: [OMPI users] openmpi hangs when running on more than one node (unless i use --debug-daemons )
From: David Zhang (solarbikedz_at_[hidden])
Date: 2010-12-27 16:16:03


have you tested your ssh key setup, fire wall, and switch settings to ensure
all nodes are talking to each other?

On Mon, Dec 27, 2010 at 1:07 AM, Advanced Computing Group University of
Padova <acg.unipd_at_[hidden]> wrote:

> using openmpi 1.4.2
>
>
> On Fri, Dec 24, 2010 at 11:17 AM, Advanced Computing Group University of
> Padova <acg.unipd_at_[hidden]> wrote:
>
>> Hi,
>> i am building a small 16 nodes cluster gentoo based.
>> I succesfully installed openmpi and i succesfully tried some simple small
>> test parallel program on a single host but...
>> i can't run parallel program on more than one nodes
>>
>>
>> The nodes are cloned (so they are equals).
>> The mpiuser (and their ssh certificates) uses /home/mpiuser that is a nfs
>> share.
>> I modified .bashrc
>>
>> -------------------------
>> PATH=/usr/bin:$PATH ; export PATH ;
>> LD_LIBRARY_PATH=/usr/lib64:$LD_LIBRARY_PATH ; export LD_LIBRARY_PATH ;
>>
>> # already present below
>> if [[ $- != *i* ]] ; then
>> # Shell is non-interactive. Be done now!
>> return
>> fi
>> ---------------------
>>
>> The very very strange behaviour is that using the --debug-daemons let my
>> program run succesfully.....
>>
>> Thank you in advance and sorry for my bad english
>>
>>
>>
>>
>
> _______________________________________________
> users mailing list
> users_at_[hidden]
> http://www.open-mpi.org/mailman/listinfo.cgi/users
>

-- 
David Zhang
University of California, San Diego