Am 18.04.2011 um 15:40 schrieb chenjie gu:
> I am a green hand on Openmpi, I have the following Openmpi structure, however it has problem when running across multiple nodes.
> I am trying to build a Bewolf Cluster between 6 nodes of our serve (HP Proliant G460 G7), I have installed the Openmpi on one node (assuming at /mirror),
> ./configure --prefix=/mirror/openmpi CC=icc CXX=icpc F77=ifort FC=ifort
> make all install
> using NFS, the directory of /mirror was successfully exported to the rest of 5 nodes. Now as I test the Openmpi, it runs very well on a single node,
> however it hangs across multiple nodes.
> Now one possible reason as I know is that Openmpi uses TCP to exchange data between different nodes, so I am worried about
> whether there are firewalls between each nodes, which can be factory integrated at somewhere(switch/NIC). Could anyone give me some
> information on this point?
It's not only about MPI communcation. Before you need some means to allow the startup of the local orte daemons on each machine by passphraseless ssh-keys or better hostbased authentication http://arc.liv.ac.uk/SGE/howto/hostbased-ssh.html , or enable `rsh` on the machines and tell Open MPI to use it. Is:
giving you a list of the involved machines?
> Thanks a lot,
> Nanyang Technological University
> users mailing list