Dear all,I am a green hand on Openmpi, I have the following Openmpi structure, however it has problem when running across multiple nodes.
I am trying to build a Bewolf Cluster between 6 nodes of our serve (HP Proliant G460 G7), I have installed the Openmpi on one node (assuming at /mirror),
./configure --prefix=/mirror/openmpi CC=icc CXX=icpc F77=ifort FC=ifort
make all install
using NFS, the directory of /mirror was successfully exported to the rest of 5 nodes. Now as I test the Openmpi, it runs very well on a single node,
however it hangs across multiple nodes.
Now one possible reason as I know is that Openmpi uses TCP to exchange data between different nodes, so I am worried about
whether there are firewalls between each nodes, which can be factory integrated at somewhere(switch/NIC). Could anyone give me some
information on this point?
Thanks a lot,
Regards,
ArchyGU
Nanyang Technological University