Open MPI logo

Open MPI User's Mailing List Archives

  |   Home   |   Support   |   FAQ   |   all Open MPI User's mailing list

Subject: [OMPI users] Troubleshooting mpirun with tree spawn hang
From: Anthony Alba (ascanio.alba7_at_[hidden])
Date: 2014-04-11 00:50:10


Is there a way to troubleshoot
plm_rsh_no_tree_spawn=true hang?

I have a set of passwordless-ssh nodes, each node can ssh into any other.,
i.e.,

for h1 in A B C D; do for h2 in A B C D; do ssh $h1 ssh $h2 hostname; done;
done

works perfectly.

Generally tree spawn works, however there is one host where
launching mpirun with tree spawn hangs as soon as there are 6 or more host
(with launch node also in the host list). If the launcher is not in the
host list the hang happens with five hosts.

- Anthony