Hi all,
I'm trying to add EC2 instances into my local cluster with openMPI. So far
openMPI works well on the local cluster, and I have set up passwordless SSH
between the local cluster and the Amazon EC2 instance.
Howver, when I add the public DNS into a file (defaulthostfiletest)
comp1 slots=2 max-slots=8
comp2 slots=2 max-slots=8
comp3 slots=2 max-slots=4
ec2-174-129-183-64.compute-1.amazonaws.com slots=2 max-slots=2
and then run:
[/home/ntlp/cashmoney/mainFrame]$mpirun -np 6 --hostfile defaulthostfiletest
hostname
foretell
foretell
augur
augur
predict
predict
it works, but trying to use the amazon cluster I get:
[/home/ntlp/cashmoney/mainFrame]$mpirun -np 8 --hostfile defaulthostfiletest
hostname (it hangs so I kill it)
^C^Cmpirun: killing job...
--------------------------------------------------------------------------
mpirun noticed that the job aborted, but has no info as to the process
that caused that situation.
--------------------------------------------------------------------------
--------------------------------------------------------------------------
mpirun was unable to cleanly terminate the daemons on the nodes shown
below. Additional manual cleanup may be required - please refer to
the "orte-clean" tool for assistance.
--------------------------------------------------------------------------
ec2-174-129-183-64.compute-1.amazonaws.com - daemon did not report
back when launched
Any advice? are there any settings in /etc/sssh/sshd_config that I might
need to change?
Theo
--
Theodore Van Rooy
http://greentheo.scroggles.com
--
Theodore Van Rooy
http://greentheo.scroggles.com
|