On Feb 14, 2007, at 12:28 PM, Mark Kosmowski wrote:
> Everything is working properly now. I needed to reinstall Linux on
> one of my nodes after a botched attempt at a network install - mpirun
> ... hostname worked, but my application hung and gave a connect()
> errno 110.
> At this point I decided to give up and try mpich instead. During the
> mpich sanity checking, there was a more verbose error message
> regarding the failed node, so I reinstalled the OS, reconfigured my
> environment variables for OpenMPI and everything is now working.
Blah. We definitely need to work on our error messages.
FWIW, what did MPICH say for the error?
Server Virtualization Business Unit