On Feb 14, 2007, at 12:28 PM, Mark Kosmowski wrote:
> Everything is working properly now. I needed to reinstall Linux on
> one of my nodes after a botched attempt at a network install - mpirun
> ... hostname worked, but my application hung and gave a connect()
> errno 110.
>
> At this point I decided to give up and try mpich instead. During the
> mpich sanity checking, there was a more verbose error message
> regarding the failed node, so I reinstalled the OS, reconfigured my
> environment variables for OpenMPI and everything is now working.
Blah. We definitely need to work on our error messages.
FWIW, what did MPICH say for the error?
--
Jeff Squyres
Server Virtualization Business Unit
Cisco Systems
|