Open MPI logo

Open MPI User's Mailing List Archives

  |   Home   |   Support   |   FAQ   |   all Open MPI User's mailing list

Subject: [OMPI users] Problem with 1.3.2 - need tips on debugging
From: Jeff Layton (laytonjb_at_[hidden])
Date: 2009-05-29 10:03:33

Good morning,

I just built 1.3.2 on a ROCKS 4.something system. I built my code
(CFL3D) with the Intel 10.1 compilers. I also linked in the
OpenMPI libs and the Intel libraries to make sure I had the paths
correct. When I try running my code, I get the following,

error: executing task of job 2951 failed: execution daemon on host
"compute-2-3.local" didn't accept task
A daemon (pid 12015) died unexpectedly with status 1 while attempting
to launch so we are aborting.

There may be more information reported by the environment (see above).

This may be because the daemon was unable to find all the needed shared
libraries on the remote node. You may set your LD_LIBRARY_PATH to have the
location of the shared libraries on the remote nodes and this will
automatically be forwarded to the remote nodes.
mpirun noticed that the job aborted, but has no info as to the process
that caused that situation.
mpirun: clean termination accomplished

Everything seems correct. I checked that the mpirun was correct
and the binary has the correct libraries (checked using ldd).

Can anyone tell me what the "status 1" means? Any tips on debugging
the problem?