Open MPI logo

Open MPI User's Mailing List Archives

  |   Home   |   Support   |   FAQ   |   all Open MPI User's mailing list

Subject: Re: [OMPI users] OpenMPI 1.4.2 with Myrinet MX, mpirun seg faults
From: Raymond Muno (muno_at_[hidden])
Date: 2010-10-20 21:22:23

  On 10/20/2010 7:59 PM, Ralph Castain wrote:
> The error message seems to imply that mpirun itself didn't segfault, but that something else did. Is that segfault pid from mpirun?
> This kind of problem usually is caused by mismatched builds - i.e., you compile against your new build, but you pick up the Myrinet build when you try to run because of path and ld_library_path issues. You might check to ensure you are running against what you built with.
The PATH and LD_LIBRARY_PATH are set explicitly (through modules) on the
frontend and each node. The PGI compiler and the OpenMPI I am trying to
run is set for each.

ldd /share/apps/opt/OpenMPI/1.4.2/PGI/10.4/bin/mpirun
(0x00002b6a16552000) =>
(0x00002b6a167aa000) => /lib64/ (0x0000003a7dc00000) => /lib64/ (0x0000003a80400000) => /lib64/ (0x0000003a88a00000) => /lib64/ (0x0000003a7e000000) => /lib64/ (0x0000003a7d800000) => /lib64/ (0x0000003a7d400000) =>
         /lib64/ (0x0000003a7d000000)

The one that works from the other tree

ldd /opt/openmpi-myrinet_mx/bin/mpirun =>
/opt/openmpi-myrinet_mx/lib/ (0x00002b51c71b0000) =>
/opt/openmpi-myrinet_mx/lib/ (0x00002b51c7430000) => /lib64/ (0x0000003a7dc00000) => /lib64/ (0x0000003a80400000) => /lib64/ (0x0000003a88a00000) => /lib64/ (0x0000003a7d800000) => /lib64/ (0x0000003a7e000000) => /lib64/ (0x0000003a7d400000)
         /lib64/ (0x0000003a7d000000)