Open MPI logo

Open MPI User's Mailing List Archives

  |   Home   |   Support   |   FAQ   |   all Open MPI User's mailing list

Subject: [OMPI users] Segmentation fault - Address not mapped
From: Catalin David (catalindavid2003_at_[hidden])
Date: 2009-07-06 08:50:52

Dear all,

I have recently started working on a project using OpenMPI. Basically,
I have been given some c++ code, a cluster to play with and a deadline
in order to make the c++ code run faster. The cluster was a bit
crowded, so I started working on my laptop (g++ 4.3.3 -- Ubuntu repos,
OpenMPI 1.3.2 -- compiled with no options ) and after one week I
actually had something that was running on my computer, therefore
decided to move to the cluster. Since the cluster is very old and it
was using g++ 3.2 and an old version of OpenMPI, I decided to install
both of them from source in my home folder (g++ 4.4, OpenMPI 1.3.2).
The issue is that when I run the program (after being compiled
flawless on the machine), I get these error messages:

[denali:30134] *** Process received signal ***
[denali:30134] Signal: Segmentation fault (11)
[denali:30134] Signal code: Address not mapped (1)
[denali:30134] Failing at address: 0x18

(more in the attached file -- mpirun -np 4 ray-trace)

All this morning, I have gone through the mailing lists, found people
experiencing my problems, but their solution did not work for me. By
using simple debugging (cout), I was able to determine where the error
comes from:

//Initialize step
//Here it breaks!!! Memory allocation issue!
MPI_Comm_size(MPI_COMM_WORLD, &pool);
std::cout<<"I'm here"<<std::endl; //this statement is never reached
MPI_Comm_rank(MPI_COMM_WORLD, &myid);

When trying to debug via gdb, the problem seems to be:

Program received signal SIGSEGV, Segmentation fault.
0xb7524772 in ompi_comm_invalid (comm=Could not find the frame base
for "ompi_comm_invalid".) at communicator.h:261
261 communicator.h: No such file or directory.
        in communicator.h

which might indicate a problem with paths. For now, my LD_LIBRARY_PATH
is set to "/users/cluster/cdavid/local/lib/" (the local folder in my
home folder emulates the directory structure of the / folder).

Moreover, I wanted to see if the installation is actually ok and I
tried running this program:

with exactly the same results; the code breaks when the memory address
of variable pool is referenced.

So, if you have any ideas or you think I might have missed something,
please let me know.



Catalin David
B.Sc. Computer Science 2010
Jacobs University Bremen
Phone: +49-(0)1577-49-38-667
College Ring 4, #343
Bremen, 28759