Open MPI logo

Open MPI User's Mailing List Archives

  |   Home   |   Support   |   FAQ   |   all Open MPI User's mailing list

Subject: Re: [OMPI users] segmentation fault with openmpi-1.6.2
From: Ralph Castain (rhc_at_[hidden])
Date: 2012-09-10 13:03:46


Yes, 1.6.2rc1 had a problem - now fixed, will be in tomorrow's nightly 1.6 tarball.

On Sep 10, 2012, at 9:50 AM, Siegmar Gross <Siegmar.Gross_at_[hidden]> wrote:

> Hi,
>
> thank you very much for your fast answer.
>
>> On 10/09/2012 15:41, Siegmar Gross wrote:
>>> Hi,
>>>
>>> I have built openmpi-1.6.2rc1 and get the following error.
>>>
>>> tyr small_prog 123 mpicc -showme
>>> cc -I/usr/local/openmpi-1.6.2_32_cc/include -mt
>>> -L/usr/local/openmpi-1.6.2_32_cc/lib -lmpi -lm -lkstat -llgrp
>>> -lsocket -lnsl -lrt -lm
>>> tyr small_prog 124 mpiexec -np 2 -host tyr init_finalize
>>>
>>> Hello!
>>> Hello!
>>>
>>> tyr small_prog 125 mpiexec -np 2 -host sunpc4 init_finalize
>>> key_from_blob: remaining bytes in key blob 81
>>>
>>> Hello!
>>> Hello!
>>>
>>> tyr small_prog 126 mpiexec -np 2 -host tyr,sunpc4 init_finalize
>>> [tyr:23956] *** Process received signal ***
>>> [tyr:23956] Signal: Segmentation Fault (11)
>>> [tyr:23956] Signal code: Address not mapped (1)
>>> [tyr:23956] Failing at address: 18
>>> /.../openmpi-1.6.2_32_cc/lib/libopen-rte.so.4.0.0:0x15434c
>>> /lib/libc.so.1:0xcad04
>>> /lib/libc.so.1:0xbf3b4
>>> /lib/libc.so.1:0xbf59c
>>>
> /.../openmpi-1.6.2_32_cc/lib/libopen-rte.so.4.0.0:orte_rmaps_base_get_target_nod
> es+0x1cc [ Signal 11 (SEGV)]
>>> /.../openmpi-1.6.2_32_cc/lib/openmpi/mca_rmaps_round_robin.so:0x1ec8
>>>
> /.../openmpi-1.6.2_32_cc/lib/libopen-rte.so.4.0.0:orte_rmaps_base_map_job+0xe4
>>>
> /.../openmpi-1.6.2_32_cc/lib/libopen-rte.so.4.0.0:orte_plm_base_setup_job+0xc4
>>>
> /.../openmpi-1.6.2_32_cc/lib/openmpi/mca_plm_rsh.so:orte_plm_rsh_launch+0x1b0
>>> /.../openmpi-1.6.2_32_cc/bin/orterun:orterun+0x16a8
>>> /.../openmpi-1.6.2_32_cc/bin/orterun:main+0x24
>>> /.../openmpi-1.6.2_32_cc/bin/orterun:_start+0xd8
>>> [tyr:23956] *** End of error message ***
>>> Segmentation fault
>>>
>>> Do you have any ideas or suggestions? As I wrote in my email from
>>> yesterday, I had to add "#include <math.h>" into file
>>> openmpi-1.6.2rc1/ompi/contrib/vt/vt/extlib/otf/tools/otfaux/otfaux.cpp
>>> to have a prototype for function "rint" in line 834. Thank you very
>>> much for any help in advance.
>>>
>>>
>>> Kind regards
>>>
>>> Siegmar
>>>
>>>
>>> _______________________________________________
>>> users mailing list
>>> users_at_[hidden]
>>> http://www.open-mpi.org/mailman/listinfo.cgi/users
>>>
>>
>> Did you compile OMPI usins '--with-pmi' option? I saw this error when
>
> No, I didn't. I used the following options.
>
> ../openmpi-1.6.2rc1/configure --prefix=/usr/local/openmpi-1.6.2_32_cc \
> LDFLAGS="-m32" \
> CC="cc" CXX="CC" F77="f77" FC="f95" \
> CFLAGS="-m32" CXXFLAGS="-m32 -library=stlport4" FFLAGS="-m32" \
> FCFLAGS="-m32" \
> CPP="cpp" CXXCPP="cpp" \
> CPPFLAGS="" CXXCPPFLAGS="" \
> C_INCL_PATH="" C_INCLUDE_PATH="" CPLUS_INCLUDE_PATH="" \
> OBJC_INCLUDE_PATH="" MPIHOME="" \
> --without-udapl --without-openib \
> --enable-mpi-f90 --with-mpi-f90-size=small \
> --enable-heterogeneous --enable-cxx-exceptions \
> --enable-orterun-prefix-by-default \
> --with-threads=posix --enable-mpi-thread-multiple \
> --enable-opal-multi-threads \
> --with-hwloc=internal --with-ft=LAM --enable-sparse-groups \
> |& tee log.configure.$SYSTEM_ENV.$MACHINE_ENV.32_cc
>
>
>> you have allocated job on one console, but running the test on another.
>> Try to run the task on the console where you allocated it.
>
> It worked for openmpi-1.6.
>
> tyr small_prog 110 mpicc -showme
> cc -I/usr/local/openmpi-1.6_32_cc/include -mt -L/usr/local/openmpi-1.6_32_cc/lib
> -lmpi -lm -lkstat -llgrp -lsocket -lnsl -lrt -lm
> tyr small_prog 111 mpiexec -np 2 -host tyr,sunpc4 init_finalize
> key_from_blob: remaining bytes in key blob 81
>
> Hello!
> Hello!
>
> tyr small_prog 112
>
>
> Kind regards
>
> Siegmar
>
> _______________________________________________
> users mailing list
> users_at_[hidden]
> http://www.open-mpi.org/mailman/listinfo.cgi/users