Open MPI logo

Open MPI Development Mailing List Archives

  |   Home   |   Support   |   FAQ   |   all Development mailing list

Subject: [OMPI devel] ddt-raw "make check" failure
From: Jeff Squyres (jsquyres) (jsquyres_at_[hidden])
Date: 2013-09-23 17:14:54


George --

See the stack trace below. Can you diagnose/fix?

Begin forwarded message:

> From: Siegmar Gross <Siegmar.Gross_at_[hidden]>
> Subject: Re: [OMPI users] Error with "make check" in openmpi-1.7.3a1r29220
> Date: September 21, 2013 11:38:58 AM EDT
> To: <users_at_[hidden]>
> Reply-To: Siegmar Gross <Siegmar.Gross_at_[hidden]>, Open MPI Users <users_at_[hidden]>
>
> Hello Jeff,
>
>
>> Can uu send a stack trace from the failed DDT-raw test?
>
> Here is the output from gdb.
>
>
> tyr .libs 41 ddt_raw
>
>
> #
> * TEST INVERSED VECTOR
> #
>
> Bus error (core dumped)
> tyr .libs 42 gdb ddt_raw core
> GNU gdb 5.3
> Copyright 2002 Free Software Foundation, Inc.
> GDB is free software, covered by the GNU General Public License, and you are
> welcome to change it and/or distribute copies of it under certain conditions.
> Type "show copying" to see the conditions.
> There is absolutely no warranty for GDB. Type "show warranty" for details.
> This GDB was configured as "sparc-sun-solaris2.9"...
> Core was generated by `ddt_raw'.
> Program terminated with signal 10, Bus error.
> Reading symbols from /usr/local/openmpi-1.7_32_gcc/lib/libmpi.so.1...done.
> Loaded symbols for /usr/local/openmpi-1.7_32_gcc/lib/libmpi.so.1
> Reading symbols from /usr/local/openmpi-1.7_32_gcc/lib/libopen-rte.so.5...done.
> Loaded symbols for /usr/local/openmpi-1.7_32_gcc/lib/libopen-rte.so.5
> Reading symbols from /usr/local/openmpi-1.7_32_gcc/lib/libopen-pal.so.5...done.
> Loaded symbols for /usr/local/openmpi-1.7_32_gcc/lib/libopen-pal.so.5
> Reading symbols from /usr/lib/libsendfile.so.1...done.
> Loaded symbols for /usr/lib/libsendfile.so.1
> Reading symbols from /usr/lib/libpicl.so.1...done.
> Loaded symbols for /usr/lib/libpicl.so.1
> Reading symbols from /usr/lib/libkstat.so.1...done.
> Loaded symbols for /usr/lib/libkstat.so.1
> Reading symbols from /usr/lib/liblgrp.so.1...done.
> Loaded symbols for /usr/lib/liblgrp.so.1
> Reading symbols from /usr/lib/libsocket.so.1...done.
> Loaded symbols for /usr/lib/libsocket.so.1
> Reading symbols from /usr/lib/libnsl.so.1...done.
> Loaded symbols for /usr/lib/libnsl.so.1
> Reading symbols from /usr/lib/librt.so.1...done.
> Loaded symbols for /usr/lib/librt.so.1
> Reading symbols from /usr/lib/libm.so.2...done.
> Loaded symbols for /usr/lib/libm.so.2
> Reading symbols from /usr/lib/libpthread.so.1...
> warning: Lowest section in /usr/lib/libpthread.so.1 is .dynamic at 00000074
> done.
> Loaded symbols for /usr/lib/libpthread.so.1
> Reading symbols from /usr/lib/libc.so.1...done.
> Loaded symbols for /usr/lib/libc.so.1
> Reading symbols from /usr/local/gcc-4.8.0/lib/libgcc_s.so.1...done.
> Loaded symbols for /usr/local/gcc-4.8.0/lib/libgcc_s.so.1
> Reading symbols from /usr/lib/libdoor.so.1...done.
> Loaded symbols for /usr/lib/libdoor.so.1
> Reading symbols from /usr/lib/libaio.so.1...done.
> Loaded symbols for /usr/lib/libaio.so.1
> Reading symbols from /usr/lib/libmd.so.1...done.
> Loaded symbols for /usr/lib/libmd.so.1
> Reading symbols from /platform/SUNW,A70/lib/libc_psr.so.1...done.
> Loaded symbols for /platform/SUNW,A70/lib/libc_psr.so.1
> #0 0xfeed9f38 in opal_convertor_raw (pConvertor=0x23ce8, iov=0x22870, iov_count=0xffbfe604, length=0xffbfe600)
> at ../../../openmpi-1.7.3a1r29220/opal/datatype/opal_convertor_raw.c:49
> 49 assert( (*iov_count) > 0 );
> (gdb) where
> #0 0xfeed9f38 in opal_convertor_raw (pConvertor=0x23ce8, iov=0x22870, iov_count=0xffbfe604, length=0xffbfe600)
> at ../../../openmpi-1.7.3a1r29220/opal/datatype/opal_convertor_raw.c:49
> #1 0x00011c2c in local_copy_ddt_raw (pdt=0x23a48, count=100, iov_num=5) at sys/atomic_impl.h:59
> #2 0x00011e94 in main (argc=1, argv=0xffbfe7b4) at sys/atomic_impl.h:59
> (gdb) bt
> #0 0xfeed9f38 in opal_convertor_raw (pConvertor=0x23ce8, iov=0x22870, iov_count=0xffbfe604, length=0xffbfe600)
> at ../../../openmpi-1.7.3a1r29220/opal/datatype/opal_convertor_raw.c:49
> #1 0x00011c2c in local_copy_ddt_raw (pdt=0x23a48, count=100, iov_num=5) at sys/atomic_impl.h:59
> #2 0x00011e94 in main (argc=1, argv=0xffbfe7b4) at sys/atomic_impl.h:59
> (gdb) q
> tyr .libs 43
>
>
> And here is the output from dbx.
>
> tyr .libs 45 dbx ddt_raw core
> For information about new features see `help changes'
> To remove this message, put `dbxenv suppress_startup_message 7.9' in your .dbxrc
> Reading ddt_raw
> core file header read successfully
> Reading ld.so.1
> Reading libmpi.so.1.1.2
> Reading libopen-rte.so.5.0.2
> Reading libopen-pal.so.5.0.2
> Reading libsendfile.so.1
> Reading libpicl.so.1
> Reading libkstat.so.1
> Reading liblgrp.so.1
> Reading libsocket.so.1
> Reading libnsl.so.1
> Reading librt.so.1
> Reading libm.so.2
> Reading libpthread.so.1
> Reading libc.so.1
> Reading libgcc_s.so.1
> Reading libdoor.so.1
> Reading libaio.so.1
> Reading libmd.so.1
> Reading libc_psr.so.1
> t_at_1 (l_at_1) program terminated by signal BUS (invalid address alignment)
> 0xfeed9f38: opal_convertor_raw+0x0154: ld [%g1], %g1
> (dbx) where
> current thread: t_at_1
> =>[1] opal_convertor_raw(0x23ce8, 0x22870, 0xffbfe604, 0xffbfe600, 0xfec423f0, 0xffbfe610), at 0xfeed9f38
> [2] local_copy_ddt_raw(0x23a48, 0x64, 0x5, 0xfec4c118, 0x1, 0xfec475aa), at 0x11c24
> [3] main(0x1, 0xffbfe7b4, 0xffbfe7bc, 0x20e98, 0xfee50100, 0x0), at 0x11e8c
> (dbx) exit
> tyr .libs 46
>
>
>
> Kind regards
>
> Siegmar
>
>
>
>
>> On Sep 21, 2013, at 10:05 AM, "Siegmar Gross" <Siegmar.Gross_at_[hidden]> wrote:
>>
>>> Hi,
>>>
>>> today I installed 32- and 64-bit versions of openmpi-1.7.3a1r29220 on
>>> Solaris Sparc with Sun C 5.12 and gcc.4.8.0. "make check" has the
>>> following problem with both versions and both compilers.
>>> "make check" works fine with "openmpi-1.6.6a1r29175".
>>>
>>> tyr openmpi-1.7.3a1r29220-SunOS.sparc.64_gcc 122
>>> more log.make-check.SunOS.sparc.64_gcc
>>> ...
>>> CCLD ddt_raw
>>> CC to_self.o
>>> ../../../openmpi-1.7.3a1r29220/test/datatype/to_self.c: In function
>>> 'create_indexed_constant_gap_ddt':
>>> ../../../openmpi-1.7.3a1r29220/test/datatype/to_self.c:48:5: warning:
>>> 'MPI_Type_struct' is deprecated (declared at ../../ompi/include/mpi.h:1712):
>>> MPI_Type_struct is superseded by MPI_Type_create_struct in MPI-2.0
>>> [-Wdeprecated-declarations]
>>> MPI_Type_struct( number, bLength, displ, types, &dt );
>>> ^
>>> ../../../openmpi-1.7.3a1r29220/test/datatype/to_self.c: In function
>>> 'create_indexed_gap_ddt':
>>> ../../../openmpi-1.7.3a1r29220/test/datatype/to_self.c:89:5: warning:
>>> 'MPI_Address' is deprecated (declared at ../../ompi/include/mpi.h:1161):
>>> MPI_Address is superseded by MPI_Get_address in MPI-2.0
>>> [-Wdeprecated-declarations]
>>> MPI_Address( &(dt[0].is[0].i[0]), &(displ[0]) );
>>> ^
>>> ../../../openmpi-1.7.3a1r29220/test/datatype/to_self.c:90:5: warning:
>>> 'MPI_Address' is deprecated (declared at ../../ompi/include/mpi.h:1161):
>>> MPI_Address is superseded by MPI_Get_address in MPI-2.0
>>> [-Wdeprecated-declarations]
>>> MPI_Address( &(dt[0].is[0].f), &(displ[1]) );
>>> ^
>>> ../../../openmpi-1.7.3a1r29220/test/datatype/to_self.c:93:5: warning:
>>> 'MPI_Type_struct' is deprecated (declared at ../../ompi/include/mpi.h:1712):
>>> MPI_Type_struct is superseded by MPI_Type_create_struct in MPI-2.0
>>> [-Wdeprecated-declarations]
>>> MPI_Type_struct( 2, bLength, displ, types, &dt1 );
>>> ^
>>> ../../../openmpi-1.7.3a1r29220/test/datatype/to_self.c:99:5: warning:
>>> 'MPI_Address' is deprecated (declared at ../../ompi/include/mpi.h:1161):
>>> MPI_Address is superseded by MPI_Get_address in MPI-2.0
>>> [-Wdeprecated-declarations]
>>> MPI_Address( &(dt[0].v1), &(displ[0]) );
>>> ^
>>> ../../../openmpi-1.7.3a1r29220/test/datatype/to_self.c:100:5: warning:
>>> 'MPI_Address' is deprecated (declared at ../../ompi/include/mpi.h:1161
>>> /ddt_raw
>>> ...skipping
>>> PASS: ddt_test
>>> /bin/bash: line 5: 20020 Bus Error ${dir}$tst
>>> FAIL: ddt_raw
>>> ========================================================
>>> 1 of 5 tests failed
>>> Please report to http://www.open-mpi.org/community/help/
>>> ========================================================
>>> make[3]: *** [check-TESTS] Error 1
>>> make[3]: Leaving directory
>>> `/export2/src/openmpi-1.7/openmpi-1.7.3a1r29220-SunOS.sparc.64_gcc/test
>>> /datatype'
>>> make[2]: *** [check-am] Error 2
>>> make[2]: Leaving directory
>>> `/export2/src/openmpi-1.7/openmpi-1.7.3a1r29220-SunOS.sparc.64_gcc/test
>>> /datatype'
>>> make[1]: *** [check-recursive] Error 1
>>> make[1]: Leaving directory
>>> `/export2/src/openmpi-1.7/openmpi-1.7.3a1r29220-SunOS.sparc.64_gcc/test
>>> '
>>> make: *** [check-recursive] Error 1
>>>
>>>
>>>
>>> Everything is fine with openmpi-1.6.6a1r29175.
>>>
>>> tyr openmpi-1.6.6a1r29175-SunOS.sparc.64_gcc 126
>>> more log.make-check.SunOS.sparc.64_gcc
>>> ...
>>> CCLD ddt_raw
>>> CC to_self.o
>>> CCLD to_self
>>> CC ddt_pack.o
>>> CCLD ddt_pack
>>> make[3]: Leaving directory
>>>
>>> `/export2/src/openmpi-1.6.6/openmpi-1.6.6a1r29175-SunOS.sparc.64_gcc/test/dataty
>>> pe'
>>> make check-TESTS
>>> /ddt_raw
>>> ...skipping
>>> raw extraction in 2 microsec
>>>>> --------------------------------------------<<
>>> PASS: ddt_raw
>>> ==================
>>> All 5 tests passed
>>> ==================
>>> ...
>>>
>>>
>>> I would be grateful if somebody could fix the problem. Thank you
>>> very much for any help in advance.
>>>
>>>
>>> Kind regards
>>>
>>> Siegmar
>>>
>>> _______________________________________________
>>> users mailing list
>>> users_at_[hidden]
>>> http://www.open-mpi.org/mailman/listinfo.cgi/users
>>
>
> _______________________________________________
> users mailing list
> users_at_[hidden]
> http://www.open-mpi.org/mailman/listinfo.cgi/users

-- 
Jeff Squyres
jsquyres_at_[hidden]
For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/