This has happened to me many times, but a simple rebuild of the Open-MPI generally fixed it.
I had a perfectly working (and tested) MPI application till a few days
ago. Since then, there has been some upgrades on my Linux OS (basically, the ones which the system picks up automatically).
Now, I rebuild and rerun the same application, I get this:
mpirun: orterun.c:486: orterun: Assertion `((0xdeafbeedULL << 32) +
0xdeafbeedULL) == ((opal_object_t *) (&cmd_line))->obj_magic_id'
[Morpheus:24572] *** Process received signal ***
[Morpheus:24572] Signal: Aborted (6)
[Morpheus:24572] Signal code: (-6)
[Morpheus:24572] [ 0] /lib64/libpthread.so.0(+0xf2d0) [0x7fd23c9312d0]
[Morpheus:24572] [ 1] /lib64/libc.so.6(gsignal+0x35) [0x7fd23c5e7ab5]
[Morpheus:24572] [ 2] /lib64/libc.so.6(abort+0x186) [0x7fd23c5e8fb6]
[Morpheus:24572] [ 3] /lib64/libc.so.6(__assert_fail+0xf5) [0x7fd23c5e0385]
[Morpheus:24572] [ 4] /home/Morpheus/CustomInstalledPrograms/OpenMPI-1.4.2/bin/mpirun() [0x4037ed]
[Morpheus:24572] [ 5] /home/Morpheus/CustomInstalledPrograms/OpenMPI-1.4.2/bin/mpirun() [0x4032a4]
[Morpheus:24572] [ 6] /lib64/libc.so.6(__libc_start_main+0xfd) [0x7fd23c5d3bfd]
[Morpheus:24572] [ 7] /home/Morpheus/CustomInstalledPrograms/OpenMPI-1.4.2/bin/mpirun() [0x4031c9]
[Morpheus:24572] *** End of error message ***
I have a custom installation of open-mpi, with the configure script looking like this:
--enable-mem-profile --enable-mem-debug --enable-debug --enable-trace --enable-memchecker --with-valgrind=/usr
The rebuild of my application was successful.
Where should I fix it? and How?
Thanks for ideas.