Open MPI logo

Open MPI User's Mailing List Archives

  |   Home   |   Support   |   FAQ   |   all Open MPI User's mailing list

Subject: [OMPI users] ga-4.1 on mx segmentation violation
From: SLIM H.A. (h.a.slim_at_[hidden])
Date: 2008-10-21 10:14:32


I have built the release candidate for ga-4.1 with OpenMPI 1.2.3 and
portland compilers 7.0.2 for Myrinet mx.

Running the test.x for 3 Myrinet nodes each with 4 cores I get the
following error messages:

warning:regcache incompatible with malloc
libibverbs: Fatal: couldn't read uverbs ABI version.
------------------------------------------------------------------------

--
[0,1,3]: OpenIB on host node057 was unable to find any HCAs.
Another transport will be used instead, although this may result in 
lower performance.
------------------------------------------------------------------------
--
(repetition of the above message)
------------------------------------------------------------------------
--
ARMCI configured for 3 cluster nodes. Network protocol is 'MPI-SPAWN'.
0:Segmentation Violation error, status=: 11
0:ARMCI DASSERT fail. signaltrap.c:SigSegvHandler():299 cond:0
4:Segmentation Violation error, status=: 11
4:ARMCI DASSERT fail. signaltrap.c:SigSegvHandler():299 cond:0
6:Segmentation Violation error, status=: 11
6:ARMCI DASSERT fail. signaltrap.c:SigSegvHandler():299 cond:0
Last System Error Message from Task 6:: No child processes
[node027:09712] *** An error occurred in MPI_Send
[node027:09712] *** on communicator MPI_COMM_WORLD
[node027:09712] *** MPI_ERR_COMM: invalid communicator
[node027:09712] *** MPI_ERRORS_ARE_FATAL (goodbye)
7:Segmentation Violation error, status=: 11
7:ARMCI DASSERT fail. signaltrap.c:SigSegvHandler():299 cond:0
Last System Error Message from Task 7:: Resource temporarily unavailable
[node027:09713] MPI_ABORT invoked on rank 7 in communicator
MPI_COMM_WORLD with errorcode 0
Last System Error Message from Task 0:: No child processes
[node020:14119] *** An error occurred in MPI_Send
[node020:14119] *** on communicator MPI_COMM_WORLD
[node020:14119] *** MPI_ERR_COMM: invalid communicator
[node020:14119] *** MPI_ERRORS_ARE_FATAL (goodbye)
1:Segmentation Violation error, status=: 11
1:ARMCI DASSERT fail. signaltrap.c:SigSegvHandler():299 cond:0
Last System Error Message from Task 1:: Resource temporarily unavailable
[node020:14120] MPI_ABORT invoked on rank 1 in communicator
MPI_COMM_WORLD with errorcode 0
2:Segmentation Violation error, status=: 11
2:ARMCI DASSERT fail. signaltrap.c:SigSegvHandler():299 cond:0
Last System Error Message from Task 2:: Resource temporarily unavailable
[node020:14121] MPI_ABORT invoked on rank 2 in communicator
MPI_COMM_WORLD with errorcode 0
3:Segmentation Violation error, status=: 11
3:ARMCI DASSERT fail. signaltrap.c:SigSegvHandler():299 cond:0
Last System Error Message from Task 3:: No child processes
[node057:15779] *** An error occurred in MPI_Send
[node057:15779] *** on communicator MPI_COMM_WORLD
[node057:15779] *** MPI_ERR_COMM: invalid communicator
[node057:15779] *** MPI_ERRORS_ARE_FATAL (goodbye)
Last System Error Message from Task 4:: Resource temporarily unavailable
[node057:15780] MPI_ABORT invoked on rank 4 in communicator
MPI_COMM_WORLD with errorcode 0
5:Segmentation Violation error, status=: 11
5:ARMCI DASSERT fail. signaltrap.c:SigSegvHandler():299 cond:0
Last System Error Message from Task 5:: Resource temporarily unavailable
[node057:15781] MPI_ABORT invoked on rank 5 in communicator
MPI_COMM_WORLD with errorcode 0
mpirun noticed that job rank 4 with PID 15780 on node node057 exited on
signal 42 (Real-time s
ignal 8). 
What could be the problem here?
Thanks
Henk