Open MPI logo

Open MPI User's Mailing List Archives

  |   Home   |   Support   |   FAQ   |   all Open MPI User's mailing list

From: Tim Prins (tprins_at_[hidden])
Date: 2007-10-01 22:28:14


Hi Joao,

Unfortunately Comm_spawn is a bit broken right now on the Open MPI trunk. We
are currently working on some major changes to the runtime system, so I would
rather not dig into this until these changes have made it onto the trunk.

I do not know of a timeline for when this these changes will be put in the
trunk and Comm_spawn (especially with threads) will be expected to work
correctly again.

Tim

On Monday 01 October 2007 03:40:46 pm Joao Vicente Lima wrote:
> Hi all!
> I'm getting a error on call MPI_Init_thread and MPI_Comm_spawn.
> am I mistaking something?
> the attachments contains my ompi_info and source ...
>
> thank!
> Joao
>
> ....
> char *arg[]= {"spawn1", (char *)0};
>
> MPI_Init_thread (&argc, &argv, MPI_THREAD_MULTIPLE, &provided);
> MPI_Comm_spawn ("./spawn_slave", arg, 1,
> MPI_INFO_NULL, 0, MPI_COMM_SELF, &slave,
> MPI_ERRCODES_IGNORE);
> .....
>
> and the error:
>
> opal_mutex_lock(): Resource deadlock avoided
> [c8:13335] *** Process received signal ***
> [c8:13335] Signal: Aborted (6)
> [c8:13335] Signal code: (-6)
> [c8:13335] [ 0] [0xb7fbf440]
> [c8:13335] [ 1] /lib/libc.so.6(abort+0x101) [0xb7abd5b1]
> [c8:13335] [ 2] /usr/local/openmpi/openmpi-svn/lib/libmpi.so.0 [0xb7e2933c]
> [c8:13335] [ 3] /usr/local/openmpi/openmpi-svn/lib/libmpi.so.0 [0xb7e2923a]
> [c8:13335] [ 4] /usr/local/openmpi/openmpi-svn/lib/libmpi.so.0 [0xb7e292e3]
> [c8:13335] [ 5] /usr/local/openmpi/openmpi-svn/lib/libmpi.so.0 [0xb7e29fa7]
> [c8:13335] [ 6] /usr/local/openmpi/openmpi-svn/lib/libmpi.so.0 [0xb7e29eda]
> [c8:13335] [ 7] /usr/local/openmpi/openmpi-svn/lib/libmpi.so.0 [0xb7e2adec]
> [c8:13335] [ 8]
> /usr/local/openmpi/openmpi-svn/lib/libmpi.so.0(ompi_proc_unpack+ 0x181)
> [0xb7e2b142]
> [c8:13335] [ 9]
> /usr/local/openmpi/openmpi-svn/lib/libmpi.so.0(ompi_comm_connect
> _accept+0x57c) [0xb7e0fb70]
> [c8:13335] [10]
> /usr/local/openmpi/openmpi-svn/lib/libmpi.so.0(PMPI_Comm_spawn+0 x395)
> [0xb7e5e285]
> [c8:13335] [11] ./spawn(main+0x7f) [0x80486ef]
> [c8:13335] [12] /lib/libc.so.6(__libc_start_main+0xdc) [0xb7aa7ebc]
> [c8:13335] [13] ./spawn [0x80485e1]
> [c8:13335] *** End of error message ***
> --------------------------------------------------------------------------
> mpirun has exited due to process rank 0 with PID 13335 on
> node c8 calling "abort". This will have caused other processes
> in the application to be terminated by signals sent by mpirun
> (as reported here).
> --------------------------------------------------------------------------