Open MPI logo

Open MPI User's Mailing List Archives

  |   Home   |   Support   |   FAQ   |   all Open MPI User's mailing list

Subject: Re: [OMPI users] open-mpi error: unable to create listen socket
From: Ralph Castain (rhc_at_[hidden])
Date: 2009-03-17 09:08:32


Hi Andria

The problem is a permissions one - your system has been setup so that
only root has permission to open a TCP socket. I don't know what
system you are running - you might want to talk to your system admin
or someone knowledgeable on that operating system to ask them how to
revise the required permissions.

Ralph

On Mar 17, 2009, at 3:12 AM, -andria- wrote:

> Dear all,
>
> I am still learning how to create a parallel program with open-mpi.
>
> I try to run a mpihello program on my cluster, but it gives error
> when it is executed as ordinary (public) user. however, it gives the
> correct result when it is run by root user.
>
> why this happen? how can it be solved?
>
> attached you can find ompi_info --all output.
>
> the code:
>
> #include "mpi.h"
> #include "stdio.h"
>
> int main(int argc, char** argv) {
> int numprocs, rank, namelen;
> char processor_name[MPI_MAX_PROCESSOR_NAME];
>
> MPI_Init(&argc, &argv);
> MPI_Comm_size(MPI_COMM_WORLD, &numprocs);
> MPI_Comm_rank(MPI_COMM_WORLD, &rank);
> MPI_Get_processor_name(processor_name, &namelen);
> printf("Process %d on %s out of %d\n", rank, processor_name,
> numprocs);
> MPI_Finalize();
>
> return 0;
> }
>
> output:
> [public_at_cisitu01 ~]$ mpicc mpihello.c -o mpihello
>
> ### as public ###
> [public_at_cisitu01 ~]$ mpirun -np 4 -hostfile nodes.lst mpihello
> [cisitu02:02897] mca_oob_tcp_create_listen: bind() failed:
> Permission denied (13)
> [cisitu02:02897] mca_oob_tcp_init: unable to create listen socket
> [cisitu02:02898] mca_oob_tcp_create_listen: bind() failed:
> Permission denied (13)
> [cisitu02:02898] mca_oob_tcp_init: unable to create listen socket
> [cisitu02][0,1,1][btl_tcp_component.c:
> 412:mca_btl_tcp_component_create_listen] bind() failed with errno=13
> [cisitu02][0,1,3][btl_tcp_component.c:
> 412:mca_btl_tcp_component_create_listen] bind() failed with errno=13
> [cisitu02:02897] [0,1,1] ORTE_ERROR_LOG: Not found in file
> gpr_proxy_deliver_notify_msg.c at line 139
> [cisitu02:02898] [0,1,3] ORTE_ERROR_LOG: Not found in file
> gpr_proxy_deliver_notify_msg.c at line 139
> ^Cmpirun: killing job...
>
> mpirun noticed that job rank 0 with PID 2976 on node cisitu01 exited
> on signal 15 (Terminated).
> 3 additional processes aborted (not shown)
>
> ### as root ###
> -bash-3.2# mpirun -np 4 -hostfile nodes.lst mpihello
> Process 0 on cisitu01 out of 4
> Process 1 on cisitu02 out of 4
> Process 3 on cisitu02 out of 4
> Process 2 on cisitu01 out of 4
> -bash-3.2#
>
> thank you in advance,
>
> regards,
> -andria
> <ompi_info.all>_______________________________________________
> users mailing list
> users_at_[hidden]
> http://www.open-mpi.org/mailman/listinfo.cgi/users