Open MPI logo

Open MPI User's Mailing List Archives

  |   Home   |   Support   |   FAQ   |   all Open MPI User's mailing list

Subject: Re: [OMPI users] Help diagnosing problem: not being able to run MPI code across computers
From: Ralph Castain (rhc_at_[hidden])
Date: 2013-05-06 08:52:23


On May 6, 2013, at 2:10 AM, Angel de Vicente <angelv_at_[hidden]> wrote:

> Hi,
>
> Ralph Castain <rhc_at_[hidden]> writes:
>
>> On May 4, 2013, at 4:54 PM, Angel de Vicente <angelv_at_[hidden]> wrote:
>>>
>>> Is there any way to dump details of what OpenMPI is trying to do in each
>>> node, so I can see if it is looking for different libraries in each
>>> node, or something similar?
>
>
> thanks for the suggestions, but I'm still stuck:
>
>> What I do is simply "ssh ompi_info -V" to each remote node and compare
>> results - you should get the same answer everywhere.
>
> exactly the same information in the three connected machines

So you should then be getting the same libraries

>
>> Another option in these situations is to configure
>> --enable-orterun-prefix-by-default. If you install in the same
>> location on each node (e.g., on an NSF mount), then this will ensure
>> you get that same library.
>
> Re-configured and re-compiled OpenMPI, but I get the same behaviour.
>
> I'm starting to think that perhaps is a firewall issue? I don't have
> root access in these machines but I'll try to investigate.

Given that result, then yes - check iptables. I suspect they are running and TCP socket comm is being blocked.

>
> Cheers,
> --
> Ángel de Vicente
> http://angel-de-vicente.blogspot.com/
>
> _______________________________________________
> users mailing list
> users_at_[hidden]
> http://www.open-mpi.org/mailman/listinfo.cgi/users