Open MPI logo

Open MPI User's Mailing List Archives

  |   Home   |   Support   |   FAQ   |   all Open MPI User's mailing list

Subject: Re: [OMPI users] Problem with sending messages from one of the machines
From: Jeff Squyres (jsquyres_at_[hidden])
Date: 2010-11-11 15:43:19


On Nov 11, 2010, at 3:23 PM, Krzysztof Zarzycki wrote:

> No, unfortunately specification of interfaces is a little more complicated... eth0/1/2 is not common for both machines.

Can you define "common"? Do you mean that eth0 on one machine is on a different network then eth0 on the other machine?

Is there any way that you can make them the same? It would certainly make things easier.

> I've tried to play with (oob/btl)_tcp_ if_include, but actually... I don't know exactly how.

See my other mail:

    http://www.open-mpi.org/community/lists/users/2010/11/14737.php

> Anyway, do you have any ideas how to further debug the communication problem?

The connect() is not getting through somehow. Sadly, we don't have enough debug messages to show exactly what is going wrong when these kinds of things happen; I have a half-finished branch that has much better debug/error messages, but I've never had the time to finish it (indeed, I think there's a bug in that development branch right now, otherwise I'd recommend giving it a whirl). :-\

-- 
Jeff Squyres
jsquyres_at_[hidden]
For corporate legal information go to:
http://www.cisco.com/web/about/doing_business/legal/cri/