What IP interfaces are configured on the cluster? In particular, are there IPoIB interfaces that are configured? If you use the dynamic connection method but restrict either the number or type of IP interfaces to be used via oob_tcp_if_{include,exclude}, do you still see the problem?
using the flag --mca mpi_preconnect_mpi seems to solved the issue with the oob connection manager.This solution is not scalable but it looks more and more like a connection establishment problem.I'm still trying to figure out what is the root cause of this and how to solve it.Any ideas will be more then welcome.Thanks,DoronOn Tue, Jan 18, 2011 at 3:29 PM, Terry Dontje <terry.dontje@oracle.com> wrote:
On 01/18/2011 07:48 AM, Jeff Squyres wrote:No I think I meant OMPI oob.IBCM is broken and disabled (has been for a long time). Did you mean RDMACM?
sorry,
--![]()
Terry D. Dontje | Principal Software Engineer
Developer Tools Engineering | +1.781.442.2631
Oracle - Performance Technologies
95 Network Drive, Burlington, MA 01803
Email terry.dontje@oracle.com
_______________________________________________
devel mailing list
devel@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
devel@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel