On Wed, Jan 30, 2008 at 12:05:50PM -0500, George Bosilca wrote:
> What is the real issue behind this whole discussion?
Hanging connections. See
https://svn.open-mpi.org/trac/ompi/ticket/1206
The multi-address peer tries to connect, but btl_tcp_proc_accept denies
due to not matching addresses. (less btl_endpoints than possible source
addresses)
r17331 and r17332 haven't fixed the issue. Don't code when leaving the
office ;) I'll have a look at it tomorrow.
Sorry for all the noise in the trunk.
> multiple IP addresses by interface the connection step will work. Now
> I can see a benefit of having multiple socket over the same link (and
> it's already implemented in Open MPI), but I don't see the interest of
> using multiple IP in this case.
I have an easy to reproduce testcase for #1206. If you like, we can step
through the debugger in a shared screen (screen -x) or VNC session.
Just mail me if you're interested. ;)
--
Cluster and Metacomputing Working Group
Friedrich-Schiller-Universität Jena, Germany
private: http://adi.thur.de
|