Open MPI logo

Open MPI Development Mailing List Archives

  |   Home   |   Support   |   FAQ   |   all Development mailing list

From: Gleb Natapov (glebn_at_[hidden])
Date: 2007-06-13 13:37:22

On Wed, Jun 13, 2007 at 12:45:01PM -0400, Jeff Squyres wrote:
> On Jun 13, 2007, at 12:08 PM, Gleb Natapov wrote:
> > I am not committing this yet. I want people to review my logic and the
> > patch. If the change is OK with everyone how cares then I want this
> > change to go into 1.2 branch.
> >
> > I don't care how this change will get to the trunk. I can use patched
> > version for a while. If you branch is in working state right now I can
> > merge this change into it tomorrow.
> I was just bitten yesterday by a problem that I've known about for a
> while but had never gotten around to looking into (I could have sworn
> that there was an open trac ticket on this, but I can't find one
> anywhere).
> I have 2 hosts: one with 3 active ports and one with 2 active ports.
> If I run an MPI job between them, the openib BTL wireup got badly and
> it aborts. So handling a heterogeneous number of ports is not
> currently handled properly in the code.
Are the all in the same subnet? If not I fixed some bug yesterday that
may help.

> I don't know if Gleb's patch addresses this situation or not; I'll
> look at his patch this afternoon.
This patch address different problem.