Open MPI logo

Open MPI Development Mailing List Archives

  |   Home   |   Support   |   FAQ   |   all Development mailing list

Subject: Re: [OMPI devel] trunk hangs since r19010
From: Jeff Squyres (jsquyres_at_[hidden])
Date: 2008-07-29 09:47:21


Ok. FWIW, Pasha and I think that openib has supported "send-to-self"
for a while (we don't know exactly when; but Pasha thinks it is very
old code that we don't check for self in add_procs). But it only
broke recently.

On Jul 29, 2008, at 9:31 AM, George Bosilca wrote:

> I ran few tests and the only combination leading to a deadlock is
> openib and self. As openib is the only BTL supporting self
> communications (except self of course), I guess it interfere with
> self in some more or less strange ways. I didn't had the time to dig
> deeper yet to see what exactly happens there, I'll schedule this
> later today.
>
> george.
>
> On Jul 29, 2008, at 8:52 AM, Pavel Shamis (Pasha) wrote:
>
>> Jeff Squyres wrote:
>>>
>>> This used to be true, but I think we changed it a while ago
>>> (Pasha: do you remember?) because Mellanox HCAs are capable of
>>> send-to-self (process) and there were no code changes necessary to
>>> enable it. So it allowed a slightly simpler command line. This
>>> was quite a while ago, IIRC.
>> Yep, Correct.
>>
>> FYI. In my MTT testing I also see a lot of killed tests.
>> _______________________________________________
>> devel mailing list
>> devel_at_[hidden]
>> http://www.open-mpi.org/mailman/listinfo.cgi/devel
>
> _______________________________________________
> devel mailing list
> devel_at_[hidden]
> http://www.open-mpi.org/mailman/listinfo.cgi/devel

-- 
Jeff Squyres
Cisco Systems