Just out of curiosity - what happens when you add the following MCA option to your openib runs?
-mca btl_openib_flags 305
Los Alamos National Laboratory
On May 13, 2011, at 2:38 PM, Brock Palen wrote:
> On May 13, 2011, at 4:09 PM, Dave Love wrote:
>> Jeff Squyres <jsquyres_at_[hidden]> writes:
>>> On May 11, 2011, at 3:21 PM, Dave Love wrote:
>>>> We can reproduce it with IMB. We could provide access, but we'd have to
>>>> negotiate with the owners of the relevant nodes to give you interactive
>>>> access to them. Maybe Brock's would be more accessible? (If you
>>>> contact me, I may not be able to respond for a few days.)
>>> Brock has replied off-list that he, too, is able to reliably reproduce the issue with IMB, and is working to get access for us. Many thanks for your offer; let's see where Brock's access takes us.
>> Good. Let me know if we could be useful
>>>>> -- we have not closed this issue,
>>>> Which issue? I couldn't find a relevant-looking one.
>> Thanks. In csse it's useful info, it hangs for me with 1.5.3 & np=32 on
>> connectx with more than one collective I can't recall.
> Extra data point, that ticket said it ran with mpi_preconnect_mpi 1, well that doesn't help here, both my production code (crash) and IMB still hang.
> Brock Palen
> Center for Advanced Computing
>> Excuse the typping -- I have a broken wrist
>> users mailing list
> users mailing list