I failed to run on different nodes or on the same node via self,openib
I checked this out some more and I believe it is ticket #1378 related. We lock up if SM is included in the BTL's, which is what I had done on my test. If I ^sm, I can run fine.On Jul 28, 2008, at 6:41 AM, Ralph Castain wrote:It could also be something new. Brad and I noted on Fri that IB was locking up as soon as we tried any cross-node communications. Hadn't seen that before, and at least I haven't explored it further - planned to do so today._______________________________________________On Jul 28, 2008, at 6:01 AM, Lenny Verkhovsky wrote:I believe it it.On 7/28/08, Jeff Squyres <jsquyres@cisco.com> wrote:On Jul 28, 2008, at 7:51 AM, Jeff Squyres wrote:
Is this related to r1378?
Gah -- I meant #1378, meaning the "PML ob1 deadlock" ticket.
On Jul 28, 2008, at 7:13 AM, Lenny Verkhovsky wrote:
Hi,
I experience hanging of tests ( latency ) since r19010
Best Regards
Lenny.
_______________________________________________
devel mailing list
devel@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
--
Jeff Squyres
Cisco Systems
--
Jeff Squyres
Cisco Systems
_______________________________________________
devel mailing list
devel@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
devel@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
devel mailing list
devel@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel
_______________________________________________
devel mailing list
devel@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel