Open MPI logo

Open MPI Development Mailing List Archives

  |   Home   |   Support   |   FAQ   |   all Development mailing list

Subject: Re: [OMPI devel] Hangs on the trunk
From: Rolf vandeVaart (rvandevaart_at_[hidden])
Date: 2014-07-14 17:09:28


Thanks. That fixed my issue. Will check MTT after tonight's run.

>-----Original Message-----
>From: devel [mailto:devel-bounces_at_[hidden]] On Behalf Of Nathan
>Hjelm
>Sent: Monday, July 14, 2014 5:01 PM
>To: Open MPI Developers
>Subject: Re: [OMPI devel] Hangs on the trunk
>
>* PGP Signed by an unknown key
>
>Found the typo. Fixed in r32232
>
>-Nathan
>
>On Mon, Jul 14, 2014 at 02:44:06PM -0600, Nathan Hjelm wrote:
>> Yeah. I think it is a problem I introduced. I am trying to track it
>> down now. Probably messed up one of the conversions in ob1.
>>
>> -Nathan
>>
>> On Mon, Jul 14, 2014 at 01:38:38PM -0700, Rolf vandeVaart wrote:
>> > I have noticed that I am seeing some tests hang on the trunk. For
>> > example:
>> >
>> >
>> >
>> > $ mpirun --mca btl_tcp_if_include eth0 --host
>> > drossetti-ivy0,drossetti-ivy1 -np 2 --mca pml ob1 --mca btl sm,tcp,self
>> > --mca coll_mdisable_allgather 1 --mca
>btl_openib_warn_default_gid_prefix 0
>> > send
>> >
>> >
>> >
>> > It is not unusual for this test to take several minutes, particularly on
>> > slow networks.
>> >
>> > Please be patient.
>> >
>> > NOTICE: Using max message size: 10485760
>> >
>> > Progress: [=====
>> >
>> >
>> >
>> > Is anyone else seeing this? (This is really a hang in spite of the
>> > message saying it should take a few minutes)
>> >
>> >
>> >
>> > This started with the changes Nathan did for renaming the descriptor
>> > fields - r32196 through r32202.
>> >
>> > From what I can tell, it looks like it hangs the second time the
>> > rendezvous protocol is used to send the data.
>> >
>> >
>> >
>> > Rolf
>> >
>> >
>> > --------------------------------------------------------------------
>> > --
>> >
>> > This email message is for the sole use of the intended recipient(s) and
>> > may contain confidential information. Any unauthorized review, use,
>> > disclosure or distribution is prohibited. If you are not the intended
>> > recipient, please contact the sender by reply email and destroy all copies
>> > of the original message.
>> >
>> >
>> > --------------------------------------------------------------------
>> > --
>>
>> > _______________________________________________
>> > devel mailing list
>> > devel_at_[hidden]
>> > Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
>> > Link to this post:
>> > http://www.open-mpi.org/community/lists/devel/2014/07/15136.php
>>
>
>
>
>> _______________________________________________
>> devel mailing list
>> devel_at_[hidden]
>> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
>> Link to this post:
>> http://www.open-mpi.org/community/lists/devel/2014/07/15137.php
>
>
>* Unknown Key
>* 0x9AC22B15