Open MPI logo

Open MPI Development Mailing List Archives

  |   Home   |   Support   |   FAQ   |   all Development mailing list

Subject: Re: [OMPI devel] Quiet Time on Trunk - ORCA Integration
From: Josh Hursey (jjhursey_at_[hidden])
Date: 2012-06-26 20:25:22


So I can confirm that it is not linking properly on the Mac. It -is-
running correctly on Linux (which is where I have been testing).

>From what I can tell this is a linking issue specific to the Mac. I'm
digging into it a bit at the moment.

Interesting way to tell is by using ompi_info as a canary. If you
compile with (what is default):
 ../../../ompi/libmpi.la -L/Users/jju/work/open-mpi/ompi-trunk/orte/ -lopen-rte
It will display components from OMPI,ORCA,OPAL
If you change that too:
 ../../../ompi/libmpi.la ../../../orte/libopen-rte.la
you get the same thing, but if you change it to:
 ./../../ompi/libmpi.la ../../../orte/libopen-rte.la ./../../ompi/libmpi.la
Then it only displays the ORTE,OPAL components.

So I am thinking that this is an issue with using two .la's in a
single linking - something that is not showing up on Linux.

Any pointers on what might be going on here would be appreciated as I
dig further.

-- Josh

On Tue, Jun 26, 2012 at 7:40 PM, Josh Hursey <jjhursey_at_[hidden]> wrote:
> That is odd. I did not see that when testing on Linux. I'll take a look.
>
> -- josh
>
> On Tue, Jun 26, 2012 at 7:37 PM, Ralph Castain <rhc_at_[hidden]> wrote:
>> FWIW: it built fine on my Mac, but doesn't run. Just hangs when attempting to execute any MPI application. Will execute non-MPI apps though.
>>
>> Looking deeper, it looks like the app is stuck waiting to receive the sync registration "ack". Mpirun thinks it sent it, but the app is unable to receive it, so I would guess the app is failing to progress the RTE.
>>
>> If you can't resolve that quickly, I would suggest backing this out and the two of us looking at it in the morning. Somehow, you've lost the progress loop that was driving the RTE thru orte_init.
>>
>>
>> On Jun 26, 2012, at 3:44 PM, Josh Hursey wrote:
>>
>>> r26670 is the first of the ORCA commits. I am switching machines for
>>> testing. Hang on for a couple more hours while the initial testing is
>>> underway.
>>>
>>> -- Josh
>>>
>>> On Tue, Jun 26, 2012 at 4:34 PM, Josh Hursey <jjhursey_at_[hidden]> wrote:
>>>> I am requesting a quiet time on the trunk for ORCA integration
>>>> starting -now- (as previously announced). I will post back when
>>>> everything is committed and ready to go.
>>>>
>>>> Some reading while you are waiting:
>>>>  http://www.open-mpi.org/community/lists/devel/2012/06/11109.php
>>>>
>>>> Thanks,
>>>> Josh
>>>>
>>>> --
>>>> Joshua Hursey
>>>> Postdoctoral Research Associate
>>>> Oak Ridge National Laboratory
>>>> http://users.nccs.gov/~jjhursey
>>>
>>>
>>>
>>> --
>>> Joshua Hursey
>>> Postdoctoral Research Associate
>>> Oak Ridge National Laboratory
>>> http://users.nccs.gov/~jjhursey
>>>
>>> _______________________________________________
>>> devel mailing list
>>> devel_at_[hidden]
>>> http://www.open-mpi.org/mailman/listinfo.cgi/devel
>>
>>
>> _______________________________________________
>> devel mailing list
>> devel_at_[hidden]
>> http://www.open-mpi.org/mailman/listinfo.cgi/devel
>
>
>
> --
> Joshua Hursey
> Postdoctoral Research Associate
> Oak Ridge National Laboratory
> http://users.nccs.gov/~jjhursey

-- 
Joshua Hursey
Postdoctoral Research Associate
Oak Ridge National Laboratory
http://users.nccs.gov/~jjhursey