Open MPI logo

Open MPI Development Mailing List Archives

  |   Home   |   Support   |   FAQ   |   all Development mailing list

Subject: Re: [OMPI devel] Quiet Time on Trunk - ORCA Integration
From: Josh Hursey (jjhursey_at_[hidden])
Date: 2012-06-26 21:04:48


So I'm spinning my wheels on this one. I am going to need someone with
more knowledge about linking to help (Jeff or Brian maybe?).

I'll see what I can do to back this out of the trunk :(

I wish this issue would have come up earlier - I just don't ever build
on my mac, so I never saw it.

-- Josh

On Tue, Jun 26, 2012 at 8:25 PM, Josh Hursey <jjhursey_at_[hidden]> wrote:
> So I can confirm that it is not linking properly on the Mac. It -is-
> running correctly on Linux (which is where I have been testing).
>
> From what I can tell this is a linking issue specific to the Mac. I'm
> digging into it a bit at the moment.
>
> Interesting way to tell is by using ompi_info as a canary. If you
> compile with (what is default):
>  ../../../ompi/libmpi.la -L/Users/jju/work/open-mpi/ompi-trunk/orte/ -lopen-rte
> It will display components from OMPI,ORCA,OPAL
> If you change that too:
>  ../../../ompi/libmpi.la  ../../../orte/libopen-rte.la
> you get the same thing, but if you change it to:
>  ./../../ompi/libmpi.la  ../../../orte/libopen-rte.la ./../../ompi/libmpi.la
> Then it only displays the ORTE,OPAL components.
>
> So I am thinking that this is an issue with using two .la's in a
> single linking - something that is not showing up on Linux.
>
> Any pointers on what might be going on here would be appreciated as I
> dig further.
>
> -- Josh
>
>
> On Tue, Jun 26, 2012 at 7:40 PM, Josh Hursey <jjhursey_at_[hidden]> wrote:
>> That is odd. I did not see that when testing on Linux. I'll take a look.
>>
>> -- josh
>>
>> On Tue, Jun 26, 2012 at 7:37 PM, Ralph Castain <rhc_at_[hidden]> wrote:
>>> FWIW: it built fine on my Mac, but doesn't run. Just hangs when attempting to execute any MPI application. Will execute non-MPI apps though.
>>>
>>> Looking deeper, it looks like the app is stuck waiting to receive the sync registration "ack". Mpirun thinks it sent it, but the app is unable to receive it, so I would guess the app is failing to progress the RTE.
>>>
>>> If you can't resolve that quickly, I would suggest backing this out and the two of us looking at it in the morning. Somehow, you've lost the progress loop that was driving the RTE thru orte_init.
>>>
>>>
>>> On Jun 26, 2012, at 3:44 PM, Josh Hursey wrote:
>>>
>>>> r26670 is the first of the ORCA commits. I am switching machines for
>>>> testing. Hang on for a couple more hours while the initial testing is
>>>> underway.
>>>>
>>>> -- Josh
>>>>
>>>> On Tue, Jun 26, 2012 at 4:34 PM, Josh Hursey <jjhursey_at_[hidden]> wrote:
>>>>> I am requesting a quiet time on the trunk for ORCA integration
>>>>> starting -now- (as previously announced). I will post back when
>>>>> everything is committed and ready to go.
>>>>>
>>>>> Some reading while you are waiting:
>>>>>  http://www.open-mpi.org/community/lists/devel/2012/06/11109.php
>>>>>
>>>>> Thanks,
>>>>> Josh
>>>>>
>>>>> --
>>>>> Joshua Hursey
>>>>> Postdoctoral Research Associate
>>>>> Oak Ridge National Laboratory
>>>>> http://users.nccs.gov/~jjhursey
>>>>
>>>>
>>>>
>>>> --
>>>> Joshua Hursey
>>>> Postdoctoral Research Associate
>>>> Oak Ridge National Laboratory
>>>> http://users.nccs.gov/~jjhursey
>>>>
>>>> _______________________________________________
>>>> devel mailing list
>>>> devel_at_[hidden]
>>>> http://www.open-mpi.org/mailman/listinfo.cgi/devel
>>>
>>>
>>> _______________________________________________
>>> devel mailing list
>>> devel_at_[hidden]
>>> http://www.open-mpi.org/mailman/listinfo.cgi/devel
>>
>>
>>
>> --
>> Joshua Hursey
>> Postdoctoral Research Associate
>> Oak Ridge National Laboratory
>> http://users.nccs.gov/~jjhursey
>
>
>
> --
> Joshua Hursey
> Postdoctoral Research Associate
> Oak Ridge National Laboratory
> http://users.nccs.gov/~jjhursey

-- 
Joshua Hursey
Postdoctoral Research Associate
Oak Ridge National Laboratory
http://users.nccs.gov/~jjhursey