Open MPI logo

Open MPI Development Mailing List Archives

  |   Home   |   Support   |   FAQ   |   all Development mailing list

Subject: Re: [OMPI devel] Merging in the CPC work
From: Jeff Squyres (jsquyres_at_[hidden])
Date: 2008-04-24 08:37:44


Thanks! That's a result of some [helpful] error messages and handling
that I added yesterday when ibcm is not configured on the host.

Fixed in r18273.

On Apr 24, 2008, at 8:22 AM, Pavel Shamis (Pasha) wrote:

> The patch below resolves the segfault :
>
> -- btl_openib_connect_ibcm.c.orig 2008-04-24 15:14:28.500676000
> +0300
> +++ btl_openib_connect_ibcm.c 2008-04-24 15:15:08.961168000 +0300
> @@ -328,7 +328,7 @@
> {
> int rc;
> modex_msg_t *msg;
> - ibcm_module_t *m;
> + ibcm_module_t *m = NULL;
> opal_list_item_t *item;
> ibcm_listen_cm_id_t *cmh;
> ibcm_module_list_item_t *imli;
>
>
> Jeff Squyres wrote:
>> I had a linker error with the rdmacm library yesterday that I fixed
>> later, sorry.
>>
>> Could you try it again? You'll need to svn up, re-autogen, etc.
>> It should be obvious whether I fixed it -- even trivial apps will
>> work or not work.
>>
>> Thanks.
>>
>>
>> On Apr 24, 2008, at 6:24 AM, Gleb Natapov wrote:
>>
>>> On Thu, Apr 24, 2008 at 11:50:10AM +0300, Pavel Shamis (Pasha)
>>> wrote:
>>>> Jeff,
>>>> All my tests fail.
>>>> XRC disabled tests failed with:
>>>> mtt/installs/Zq_9/install/lib/openmpi/mca_btl_openib.so: undefined
>>>> symbol: rdma_create_event_channel
>>>> XRC enabled failed with segfault , I will take a look later today.
>>> Well it is a little bit better for me. I compiled only OOB
>>> connection
>>> manager and ompi passes simple testing.
>>>
>>>>
>>>> Pasha
>>>>
>>>> Jeff Squyres wrote:
>>>>> As we discussed yesterday, I have started the merge from the /tmp-
>>>>> public/openib-cpc2 branch. "oob" is currently the default.
>>>>>
>>>>> Unfortunately, it caused quite a few conflicts when I merged
>>>>> with the
>>>>> trunk, so I created a new temp branch and put all the work
>>>>> there: /tmp-
>>>>> public/openib-cpc3.
>>>>>
>>>>> Could all the IB and iWARP vendors and any other interested
>>>>> parties
>>>>> please try this branch before we bring it back to the trunk?
>>>>> Please
>>>>> test all functionality that you care about -- XRC, etc. I'd
>>>>> like to
>>>>> bring it back to the trunk COB Thursday. Please let me know if
>>>>> this
>>>>> is too soon.
>>>>>
>>>>> You can force the selection of a different CPC with the
>>>>> btl_openib_cpc_include MCA param:
>>>>>
>>>>> mpirun --mca btl_openib_cpc_include oob ...
>>>>> mpirun --mca btl_openib_cpc_include xoob ...
>>>>> mpirun --mca btl_openib_cpc_include rdma_cm ...
>>>>> mpirun --mca btl_openib_cpc_include ibcm ...
>>>>>
>>>>> You might want to concentrate on testing oob and xoob to ensure
>>>>> that
>>>>> we didn't cause any regressions. The ibcm and rdma_cm CPCs
>>>>> probably
>>>>> still have some rough edges (and the IBCM package in OFED itself
>>>>> may
>>>>> not be 100% -- that's one of the things we're evaluating. It's
>>>>> known
>>>>> to not install properly on RHEL4U4, for example -- you have to
>>>>> manually mknod and chmod a device in /dev/infiniband for every
>>>>> HCA in
>>>>> the host).
>>>>>
>>>>> Thanks.
>>>>>
>>>>>
>>>>
>>>>
>>>> --
>>>> Pavel Shamis (Pasha)
>>>> Mellanox Technologies
>>>>
>>>> _______________________________________________
>>>> devel mailing list
>>>> devel_at_[hidden]
>>>> http://www.open-mpi.org/mailman/listinfo.cgi/devel
>>>
>>> --
>>> Gleb.
>>> _______________________________________________
>>> devel mailing list
>>> devel_at_[hidden]
>>> http://www.open-mpi.org/mailman/listinfo.cgi/devel
>>
>>
>
>
> --
> Pavel Shamis (Pasha)
> Mellanox Technologies
>

-- 
Jeff Squyres
Cisco Systems