Open MPI logo

Open MPI Development Mailing List Archives

  |   Home   |   Support   |   FAQ   |   all Development mailing list

Subject: Re: [OMPI devel] Merging in the CPC work
From: Pavel Shamis (Pasha) (pasha_at_[hidden])
Date: 2008-04-24 08:22:58


The patch below resolves the segfault :

-- btl_openib_connect_ibcm.c.orig 2008-04-24 15:14:28.500676000 +0300
+++ btl_openib_connect_ibcm.c 2008-04-24 15:15:08.961168000 +0300
@@ -328,7 +328,7 @@
 {
     int rc;
     modex_msg_t *msg;
- ibcm_module_t *m;
+ ibcm_module_t *m = NULL;
     opal_list_item_t *item;
     ibcm_listen_cm_id_t *cmh;
     ibcm_module_list_item_t *imli;

Jeff Squyres wrote:
> I had a linker error with the rdmacm library yesterday that I fixed
> later, sorry.
>
> Could you try it again? You'll need to svn up, re-autogen, etc. It
> should be obvious whether I fixed it -- even trivial apps will work or
> not work.
>
> Thanks.
>
>
> On Apr 24, 2008, at 6:24 AM, Gleb Natapov wrote:
>
>> On Thu, Apr 24, 2008 at 11:50:10AM +0300, Pavel Shamis (Pasha) wrote:
>>> Jeff,
>>> All my tests fail.
>>> XRC disabled tests failed with:
>>> mtt/installs/Zq_9/install/lib/openmpi/mca_btl_openib.so: undefined
>>> symbol: rdma_create_event_channel
>>> XRC enabled failed with segfault , I will take a look later today.
>> Well it is a little bit better for me. I compiled only OOB connection
>> manager and ompi passes simple testing.
>>
>>>
>>> Pasha
>>>
>>> Jeff Squyres wrote:
>>>> As we discussed yesterday, I have started the merge from the /tmp-
>>>> public/openib-cpc2 branch. "oob" is currently the default.
>>>>
>>>> Unfortunately, it caused quite a few conflicts when I merged with the
>>>> trunk, so I created a new temp branch and put all the work there:
>>>> /tmp-
>>>> public/openib-cpc3.
>>>>
>>>> Could all the IB and iWARP vendors and any other interested parties
>>>> please try this branch before we bring it back to the trunk? Please
>>>> test all functionality that you care about -- XRC, etc. I'd like to
>>>> bring it back to the trunk COB Thursday. Please let me know if this
>>>> is too soon.
>>>>
>>>> You can force the selection of a different CPC with the
>>>> btl_openib_cpc_include MCA param:
>>>>
>>>> mpirun --mca btl_openib_cpc_include oob ...
>>>> mpirun --mca btl_openib_cpc_include xoob ...
>>>> mpirun --mca btl_openib_cpc_include rdma_cm ...
>>>> mpirun --mca btl_openib_cpc_include ibcm ...
>>>>
>>>> You might want to concentrate on testing oob and xoob to ensure that
>>>> we didn't cause any regressions. The ibcm and rdma_cm CPCs probably
>>>> still have some rough edges (and the IBCM package in OFED itself may
>>>> not be 100% -- that's one of the things we're evaluating. It's known
>>>> to not install properly on RHEL4U4, for example -- you have to
>>>> manually mknod and chmod a device in /dev/infiniband for every HCA in
>>>> the host).
>>>>
>>>> Thanks.
>>>>
>>>>
>>>
>>>
>>> --
>>> Pavel Shamis (Pasha)
>>> Mellanox Technologies
>>>
>>> _______________________________________________
>>> devel mailing list
>>> devel_at_[hidden]
>>> http://www.open-mpi.org/mailman/listinfo.cgi/devel
>>
>> --
>> Gleb.
>> _______________________________________________
>> devel mailing list
>> devel_at_[hidden]
>> http://www.open-mpi.org/mailman/listinfo.cgi/devel
>
>

-- 
Pavel Shamis (Pasha)
Mellanox Technologies