On Nov 24, 2013, at 8:30 AM, JŲrg Bornschein <jb@capsec.org> wrote:

On 24.11.2013, at 10:22, Ralph Castain <rhc@open-mpi.org> wrote:

The cuda support in the 1.7 series has been evolving - a number of patches have been applied since 1.7.3 was released, and I see another (for optimization) scheduled.

You might try the 1.7.4 nightly tarball and see if the problem has been fixed.


Same problem with 1.7.4-nightly.

But I compiled and started my little test program on a machine with actual Infiniband hardware
and the problem disappeared! I guess on machines with Inifniband hardware OB1 is not
selected at runtime? Is this correct?

Sounds like a bug to me - if cuda is being used, we need to select ob1 regardless. I'll have to let Rolf figure that one out.



I still believe that ompi/mca/pml/ob1/* is not linked to common_cuda.*, although it 
should. Iím slightly overwhelmed by automake, so I donít know how to add this
reference and try it myself..

Try the attached - should fix the problem.