On Jun 2, 2010, at 1:31 PM, guillaume ranquet wrote:
> granquet_at_bordeplage-9 ~/openmpi-1.4.2 $ ~/openmpi-1.4.2-bin/bin/mpirun
> - --mca btl openib,sm,self --mca pml ^cm ~/bwlat/mpi_helloworld
> Hello world from process 0 of 1
> granquet_at_bordeplage-9 ~/openmpi-1.4.2 $
> I can tell it works :)
Ok. I think that OMPI is trying to open the MX MTL first. It fails at mx_init() (the first error message) but it had already created some mpool resources. It then tries to open the MX BTL and it skips the MX initialization and returns SUCCESS. The MX BTL then tries to call mx_get_info() which fails and prints the second message.
Try the attached patch. It tries to clean up if mx_init() fails and does not return SUCCESS on subsequent attempts to initialize MX.