> I wonder if we can narrow this down a bit to perhaps a PML protocol
> Start by disabling RDMA by using:
> -mca btl_gm_flags 1
This helps some, I at least now see the start up of HPL, but i never
get a single pass, output ends at:
- Computational tests pass if scaled residuals are less
On the other-hand, with OB1 using btl_gm_flags 1 fixed the error
problem with OMPI! Which is a great first step.
mpirun -np 4 --mca btl_gm_flags 1 ./xhpl
Allowed HPL to run with no errors. I verified the performance was
better than when ran without gm
(added --mca btl ^gm )
So still a problem with DR which i dont need but im willing to help
Can we look into why leaving RDMA on if causing a problem?
> Let's see if that helps things out at all.
> - Galen
>> users mailing list
> users mailing list