Hi all, I have "inherited" a small cluster with a head node and four compute nodes which I have to administer. The nodes are connected via infiniband (OFED), but the head is not. I am a complete novice to the infiniband stuff and here is my problem: The infiniband configuration seems to be OK. The usual tests suggested in the OFED install guide give the expected output, e.g.
ibv_devinfo on the nodes:
0,1,0]: uDAPL on host n01 was unable to find any NICs. Another transport will be used instead, although this may result in lower performance. -------------------------------------------------------------------------- -------------------------------------------------------------------------- [0,1,2]: uDAPL on host n01 was unable to find any NICs. Another transport will be used instead, although this may result in lower performance. -------------------------------------------------------------------------- -------------------------------------------------------------------------- [0,1,3]: uDAPL on host n02 was unable to find any NICs. Another transport will be used instead, although this may result in lower performance. -------------------------------------------------------------------------- -------------------------------------------------------------------------- [0,1,1]: uDAPL on host n02 was unable to find any NICs. Another transport will be used instead, although this may result in lower performance. --------------------------------------------------------------------------MPI with normal GB Etherrnet and IP networking just works fine, but the infinband doesn't. The MPI libs I am using