Hello,
We have a random hungs of some applications (NAMD, Molpro, ...) when
using openib BTL.
We are using ompi 1.4.3 and ompi 1.3.4 compiled with icc intel compiler.
linux kernel : 2.6.18-128 RH, node have 8 cores.
OFED version : 3.2
ibv_devifno seems to be ok on all nodes.
Note that we dont have problems when running with TCP.
when i do strace -p value I got this infinite output :
poll([{fd=4, events=POLLIN}, {fd=5, events=POLLIN}, {fd=6,
events=POLLIN}, {fd=7, events=POLLIN}
..
..
Any idea?
Than you for your help.
nixter
|