Dear OpenMPI developers,
I'm running my MPI application over Infiniband connection net over 128
processors. During the execution my application, i get a strange time
out error:
checkPAMRESActionTab: action 63 connecting to RES on host <node0389> timed
out after 200 seconds
Is a net problem or an application problem? How can i solve it?
Thanks in advance.
--
Ing. Gabriele Fatigati
Parallel programmer
CINECA Systems & Tecnologies Department
Supercomputing Group
Via Magnanelli 6/3, Casalecchio di Reno (BO) Italy
www.cineca.it Tel: +39 051 6171722
g.fatigati [AT] cineca.it
|