Open MPI logo

Open MPI User's Mailing List Archives

  |   Home   |   Support   |   FAQ   |   all Open MPI User's mailing list

Subject: [OMPI users] Hanging vs Stopping behaviour in communication failures
From: Constantinos Makassikis (cmakassikis_at_[hidden])
Date: 2009-12-09 03:47:20


Dear all,

sometimes when running Open MPI jobs, the application hangs. By looking the
output I get the following error message:

[ic17][[34562,1],74][../../../../../ompi/mca/btl/tcp/btl_tcp_frag.c:216:mca_btl_tcp_frag_recv
] mca_btl_tcp_frag_recv: readv failed: No route to host (113)

I would expect Open MPI to eventually quit with an error at such situations.
Is the observed behaviour (i.e.: hanging) the intended one ?

If so, what would be the reason(s) behind choosing the hanging over the
stopping ?

Best Regards,

--
Constantinos