Open MPI logo

Open MPI User's Mailing List Archives

  |   Home   |   Support   |   FAQ   |   all Open MPI User's mailing list

Subject: Re: [OMPI users] Crashes over TCP/ethernet but not on shared memory
From: George Bosilca (bosilca_at_[hidden])
Date: 2008-10-10 14:54:58

On Oct 10, 2008, at 12:42 PM, V. Ram wrote:

> Can anyone else suggest why the code might be crashing when running
> over
> ethernet and not over shared memory? Any suggestions on how to debug
> this or interpret the error message issued from btl_tcp_frag.c ?

Unfortunately this is a standard error message which do not enlighten
us on what the real error is/was. It simply state that one node failed
to read data from a socket, which usually happens when the remote peer
died unexpectedly (such as a seg-fault).