Open MPI logo

Open MPI User's Mailing List Archives

  |   Home   |   Support   |   FAQ   |   all Open MPI User's mailing list

Subject: Re: [OMPI users] OpenMPI 1.2.5 race condition / core dump with MPI_Reduce and MPI_Gather
From: Gleb Natapov (glebn_at_[hidden])
Date: 2008-02-28 11:31:42

On Wed, Feb 27, 2008 at 10:01:06AM -0600, Brian W. Barrett wrote:
> The only solution to this problem is to suck it up and audit all the code
> to eliminate calls to opal_progress() in situations where infinite
> recursion can result. It's going to be long and painful, but there's no
> quick fix (IMHO).
The trick is to call progress only from functions that are called
directly by a user process. Never call progress from a callback functions.
The main offenders of this rule are calls to OMPI_FREE_LIST_WAIT(). They
should be changed to OMPI_FREE_LIST_GET() and dial with NULL return value.