Open MPI logo

Open MPI Development Mailing List Archives

  |   Home   |   Support   |   FAQ   |   all Development mailing list

Subject: Re: [OMPI devel] RFC: Resilient ORTE
From: Josh Hursey (jjhursey_at_[hidden])
Date: 2011-06-17 17:32:55


Does this include a fix for the problem I reported with mpirun-hosted processes?

If not I would ask that we holding off on putting it into the trunk
until that particular bug is addressed. From my experience tackling
this particular issues requires some code refactoring, which should
probably be done once in the trunk instead of two possibly disruptive
commits.

-- Josh

On Fri, Jun 17, 2011 at 5:18 PM, Wesley Bland <wbland_at_[hidden]> wrote:
> This is a reminder that the Resilient ORTE RFC is set to go into the trunk
> on Monday at COB.
> I've updated the code with a few of the changes that were mentioned on and
> off the list (moved code out of orted_comm.c, errmgr_set_callback returns
> previous callback, post_startup function, corrected normal termination
> issues). Please take another look at it if you have any interest. The code
> can be found here:
> https://bitbucket.org/wesbland/resilient-orte/
> Thanks,
> Wesley Bland

-- 
Joshua Hursey
Postdoctoral Research Associate
Oak Ridge National Laboratory
http://users.nccs.gov/~jjhursey