What's the latest status on Open-MPI fault tolerance? Is there any progress?
I am only interested to intercept problems when they occur (such as a
node crash) without taking the whole MPI_WORLD down with it. At least, I
want to cope with such situation.
I did use ERRORS_THROW_EXCEPTIONS; however, it did not work the way I
want it to.
We can't resolve problems by using the same kind of thinking we used
when we created them.