Hello to all.
I'm new in the forum, at least is the first time i write.
I'm working with open mpi and I would do a little experiment, i will try to
pass one process by another process.
For example, assuming that there are 2 processes that are communicating say
rank 1 and 2. And there is a process of rank 3, I would like the rank 3 (it
could be assumed that this node is marked down at the initial hostfile) took
the place of rank 2, and rank 1 still think that he is communicating with
rank 2 when in fact is communicating with the rank 3.
I guess I'll have to modify tables as orte_job_map_t and orte_proc_t, but I
wanted to know if someone already has experience doing something similar,
and can guide me at least.
The communication between processes, in principle, would be irrelevant, so i
will not need to use checkpoints / restarts for now.