Is there a way to detect a situation than one of the processes in an MPI application exits without even calling MPI_Init()?

I have a case in which all the processes except one are stuck forever in MPI_Init(), and that one exits before being able to call MPI_Init()…

I tried using the mca params that I thought might be related - orte_startup_timeout, orte_abort_timeout, but that didn’t help.




Jacob M. Katz | jacob.katz@intel.com | Work: +972-4-865-5726 | iNet: (8)-465-5726


Intel Israel (74) Limited

This e-mail and any attachments may contain confidential material for
the sole use of the intended recipient(s). Any review or distribution
by others is strictly prohibited. If you are not the intended
recipient, please contact the sender and delete all copies.