When my MPI code fails (seg fault), it usually cause the rest of the mpi process to abort as well. Perhaps rather than calling abort(), perhaps you could do a divide-by-zero operation to halt the program?
I've got a mpi program that is supposed to to generate a core file if
problems arise on any of the nodes. I tried to do this by adding a
call to abort() to my exit routines but this doesn't work; I get no core
file, and worse, mpirun doesn't detect that one of my nodes has
aborted(?) and doesn't kill off the entire job, except in the trivial
case where the number of processors I'm running on is 1. I've replaced
abort with MPI_Abort, which kills everything off, but leaves no core
file. Any suggestions how I can get one and still have mpi exit?
Thanks in advance.
users mailing list