Open MPI logo

Open MPI User's Mailing List Archives

  |   Home   |   Support   |   FAQ   |   all Open MPI User's mailing list

From: Anthony Chan (chan_at_[hidden])
Date: 2006-01-04 14:08:06


On Wed, 4 Jan 2006, Carsten Kutzner wrote:

> On Tue, 3 Jan 2006, Anthony Chan wrote:
>
> > MPE/MPE2 logging (or clog/clog2) does not impose any limitation on the
> > number of processes. Could you explain what difficulty or error
> > message you encountered when using >32 processes ?
>
> Either my program quits without writing the logfile (and without
> complaining) or it crashes in MPI_Finalize. I get the message
> "33 additional processes aborted (not shown)".

This is not MPE error message. If the logging crashes in MPI_Finalize,
it usually means the merging of logging data from child nodes fails.
Since you didn't get any MPE error messages, so it means the cause of
the crash isn't expected by MPE. Does anyone know if "33 additional
processes aborted (not shown)" is from OpenMPI ?

Since I don't know the real cause of the crash, this is what I would do:

1, Set MPE_TMPDIR or TMPDIR to bigger local filesystem to make sure that
   disk space is not an issue here.

2, Run <MPE_INSTALL_DIR>/share/examples_logging/cpilog with >32 processes
   to see if you get the same error message. If the same error occurs,
   it could be there is some other fundamental issue e.g. networking
   problem...

A.Chan

> Since this looks weird I think I will recompile with the newer MPE
> version you suggested. (When I do not link with MPE libraries my program
> runs fine.)
>
> Thanks,
> Carsten
>
> _______________________________________________
> users mailing list
> users_at_[hidden]
> http://www.open-mpi.org/mailman/listinfo.cgi/users
>
>