Open MPI logo

Open MPI User's Mailing List Archives

  |   Home   |   Support   |   FAQ   |   all Open MPI User's mailing list

From: Jens Klostermann (Jens.Klostermann_at_[hidden])
Date: 2006-06-12 10:51:05


This morning I was running

mpirun -v --mca btl mvapi,self -np 12 --hostfile ompimachinefile
oodles . les_test1100k -parallel >> ./les_test1100k/log12 &

with openmpi-1.2a1r10111 and everything worked and still works as
expected.

Now I tried to start a second (very same) job with the following error
message as a result:

[stokes:29489] [0,0,0] ORTE_ERROR_LOG: Error in file
runtime/orte_init_stage1.c at line 302
--------------------------------------------------------------------------
It looks like orte_init failed for some reason; your parallel process is
likely to abort. There are many reasons that a parallel process can
fail during orte_init; some of which are due to configuration or
environment problems. This failure appears to be an internal failure;
here's some additional information (which may only be relevant to an
Open MPI developer):

  orte_session_dir failed
  --> Returned value -1 instead of ORTE_SUCCESS

--------------------------------------------------------------------------
[stokes:29489] [0,0,0] ORTE_ERROR_LOG: Error in file
runtime/orte_system_init.c at line 42
[stokes:29489] [0,0,0] ORTE_ERROR_LOG: Error in file runtime/orte_init.c
at line 49
--------------------------------------------------------------------------
Open RTE was unable to initialize properly. The error occured while
attempting to orte_init(). Returned value -1 instead of ORTE_SUCCESS.

Has anybody an idea what the error might be or how to trag it down?

Regards Jens