Open MPI logo

Open MPI Development Mailing List Archives

  |   Home   |   Support   |   FAQ   |   all Development mailing list

From: Ralph H Castain (rhc_at_[hidden])
Date: 2007-08-06 16:33:31


On 8/6/07 1:51 PM, "Jeff Squyres" <jsquyres_at_[hidden]> wrote:

> On Aug 6, 2007, at 11:49 AM, Ralph H Castain wrote:
>
>> 1. if everything is being done on localhost, I do not see any of
>> the IO from
>> the child process. Mpirun executes and completes cleanly, however.
>> Because,
>> the spawn'd child terminates so quickly, I haven't been able to
>> positively
>> confirm it is actually running - though I have some indication that
>> it is.
>
> This is probably my fault somehow;

Isn't everything?? :-)

> I can look into this but not
> immediately. I'm guessing this is related to the IOF fix that I put
> in last week sometime. If you can deal without io from the
> COMM_SPAWN children for a little while, I can look at it in a few
> days...

No problem, really - just wanted to ensure someone was aware of it.

>
>> 2. if running on multiple hosts, I see the output from the child
>> processes,
>> but mpirun "hangs" in MPI_Comm_disconnect. A ctrl-C is able to kill
>> the
>> entire job.
>
> I can't comment on this one...

Could be related - let's fix the first and see if the second goes away.

Thanks
Ralph

>
>> Any ideas on what might have happened? This was all working not
>> that long
>> ago...can't swear to an r-level at the moment, but am hoping
>> someone has an
>> idea before I start having to blindly work backwards to find out
>> what broke
>> it.
>>
>> Thanks
>> Ralph
>>
>>
>> _______________________________________________
>> devel-core mailing list
>> devel-core_at_[hidden]
>> http://www.open-mpi.org/mailman/listinfo.cgi/devel-core
>