Open MPI logo

Open MPI User's Mailing List Archives

  |   Home   |   Support   |   FAQ   |   all Open MPI User's mailing list

From: Xiaoning (David) Yang (xyang_at_[hidden])
Date: 2006-03-02 13:58:54


Yes, that's it! I do have an if statement for interactive shells. Now I
know. Thanks.

David

***** Correspondence *****

> From: Brian Barrett <brbarret_at_[hidden]>
> Reply-To: Open MPI Users <users_at_[hidden]>
> Date: Thu, 2 Mar 2006 12:09:18 -0500
> To: Open MPI Users <users_at_[hidden]>
> Subject: Re: [OMPI users] tcsh: orted: Not Found
>
> On Mar 2, 2006, at 11:34 AM, Xiaoning (David) Yang wrote:
>
>> Thank you for the help. I did include path to orted in my .tcshrc
>> file on
>> mac2, but I put the path at the end of the file. It is interesting
>> that when
>> I logged into mac with ssh, the path was included and orted was in
>> my path.
>> But when I ran "ssh mac2 which orted", orted was not found. It
>> finds orted
>> only after I move the path from the end of .tcshrc to the beginning
>> of the
>> file. Strange. Again, thanks and at least I may make MPI work.
>
> Do you have a test like if ( $?prompt ) exit towards the end of
> your .tcshrc? Most .tcshrc files do, and the end is only evaluated
> for interactive shells (which the one to start the orted is not).
> This is probably why moving it to the top helped.
>
> Anyway, glad to hear things are working for you.
>
> Brian
>
>
>
>>> From: Brian Barrett <brbarret_at_[hidden]>
>>> Reply-To: Open MPI Users <users_at_[hidden]>
>>> Date: Thu, 2 Mar 2006 00:24:27 -0500
>>> To: Open MPI Users <users_at_[hidden]>
>>> Subject: Re: [OMPI users] tcsh: orted: Not Found
>>>
>>> On Mar 1, 2006, at 5:26 PM, Xiaoning (David) Yang wrote:
>>>
>>>> I installed Open MPI 1.0.1 on two Mac G5s (one with two cpus and
>>>> the other
>>>> with 4 cpus.). I set up ssh on both machines according to the FAQ.
>>>> My mpi
>>>> jobs work fine if I run the jobs on only one computer. But when I
>>>> ran a job
>>>> across the two Macs from the first Mac mac1, I got:
>>>>
>>>> mac1: mpirun -np 6 --hostfiles /Users/me/my_hosts hello_world
>>>> tcsh: orted: Command not found.
>>>> [mac1:01019] ERROR: A daemon on node mac2 failed to start as
>>>> expected.
>>>> [mac1:01019] ERROR: There may be more information available from
>>>> [mac1:01019] ERROR: the remote shell (see above).
>>>> [mac1:01019] ERROR: The daemon exited unexpectedly with status 1.
>>>> 2 processes killed (possibly by Open MPI)
>>>>
>>>> File my_hosts looks like
>>>>
>>>> mac1 slots=2
>>>> mac2 slots=4
>>>>
>>>> The orted is definitely on my path on both machines. Any idea?
>>>> Help is
>>>> greatly appreciated!
>>>
>>> I'm guessing that the issue is with your shell configuration. mpirun
>>> starts the orted on the remote node through rsh/ssh, which will start
>>> a non-login shell on the remote node. Unfortunately, the set of
>>> dotfiles evaluated when a non-login shell is different than when
>>> starting a login shell. The easiest way to tell if this is the issue
>>> is to check whether orted is in your path when started in a non-login
>>> shell with a command like:
>>>
>>> ssh remote_host which orted
>>>
>>> More information on how to configure your particular shell for use
>>> with Open MPI can be found in our FAQ at:
>>>
>>> http://www.open-mpi.org/faq/?category=running
>>>
>>>
>>> Hope this helps,
>>>
>>> Brian
>>>
>>> --
>>> Brian Barrett
>>> Open MPI developer
>>> http://www.open-mpi.org/
>>>
>>>
>>> _______________________________________________
>>> users mailing list
>>> users_at_[hidden]
>>> http://www.open-mpi.org/mailman/listinfo.cgi/users
>>
>>
>> _______________________________________________
>> users mailing list
>> users_at_[hidden]
>> http://www.open-mpi.org/mailman/listinfo.cgi/users
>
> _______________________________________________
> users mailing list
> users_at_[hidden]
> http://www.open-mpi.org/mailman/listinfo.cgi/users