Open MPI logo

Open MPI User's Mailing List Archives

  |   Home   |   Support   |   FAQ   |   all Open MPI User's mailing list

Subject: Re: [OMPI users] can't run mpi-jobs on remote host
From: Ralph Castain (rhc_at_[hidden])
Date: 2014-04-14 10:08:59


I'm confused - how are you building OMPI?? You normally have to do:

1. ./configure --prefix=<foo> .... This is where you would add --enable-debug

2. make clean all install

You then run your mpirun command as you've done.

On Apr 14, 2014, at 12:52 AM, Lubrano Francesco <lubrano.francesco_at_[hidden]> wrote:

> I can't set --enable-debug (command not found: I have just --enable-recovery in help command), but the other commands works properly. The output is:
>
> francesco_at_linux-hldu:~> mpirun -mca plm_base_verbose 10 --debug-daemons --host Frank_at_158.110.39.110 hostname
> [linux-hldu.site:02234] mca: base: components_register: registering plm components
> [linux-hldu.site:02234] mca: base: components_register: found loaded component isolated
> [linux-hldu.site:02234] mca: base: components_register: component isolated has no register or open function
> [linux-hldu.site:02234] mca: base: components_register: found loaded component rsh
> [linux-hldu.site:02234] mca: base: components_register: component rsh register function successful
> [linux-hldu.site:02234] mca: base: components_register: found loaded component slurm
> [linux-hldu.site:02234] mca: base: components_register: component slurm register function successful
> [linux-hldu.site:02234] mca: base: components_open: opening plm components
> [linux-hldu.site:02234] mca: base: components_open: found loaded component isolated
> [linux-hldu.site:02234] mca: base: components_open: component isolated open function successful
> [linux-hldu.site:02234] mca: base: components_open: found loaded component rsh
> [linux-hldu.site:02234] mca: base: components_open: component rsh open function successful
> [linux-hldu.site:02234] mca: base: components_open: found loaded component slurm
> [linux-hldu.site:02234] mca: base: components_open: component slurm open function successful
> [linux-hldu.site:02234] mca:base:select: Auto-selecting plm components
> [linux-hldu.site:02234] mca:base:select:( plm) Querying component [isolated]
> [linux-hldu.site:02234] mca:base:select:( plm) Query of component [isolated] set priority to 0
> [linux-hldu.site:02234] mca:base:select:( plm) Querying component [rsh]
> [linux-hldu.site:02234] mca:base:select:( plm) Query of component [rsh] set priority to 10
> [linux-hldu.site:02234] mca:base:select:( plm) Querying component [slurm]
> [linux-hldu.site:02234] mca:base:select:( plm) Skipping component [slurm]. Query failed to return a module
> [linux-hldu.site:02234] mca:base:select:( plm) Selected component [rsh]
> [linux-hldu.site:02234] mca: base: close: component isolated closed
> [linux-hldu.site:02234] mca: base: close: unloading component isolated
> [linux-hldu.site:02234] mca: base: close: component slurm closed
> [linux-hldu.site:02234] mca: base: close: unloading component slurm
> Daemon was launched on linux-o5sl.site - beginning to initialize
> [linux-o5sl.site:02271] mca: base: components_register: registering plm components
> [linux-o5sl.site:02271] mca: base: components_register: found loaded component rsh
> [linux-o5sl.site:02271] mca: base: components_register: component rsh register function successful
> [linux-o5sl.site:02271] mca: base: components_open: opening plm components
> [linux-o5sl.site:02271] mca: base: components_open: found loaded component rsh
> [linux-o5sl.site:02271] mca: base: components_open: component rsh open function successful
> [linux-o5sl.site:02271] mca:base:select: Auto-selecting plm components
> [linux-o5sl.site:02271] mca:base:select:( plm) Querying component [rsh]
> [linux-o5sl.site:02271] mca:base:select:( plm) Query of component [rsh] set priority to 10
> [linux-o5sl.site:02271] mca:base:select:( plm) Selected component [rsh]
> Daemon [[33734,0],1] checking in as pid 2271 on host linux-o5sl
> [linux-o5sl.site:02271] [[33734,0],1] orted: up and running - waiting for commands!
> [linux-o5sl.site:02271] mca: base: close: component rsh closed
> [linux-o5sl.site:02271] mca: base: close: unloading component rsh
> [linux-hldu.site:02234] [[33734,0],0] orted_cmd: received exit cmd
> [linux-hldu.site:02234] [[33734,0],0] orted_cmd: all routes and children gone - exiting
> [linux-hldu.site:02234] mca: base: close: component rsh closed
> [linux-hldu.site:02234] mca: base: close: unloading component rsh
>
> Is orted in linux-05sl reciving any command?
> Thank you for your cooperation
>
> (I don't know if it matter, but I have the same problem using the first pc as remote and the second as local).
>
> regards
>
> Francesco
>
> _______________________________________________
> users mailing list
> users_at_[hidden]
> http://www.open-mpi.org/mailman/listinfo.cgi/users