Open MPI logo

Open MPI User's Mailing List Archives

  |   Home   |   Support   |   FAQ   |   all Open MPI User's mailing list

Subject: [OMPI users] problem with rankfile and openmpi-1.6.4rc3r27923
From: Siegmar Gross (Siegmar.Gross_at_[hidden])
Date: 2013-01-29 13:54:17


Hi

today I have installed openmpi-1.6.4rc3r27923. Unfortunately I
still have a problem with rankfiles, if I start a process on a
remote machine.

tyr rankfiles 114 ssh linpc1 ompi_info | grep "Open MPI:"
                Open MPI: 1.6.4rc3r27923

tyr rankfiles 115 cat rf_linpc1
rank 0=linpc1 slot=0:0-1,1:0-1

tyr rankfiles 116 mpiexec -report-bindings -np 1 \
  -rf rf_linpc1 hostname
------------------------------------------------------------------
All nodes which are allocated for this job are already filled.
------------------------------------------------------------------

The following command still works.

tyr rankfiles 119 mpiexec -report-bindings -np 1 -host linpc1 \
  -cpus-per-proc 4 -bycore -bind-to-core hostname
[linpc1:32262] MCW rank 0 bound to socket 0[core 0-1]
  socket 1[core 0-1]: [B B][B B]
linpc1
tyr rankfiles 120

Everything is fine, if I use the rankfile on the local machine.

linpc1 rankfiles 103 ompi_info | grep "Open MPI:"
 Open MPI: 1.6.4rc3r27923

linpc1 rankfiles 104 cat rf_linpc1
rank 0=linpc1 slot=0:0-1,1:0-1

linpc1 rankfiles 105 mpiexec -report-bindings -np 1 \
  -rf rf_linpc1 hostname
[linpc1:32385] MCW rank 0 bound to socket 0[core 0-1]
  socket 1[core 0-1]: [B B][B B] (slot list 0:0-1,1:0-1)
linpc1
linpc1 rankfiles 106

In my opinion it should also work if I start a process on a
remote machine. Can somebody look once more into this issue?
Thank you very much for your help in advance.

Kind regards

Siegmar