Hi
today I have installed openmpi-1.6.4rc3r27923. Unfortunately I
still have a problem with rankfiles, if I start a process on a
remote machine.
tyr rankfiles 114 ssh linpc1 ompi_info | grep "Open MPI:"
Open MPI: 1.6.4rc3r27923
tyr rankfiles 115 cat rf_linpc1
rank 0=linpc1 slot=0:0-1,1:0-1
tyr rankfiles 116 mpiexec -report-bindings -np 1 \
-rf rf_linpc1 hostname
------------------------------------------------------------------
All nodes which are allocated for this job are already filled.
------------------------------------------------------------------
The following command still works.
tyr rankfiles 119 mpiexec -report-bindings -np 1 -host linpc1 \
-cpus-per-proc 4 -bycore -bind-to-core hostname
[linpc1:32262] MCW rank 0 bound to socket 0[core 0-1]
socket 1[core 0-1]: [B B][B B]
linpc1
tyr rankfiles 120
Everything is fine, if I use the rankfile on the local machine.
linpc1 rankfiles 103 ompi_info | grep "Open MPI:"
Open MPI: 1.6.4rc3r27923
linpc1 rankfiles 104 cat rf_linpc1
rank 0=linpc1 slot=0:0-1,1:0-1
linpc1 rankfiles 105 mpiexec -report-bindings -np 1 \
-rf rf_linpc1 hostname
[linpc1:32385] MCW rank 0 bound to socket 0[core 0-1]
socket 1[core 0-1]: [B B][B B] (slot list 0:0-1,1:0-1)
linpc1
linpc1 rankfiles 106
In my opinion it should also work if I start a process on a
remote machine. Can somebody look once more into this issue?
Thank you very much for your help in advance.
Kind regards
Siegmar
|