hi all,
Many time, there are no difference between "proc" and "slot". But when
you use "mpirun -cpus-per-proc X", slot have X procs.
On orte/mca/rmaps/base/rmaps_base_common_mappers.c, there are a
confusion between proc and slot. this little error impact mapping action:
On OMPI last version with 32 cores compute node:
salloc -n 8 -c 8 mpirun -bind-to-core -bycore ./a.out
[rank:0]<stdout>: host:compute18
[rank:1]<stdout>: host:compute19
[rank:2]<stdout>: host:compute18
[rank:3]<stdout>: host:compute19
[rank:4]<stdout>: host:compute18
[rank:5]<stdout>: host:compute19
[rank:6]<stdout>: host:compute18
[rank:7]<stdout>: host:compute19
with patch:
[rank:0]<stdout>: host:compute18
[rank:1]<stdout>: host:compute18
[rank:2]<stdout>: host:compute18
[rank:3]<stdout>: host:compute18
[rank:4]<stdout>: host:compute19
[rank:5]<stdout>: host:compute19
[rank:6]<stdout>: host:compute19
[rank:7]<stdout>: host:compute19
Can you say, if my patch is correct ?
Thanks you
Damien
|