Resending, as i didn't get any answer...
email@example.com wrote on 01/27/2012 05:38:34
> De : "nadia.derbey" <Nadia.Derbey@bull.net>
> A : Open MPI Developers <firstname.lastname@example.org>
> Date : 01/27/2012 05:35 PM
> Objet : [OMPI devel] btl/openib: get_ib_dev_distance
> processes as bound if the job has been launched by srun
> Envoyé par : email@example.com
> If a job is launched using "srun --resv-ports --cpu_bind:..."
> is configured with:
> each rank of that job is in a cpuset that contains a single CPU.
> Now, if we use carto on top of this, the following happens in
> get_ib_dev_distance() (in btl/openib/btl_openib_component.c):
> . opal_paffinity_base_get_processor_info() is called
to get the
> number of logical processors (we get 1 due to
the singleton cpuset)
> . we loop over that # of processors to check whether
our process is
> bound to one of them. In our case the loop will
be executed only
> once and we will never get the correct binding
> . if the process is bound actually get the distance to
> in our case we won't execute that part of the
> The attached patch is a proposal to fix the issue.
> [attachment "get_ib_dev_distance.patch" deleted by Nadia
> BULL] _______________________________________________
> devel mailing list