Open MPI logo

Open MPI Development Mailing List Archives

  |   Home   |   Support   |   FAQ   |   all Development mailing list

Subject: Re: [OMPI devel] Error after ompi-restart
From: Leonardo Fialho (lfialho_at_[hidden])
Date: 2008-11-04 10:16:56


I´m not sure, but I think that line 659 on file
orte/mca/ess/env/ess_env_module.c should contain

if (ORTE_SUCCESS != (ret =
orte_ess_base_build_nidmap(orte_process_info.sync_buf, &nidmap, *jmap*))) {

But actually it contains

if (ORTE_SUCCESS != (ret =
orte_ess_base_build_nidmap(orte_process_info.sync_buf, &nidmap,
*&jmap->pmap*))) {

No?

Leonardo

Leonardo Fialho escribió:
> Hi All,
>
> I think that exists an error in the trunk version while trying to
> restore a checkpoint.
>
> The function orte_util_decode_pidmap while attempts to execute the
> following code
>
> /* store the data */
> for (i=0; i < num_procs; i++) {
> pmap.node = nodes[i];
> pmap.local_rank = local_rank[i];
> pmap.node_rank = node_rank[i];
> opal_value_array_set_item(procs, i, &pmap);
> }
>
> produces a segmentation fault
>
> [nodo2:18027] *** Process received signal ***
> [nodo2:18027] Signal: Segmentation fault (11)
> [nodo2:18027] Signal code: Address not mapped (1)
> [nodo2:18027] Failing at address: (nil)
>
> I was trying to trace the problem and I think that it occurs in the
> line opal_value_array_set_item(procs, i, &pmap);
>
> Thanks,
>

-- 
Leonardo Fialho
Computer Architecture and Operating Systems Department - CAOS
Universidad Autonoma de Barcelona - UAB
ETSE, Edifcio Q, QC/3088
http://www.caos.uab.es
Phone: +34-93-581-2888
Fax: +34-93-581-2478