Open MPI logo

Open MPI User's Mailing List Archives

  |   Home   |   Support   |   FAQ   |   all Open MPI User's mailing list

Subject: [OMPI users] more XGrid Problems with openmpi1.2.9
From: Ricardo Fernández-Perea (rfernandezperea_at_[hidden])
Date: 2009-02-27 07:43:41


Hi
It seems to me more like time issues.
All the runs end with something similar to

Exception Type: EXC_BAD_ACCESS (SIGSEGV)
Exception Codes: KERN_INVALID_ADDRESS at 0x0000000045485308
Crashed Thread: 0

Thread 0 Crashed:
0 libSystem.B.dylib 0x95208f04 strcmp + 84
1 libopen-rte.0.dylib 0x000786fd
orte_pls_base_get_active_daemons + 45
2 mca_pls_xgrid.so 0x00271725
orte_pls_xgrid_terminate_orteds + 117 (pls_xgrid_module.m:133)
3 mpirun 0x000020ec orterun + 1896 (orterun.c:468)
4 mpirun 0x00001982 main + 24 (main.c:14)
5 mpirun 0x0000193e start + 54

Thread 1:

A simple mpirun -n 4 ring give the following results

 Process 0 sending 10 to 1 tag 201 (
    4 processes in ring)
 Process 1 exiting
 Process 2 exiting
 Process 3 exiting
 Process 0 sent to 1
 Process 0 decremented value: 9
 Process 0 decremented value: 8
 Process 0 decremented value: 7
 Process 0 decremented value: 6
 Process 0 decremented value: 5
 Process 0 decremented value: 4
 Process 0 decremented value: 3
 Process 0 decremented value: 2
 Process 0 decremented value: 1
 Process 0 decremented value: 0
 Process 0 exiting
[nexus11:38502] *** Process received signal ***
[nexus11:38502] Signal: Segmentation fault (11)
[nexus11:38502] Signal code: Address not mapped (1)
[nexus11:38502] Failing at address: 0x45485308
[nexus11:38502] [ 0] 2 libSystem.B.dylib 0x9526c2bb
_sigtramp + 43
[nexus11:38502] [ 1] 3 ??? 0xffffffff 0x0
+ 4294967295
[nexus11:38502] [ 2] 4 libopen-rte.0.dylib 0x000786fd
orte_pls_base_get_active_daemons + 45
[nexus11:38502] [ 3] 5 mca_pls_xgrid.so 0x00271725
orte_pls_xgrid_terminate_orteds + 117
[nexus11:38502] [ 4] 6 mpirun 0x000020ec
orterun + 1896
[nexus11:38502] [ 5] 7 mpirun 0x00001982 main
+ 24
[nexus11:38502] [ 6] 8 mpirun 0x0000193e
start + 54
[nexus11:38502] [ 7] 9 ??? 0x00000004 0x0
+ 4
[nexus11:38502] *** End of error message ***
Segmentation fault

Any idea of what I can do?

Ricardo

my ompi_info is
                Open MPI: 1.2.9
   Open MPI SVN revision: r20259
                Open RTE: 1.2.9
   Open RTE SVN revision: r20259
                    OPAL: 1.2.9
       OPAL SVN revision: r20259
                  Prefix: /opt/openmpi
 Configured architecture: i386-apple-darwin9.6.0
           Configured by: sofhtest
           Configured on: Fri Feb 27 11:02:30 CET 2009
          Configure host: nexus10.nlroc
                Built by: sofhtest
                Built on: Fri Feb 27 12:00:08 CET 2009
              Built host: nexus10.nlroc
              C bindings: yes
            C++ bindings: yes
      Fortran77 bindings: yes (single underscore)
      Fortran90 bindings: yes
 Fortran90 bindings size: small
              C compiler: gcc-4.2
     C compiler absolute: /usr/bin/gcc-4.2
            C++ compiler: g++-4.2
   C++ compiler absolute: /usr/bin/g++-4.2
      Fortran77 compiler: gfortran-4.2
  Fortran77 compiler abs: /usr/bin/gfortran-4.2
      Fortran90 compiler: gfortran-4.2
  Fortran90 compiler abs: /usr/bin/gfortran-4.2
             C profiling: yes
           C++ profiling: yes
     Fortran77 profiling: yes
     Fortran90 profiling: yes
          C++ exceptions: no
          Thread support: posix (mpi: no, progress: no)
  Internal debug support: no
     MPI parameter check: runtime
Memory profiling support: no
Memory debugging support: no
         libltdl support: yes
   Heterogeneous support: yes
 mpirun default --prefix: no
           MCA backtrace: execinfo (MCA v1.0, API v1.0, Component v1.2.9)
              MCA memory: darwin (MCA v1.0, API v1.0, Component v1.2.9)
           MCA maffinity: first_use (MCA v1.0, API v1.0, Component v1.2.9)
               MCA timer: darwin (MCA v1.0, API v1.0, Component v1.2.9)
         MCA installdirs: env (MCA v1.0, API v1.0, Component v1.2.9)
         MCA installdirs: config (MCA v1.0, API v1.0, Component v1.2.9)
           MCA allocator: basic (MCA v1.0, API v1.0, Component v1.0)
           MCA allocator: bucket (MCA v1.0, API v1.0, Component v1.0)
                MCA coll: basic (MCA v1.0, API v1.0, Component v1.2.9)
                MCA coll: self (MCA v1.0, API v1.0, Component v1.2.9)
                MCA coll: sm (MCA v1.0, API v1.0, Component v1.2.9)
                MCA coll: tuned (MCA v1.0, API v1.0, Component v1.2.9)
                  MCA io: romio (MCA v1.0, API v1.0, Component v1.2.9)
               MCA mpool: rdma (MCA v1.0, API v1.0, Component v1.2.9)
               MCA mpool: sm (MCA v1.0, API v1.0, Component v1.2.9)
                 MCA pml: cm (MCA v1.0, API v1.0, Component v1.2.9)
                 MCA pml: ob1 (MCA v1.0, API v1.0, Component v1.2.9)
                 MCA bml: r2 (MCA v1.0, API v1.0, Component v1.2.9)
              MCA rcache: vma (MCA v1.0, API v1.0, Component v1.2.9)
                 MCA btl: self (MCA v1.0, API v1.0.1, Component v1.2.9)
                 MCA btl: sm (MCA v1.0, API v1.0.1, Component v1.2.9)
                 MCA btl: tcp (MCA v1.0, API v1.0.1, Component v1.0)
                MCA topo: unity (MCA v1.0, API v1.0, Component v1.2.9)
                 MCA osc: pt2pt (MCA v1.0, API v1.0, Component v1.2.9)
              MCA errmgr: hnp (MCA v1.0, API v1.3, Component v1.2.9)
              MCA errmgr: orted (MCA v1.0, API v1.3, Component v1.2.9)
              MCA errmgr: proxy (MCA v1.0, API v1.3, Component v1.2.9)
                 MCA gpr: null (MCA v1.0, API v1.0, Component v1.2.9)
                 MCA gpr: proxy (MCA v1.0, API v1.0, Component v1.2.9)
                 MCA gpr: replica (MCA v1.0, API v1.0, Component v1.2.9)
                 MCA iof: proxy (MCA v1.0, API v1.0, Component v1.2.9)
                 MCA iof: svc (MCA v1.0, API v1.0, Component v1.2.9)
                  MCA ns: proxy (MCA v1.0, API v2.0, Component v1.2.9)
                  MCA ns: replica (MCA v1.0, API v2.0, Component v1.2.9)
                 MCA oob: tcp (MCA v1.0, API v1.0, Component v1.0)
                 MCA ras: dash_host (MCA v1.0, API v1.3, Component v1.2.9)
                 MCA ras: gridengine (MCA v1.0, API v1.3, Component v1.2.9)
                 MCA ras: localhost (MCA v1.0, API v1.3, Component v1.2.9)
                 MCA ras: xgrid (MCA v1.0, API v1.3, Component v1.2.9)
                 MCA rds: hostfile (MCA v1.0, API v1.3, Component v1.2.9)
                 MCA rds: proxy (MCA v1.0, API v1.3, Component v1.2.9)
                 MCA rds: resfile (MCA v1.0, API v1.3, Component v1.2.9)
               MCA rmaps: round_robin (MCA v1.0, API v1.3, Component v1.2.9)
                MCA rmgr: proxy (MCA v1.0, API v2.0, Component v1.2.9)
                MCA rmgr: urm (MCA v1.0, API v2.0, Component v1.2.9)
                 MCA rml: oob (MCA v1.0, API v1.0, Component v1.2.9)
                 MCA pls: gridengine (MCA v1.0, API v1.3, Component v1.2.9)
                 MCA pls: proxy (MCA v1.0, API v1.3, Component v1.2.9)
                 MCA pls: rsh (MCA v1.0, API v1.3, Component v1.2.9)
                 MCA pls: xgrid (MCA v1.0, API v1.3, Component v1.2.9)
                 MCA sds: env (MCA v1.0, API v1.0, Component v1.2.9)
                 MCA sds: pipe (MCA v1.0, API v1.0, Component v1.2.9)
                 MCA sds: seed (MCA v1.0, API v1.0, Component v1.2.9)
                 MCA sds: singleton (MCA v1.0, API v1.0, Component v1.2.9)