Open MPI logo

Open MPI User's Mailing List Archives

  |   Home   |   Support   |   FAQ   |   all Open MPI User's mailing list

From: Bas van der Vlies (basv_at_[hidden])
Date: 2007-04-13 03:41:42


Can somebody explain these errors or did i something stupid?

I have done some more testing and i can run with a very small problem
sizes the maximum is 1440 (looks like a mtu problem).

If i increase the value to 1441 it crashes.

=== output ==

============================================================================
T/V N NB P Q Time Gflops
----------------------------------------------------------------------------
WR01L2C8 1440 60 2 4 0.19 1.056e+01

Bas van der Vlies wrote:
> Hello,
>
> I am trying to run a xhpl (linpack) and after a while a get these
> errors. I use a simple blas library
>
>
>
> {{{
> B-[ C ][ERR] MPI_DOUBLE count 4260 disp 0x317700 (3241728) extent
> 8 (size 34080)
> -cC---P-DB-[ C ][ERR] MPI_DOUBLE count 4260 disp 0x326180
> (3301760) extent 8 (size 34080)
> -cC---P-DB-[ C ][ERR] MPI_DOUBLE count 4260 disp 0x334c00
> (3361792) extent 8 (size 34080)
> -cC---P-DB-[ C ][ERR] MPI_DOUBLE count 4260 disp 0x343680
> (3421824) extent 8 (size 34080)
> -cC---P-DB-[ C ][ERR] MPI_DOUBLE count 4260 disp 0x352100
> (3481856) extent 8 (size 34080)
> -cC---P-DB-[ C ][ERR] MPI_DOUBLE count 4260 disp 0x360b80
> (3541888) extent 8 (size 34080)
> -cC---P-DB-[ C ][ERR] MPI_DOUBLE count 3661 disp 0x609327c0
> (1620256704) extent 8 (size 29288)
> -------G---[---][---] MPI_END_LOOP prev 61 elements first elem
> displacement 0 size of data 2074088
>
> [ib-r5n6.irc.sara.nl:11140] ../../ompi/datatype/datatype_pack.h:38
> Pointer 0xa7f36278 size 1960 is outside
> [0xa7bcd980,0x85073a8] for
> base ptr 0xa7bcd980 count 1 and data
> [ib-r5n6.irc.sara.nl:11140] Datatype 0x8462888[]
> size 2074088 align 4 id 0 length 184 used 61
> true_lb 0 true_ub 1620285992 (true_extent
> 1620285992) lb 0 ub 1620285992 (extent 1620285992)
> nbElems 2592type 11 count ints 62 count disp 61
> count datatype 61
> ints: 61 4200 4200 4200 4200 4200 4200 4200 4200
> 4200 4200 4200 4200 4200 4200 4200 4200 4200 4200 4200 4200 4200 4200
> 4200 4200 4200 4200 4200 4200 4200 4200 4200 4200 4200 4200 4200 4200
> 4200 4200 4200 4200 4200 4200 4200 4200 4200 4200 4200 4200 4200 4200
> 4200 4200 4200 4200 4200 4200 4200 4200 4200 4200 3661
> MPI_Aint: 0 60032 120064 180096 240128 300160 360192
> 420224 480256 540288 600320 660352 720384 780416 840448 900480 960512
> 1020544 1080576 1140608 1200640 1260672 1320704 1380736 1440768
> 1500800 1560832 1620864 1680896 1740928 1800960 1860992 1921024
> 1981056 2041088 2101120 2161152 2221184 2281216 2341248 2401280
> 2461312 2521344 2581376 2641408 2701440 2761472 2821504 2881536
> 2941568 3001600 3061632 3121664 3181696 3241728 3301760 3361792
> 3421824 3481856 3541888 1620043488
> types: (61 * MPI_DOUBLE)
> 61 loops 0 flags 2 (commited )-c-----G---[---][---]
> contain MPI_DOUBLE
> cC---P-DB-[ C ][ERR] MPI_DOUBLE count 4260 disp 0x352100
> (3481856) extent 8 (size 34080)
> -cC---P-DB-[ C ][ERR] MPI_DOUBLE count 4260 disp 0x360b80
> (3541888) extent 8 (size 34080)
> -cC---P-DB-[ C ][ERR] MPI_DOUBLE count 3661 disp 0x609327c0
> (1620256704) extent 8 (size 29288)
> -------G---[---][---] MPI_END_LOOP prev 61 elements first elem
> displacement 0 size of data 2074088
>
> [ib-r5n6.irc.sara.nl:11140] ../../ompi/datatype/datatype_pack.h:38
> Pointer 0x8500140 size 29288 is outside
> [0xa7bcd980,0x85073a8] for
> base ptr 0xa7bcd980 count 1 and data
> [ib-r5n6.irc.sara.nl:11140] Datatype 0x8462888[]
> size 2074088 align 4 id 0 length 184 used 61
> true_lb 0 true_ub 1620285992 (true_extent
> 1620285992) lb 0 ub 1620285992 (extent 1620285992)
> nbElems 259261 loops 0 flags 2 (commited )-c-----G---
> [---][---]
> contain MPI_DOUBLE
> }}}
>
> ompi_info:
> {{{
> Open MPI: 1.2.1a0r14297M
> Open MPI SVN revision: r14297M
> Open RTE: 1.2.1a0r14297M
> Open RTE SVN revision: r14297M
> OPAL: 1.2.1a0r14297M
> OPAL SVN revision: r14297M
> Prefix: /usr/local/gnu-openmpi-1.2.1p0
> Configured architecture: i686-pc-linux-gnu
> Configured by: root
> Configured on: Wed Apr 11 13:11:09 CEST 2007
> Configure host: ib-r1n1.irc.sara.nl
> Built by: root
> Built on: Wed Apr 11 13:16:36 CEST 2007
> Built host: ib-r1n1.irc.sara.nl
> C bindings: yes
> C++ bindings: yes
> Fortran77 bindings: yes (all)
> Fortran90 bindings: yes
> Fortran90 bindings size: small
> C compiler: gcc
> C compiler absolute: /usr/bin/gcc
> C++ compiler: g++
> C++ compiler absolute: /usr/bin/g++
> Fortran77 compiler: gfortran
> Fortran77 compiler abs: /usr/bin/gfortran
> Fortran90 compiler: gfortran
> Fortran90 compiler abs: /usr/bin/gfortran
> C profiling: yes
> C++ profiling: yes
> Fortran77 profiling: yes
> Fortran90 profiling: yes
> C++ exceptions: no
> Thread support: posix (mpi: no, progress: no)
> Internal debug support: yes
> MPI parameter check: runtime
> Memory profiling support: yes
> Memory debugging support: yes
> libltdl support: yes
> Heterogeneous support: yes
> mpirun default --prefix: yes
> MCA backtrace: execinfo (MCA v1.0, API v1.0, Component
> v1.2.1)
> MCA memory: ptmalloc2 (MCA v1.0, API v1.0, Component
> v1.2.1)
> MCA paffinity: linux (MCA v1.0, API v1.0, Component v1.2.1)
> MCA maffinity: first_use (MCA v1.0, API v1.0, Component
> v1.2.1)
> MCA timer: linux (MCA v1.0, API v1.0, Component v1.2.1)
> MCA allocator: basic (MCA v1.0, API v1.0, Component v1.0)
> MCA allocator: bucket (MCA v1.0, API v1.0, Component v1.0)
> MCA coll: basic (MCA v1.0, API v1.0, Component v1.2.1)
> MCA coll: self (MCA v1.0, API v1.0, Component v1.2.1)
> MCA coll: sm (MCA v1.0, API v1.0, Component v1.2.1)
> MCA coll: tuned (MCA v1.0, API v1.0, Component v1.2.1)
> MCA io: romio (MCA v1.0, API v1.0, Component v1.2.1)
> MCA mpool: openib (MCA v1.0, API v1.0, Component v1.2.1)
> MCA mpool: sm (MCA v1.0, API v1.0, Component v1.2.1)
> MCA pml: cm (MCA v1.0, API v1.0, Component v1.2.1)
> MCA pml: ob1 (MCA v1.0, API v1.0, Component v1.2.1)
> MCA bml: r2 (MCA v1.0, API v1.0, Component v1.2.1)
> MCA rcache: rb (MCA v1.0, API v1.0, Component v1.2.1)
> MCA rcache: vma (MCA v1.0, API v1.0, Component v1.2.1)
> MCA btl: openib (MCA v1.0, API v1.0.1, Component
> v1.2.1)
> MCA btl: self (MCA v1.0, API v1.0.1, Component v1.2.1)
> MCA btl: sm (MCA v1.0, API v1.0.1, Component v1.2.1)
> MCA btl: tcp (MCA v1.0, API v1.0.1, Component v1.0)
> MCA topo: unity (MCA v1.0, API v1.0, Component v1.2.1)
> MCA osc: pt2pt (MCA v1.0, API v1.0, Component v1.2.1)
> MCA errmgr: hnp (MCA v1.0, API v1.3, Component v1.2.1)
> MCA errmgr: orted (MCA v1.0, API v1.3, Component v1.2.1)
> MCA errmgr: proxy (MCA v1.0, API v1.3, Component v1.2.1)
> MCA gpr: null (MCA v1.0, API v1.0, Component v1.2.1)
> MCA gpr: proxy (MCA v1.0, API v1.0, Component v1.2.1)
> MCA gpr: replica (MCA v1.0, API v1.0, Component
> v1.2.1)
> MCA iof: proxy (MCA v1.0, API v1.0, Component v1.2.1)
> MCA iof: svc (MCA v1.0, API v1.0, Component v1.2.1)
> MCA ns: proxy (MCA v1.0, API v2.0, Component v1.2.1)
> MCA ns: replica (MCA v1.0, API v2.0, Component
> v1.2.1)
> MCA oob: tcp (MCA v1.0, API v1.0, Component v1.0)
> MCA ras: dash_host (MCA v1.0, API v1.3, Component
> v1.2.1)
> MCA ras: gridengine (MCA v1.0, API v1.3, Component
> v1.2.1)
> MCA ras: localhost (MCA v1.0, API v1.3, Component
> v1.2.1)
> MCA ras: slurm (MCA v1.0, API v1.3, Component v1.2.1)
> MCA ras: tm (MCA v1.0, API v1.3, Component v1.2.1)
> MCA rds: hostfile (MCA v1.0, API v1.3, Component
> v1.2.1)
> MCA rds: proxy (MCA v1.0, API v1.3, Component v1.2.1)
> MCA rds: resfile (MCA v1.0, API v1.3, Component
> v1.2.1)
> MCA rmaps: round_robin (MCA v1.0, API v1.3, Component
> v1.2.1)
> MCA rmgr: proxy (MCA v1.0, API v2.0, Component v1.2.1)
> MCA rmgr: urm (MCA v1.0, API v2.0, Component v1.2.1)
> MCA rml: oob (MCA v1.0, API v1.0, Component v1.2.1)
> MCA pls: gridengine (MCA v1.0, API v1.3, Component
> v1.2.1)
> MCA pls: proxy (MCA v1.0, API v1.3, Component v1.2.1)
> MCA pls: rsh (MCA v1.0, API v1.3, Component v1.2.1)
> MCA pls: slurm (MCA v1.0, API v1.3, Component v1.2.1)
> MCA pls: tm (MCA v1.0, API v1.3, Component v1.2.1)
> MCA sds: env (MCA v1.0, API v1.0, Component v1.2.1)
> MCA sds: pipe (MCA v1.0, API v1.0, Component v1.2.1)
> MCA sds: seed (MCA v1.0, API v1.0, Component v1.2.1)
> MCA sds: singleton (MCA v1.0, API v1.0, Component
> v1.2.1
> }}}
> --
> Bas van der Vlies
> basv_at_[hidden]
>
>
>
> _______________________________________________
> users mailing list
> users_at_[hidden]
> http://www.open-mpi.org/mailman/listinfo.cgi/users

-- 
********************************************************************
*                                                                  *
*  Bas van der Vlies                     e-mail: basv_at_[hidden]      *
*  SARA - Academic Computing Services    phone:  +31 20 592 8012   *
*  Kruislaan 415                         fax:    +31 20 6683167    *
*  1098 SJ Amsterdam                                               *
*                                                                  *
********************************************************************