Open MPI logo

Open MPI User's Mailing List Archives

  |   Home   |   Support   |   FAQ   |   all Open MPI User's mailing list

From: Bas van der Vlies (basv_at_[hidden])
Date: 2007-09-20 08:10:43


To answer my self. I have just installed OFED 1.2.5.1 and openmpi 1.2.3.
With this combo i have no errors wirh a linpack run (hpl) ;-)

Regards

Bas van der Vlies wrote:
> Can somebody explain these errors or did i something stupid?
>
> I have done some more testing and i can run with a very small problem
> sizes the maximum is 1440 (looks like a mtu problem).
>
> If i increase the value to 1441 it crashes.
>
> === output ==
>
> ============================================================================
> T/V N NB P Q Time Gflops
> ----------------------------------------------------------------------------
> WR01L2C8 1440 60 2 4 0.19 1.056e+01
>
>
>
>
>
> Bas van der Vlies wrote:
>> Hello,
>>
>> I am trying to run a xhpl (linpack) and after a while a get these
>> errors. I use a simple blas library
>>
>>
>>
>> {{{
>> B-[ C ][ERR] MPI_DOUBLE count 4260 disp 0x317700 (3241728) extent
>> 8 (size 34080)
>> -cC---P-DB-[ C ][ERR] MPI_DOUBLE count 4260 disp 0x326180
>> (3301760) extent 8 (size 34080)
>> -cC---P-DB-[ C ][ERR] MPI_DOUBLE count 4260 disp 0x334c00
>> (3361792) extent 8 (size 34080)
>> -cC---P-DB-[ C ][ERR] MPI_DOUBLE count 4260 disp 0x343680
>> (3421824) extent 8 (size 34080)
>> -cC---P-DB-[ C ][ERR] MPI_DOUBLE count 4260 disp 0x352100
>> (3481856) extent 8 (size 34080)
>> -cC---P-DB-[ C ][ERR] MPI_DOUBLE count 4260 disp 0x360b80
>> (3541888) extent 8 (size 34080)
>> -cC---P-DB-[ C ][ERR] MPI_DOUBLE count 3661 disp 0x609327c0
>> (1620256704) extent 8 (size 29288)
>> -------G---[---][---] MPI_END_LOOP prev 61 elements first elem
>> displacement 0 size of data 2074088
>>
>> [ib-r5n6.irc.sara.nl:11140] ../../ompi/datatype/datatype_pack.h:38
>> Pointer 0xa7f36278 size 1960 is outside
>> [0xa7bcd980,0x85073a8] for
>> base ptr 0xa7bcd980 count 1 and data
>> [ib-r5n6.irc.sara.nl:11140] Datatype 0x8462888[]
>> size 2074088 align 4 id 0 length 184 used 61
>> true_lb 0 true_ub 1620285992 (true_extent
>> 1620285992) lb 0 ub 1620285992 (extent 1620285992)
>> nbElems 2592type 11 count ints 62 count disp 61
>> count datatype 61
>> ints: 61 4200 4200 4200 4200 4200 4200 4200 4200
>> 4200 4200 4200 4200 4200 4200 4200 4200 4200 4200 4200 4200 4200 4200
>> 4200 4200 4200 4200 4200 4200 4200 4200 4200 4200 4200 4200 4200 4200
>> 4200 4200 4200 4200 4200 4200 4200 4200 4200 4200 4200 4200 4200 4200
>> 4200 4200 4200 4200 4200 4200 4200 4200 4200 4200 3661
>> MPI_Aint: 0 60032 120064 180096 240128 300160 360192
>> 420224 480256 540288 600320 660352 720384 780416 840448 900480 960512
>> 1020544 1080576 1140608 1200640 1260672 1320704 1380736 1440768
>> 1500800 1560832 1620864 1680896 1740928 1800960 1860992 1921024
>> 1981056 2041088 2101120 2161152 2221184 2281216 2341248 2401280
>> 2461312 2521344 2581376 2641408 2701440 2761472 2821504 2881536
>> 2941568 3001600 3061632 3121664 3181696 3241728 3301760 3361792
>> 3421824 3481856 3541888 1620043488
>> types: (61 * MPI_DOUBLE)
>> 61 loops 0 flags 2 (commited )-c-----G---[---][---]
>> contain MPI_DOUBLE
>> cC---P-DB-[ C ][ERR] MPI_DOUBLE count 4260 disp 0x352100
>> (3481856) extent 8 (size 34080)
>> -cC---P-DB-[ C ][ERR] MPI_DOUBLE count 4260 disp 0x360b80
>> (3541888) extent 8 (size 34080)
>> -cC---P-DB-[ C ][ERR] MPI_DOUBLE count 3661 disp 0x609327c0
>> (1620256704) extent 8 (size 29288)
>> -------G---[---][---] MPI_END_LOOP prev 61 elements first elem
>> displacement 0 size of data 2074088
>>
>> [ib-r5n6.irc.sara.nl:11140] ../../ompi/datatype/datatype_pack.h:38
>> Pointer 0x8500140 size 29288 is outside
>> [0xa7bcd980,0x85073a8] for
>> base ptr 0xa7bcd980 count 1 and data
>> [ib-r5n6.irc.sara.nl:11140] Datatype 0x8462888[]
>> size 2074088 align 4 id 0 length 184 used 61
>> true_lb 0 true_ub 1620285992 (true_extent
>> 1620285992) lb 0 ub 1620285992 (extent 1620285992)
>> nbElems 259261 loops 0 flags 2 (commited )-c-----G---
>> [---][---]
>> contain MPI_DOUBLE
>> }}}
>>
>> ompi_info:
>> {{{
>> Open MPI: 1.2.1a0r14297M
>> Open MPI SVN revision: r14297M
>> Open RTE: 1.2.1a0r14297M
>> Open RTE SVN revision: r14297M
>> OPAL: 1.2.1a0r14297M
>> OPAL SVN revision: r14297M
>> Prefix: /usr/local/gnu-openmpi-1.2.1p0
>> Configured architecture: i686-pc-linux-gnu
>> Configured by: root
>> Configured on: Wed Apr 11 13:11:09 CEST 2007
>> Configure host: ib-r1n1.irc.sara.nl
>> Built by: root
>> Built on: Wed Apr 11 13:16:36 CEST 2007
>> Built host: ib-r1n1.irc.sara.nl
>> C bindings: yes
>> C++ bindings: yes
>> Fortran77 bindings: yes (all)
>> Fortran90 bindings: yes
>> Fortran90 bindings size: small
>> C compiler: gcc
>> C compiler absolute: /usr/bin/gcc
>> C++ compiler: g++
>> C++ compiler absolute: /usr/bin/g++
>> Fortran77 compiler: gfortran
>> Fortran77 compiler abs: /usr/bin/gfortran
>> Fortran90 compiler: gfortran
>> Fortran90 compiler abs: /usr/bin/gfortran
>> C profiling: yes
>> C++ profiling: yes
>> Fortran77 profiling: yes
>> Fortran90 profiling: yes
>> C++ exceptions: no
>> Thread support: posix (mpi: no, progress: no)
>> Internal debug support: yes
>> MPI parameter check: runtime
>> Memory profiling support: yes
>> Memory debugging support: yes
>> libltdl support: yes
>> Heterogeneous support: yes
>> mpirun default --prefix: yes
>> MCA backtrace: execinfo (MCA v1.0, API v1.0, Component
>> v1.2.1)
>> MCA memory: ptmalloc2 (MCA v1.0, API v1.0, Component
>> v1.2.1)
>> MCA paffinity: linux (MCA v1.0, API v1.0, Component v1.2.1)
>> MCA maffinity: first_use (MCA v1.0, API v1.0, Component
>> v1.2.1)
>> MCA timer: linux (MCA v1.0, API v1.0, Component v1.2.1)
>> MCA allocator: basic (MCA v1.0, API v1.0, Component v1.0)
>> MCA allocator: bucket (MCA v1.0, API v1.0, Component v1.0)
>> MCA coll: basic (MCA v1.0, API v1.0, Component v1.2.1)
>> MCA coll: self (MCA v1.0, API v1.0, Component v1.2.1)
>> MCA coll: sm (MCA v1.0, API v1.0, Component v1.2.1)
>> MCA coll: tuned (MCA v1.0, API v1.0, Component v1.2.1)
>> MCA io: romio (MCA v1.0, API v1.0, Component v1.2.1)
>> MCA mpool: openib (MCA v1.0, API v1.0, Component v1.2.1)
>> MCA mpool: sm (MCA v1.0, API v1.0, Component v1.2.1)
>> MCA pml: cm (MCA v1.0, API v1.0, Component v1.2.1)
>> MCA pml: ob1 (MCA v1.0, API v1.0, Component v1.2.1)
>> MCA bml: r2 (MCA v1.0, API v1.0, Component v1.2.1)
>> MCA rcache: rb (MCA v1.0, API v1.0, Component v1.2.1)
>> MCA rcache: vma (MCA v1.0, API v1.0, Component v1.2.1)
>> MCA btl: openib (MCA v1.0, API v1.0.1, Component
>> v1.2.1)
>> MCA btl: self (MCA v1.0, API v1.0.1, Component v1.2.1)
>> MCA btl: sm (MCA v1.0, API v1.0.1, Component v1.2.1)
>> MCA btl: tcp (MCA v1.0, API v1.0.1, Component v1.0)
>> MCA topo: unity (MCA v1.0, API v1.0, Component v1.2.1)
>> MCA osc: pt2pt (MCA v1.0, API v1.0, Component v1.2.1)
>> MCA errmgr: hnp (MCA v1.0, API v1.3, Component v1.2.1)
>> MCA errmgr: orted (MCA v1.0, API v1.3, Component v1.2.1)
>> MCA errmgr: proxy (MCA v1.0, API v1.3, Component v1.2.1)
>> MCA gpr: null (MCA v1.0, API v1.0, Component v1.2.1)
>> MCA gpr: proxy (MCA v1.0, API v1.0, Component v1.2.1)
>> MCA gpr: replica (MCA v1.0, API v1.0, Component
>> v1.2.1)
>> MCA iof: proxy (MCA v1.0, API v1.0, Component v1.2.1)
>> MCA iof: svc (MCA v1.0, API v1.0, Component v1.2.1)
>> MCA ns: proxy (MCA v1.0, API v2.0, Component v1.2.1)
>> MCA ns: replica (MCA v1.0, API v2.0, Component
>> v1.2.1)
>> MCA oob: tcp (MCA v1.0, API v1.0, Component v1.0)
>> MCA ras: dash_host (MCA v1.0, API v1.3, Component
>> v1.2.1)
>> MCA ras: gridengine (MCA v1.0, API v1.3, Component
>> v1.2.1)
>> MCA ras: localhost (MCA v1.0, API v1.3, Component
>> v1.2.1)
>> MCA ras: slurm (MCA v1.0, API v1.3, Component v1.2.1)
>> MCA ras: tm (MCA v1.0, API v1.3, Component v1.2.1)
>> MCA rds: hostfile (MCA v1.0, API v1.3, Component
>> v1.2.1)
>> MCA rds: proxy (MCA v1.0, API v1.3, Component v1.2.1)
>> MCA rds: resfile (MCA v1.0, API v1.3, Component
>> v1.2.1)
>> MCA rmaps: round_robin (MCA v1.0, API v1.3, Component
>> v1.2.1)
>> MCA rmgr: proxy (MCA v1.0, API v2.0, Component v1.2.1)
>> MCA rmgr: urm (MCA v1.0, API v2.0, Component v1.2.1)
>> MCA rml: oob (MCA v1.0, API v1.0, Component v1.2.1)
>> MCA pls: gridengine (MCA v1.0, API v1.3, Component
>> v1.2.1)
>> MCA pls: proxy (MCA v1.0, API v1.3, Component v1.2.1)
>> MCA pls: rsh (MCA v1.0, API v1.3, Component v1.2.1)
>> MCA pls: slurm (MCA v1.0, API v1.3, Component v1.2.1)
>> MCA pls: tm (MCA v1.0, API v1.3, Component v1.2.1)
>> MCA sds: env (MCA v1.0, API v1.0, Component v1.2.1)
>> MCA sds: pipe (MCA v1.0, API v1.0, Component v1.2.1)
>> MCA sds: seed (MCA v1.0, API v1.0, Component v1.2.1)
>> MCA sds: singleton (MCA v1.0, API v1.0, Component
>> v1.2.1
>> }}}
>> --
>> Bas van der Vlies
>> basv_at_[hidden]
>>
>>
>>
>> _______________________________________________
>> users mailing list
>> users_at_[hidden]
>> http://www.open-mpi.org/mailman/listinfo.cgi/users
>
>
> --
> ********************************************************************
> * *
> * Bas van der Vlies e-mail: basv_at_[hidden] *
> * SARA - Academic Computing Services phone: +31 20 592 8012 *
> * Kruislaan 415 fax: +31 20 6683167 *
> * 1098 SJ Amsterdam *
> * *
> ********************************************************************
> _______________________________________________
> users mailing list
> users_at_[hidden]
> http://www.open-mpi.org/mailman/listinfo.cgi/users

-- 
--
********************************************************************
*                                                                  *
*  Bas van der Vlies                     e-mail: basv_at_[hidden]      *
*  SARA - Academic Computing Services    phone:  +31 20 592 8012   *
*  Kruislaan 415                         fax:    +31 20 6683167    *
*  1098 SJ Amsterdam                                               *
*                                                                  *
********************************************************************