Open MPI logo

Open MPI User's Mailing List Archives

  |   Home   |   Support   |   FAQ   |   all Open MPI User's mailing list

Subject: Re: [OMPI users] MPICH2 is working OpenMPI Not
From: Scott Atchley (atchley_at_[hidden])
Date: 2010-07-19 10:16:09


Hi Bibrak,

The message about malloc looks like a MX message. Which interconnects did you compile support for?

If you are using MX, does it appear when you run with:

$ mpirun --mca pml cm -np 4 ./exec 98

which uses the MX MTL instead of MX BTL.

Scott

On Jul 18, 2010, at 9:23 AM, Bibrak Qamar wrote:

> Hello,
>
> I have developed a code which I tested on MPICH2, it working fine.
>
> But when I compile and run it with OpenMPI, its not working.
>
> The result of the program with the errors by OpenMPI is below ..
>
> --------------
>
>
> bibrak_at_barq:~/XXX> mpirun -np 4 ./exec 98
>
>
> warning:regcache incompatible with malloc
> warning:regcache incompatible with malloc
> warning:regcache incompatible with malloc
> warning:regcache incompatible with malloc
> Send count -- >> 25
> Send count -- >> 25
> Send count -- >> 24
> Send count -- >> 24
> Dis -- >> 0
> Dis -- >> 25
> Dis -- >> 50
> Dis -- >> 74
>
>
>
>
> 0 d[0] = -14.025975
> 1 d[0] = -14.025975
> -- 1 --
> 2 d[0] = -14.025975
> -- 2 --
> -- 0 --
> 3 d[0] = -14.025975
> --3 --
> [barq:27118] *** Process received signal ***
> [barq:27118] Signal: Segmentation fault (11)
> [barq:27118] Signal code: Address not mapped (1)
> [barq:27118] Failing at address: 0x51681f96
> [barq:27121] *** Process received signal ***
> [barq:27121] Signal: Segmentation fault (11)
> [barq:27121] Signal code: Address not mapped (1)
> [barq:27121] Failing at address: 0x77b5685
> [barq:27118] [ 0] [0xffffe410]
> [barq:27118] [ 1] /lib/libc.so.6(cfree+0x9c) [0xb7d20f3c]
> [barq:27118] [ 2] ./exec(main+0x2214) [0x804ad8d]
> [barq:27118] [ 3] /lib/libc.so.6(__libc_start_main+0xe5) [0xb7cc9705]
> [barq:27121] [ 0] [0xffffe410]
> [barq:27121] [ 1] /lib/libc.so.6(cfree+0x9c) [0xb7d0ef3c]
> [barq:27121] [ 2] ./exec(main+0x2214) [0x804ad8d]
> [barq:27121] [ 3] /lib/libc.so.6(__libc_start_main+0xe5) [0xb7cb7705]
> [barq:27121] [ 4] ./exec [0x8048b01]
> [barq:27121] *** End of error message ***
> [barq:27118] [ 4] ./exec [0x8048b01]
> [barq:27118] *** End of error message ***
> --------------------------------------------------------------------------
> mpirun noticed that process rank 3 with PID 27121 on node barq exited on signal 11 (Segmentation fault).
> --------------------------------------------------------------------------
> [barq:27120] *** Process received signal ***
> [barq:27120] Signal: Segmentation fault (11)
> [barq:27120] Signal code: Address not mapped (1)
> [barq:27120] Failing at address: 0x4bd1ca3e
> [barq:27120] [ 0] [0xffffe410]
> [barq:27120] [ 1] /lib/libc.so.6(cfree+0x9c) [0xb7c97f3c]
> [barq:27120] [ 2] ./exec(main+0x2214) [0x804ad8d]
> [barq:27120] [ 3] /lib/libc.so.6(__libc_start_main+0xe5) [0xb7c40705]
> [barq:27120] [ 4] ./exec [0x8048b01]
> [barq:27120] *** End of error message ***
>
>
> ----------------
>
> Because of the warning:regcache incompatible with malloc warning I did
> > bibrak_at_barq:~/XXX> export MX_RCACHE=2
>
> And now ignored the warning, but the error still remains
>
> I shall appreciate any help.
>
> Bibrak Qamar
> NUST-SEECS
> _______________________________________________
> users mailing list
> users_at_[hidden]
> http://www.open-mpi.org/mailman/listinfo.cgi/users