Based on my experiences, I would FULLY endorse (100% agree with) David Zhang.
It is usually a coding or typo mistake.
At first, Ensure that array sizes and dimension are correct.
I experience that if openmpi is compiled with gnu compilers (not with Intel) then it also point outs the subroutine exactly in which the fault occur. have a try.
best,
AA
When I got segmentation faults, it has always been my coding mistakes. Perhaps your code is not robust against number of processes not divisible by 2?On Wed, Jun 30, 2010 at 8:47 AM, Jack Bryan <dtustudy68@hotmail.com> wrote:
Dear All,I am using Open MPI, I got the error:n337:37664] *** Process received signal ***[n337:37664] Signal: Segmentation fault (11)[n337:37664] Signal code: Address not mapped (1)[n337:37664] Failing at address: 0x7fffcfe90000[n337:37664] [ 0] /lib64/libpthread.so.0 [0x3c50e0e4c0][n337:37664] [ 1] /lustre/home/rhascheduler/RhaScheduler-0.4.1.1/mytest/nmn2 [0x414ed7][n337:37664] [ 2] /lib64/libc.so.6(__libc_start_main+0xf4) [0x3c5021d974][n337:37664] [ 3] /lustre/home/rhascheduler/RhaScheduler-0.4.1.1/mytest/nmn2(__gxx_personality_v0+0x1f1) [0x412139][n337:37664] *** End of error message ***After searching answers, it seems that some functions fail.My program can run well for 1,2,10 processors, but fail when the number of tasks cannotbe divided evenly by number of processes.Any help is appreciated.thanksJackJune 30 2010_______________________________________________
users mailing list
users@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/users
--
David Zhang
University of California, San Diego
_______________________________________________
users mailing list
users@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/users