Open MPI logo

Open MPI User's Mailing List Archives

  |   Home   |   Support   |   FAQ   |  

This web mail archive is frozen.

This page is part of a frozen web archive of this mailing list.

You can still navigate around this archive, but know that no new mails have been added to it since July of 2016.

Click here to be taken to the new web archives of this list; it includes all the mails that are in this frozen archive plus all new mails that have been sent to the list since it was migrated to the new archives.

From: George Bosilca (bosilca_at_[hidden])
Date: 2007-01-11 03:48:39


Tim,

Thanks for the bug report. I just commit a patch in our development
version (revision 13079). It will go into the 1.2b2 soon, after some
soak time. Until then please use the latest nightly tar (with a
version bigger than 13079) from our website.

   Thanks,
     george.

On Jan 10, 2007, at 5:19 PM, Tim Campbell wrote:

> Greetings,
>
> Attached is a small test fortran program that triggers a failure in
> the mpi_waitall. The problem is that the after a couple of calls
> to mpi_startall and mpi_waitall some of the mpi_requests become
> corrupted. This causes the next call to mpi_startall to fail.
> Here is output from a 2 cpu run.
>
> [44]% mpif90 -g test_ompi.f
> [45]% mpirun -np 2 a.out
> TEST(A): 0 1 | 2 3 4 5
> TEST(B): 0 1 | 2 3 4 5
> OUTPUT: 0 1 | 100 100 101 101
> TEST(A): 0 2 | 2 3 4 5
> TEST(B): 0 2 | -32766 -32766 4 5
> OUTPUT: 0 2 | 200 200 201 201
> TEST(A): 1 1 | 2 3 4 5
> TEST(B): 1 1 | 2 3 4 5
> OUTPUT: 1 1 | 101 101 100 100
> TEST(A): 1 2 | 2 3 4 5
> TEST(B): 1 2 | -32766 -32766 4 5
> OUTPUT: 1 2 | 201 201 200 200
> ^Cmpirun: killing job...
>
> The "-32766" values show up in the mpi_request array after the
> second call to mpi_waitall. Using prints in the OpenMPI code I
> have tracked the problem to
>
> ompi/request/req_wait.c:ompi_request_wait_all().
>
> I find upon entry to ompi_request_wait_all() that the values of
> request[:]->req_f_to_c_index are valid. However, upon exit of
> ompi_request_wait_all() the first two entries of request[:]-
> >req_f_to_c_index have the value of -32766.
>
> I am testing with OpenMPI version 1.2b2. This problem occurs on
> both x86_64 and Intel i386 and it occurs for both Portland Group
> compilers and for GCC/G95.
>
> Cheers,
> Tim Campbell
> Naval Research Laboratory
> <test_ompi.f.gz>
>
> _______________________________________________
> users mailing list
> users_at_[hidden]
> http://www.open-mpi.org/mailman/listinfo.cgi/users