I am new user of Open MPI, I've used MPICH before.
There is performance bug with the following scenario:
For message size 8MB, proc_B calls MPI_Test 88 times. It means that
point to point communication costs 88 seconds.
Btw, bandwidth isn't the problem (interconnection network: InfiniBand)
Obviously, there is the problem with progress of the asynchronous
How can I avoid this problem?