I would suggest that before this is checked in this be tested on a system
that has N-way network parallelism, where N is as large as you can find.
This is a key bit of code for MPI correctness, and out-of-order operations
will break it, so you want to maximize the chance for such operations.
On 12/11/07 10:54 AM, "Gleb Natapov" <glebn_at_[hidden]> wrote:
> I did a rewrite of matching code in OB1. I made it much simpler and 2
> times smaller (which is good, less code - less bugs). I also got rid
> of huge macros - very helpful if you need to debug something. There
> is no performance degradation, actually I even see very small performance
> improvement. I ran MTT with this patch and the result is the same as on
> trunk. I would like to commit this to the trunk. The patch is attached
> for everybody to try.
> devel mailing list