WHAT: Add CUDA registration of host memory in sm and openib BTLs.
DETAILS: In order to improve performance of sending GPU device memory,
we need to register the host memory with the CUDA framework. These
changes allow that to happen. These changes are somewhat different
from what I proposed a while ago and I think a lot cleaner. There is
a new memory pool flag that indicates whether a piece of memory
should be registered. This allows us to register the sm memory and
the pre-posted openib memory.
The CUDA specific code is in the ompi/mca/common/cuda directory.
Do not look at the configure.m4 code, as that is still not done.
Here a link to the proposed changes:
Here is a list of files that would change.