WHAT: Add CUDA registration of host memory in sm and openib BTLs.
DETAILS: In order to improve performance of sending GPU device memory,
we need to register the host memory with the CUDA framework. These
changes allow that to happen. These changes are somewhat different
from what I proposed a while ago and I think a lot cleaner. There is
a new memory pool flag that indicates whether a piece of memory
should be registered. This allows us to register the sm memory and
the pre-posted openib memory.
The CUDA specific code is in the ompi/mca/common/cuda directory.
Do not look at the configure.m4 code, as that is still not done.
Here a link to the proposed changes:
Here is a list of files that would change.
This email message is for the sole use of the intended recipient(s) and may contain confidential information. Any unauthorized review, use, disclosure or distribution is prohibited. If you are not the intended recipient, please contact the sender by reply email and destroy all copies of the original message.