Dear Open MPI developers,
sm BTL to transfer data with high bandwidth via shared memory. My question is that what about Open MPI collectives on shared memory? Were they implemented and optimized on top of point-to-point communication or utilizing shared memory separately?
Best Regards,
Shigang Li.