Dear Open MPI developers,
I'm a graduate student and recently test some minibenchmarks using Open MPI
library on Xeon X5650 cluster. From the website of Open MPI, I know that
Open MPI can use
*sm BTL* to transfer data with high bandwidth via shared memory. My
question is that what about Open MPI collectives on shared memory? Were
they implemented and optimized on top of point-to-point communication or
utilizing shared memory separately?