Finally -- I think I have it working:
Please, everyone, give coll sm as much testing as you can on as many
different platforms as you can. Use "--mca coll_sm_priority 100" to
activate it (and run >1 ppn, of course!).