On Thu, 30 Apr 2009, Ralph Castain wrote:
> well, that's only because the code's doing something it shouldn't.
> Have a look at comm_cid.c:185 - there's the check we added to the
> multi-threaded case (which was the only case when we added it).
> The cid generation should never generate a number larger than
> mca_pml.pml_max_contextid. I'm actually somewhat amazed this fails
> gracefully, as OB1 doesn't appear to check it got a valid cid in
> add_comm, which it should probably do.
> Actually, as an FYI: it doesn't fail gracefully. It just hangs...ick.
Ah. Patch to change the hang into an abort coming RSN.