Hi Brice,
but in the manual is not written that get_cpubind() returns the logical OR of the binding of all threads... I ever understand that returns the bind of the calloer, where the caller can be process or thread..
I'm mixing bind of process and threads, and I've noted that if the process and thread are on the same NUMA node, works well, also on different cores.
If the NUMA node of process is different of NUMA node of threads, there is a problem.