if(tid==0){
set_membind(HWLOCMEMBIND_BIND, node 0)malloc(array)...
}
if (tid==1){set_membind(HWLOCMEMBIND_BIND, node 1)
for(i...)array(i)}
end parallel region
array is allocated on node 1, not node 0 as I expected So it seems set_membind() of second thread influence in some manner array allocation using first touch.