> Hi.
>
> We have started to scale up one of our codes and sometimes we get messages
> like this:
>
> [c9-13.local:31125] Memory 0x2aaab7b64000:217088 cannot be freed from
> the registration cache. Possible memory corruption.
>
> It seems like the application runs normally and it does not crash becaus of
> this. Should we be worried? We have tested the code with up to 1700 cores
> and the message becomes more frequent as we scale up.
Nevermind, this turned out to be an application bug. A buffer was freed before
Isend completed.
r.
--
The Computer Center, University of Tromsø, N-9037 TROMSØ Norway.
phone:+47 77 64 41 07, fax:+47 77 64 41 00
Roy Dragseth, Team Leader, High Performance Computing
Direct call: +47 77 64 62 56. email: roy.dragseth_at_[hidden]
|