On Nov 29, 2006, at 8:44 AM, Scott Atchley wrote:
> My last few runs all completed successfully without hanging. The job
> I am currently running just hung one node (can respond to ping,
> cannot ssh into it, cannot use any terminals connected to it).
>
> There are no messages in dmesg and vmstat shows that the node is not
> swapping (before it hung).
>
> Any ideas where I should look next?
>
> Scott
I just had another job hang at the start of the HPL portion. As
before, I do not see anything in dmesg to indicate any problems.
vmstat did not show any paging (60% of memory free).
Scott
|