Open MPI logo

Open MPI User's Mailing List Archives

  |   Home   |   Support   |   FAQ   |   all Open MPI User's mailing list

Subject: Re: [OMPI users] intermittent node file error running with torque/maui integration
From: Noam Bernstein (noam.bernstein_at_[hidden])
Date: 2013-09-20 10:04:43


On Sep 20, 2013, at 9:55 AM, Noam Bernstein <noam.bernstein_at_[hidden]> wrote:
>
> This is completely unrepeatable - resubmitting the same job almost
> always works the second time around. The line appears to be
> associated with looking for the torque/maui generated node file,
> and when I do something like
> echo $PBS_NODEFILE
> cat $PBS_NODEFILE
> it appears that the file is present and correct.

Never mind - I was sure that my earlier tests showed that the $PBS_NODEFILE
was there, but now it seems like every time the job fails it's because this
file really is missing. Time to check why torque isn't always creating
the nodefile.

                                                                                                Noam