John,
Any progress on this?
John Ouellette wrote:
> Hi Tim,
>
> Hmmmm, nope. I recompiled OpenMPI to produce the static libs, and even
> recompiled my app statically, and received the same error messages.
>
> If orted isn't starting on the compute nodes, is there any way I can debug
> this to find out where it is failing?
>
You could try running orted standalone to verify that it starts up correctly:
bpsh <node> orted --debug
> Actually, I just tried your suggestion of running ldd on one of the compute
> nodes (should've tried this before recompiling, I guess...). I get a strange
> error, which probably indicates a problem with bproc:
>
> ldd /usr/local/intel-libs/9.0/bin/orted
> /proc/self/fd/3: line 125: cat: command not found
>
> I know I've run ldd on a node before without problems before.... I don't know
> if this is related to my OpenMPI problems, but I'm going to have to look into
> it.
Tim
|