Hi,
I am trying to run with Open MPI 1.3 on a cluster using PBS Pro:
pbs_version = PBSPro_9.2.0.81361
However, after compiling with these options:
../configure
--prefix=/home_nfs/parma/x86_64/UNITE/packages/openmpi/1.3-intel10.1-64bit-dynamic-threads CC=/opt/intel/cce/10.1.015/bin/icc CXX=/opt/intel/cce/10.1.015/bin/icpc CPP="/opt/intel/cce/10.1.015/bin/icc -E" FC=/opt/intel/fce/10.1.015/bin/ifort F90=/opt/intel/fce/10.1.015/bin/ifort F77=/opt/intel/fce/10.1.015/bin/ifort --enable-mpi-f90 --with-tm=/usr/pbs/ --enable-mpi-threads=yes --enable-contrib-no-build=vt
I get runtime errors when running on more than one reserved node
even /bin/hostname:
/home_nfs/parma/x86_64/UNITE/packages/openmpi/1.3-intel10.1-64bit-dynamic-threads/bin/mpirun -np 5 /bin/hostname
/home_nfs/parma/x86_64/UNITE/packages/openmpi/1.3-intel10.1-64bit-dynamic-threads/bin/mpirun: symbol lookup error: /home_nfs/parma/x86_64/UNITE/packages/openmpi/1.3-intel10.1-64bit-dynamic-threads/lib/openmpi/mca_plm_tm.so: undefined symbol: tm_init
When running on one node only, I don't get this error.
Now, I see that I only have static PBS libraries so I tried to compile
this component statically. I added to the above configure:
"--enable-mca-static=ras-tm,pls-tm"
However, nothing changed. The same errors occurr.
But if I compile Open MPI only with static libraries ("--enable-static
--disable-shared"), the MPI (or non-MPI) programs run OK.
Can you help me here ?
Thanks,
Kiril
--
Dipl.-Inf. Kiril Dichev
Tel.: +49 711 685 60492
E-mail: dichev_at_[hidden]
High Performance Computing Center Stuttgart (HLRS)
Universität Stuttgart
70550 Stuttgart
Germany
|