Open MPI logo

Open MPI User's Mailing List Archives

  |   Home   |   Support   |   FAQ   |   all Open MPI User's mailing list

Subject: Re: [OMPI users] error while loading shared libraries:
From: Syed Ahsan Ali (ahsanshah01_at_[hidden])
Date: 2013-02-07 11:37:02

Dear John
Thanks for the reply. I'll need help of you people to solve this problem. I
am not expert in HPC and this would be my learning as well. Let me add that
the cluster is based on Platform Cluster Manager (PCM) by IBM Computing.
The compute nodes are NFS mounted with the installer node. Therefore the
directory containing binary rca.x is also present in the compute nodes.
Unfortunately I was trying to copy gfortran libraries from installer node
to compute nodes using rsync but something went wrong and the model binary
rca.x stopped working. I have recompiled the binary after reinstalling hdf
as well as netcdf which model uses during compilation. All path are set in
bashrc as well.
Below is the output of ldd on master as well as compute nodes

[pmdtest_at_pmd HadGEM]$ ldd rca.x => /usr/local/lib64/ (0x00002b6a9503c000) => /usr/local/lib/ (0x00002b6a95344000) => /usr/local/lib/ (0x00002b6a95798000) => /usr/local/lib/ (0x00002b6a95aa1000) => /usr/local/lib/ (0x00002b6a95f5c000) => /usr/local/lib/ (0x00002b6a9618b000) => /usr/local/lib/ (0x00002b6a9639f000) => /home/openmpi/lib/ (0x00002b6a965b4000) => /home/openmpi/lib/ (0x00002b6a967b7000) => /home/openmpi/lib/ (0x00002b6a969ee000) => /home/openmpi/lib/ (0x00002b6a96cb6000) => /home/openmpi/lib/ (0x00002b6a96f16000) => /lib64/ (0x00000033e0e00000) => /lib64/ (0x00000033e2200000) => /lib64/ (0x00000033ee400000) => /lib64/ (0x00000033e1200000) => /lib64/ (0x00000033e1600000) => /lib64/ (0x00000033e0a00000) => /usr/local/lib64/ (0x00002b6a971a0000)

/lib64/ (0x00000033e0600000) => /lib64/ (0x000000362ac00000) => /opt/intel/Compiler/11.1/064/lib/intel64/
(0x00002b6a973b5000) => /opt/intel/Compiler/11.1/064/lib/intel64/
(0x00002b6a974ef000) =>
(0x00002b6a97765000) =>
(0x00002b6a97c2f000) =>
(0x00002b6a984f5000) =>
(0x00002b6a98743000) =>

[pmdtest_at_pmd HadGEM]$ ssh compute-01-18

ssh: connect to host compute-01-18 port 22: No route to host

[pmdtest_at_pmd HadGEM]$ ssh compute-01-13

Last login: Mon Jan 28 07:48:08 2013 from

[pmdtest_at_compute-01-13 ~]$ ldd rca.x

ldd: ./rca.x: No such file or directory

[pmdtest_at_compute-01-13 ~]$ ls

On Thu, Feb 7, 2013 at 7:40 PM, John Hearns <hearnsj_at_[hidden]> wrote:

> ldd rca.x
> Try logging in to each node and run this command.
> Even better use pdsh
> _______________________________________________
> users mailing list
> users_at_[hidden]