On Jun 28, 2011, at 9:05 AM, yanyg_at_[hidden] wrote:
> Hello All,
> I installed Open MPI 1.4.3 on our new HPC blades, with Infiniband
> My system environments are as:
> 1)uname -a output:
> Linux gulftown 2.6.18-194.el5 #1 SMP Tue Mar 16 21:52:39 EDT
> 2010 x86_64 x86_64 x86_64 GNU/Linux
> 2) /home is mounted over all nodes, and mpirun is started under
> Open MPI and application codes are compiled with intel(R)
> compilers V11. Infiniband stack is Mellanox OFED 1.5.2.
> I have two questions about mpirun:
> a) how could I get to know what is the network interconnect
> protocol used by the MPI application?
> I specify "--mca btl openib,self,sm,tcp" to mpirun, but I want to
> make sure it really uses infiniband interconnect.
Why specify tcp if you don't want it used? Just leave that off and it will have no choice but to use IB.
> b) when I run mpirun, I get the following message:
> It seems orted is not found on slave nodes. If I set the PATH and
> LD_LIBRARY_PATH through --prefix to mpirun, or --path, or -x
> options to mpirun, to make the orted and related dynamic libs
> available on slave nodes, it does not work as expected from mpirun
> manual page. The only working case is that I set PATH and
> LD_LIBRARY_PATH in ~/.bashrc for mpirun, and this .bashrc is
> invoked by slave nodes too for login shell. I do not want to set PATH
> and LD_LIBRARY_PATH in ~/.bashrc, but instead to set options to
> mpirun directly.
Should work with either prefix or -x options, assuming the right syntax with the latter.
I take it your default shell is bash, and that you are using the rsh launcher (as opposed to something like torque)? Are you launching from your default shell, or did you perhaps change shell?
Can you send the actual mpirun command you typed?
> users mailing list