Open MPI logo

Open MPI User's Mailing List Archives

  |   Home   |   Support   |   FAQ   |   all Open MPI User's mailing list

Subject: Re: [OMPI users] using OpenMPI + SGE in a heterogeneous network
From: Reuti (reuti_at_[hidden])
Date: 2008-06-07 05:09:07


Hi,

Am 07.06.2008 um 10:50 schrieb SLIM H.A.:

> I tried this but the problem I ran into was that the value of the
> environment variable OMPI_MCA_btl set in the PE start script is not

exactly. Therefore I mentioned "starter_method", which is set in the
queue definition. The "start_proc_args" in the PE definition you need
anyway to prepare the machinefile.

-- Reuti

> propagated to the job script. Therefore the mpirun environment is
> not aware of its value. I must be missing something here, do you
> have any idea?
> (I associate each hostgroup with its own PE, similar to the
> suggestion in that url, but with a common start script)
>
> Thanks
>
> Henk
>
>
> A transparent solution would be to create a "starter_method" in the
> parallel queue, which will check the node that it's running on and
> hence you know which type of interface to use. Export the variable
> and exec the real script; i.e. something like:
>
> #!/bin/sh
> if [ "${HOSTNAME%%[0-9]*}" = "node_ib" ]; then
> export OMPI_MCA_btl=tcp
> else
> export OMPI_MCA_btl=openib
> fi
> exec "$@"
>
>
> I think you get the idea of the script. Maybe you have some special
> names for these machines. Another indicator would be the $PE or
> $QUEUE of SGE. If you want to be sure to get only nodes of one and
> the same type (but you don't care which at submission time), you will
> need to setup something like:
>
> http://gridengine.info/articles/2006/02/14/grouping-jobs-to-nodes-
> via-wildcard-pes
>
> anyway.
>
> HTH - Reuti
>
>
> >
> > mpirun -np $NSLOTS executable
> >
> > as I have seen in some of the OpenMPI FAQs.
> >
> > Thanks
> >
> > Henk
> >
> > From: users-bounces_at_[hidden] on behalf of Reuti
> > Sent: Fri 6/6/2008 6:47 PM
> > To: Open MPI Users
> > Subject: Re: [OMPI users] using OpenMPI + SGE in a heterogeneous
> > network
> >
> > Am 06.06.2008 um 19:31 schrieb Patrick Geoffray:
> >
> > > SLIM H.A. wrote:
> > >> I would be grateful for any advice
> > >
> > > Just to check, you are not using the MTL for MX, right ? Only the
> > BTL
> > > interface allow to choose between several devices at run time.
> >
> > At least there would be the option to built two binaries and decide
> > at runtime which to use in your jobscript - depending on the machine
> > you landed on.
> >
> > -- Reuti
> > _______________________________________________
> > users mailing list
> > users_at_[hidden]
> > http://www.open-mpi.org/mailman/listinfo.cgi/users
> >
> > _______________________________________________
> > users mailing list
> > users_at_[hidden]
> > http://www.open-mpi.org/mailman/listinfo.cgi/users
>
> _______________________________________________
> users mailing list
> users_at_[hidden]
> http://www.open-mpi.org/mailman/listinfo.cgi/users
> _______________________________________________
> users mailing list
> users_at_[hidden]
> http://www.open-mpi.org/mailman/listinfo.cgi/users