On 14 December 2010 17:32, Lydia Heck <lydia.heck_at_[hidden]> wrote:
>
> I have experimented a bit more and found that if I set
>
> OMPI_MCA_plm_rsh_num_concurrent=1024
>
> a job with more than 2,500 processes will start and run.
>
> However when I searched the open-mpi web site for the the variable I could
> not find any indication.
Lydia, a quick search find this page:
http://docs.sun.com/source/820-3176-10/appb-mca.html
It may be out of data, but does describe the parameters.
What is your setting for plm_rsh_agent (ie are you using ssh or rsh)
and also have you tried setting plm_rsh_debug
|