Open MPI logo

Open MPI User's Mailing List Archives

  |   Home   |   Support   |   FAQ   |   all Open MPI User's mailing list

Subject: Re: [OMPI users] run orterun with more than 200 processes
From: Ralph Castain (rhc_at_[hidden])
Date: 2012-03-02 14:21:57


On Mar 2, 2012, at 12:14 PM, Jianzhang He wrote:

> Sorry that I did not make it clear. Actually, we are able to run in this way with less than 100 processes.
> I put ‘hostname1.domain.com,1,2,3,4,5,6,7,8,9,…..,196,197,198,199’ in a hostfile and it does not work. I wonder what might be the equivalent format in a hostfile.

The correct format is one host name per line :-)

>
> Thaks,
> Jianzhang
>
> From: users-bounces_at_[hidden] [mailto:users-bounces_at_[hidden]] On Behalf Of Ralph Castain
> Sent: Thursday, March 01, 2012 3:05 PM
> To: Open MPI Users
> Subject: Re: [OMPI users] run orterun with more than 200 processes
>
> You might try putting that list of hosts in a hostfile instead of on the cmd line - you may be hitting some limits there.
>
> I also don't believe that you can add an orted in that manner - orterun will have no idea how it got there and is likely to abort.
>
> On Mar 1, 2012, at 3:20 PM, Jianzhang He wrote:
>
>
> Hi,
>
> I am not sure if this is the right place to post this question. If you know where it is appropriate, please let me know.
>
> I need to run application that launches 200 processes with the command:
> 1) orterun --prefix ./ -np 200 -wd ./ -host hostname1.domain.com,1,2,3,4,5,6,7,8,9,…..,196,197,198,199 CMD
>
> Later, I will run a command to communicate with 1) with a command like:
> 2) orted -mca ess env -mca orte_ess_ -mca orte_ess_vpid 100 -mca orte_ess_num_procs 200 --hnp-uri "job#;tcp:/ hostname1.domain.com /:port#"
>
> The problem I have is I can only run with about 100 nodes. If the number is higher, 1) will not invoke CMD and the total number of processes is about 130 or so.
>
> My question is how to remove that limit?
>
> Thanks in advance.
>
> Jianzhang
>
> _______________________________________________
> users mailing list
> users_at_[hidden]
> http://www.open-mpi.org/mailman/listinfo.cgi/users
>
> _______________________________________________
> users mailing list
> users_at_[hidden]
> http://www.open-mpi.org/mailman/listinfo.cgi/users