I'm afraid you're right... I was testing it with Open MPI on my laptop, but later on the cluster I had some problems... Probably a colleague has uploaded mpich...
But I thought the behavior I see might be "implementation-independant". Probably sounds stupid... :)
Thanks anyway :)

 

2009/10/12 <users-request@open-mpi.org>
Send users mailing list submissions to
       users@open-mpi.org

To subscribe or unsubscribe via the World Wide Web, visit
       http://www.open-mpi.org/mailman/listinfo.cgi/users
or, via email, send a message with subject or body 'help' to
       users-request@open-mpi.org

You can reach the person managing the list at
       users-owner@open-mpi.org

When replying, please edit your Subject line so it is more specific
than "Re: Contents of users digest..."


Today's Topics:

  1. problem with mpiexec/mpirun (Jovana Knezevic)
  2. Re: problem with mpiexec/mpirun (Ralph Castain)


----------------------------------------------------------------------

Message: 1
Date: Mon, 12 Oct 2009 17:01:03 +0200
From: Jovana Knezevic <jovana.knezevic.83@gmail.com>
Subject: [OMPI users] problem with mpiexec/mpirun
To: users@open-mpi.org
Message-ID:
       <9d13e50c0910120801p4058214n7e5de181c09b0e3@mail.gmail.com>
Content-Type: text/plain; charset=ISO-8859-1

Hello everyone!

I am trying to run 11 instances of my program on 6 dual-core Opterons
(it is not time-consuming application anyway, takes 10 seconds at
one-core laptop :)).
so, when I type:

 mpiexec -machinefile hostfile -n 11 ./program

nothing happens!

The output of:

"mpdtrace -l"  command (from the machine I type the command at) is:

lx64a171_41469 (10.156.70.171)
lx64a176_47945 (10.156.70.176)
lx64a175_42990 (10.156.70.175)
lx64a174_39601 (10.156.70.174)
lx64a173_45387 (10.156.70.173)
lx64a172_55297 (10.156.70.172)

(seems that all 6 machines are there)

Does anyone have any idea/clue what the reason could be?

Thanks in advance!

Regards,
Jovana


------------------------------

Message: 2
Date: Mon, 12 Oct 2009 09:10:51 -0600
From: Ralph Castain <rhc@open-mpi.org>
Subject: Re: [OMPI users] problem with mpiexec/mpirun
To: Open MPI Users <users@open-mpi.org>
Message-ID: <A99B8347-8E81-4DD7-8B37-B541FA0AB700@open-mpi.org>
Content-Type: text/plain; charset=us-ascii; format=flowed; delsp=yes

Hate to say this, but you don't appear to be using Open MPI.
"mpdtrace" is an MPICH command, last I checked.

You might try their mailing list, or check which mpiexec you are using
and contact them.


On Oct 12, 2009, at 9:01 AM, Jovana Knezevic wrote:

> Hello everyone!
>
> I am trying to run 11 instances of my program on 6 dual-core Opterons
> (it is not time-consuming application anyway, takes 10 seconds at
> one-core laptop :)).
> so, when I type:
>
> mpiexec -machinefile hostfile -n 11 ./program
>
> nothing happens!
>
> The output of:
>
> "mpdtrace -l"  command (from the machine I type the command at) is:
>
> lx64a171_41469 (10.156.70.171)
> lx64a176_47945 (10.156.70.176)
> lx64a175_42990 (10.156.70.175)
> lx64a174_39601 (10.156.70.174)
> lx64a173_45387 (10.156.70.173)
> lx64a172_55297 (10.156.70.172)
>
> (seems that all 6 machines are there)
>
> Does anyone have any idea/clue what the reason could be?
>
> Thanks in advance!
>
> Regards,
> Jovana
> _______________________________________________
> users mailing list
> users@open-mpi.org
> http://www.open-mpi.org/mailman/listinfo.cgi/users



------------------------------

_______________________________________________
users mailing list
users@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/users

End of users Digest, Vol 1373, Issue 2
**************************************