Open MPI logo

Open MPI User's Mailing List Archives

  |   Home   |   Support   |   FAQ   |   all Open MPI User's mailing list

Subject: Re: [OMPI users] FW: Re: [MTT users] Is the stock MPI that comes with OSX leopard broken with xgrid?
From: John Fink (john.fink_at_[hidden])
Date: 2008-12-17 11:07:33


Boy, it sure looks that way, doesn't it? So I guess the solution is to
compile a new openmpi directly on my xgrid controller and push that binary
to all the agents. I should probably do this anyway as the program I am
trying to run is a FORTRAN program, and I'm meant to understand that Apple's
OpenMPI doesn't support FORTRAN either. Hoo boy. Thanks Doug, Ethan and
Jeff.

jf

On Wed, Dec 17, 2008 at 10:29 AM, Doug Reeder <dlr_at_[hidden]> wrote:

> I believe that the openmpi that comes with leopard doesn't support xgrid.
> If you type ompi_info|grep xgrid you get nothing. I'm not sure what apple
> was thinking.
>
> Doug Reeder
>
> On Dec 17, 2008, at 6:30 AM, Ethan Mallove wrote:
>
> Hi John,
>>
>> I'm forwarding your question to the Open MPI users list.
>>
>> Regards,
>> Ethan
>>
>> On Wed, Dec/17/2008 08:35:00AM, John Fink wrote:
>>
>>> Hello OpenMPI folks,
>>>
>>> I've got a large pool of Macs running Leopard that are all on an xgrid.
>>> However, I can't seem to use the mpirun that comes with Leopard with
>>> the
>>> xgrid. I've got my grid and password environment variables set up okay
>>> on
>>> my controller, all the xgrid command line commands work (displaying
>>> grid
>>> IDs, things like that) but mpirun only wants to run things on the local
>>> host.
>>>
>>> I'm extremely new to OpenMPI and only slightly less new to Macs so
>>> there's
>>> probably something very obvious that I'm missing, but I'm trying what's
>>> detailed on this page:
>>> http://www.macresearch.org/runing_mpi_job_through_xgrid (the
>>> /bin/hostname
>>> example). Here's my output:
>>>
>>> as-0003-l:~ locadmin$ mpirun -n 8 /bin/hostname
>>> as-0003-l.lib.mcmaster.ca
>>> as-0003-l.lib.mcmaster.ca
>>> as-0003-l.lib.mcmaster.ca
>>> as-0003-l.lib.mcmaster.ca
>>> as-0003-l.lib.mcmaster.ca
>>> as-0003-l.lib.mcmaster.ca
>>> as-0003-l.lib.mcmaster.ca
>>> as-0003-l.lib.mcmaster.ca
>>>
>>> Issuing the same command with -nolocal yields the following:
>>>
>>> as-0003-l:~ locadmin$ mpirun --nolocal -n 8 /bin/hostname
>>>
>>> --------------------------------------------------------------------------
>>> There are no available nodes allocated to this job. This could be
>>> because
>>> no nodes were found or all the available nodes were already used.
>>>
>>> Note that since the -nolocal option was given no processes can be
>>> launched on the local node.
>>>
>>> --------------------------------------------------------------------------
>>> [as-0003-l.lib.mcmaster.ca:82776] [0,0,0] ORTE_ERROR_LOG: Temporarily
>>> out
>>> of resource in file
>>>
>>> /SourceCache/openmpi/openmpi-5/openmpi/orte/mca/rmaps/base/rmaps_base_support_fns.c
>>> at line 168
>>> [as-0003-l.lib.mcmaster.ca:82776] [0,0,0] ORTE_ERROR_LOG: Temporarily
>>> out
>>> of resource in file
>>>
>>> /SourceCache/openmpi/openmpi-5/openmpi/orte/mca/rmaps/round_robin/rmaps_rr.c
>>> at line 402
>>> [as-0003-l.lib.mcmaster.ca:82776] [0,0,0] ORTE_ERROR_LOG: Temporarily
>>> out
>>> of resource in file
>>>
>>> /SourceCache/openmpi/openmpi-5/openmpi/orte/mca/rmaps/base/rmaps_base_map_job.c
>>> at line 210
>>> [as-0003-l.lib.mcmaster.ca:82776] [0,0,0] ORTE_ERROR_LOG: Temporarily
>>> out
>>> of resource in file
>>> /SourceCache/openmpi/openmpi-5/openmpi/orte/mca/rmgr/urm/rmgr_urm.c at
>>> line 372
>>> [as-0003-l.lib.mcmaster.ca:82776] mpirun: spawn failed with errno=-3
>>>
>>> Thanks very much for any help you can provide!
>>>
>>> jf
>>>
>>> --
>>> http://libgrunt.blogspot.com -- library culture and technology.
>>>
>>> References
>>>
>>> Visible links
>>> . http://www.macresearch.org/runing_mpi_job_through_xgrid
>>> . http://as-0003-l.lib.mcmaster.ca/
>>> . http://as-0003-l.lib.mcmaster.ca/
>>> . http://as-0003-l.lib.mcmaster.ca/
>>> . file:///tmp/http:/as-0003-l.lib.mcmaster.ca:82776
>>> . file:///tmp/http:/as-0003-l.lib.mcmaster.ca:82776
>>> . file:///tmp/http:/as-0003-l.lib.mcmaster.ca:82776
>>> . file:///tmp/http:/as-0003-l.lib.mcmaster.ca:82776
>>> . file:///tmp/http:/as-0003-l.lib.mcmaster.ca:82776
>>> . http://libgrunt.blogspot.com/
>>>
>>
>> _______________________________________________
>>> mtt-users mailing list
>>> mtt-users_at_[hidden]
>>> http://www.open-mpi.org/mailman/listinfo.cgi/mtt-users
>>>
>>
>> _______________________________________________
>> users mailing list
>> users_at_[hidden]
>> http://www.open-mpi.org/mailman/listinfo.cgi/users
>>
>
>

-- 
http://libgrunt.blogspot.com -- library culture and technology.