Open MPI logo

Open MPI User's Mailing List Archives

  |   Home   |   Support   |   FAQ   |   all Open MPI User's mailing list

Subject: Re: [OMPI users] Error when attempting to run LAMMPS on Centos 6.2 with OpenMPI
From: #YEO JINGJIE# (JYEO1_at_[hidden])
Date: 2013-01-28 02:35:54


I obtained exactly the same error:

[NTU-2:24680] [[INVALID],INVALID] ORTE_ERROR_LOG: Not found in file ess_hnp_module.c at line 194
--------------------------------------------------------------------------
It looks like orte_init failed for some reason; your parallel process is
likely to abort. There are many reasons that a parallel process can
fail during orte_init; some of which are due to configuration or
environment problems. This failure appears to be an internal failure;
here's some additional information (which may only be relevant to an
Open MPI developer):
  orte_plm_base_select failed
  --> Returned value Not found (-13) instead of ORTE_SUCCESS
--------------------------------------------------------------------------
[NTU-2:24680] [[INVALID],INVALID] ORTE_ERROR_LOG: Not found in file runtime/orte_init.c at line 128
--------------------------------------------------------------------------
It looks like orte_init failed for some reason; your parallel process is
likely to abort. There are many reasons that a parallel process can
fail during orte_init; some of which are due to configuration or
environment problems. This failure appears to be an internal failure;
here's some additional information (which may only be relevant to an
Open MPI developer):
  orte_ess_set_name failed
  --> Returned value Not found (-13) instead of ORTE_SUCCESS
--------------------------------------------------------------------------
[NTU-2:24680] [[INVALID],INVALID] ORTE_ERROR_LOG: Not found in file orterun.c at line 616

This does seem to be incredibly perplexing, I will attempt a proper (non-packaged) installation for my cluster once more and determine whether it works. Thank you so much for all the help!

Regards,
Jingjie Yeo
Ph.D. Student
School of Mechanical and Aerospace Engineering
Nanyang Technological University, Singapore

________________________________________
From: Ralph Castain [rhc.openmpi_at_[hidden]] on behalf of Ralph Castain [rhc_at_[hidden]]
Sent: Monday, 28 January, 2013 12:24:23 AM
To: #YEO JINGJIE#; Open MPI Users
Subject: Re: [OMPI users] Error when attempting to run LAMMPS on Centos 6.2 with OpenMPI

On Jan 26, 2013, at 11:18 PM, #YEO JINGJIE# <JYEO1_at_[hidden]> wrote:

> So I should run the job as:
>
> /usr/lib64/openmpi/bin/mpirun -mca mca_component_show_load_errors 1 -n 16 /opt/lammps-21Jan13/lmp_linux < zigzag.in
>
> Is that correct?

Yes, thanks - though for our purposes, why don't you simplify it to:

/usr/lib64/openmpi/bin/mpirun -mca mca_component_show_load_errors 1 -n 1 hostname

>
> Regards,
> Jingjie Yeo
> Ph.D. Student
> School of Mechanical and Aerospace Engineering
> Nanyang Technological University, Singapore
>
> ________________________________________
> From: users-bounces_at_[hidden] [users-bounces_at_[hidden]] on behalf of Ralph Castain [rhc_at_[hidden]]
> Sent: Sunday, 27 January, 2013 11:58:51 AM
> To: Open MPI Users
> Subject: Re: [OMPI users] Error when attempting to run LAMMPS on Centos 6.2 with OpenMPI
>
> One thing you might try: add "-mca mca_component_show_load_errors 1" to your mpirun cmd line. This will tell us if the libraries have some missing dependencies.
>
> It's the main reason I dislike installing from a package - the package assumes that your system is configured identically to that of the one used to generate the package. This is rarely the case - much easier to just download an OMPI tarball, configure and compile it yourself.
>
>
> On Jan 26, 2013, at 7:32 PM, #YEO JINGJIE# <JYEO1_at_[hidden]> wrote:
>
>> Hi Jeff,
>>
>> Sorry the original error info was lost along the way, I'm terribly new to linux and I am trying to compile OMPI and to run a program, LAMMPS using the command:
>>
>> /usr/lib64/openmpi/bin/mpirun -n 16 /opt/lammps-21Jan13/lmp_linux < zigzag.in
>>
>> And I received the errors:
>>
>> [[INVALID],INVALID] ORTE_ERROR_LOG: Not found in file ess_hnp_module.c at line 194
>> --------------------------------------------------------------------------
>> It looks like orte_init failed for some reason; your parallel process is
>> likely to abort. There are many reasons that a parallel process can
>> fail during orte_init; some of which are due to configuration or
>> environment problems. This failure appears to be an internal failure;
>> here's some additional information (which may only be relevant to an
>> Open MPI developer):
>> orte_plm_base_select failed
>> --> Returned value Not found (-13) instead of ORTE_SUCCESS
>> --------------------------------------------------------------------------
>> [NTU-2:24127] [[INVALID],INVALID] ORTE_ERROR_LOG: Not found in file runtime/orte_init.c at line 128
>> --------------------------------------------------------------------------
>> It looks like orte_init failed for some reason; your parallel process is
>> likely to abort. There are many reasons that a parallel process can
>> fail during orte_init; some of which are due to configuration or
>> environment problems. This failure appears to be an internal failure;
>> here's some additional information (which may only be relevant to an
>> Open MPI developer):
>> orte_ess_set_name failed
>> --> Returned value Not found (-13) instead of ORTE_SUCCESS
>> --------------------------------------------------------------------------
>> [NTU-2:24127] [[INVALID],INVALID] ORTE_ERROR_LOG: Not found in file orterun.c at line 616
>>
>>
>> I have attached the output for "ompi_info --all" in this email. Any advise and guidance is greatly appreciated, thank you!
>>
>>
>> Regards,
>> Jingjie Yeo
>> Ph.D. Student
>> School of Mechanical and Aerospace Engineering
>> Nanyang Technological University, Singapore
>>
>> ________________________________________
>> From: users-bounces_at_[hidden] [users-bounces_at_[hidden]] on behalf of Jeff Squyres (jsquyres) [jsquyres_at_[hidden]]
>> Sent: Saturday, 26 January, 2013 9:51:43 PM
>> To: Open MPI Users
>> Subject: Re: [OMPI users] Error when attempting to run LAMMPS on Centos 6.2 with OpenMPI
>>
>> Can you provide more information? You really haven't said *what* the problem is that is occurring (even in your followup emails to Ralph). Without such information, it is very hard for us to guess what is wrong.
>>
>> Please send all the information listed here:
>>
>> http://www.open-mpi.org/community/help/
>>
>>
>>
>> On Jan 25, 2013, at 8:51 PM, #YEO JINGJIE# <JYEO1_at_[hidden]>
>> wrote:
>>
>>> I tried to follow the installation instructions over here:
>>>
>>> http://amusecode.org/doc/install/install-prerequisites-redhat.html
>>>
>>> And I am using bash and yum.
>>>
>>> Regards,
>>> Jingjie Yeo
>>> Ph.D. Student
>>> School of Mechanical and Aerospace Engineering
>>> Nanyang Technological University, Singapore
>>> _______________________________________________
>>> users mailing list
>>> users_at_[hidden]
>>> http://www.open-mpi.org/mailman/listinfo.cgi/users
>>
>>
>> --
>> Jeff Squyres
>> jsquyres_at_[hidden]
>> For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/
>>
>>
>> _______________________________________________
>> users mailing list
>> users_at_[hidden]
>> http://www.open-mpi.org/mailman/listinfo.cgi/users>_______________________________________________
>> users mailing list
>> users_at_[hidden]
>>
http://www.open-mpi.org/mailman/listinfo.cgi/users
>
>
> _______________________________________________
> users mailing list
> users_at_[hidden]
> http://www.open-mpi.org/mailman/listinfo.cgi/users
>
> _______________________________________________
> users mailing list
> users_at_[hidden]
> http://www.open-mpi.org/mailman/listinfo.cgi/users