Open MPI logo

Open MPI User's Mailing List Archives

  |   Home   |   Support   |   FAQ   |  

This web mail archive is frozen.

This page is part of a frozen web archive of this mailing list.

You can still navigate around this archive, but know that no new mails have been added to it since July of 2016.

Click here to be taken to the new web archives of this list; it includes all the mails that are in this frozen archive plus all new mails that have been sent to the list since it was migrated to the new archives.

Subject: Re: [OMPI users] MTT -trivial :All tests are not getting passed
From: Ethan Mallove (ethan.mallove_at_[hidden])
Date: 2009-12-22 09:12:34


Hi Vishal,

This is an MTT question for mtt-users_at_[hidden] (see comments
below).

On Tue, Dec/22/2009 03:54:08PM, vishal shorghar wrote:
> Hi All,
>
> I have one issue with MTT trivial tests.All tests are not getting
> passed,Please read below for detailed description.
>
> Today I ran mtt trivial tests with latest ofed package
> OFED-1.5-20091217-0600 (ompi-1.4), between two machines,I was able to run
> the MTT trivial tests manually but not through MTT framework. I think we
> are missing some configuration steps since it is unable to find the test
> executables in the test run phase of the MTT.
>
> -> When we ran it through MTT it gave us the error and exits.
> I ran the test as "cat developer.ini trivial.ini | ../client/mtt
> --verbose - "
>
> -> When we analyzed error from
> /root/mtt-svn/samples/Test_Run-trivial-my_installation-1.4.txt file we
> found it is not getting the executable files of the different test to
> execute.
>
> -> Then we found that those executables were being generated only on one
> of the machine out of two machines. So, we manually copied the tests from
> /root/mtt-svn/samples/installs/nRpF/tests/trivial/test_get__trivial/c_ring
> to another machine.
>
> -> And we ran it manually as shown below and it worked fine:
> mpirun --host 102.77.77.64,102.77.77.68 -np 2 --mca btl openib,sm,self
> --prefix
> /usr/mpi/gcc/openmpi-1.4/root/mtt-svn/samples/installs/nRpF/tests/trivial/test_get__trivial/c_ring
>
> -> I am attaching file trivial.ini,developer.ini and
> /root/mtt-svn/samples/Test_Run-trivial-my_installation-1.4.txt.
>
> Let us know if I am missing some configuration steps.
>

You need to set your scratch directory (via the --scratch option) to
an NFS share that is accessible to all nodes in your hostlist. MTT
won't copy local files onto each node for you.

Regards,
Ethan

> NOTE:
> ====
> It gave me following output at the end of execution of test command and
> the same is saved in /root/mtt-svn/samples/All_phase-summary.txt
>
> hostname: nizam
> uname: Linux nizam 2.6.18-128.el5 #1 SMP Wed Jan 21 10:41:14 EST 2009
> x86_64 x86_64 x86_64 GNU/Linux
> who am i:
>
> +-------------+-----------------+-------------+----------+------+------+----------+------+------------------------------------------------------+
> | Phase | Section | MPI Version | Duration | Pass | Fail |
> Time out | Skip | Detailed report |
> +-------------+-----------------+-------------+----------+------+------+----------+------+------------------------------------------------------+
> | MPI Install | my installation | 1.4 | 00:00 | 1 |
> | | | MPI_Install-my_installation-my_installation-1.4.html |
> | Test Build | trivial | 1.4 | 00:01 | 1 |
> | | | Test_Build-trivial-my_installation-1.4.html |
> | Test Run | trivial | 1.4 | 00:10 | | 8
> | | | Test_Run-trivial-my_installation-1.4.html |
> +-------------+-----------------+-------------+----------+------+------+----------+------+------------------------------------------------------+
>
> Total Tests: 10
> Total Failures: 8
> Total Passed: 2
> Total Duration: 11 secs. (00:11)
>
> Thanks & Regards,
>
> Vishal shorghar
> MTS
> Chelsio Communication

> #
> # Copyright (c) 2007 Sun Microystems, Inc. All rights reserved.
> #
>
> # Template MTT configuration file for Open MPI developers. The intent
> # for this template file is to establish at least some loose
> # guidelines for what Open MPI core developers should be running
> # before committing changes to the OMPI repository. This file is not
> # intended to be an exhaustive sample of all possible fields and
> # values that MTT offers. Each developer will undoubtedly have to
> # edit this template for their own needs (e.g., pick compilers to use,
> # etc.), but this file provides a baseline set of configurations that
> # we intend for you to run.
> #
> # Sample usage:
> # cat developer.ini intel.ini | client/mtt - alreadyinstalled_dir=/your/install
> # cat developer.ini trivial.ini | client/mtt - alreadyinstalled_dir=/your/install
> #
>
> [MTT]
> # No overrides to defaults
>
> # Fill this field in
>
> #hostlist = 102.77.77.63 102.77.77.54 102.77.77.64 102.77.77.68
> #hostlist = 102.77.77.66 102.77.77.68 102.77.77.63 102.77.77.64 102.77.77.53 102.77.77.54 102.77.77.243 102.77.77.65
> hostlist = 102.77.77.64 102.77.77.68
> hostlist_max_np = 2
> max_np = 2
> force = 1
> #prefix = /usr/mpi/gcc/openmpi-1.3.4/bin
>
> #----------------------------------------------------------------------
>
> [MPI Details: Open MPI]
>
> exec = mpirun @hosts@ -np &test_np() @mca@ --prefix &test_prefix() &test_executable() &test_argv()
>
> mca = --mca btl openib,sm,self
>
> hosts = <<EOT
> &if(
> &have_hostfile(),
> &stringify("--hostfile ", &hostfile()),
> &if(
> &have_hostlist(),
> &stringify("--host ", &hostlist()),
> ""
> )
> )
> EOT
>
> # Here is a good place to put any cleanup commands (e.g.,
> # kill a stale process, remove a leftover file, etc.)
> after_all_exec = &shell("echo done.")
>
> #----------------------------------------------------------------------
>
> [MPI get: My Installation]
> mpi_details = Open MPI
> # Leave this parameter blank to
> # have MTT search your path for an MPI
> # alreadyinstalled_dir = /your/installation
> module = AlreadyInstalled
>
> #----------------------------------------------------------------------
>
> [MPI install: My Installation]
> mpi_get = My Installation
> module = Analyze::OMPI
>
> #----------------------------------------------------------------------
>
> [Reporter: text file backup]
> module = TextFile
>
> textfile_filename = $phase-$section-$mpi_name-$mpi_version.txt
>
> # User-defined report headers/footers
> textfile_summary_header = <<EOT
> hostname: &shell("hostname")
> uname: &shell("uname -a")
> who am i: &shell("who am i")
> EOT
>
> textfile_summary_footer =
> textfile_detail_header =
> textfile_detail_footer =
>
> textfile_textwrap = 78

>
>
> +--------------------------+-------------------------------------------------------------------------------+
> | Field | Value |
> +--------------------------+-------------------------------------------------------------------------------+
> | description | |
> | environment | |
> | exit_signal | -1 |
> | launcher | mpirun |
> | mpi_install_section_name | my installation |
> | mpi_name | my installation |
> | mpi_version | 1.4 |
> | network | |
> | np | 2 |
> | parameters | |
> | phase | Test Run |
> | resource_manager | none |
> | result_stderr | |
> | test_build_section_name | trivial |
> | variant | 1 |
> | | |
> | command | mpirun --host 102.77.77.64,102.77.77.68 -np 2 --mca btl openib,sm,self |
> | | --prefix /usr/mpi/gcc/openmpi-1.4/ ./c_ring |
> | duration | 1 seconds |
> | exit_value | 131 |
> | result_message | Failed; exit status: 131 |
> | result_stdout | -------------------------------------------------------------------------- |
> | | mpirun was unable to launch the specified application as it could not access |
> | | or execute an executable: |
> | | |
> | | Executable: ./c_ring |
> | | Node: 102.77.77.64 |
> | | |
> | | while attempting to start process rank 0. |
> | | -------------------------------------------------------------------------- |
> | start_timestamp | 1261107371 |
> | start_timestamp_human | Fri Dec 18 03:36:11 2009 |
> | test_name | c_ring |
> | | |
> | command | mpirun --host 102.77.77.64,102.77.77.68 -np 2 --mca btl openib,sm,self |
> | | --prefix /usr/mpi/gcc/openmpi-1.4/ ./cxx_ring |
> | duration | 2 seconds |
> | exit_value | 131 |
> | result_message | Failed; exit status: 131 |
> | result_stdout | -------------------------------------------------------------------------- |
> | | mpirun was unable to launch the specified application as it could not access |
> | | or execute an executable: |
> | | |
> | | Executable: ./cxx_ring |
> | | Node: 102.77.77.64 |
> | | |
> | | while attempting to start process rank 0. |
> | | -------------------------------------------------------------------------- |
> | start_timestamp | 1261107372 |
> | start_timestamp_human | Fri Dec 18 03:36:12 2009 |
> | test_name | cxx_ring |
> | | |
> | command | mpirun --host 102.77.77.64,102.77.77.68 -np 2 --mca btl openib,sm,self |
> | | --prefix /usr/mpi/gcc/openmpi-1.4/ ./cxx_hello |
> | duration | 2 seconds |
> | exit_value | 131 |
> | result_message | Failed; exit status: 131 |
> | result_stdout | -------------------------------------------------------------------------- |
> | | mpirun was unable to launch the specified application as it could not access |
> | | or execute an executable: |
> | | |
> | | Executable: ./cxx_hello |
> | | Node: 102.77.77.64 |
> | | |
> | | while attempting to start process rank 0. |
> | | -------------------------------------------------------------------------- |
> | start_timestamp | 1261107374 |
> | start_timestamp_human | Fri Dec 18 03:36:14 2009 |
> | test_name | cxx_hello |
> | | |
> | command | mpirun --host 102.77.77.64,102.77.77.68 -np 2 --mca btl openib,sm,self |
> | | --prefix /usr/mpi/gcc/openmpi-1.4/ ./f90_hello |
> | duration | 2 seconds |
> | exit_value | 131 |
> | result_message | Failed; exit status: 131 |
> | result_stdout | -------------------------------------------------------------------------- |
> | | mpirun was unable to launch the specified application as it could not access |
> | | or execute an executable: |
> | | |
> | | Executable: ./f90_hello |
> | | Node: 102.77.77.64 |
> | | |
> | | while attempting to start process rank 0. |
> | | -------------------------------------------------------------------------- |
> | start_timestamp | 1261107376 |
> | start_timestamp_human | Fri Dec 18 03:36:16 2009 |
> | test_name | f90_hello |
> | | |
> | command | mpirun --host 102.77.77.64,102.77.77.68 -np 2 --mca btl openib,sm,self |
> | | --prefix /usr/mpi/gcc/openmpi-1.4/ ./f90_ring |
> | duration | 1 seconds |
> | exit_value | 131 |
> | result_message | Failed; exit status: 131 |
> | result_stdout | -------------------------------------------------------------------------- |
> | | mpirun was unable to launch the specified application as it could not access |
> | | or execute an executable: |
> | | |
> | | Executable: ./f90_ring |
> | | Node: 102.77.77.64 |
> | | |
> | | while attempting to start process rank 0. |
> | | -------------------------------------------------------------------------- |
> | start_timestamp | 1261107378 |
> | start_timestamp_human | Fri Dec 18 03:36:18 2009 |
> | test_name | f90_ring |
> | | |
> | command | mpirun --host 102.77.77.64,102.77.77.68 -np 2 --mca btl openib,sm,self |
> | | --prefix /usr/mpi/gcc/openmpi-1.4/ ./c_hello |
> | duration | 4 seconds |
> | exit_value | 0 |
> | result_message | Failed; timeout expired (2 seconds) ) |
> | result_stdout | -------------------------------------------------------------------------- |
> | | mpirun was unable to launch the specified application as it could not access |
> | | or execute an executable: |
> | | |
> | | Executable: ./c_hello |
> | | Node: 102.77.77.64 |
> | | |
> | | while attempting to start process rank 0. |
> | | -------------------------------------------------------------------------- |
> | | mpirun: abort is already in progress...hit ctrl-c again to forcibly terminate |
> | start_timestamp | 1261107379 |
> | start_timestamp_human | Fri Dec 18 03:36:19 2009 |
> | test_name | c_hello |
> | | |
> | command | mpirun --host 102.77.77.64,102.77.77.68 -np 2 --mca btl openib,sm,self |
> | | --prefix /usr/mpi/gcc/openmpi-1.4/ ./f77_hello |
> | duration | 1 seconds |
> | exit_value | 131 |
> | result_message | Failed; exit status: 131 |
> | result_stdout | -------------------------------------------------------------------------- |
> | | mpirun was unable to launch the specified application as it could not access |
> | | or execute an executable: |
> | | |
> | | Executable: ./f77_hello |
> | | Node: 102.77.77.64 |
> | | |
> | | while attempting to start process rank 0. |
> | | -------------------------------------------------------------------------- |
> | start_timestamp | 1261107383 |
> | start_timestamp_human | Fri Dec 18 03:36:23 2009 |
> | test_name | f77_hello |
> | | |
> | command | mpirun --host 102.77.77.64,102.77.77.68 -np 2 --mca btl openib,sm,self |
> | | --prefix /usr/mpi/gcc/openmpi-1.4/ ./f77_ring |
> | duration | 1 seconds |
> | exit_value | 131 |
> | result_message | Failed; exit status: 131 |
> | result_stdout | -------------------------------------------------------------------------- |
> | | mpirun was unable to launch the specified application as it could not access |
> | | or execute an executable: |
> | | |
> | | Executable: ./f77_ring |
> | | Node: 102.77.77.64 |
> | | |
> | | while attempting to start process rank 0. |
> | | -------------------------------------------------------------------------- |
> | start_timestamp | 1261107385 |
> | start_timestamp_human | Fri Dec 18 03:36:25 2009 |
> | test_name | f77_ring |
> | | |
> +--------------------------+-------------------------------------------------------------------------------+
>

> #
> # Copyright (c) 2007 Sun Microystems, Inc. All rights reserved.
> #
>
> #
> # This INI file is an addendum to samples/developer.ini (which only
> # contains Trivial tests).
> #
> # Sample usage:
> # cat developer.ini trivial.ini | client/mtt - alreadyinstalled_dir=/your/install
> #
>
> [Test get: trivial]
> module = Trivial
>
> #----------------------------------------------------------------------
>
> [Test build: trivial]
> test_get = trivial
> save_stdout_on_success = 1
> merge_stdout_stderr = 1
> stderr_save_lines = 100
>
> module = Trivial
>
> #----------------------------------------------------------------------
>
> [Test run: trivial]
> test_build = trivial
> pass = &and(&cmd_wifexited(), &eq(&cmd_wexitstatus(), 0))
> timeout = &test_np()
> save_stdout_on_pass = 1
> merge_stdout_stderr = 1
> stdout_save_lines = 100
> np = &env_max_procs()
>
> specify_module = Simple
> simple_only:tests = &find_executables(".")
>
> #----------------------------------------------------------------------
>

> _______________________________________________
> users mailing list
> users_at_[hidden]
> http://www.open-mpi.org/mailman/listinfo.cgi/users