Subject: Re: [MTT users] MTT trivial tests fails tocompleteon Centos5.3-x86_64 bit platform with OFED 1.5
From: Ethan Mallove (ethan.mallove_at_[hidden])
Date: 2009-11-19 12:31:56


On Thu, Nov/19/2009 08:53:12AM, Venkat Venkatsubra wrote:
> Hi Ethan,
>
> Yes, we are using http://svn.open-mpi.org/svn/mtt/trunk.
> After "svn update" when I rerun this is what the screen output looks
> like.
>
> Debug is 1, Verbose is 1
> *** MTT: ../client/mtt --debug -
> *** Running on mughal
> Chdir ../client
> Chdir /root/mtt-svn/samples
> Copying: stdin to /tmp/XzxTdnyg85.ini
> Expanding include_file(s) parameters in /tmp/XzxTdnyg85.ini
> Reading ini file: stdin
> *** ERROR: Syntax error in stdin: Line 36 in file /tmp/UqPeD61ett.ini is
> mal-formed:
> <<<<<<< .mine
> Line 41 in file /tmp/UqPeD61ett.ini is mal-formed:
> =======
> Line 53 in file /tmp/UqPeD61ett.ini is mal-formed:
> >>>>>>> .r1337 at /root/mtt-svn/lib/MTT/Messages.pm line
> 121.

You have some SVN conflicts in your samples/developer.ini file.
Do you have your own local changes to that file, e.g., near the
"hosts" parameter? I think you just want to resolve those with:

  $ svn resolve --accept=theirs-conflict samples/developer.ini

-Ethan

>
> Attached are the new developer.ini and the /tmp/UqPeD61ett.ini files.
>
> Venkat
>
> -----Original Message-----
> From: mtt-users-bounces_at_[hidden]
> [mailto:mtt-users-bounces_at_[hidden]] On Behalf Of Ethan Mallove
> Sent: Thursday, November 19, 2009 8:36 AM
> To: General user list for the MPI Testing Tool
> Subject: Re: [MTT users] MTT trivial tests fails tocompleteon
> Centos5.3-x86_64 bit platform with OFED 1.5
>
> On Wed, Nov/18/2009 12:17:24PM, Venkat Venkatsubra wrote:
> > Attached.
>
> That developer.ini file is out-of-date. Can you "svn update", and try
> to run again? I assume the SVN URL you're using is
> http://svn.open-mpi.org/svn/mtt/trunk?
>
> -Ethan
>
> >
> > -----Original Message-----
> > From: mtt-users-bounces_at_[hidden]
> > [mailto:mtt-users-bounces_at_[hidden]] On Behalf Of Ethan Mallove
> > Sent: Wednesday, November 18, 2009 1:41 PM
> > To: General user list for the MPI Testing Tool
> > Subject: Re: [MTT users] MTT trivial tests fails to completeon
> > Centos5.3-x86_64 bit platform with OFED 1.5
> >
> > Could you run with --debug (instead of --verbose), and send the
> > output.
> >
> > Thanks,
> > Ethan
> >
> > On Wed, Nov/18/2009 11:08:18AM, Venkat Venkatsubra wrote:
> > >
> > >
> > >
> > >
> > > From: Venkat Venkatsubra
> > > Sent: Wednesday, November 18, 2009 12:54 PM
> > > To: 'mtt-users_at_[hidden]'
> > > Subject: MTT trivial tests fails to complete on Centos5.3-x86_64
> > bit
> > > platform with OFED 1.5
> > >
> > >
> > >
> > > Hello All,
> > >
> > >
> > >
> > > How do I debug this problem ? Attached are the developer.ini and
> > > trivial.ini files.
> > >
> > > I can provide any other information that you need.
> > >
> > >
> > >
> > > [root_at_samples]# cat /etc/issue
> > >
> > > CentOS release 5.3 (Final)
> > >
> > > Kernel \r on an \m
> > >
> > >
> > >
> > > [root_at_samples]# uname -a
> > >
> > > Linux 2.6.18-128.el5 #1 SMP Wed Jan 21 10:41:14 EST 2009 x86_64
> > x86_64
> > > x86_64 GNU/Linux
> > >
> > >
> > >
> > > I am running OFED-1.5-20091029-0617 daily build.
> > >
> > >
> > >
> > > Started trivial tests using the following command:
> > >
> > >
> > >
> > > [root_at_samples]# cat developer.ini trivial.ini | ../client/mtt
> > --verbose -
> > >
> > > ....
> > >
> > > ....
> > >
> > > >> Initializing reporter module: TextFile
> > >
> > > *** Reporter initialized
> > >
> > > *** MPI Get phase starting
> > >
> > > >> MPI Get: [mpi get: my installation]
> > >
> > > Checking for new MPI sources...
> > >
> > > Using MPI in: /usr/mpi/gcc/openmpi-1.3.2/
> > >
> > > *** WARNING: alreadyinstalled_mpi_type was not specified,
> > defaulting to
> > >
> > > "OMPI".
> > >
> > > Got new MPI sources: version 1.3.2
> > >
> > > *** MPI Get phase complete
> > >
> > > *** MPI Install phase starting
> > >
> > > >> MPI Install [mpi install: my installation]
> > >
> > > Installing MPI: [my installation] / [1.3.2] / [my
> > installation]...
> > >
> > > >> Reported to text file
> > >
> > >
> > >
> >
> /root/mtt-svn/samples/MPI_Install-my_installation-my_installation-1.3.2.
> > htm
> > >
> > > l
> > >
> > > >> Reported to text file
> > >
> > >
> > >
> >
> /root/mtt-svn/samples/MPI_Install-my_installation-my_installation-1.3.2.
> > txt
> > >
> > > Completed MPI Install successfully
> > >
> > > *** MPI Install phase complete
> > >
> > > *** Test Get phase starting
> > >
> > > >> Test Get: [test get: trivial]
> > >
> > > Checking for new test sources...
> > >
> > > Got new test sources
> > >
> > > *** Test Get phase complete
> > >
> > > *** Test Build phase starting
> > >
> > > >> Test Build [test build: trivial]
> > >
> > > Building for [my installation] / [1.3.2] / [my installation]
> /
> > > [trivial]
> > >
> > > >> Reported to text file
> > >
> > >
> > /root/mtt-svn/samples/Test_Build-trivial-my_installation-1.3.2.html
> > >
> > > >> Reported to text file
> > >
> > >
> > /root/mtt-svn/samples/Test_Build-trivial-my_installation-1.3.2.txt
> > >
> > > Completed test build successfully
> > >
> > > *** Test Build phase complete
> > >
> > > *** Test Run phase starting
> > >
> > > >> Test Run [trivial]
> > >
> > > >> Running with [my installation] / [1.3.2] / [my installation]
> > >
> > > Using MPI Details [open mpi] with MPI Install [my
> installation]
> > >
> > >
> > >
> > > During this stage the test stalls.
> > >
> > > After about ~10 minutes the test gets killed.
> > >
> > > dmesg on which the test is running displays the following output:
> > >
> > >
> > >
> > > ==========
> > >
> > > Dmesg output
> > >
> > > ==========
> > >
> > > Out of memory: Killed process 5346 (gdmgreeter).
> > >
> > > audispd invoked oom-killer: gfp_mask=0x201d2, order=0,
> > oomkilladj=0
> > >
> > >
> > >
> > > Call Trace:
> > >
> > > [<ffffffff800c39dd>] out_of_memory+0x8e/0x2f5
> > >
> > > [<ffffffff8000f2eb>] __alloc_pages+0x245/0x2ce
> > >
> > > [<ffffffff80012a62>] __do_page_cache_readahead+0x95/0x1d9
> > >
> > > [<ffffffff80215932>] sock_readv+0xb7/0xd1
> > >
> > > [<ffffffff80088896>] __wake_up_common+0x3e/0x68
> > >
> > > [<ffffffff80013401>] filemap_nopage+0x148/0x322
> > >
> > > [<ffffffff80008863>] __handle_mm_fault+0x1f8/0xe5c
> > >
> > > [<ffffffff80066b9a>] do_page_fault+0x4cb/0x830
> > >
> > > [<ffffffff8005dde9>] error_exit+0x0/0x84
> > >
> > >
> > >
> > > Thanks!
> > >
> > >
> > >
> > > Venkat
> >
> >
> >
> > > _______________________________________________
> > > mtt-users mailing list
> > > mtt-users_at_[hidden]
> > > http://www.open-mpi.org/mailman/listinfo.cgi/mtt-users
> >
> > _______________________________________________
> > mtt-users mailing list
> > mtt-users_at_[hidden]
> > http://www.open-mpi.org/mailman/listinfo.cgi/mtt-users
>
> _______________________________________________
> mtt-users mailing list
> mtt-users_at_[hidden]
> http://www.open-mpi.org/mailman/listinfo.cgi/mtt-users