Subject: Re: [MTT users] MTT trivial tests fails to completeon Centos5.3-x86_64 bit platform with OFED 1.5
From: Ethan Mallove (ethan.mallove_at_[hidden])
Date: 2009-11-19 09:35:39


On Wed, Nov/18/2009 12:17:24PM, Venkat Venkatsubra wrote:
> Attached.

That developer.ini file is out-of-date. Can you "svn update", and try
to run again? I assume the SVN URL you're using is
http://svn.open-mpi.org/svn/mtt/trunk?

-Ethan

>
> -----Original Message-----
> From: mtt-users-bounces_at_[hidden]
> [mailto:mtt-users-bounces_at_[hidden]] On Behalf Of Ethan Mallove
> Sent: Wednesday, November 18, 2009 1:41 PM
> To: General user list for the MPI Testing Tool
> Subject: Re: [MTT users] MTT trivial tests fails to completeon
> Centos5.3-x86_64 bit platform with OFED 1.5
>
> Could you run with --debug (instead of --verbose), and send the
> output.
>
> Thanks,
> Ethan
>
> On Wed, Nov/18/2009 11:08:18AM, Venkat Venkatsubra wrote:
> >
> >
> >
> >
> > From: Venkat Venkatsubra
> > Sent: Wednesday, November 18, 2009 12:54 PM
> > To: 'mtt-users_at_[hidden]'
> > Subject: MTT trivial tests fails to complete on Centos5.3-x86_64
> bit
> > platform with OFED 1.5
> >
> >
> >
> > Hello All,
> >
> >
> >
> > How do I debug this problem ? Attached are the developer.ini and
> > trivial.ini files.
> >
> > I can provide any other information that you need.
> >
> >
> >
> > [root_at_samples]# cat /etc/issue
> >
> > CentOS release 5.3 (Final)
> >
> > Kernel \r on an \m
> >
> >
> >
> > [root_at_samples]# uname -a
> >
> > Linux 2.6.18-128.el5 #1 SMP Wed Jan 21 10:41:14 EST 2009 x86_64
> x86_64
> > x86_64 GNU/Linux
> >
> >
> >
> > I am running OFED-1.5-20091029-0617 daily build.
> >
> >
> >
> > Started trivial tests using the following command:
> >
> >
> >
> > [root_at_samples]# cat developer.ini trivial.ini | ../client/mtt
> --verbose -
> >
> > ....
> >
> > ....
> >
> > >> Initializing reporter module: TextFile
> >
> > *** Reporter initialized
> >
> > *** MPI Get phase starting
> >
> > >> MPI Get: [mpi get: my installation]
> >
> > Checking for new MPI sources...
> >
> > Using MPI in: /usr/mpi/gcc/openmpi-1.3.2/
> >
> > *** WARNING: alreadyinstalled_mpi_type was not specified,
> defaulting to
> >
> > "OMPI".
> >
> > Got new MPI sources: version 1.3.2
> >
> > *** MPI Get phase complete
> >
> > *** MPI Install phase starting
> >
> > >> MPI Install [mpi install: my installation]
> >
> > Installing MPI: [my installation] / [1.3.2] / [my
> installation]...
> >
> > >> Reported to text file
> >
> >
> >
> /root/mtt-svn/samples/MPI_Install-my_installation-my_installation-1.3.2.
> htm
> >
> > l
> >
> > >> Reported to text file
> >
> >
> >
> /root/mtt-svn/samples/MPI_Install-my_installation-my_installation-1.3.2.
> txt
> >
> > Completed MPI Install successfully
> >
> > *** MPI Install phase complete
> >
> > *** Test Get phase starting
> >
> > >> Test Get: [test get: trivial]
> >
> > Checking for new test sources...
> >
> > Got new test sources
> >
> > *** Test Get phase complete
> >
> > *** Test Build phase starting
> >
> > >> Test Build [test build: trivial]
> >
> > Building for [my installation] / [1.3.2] / [my installation] /
> > [trivial]
> >
> > >> Reported to text file
> >
> >
> /root/mtt-svn/samples/Test_Build-trivial-my_installation-1.3.2.html
> >
> > >> Reported to text file
> >
> >
> /root/mtt-svn/samples/Test_Build-trivial-my_installation-1.3.2.txt
> >
> > Completed test build successfully
> >
> > *** Test Build phase complete
> >
> > *** Test Run phase starting
> >
> > >> Test Run [trivial]
> >
> > >> Running with [my installation] / [1.3.2] / [my installation]
> >
> > Using MPI Details [open mpi] with MPI Install [my installation]
> >
> >
> >
> > During this stage the test stalls.
> >
> > After about ~10 minutes the test gets killed.
> >
> > dmesg on which the test is running displays the following output:
> >
> >
> >
> > ==========
> >
> > Dmesg output
> >
> > ==========
> >
> > Out of memory: Killed process 5346 (gdmgreeter).
> >
> > audispd invoked oom-killer: gfp_mask=0x201d2, order=0,
> oomkilladj=0
> >
> >
> >
> > Call Trace:
> >
> > [<ffffffff800c39dd>] out_of_memory+0x8e/0x2f5
> >
> > [<ffffffff8000f2eb>] __alloc_pages+0x245/0x2ce
> >
> > [<ffffffff80012a62>] __do_page_cache_readahead+0x95/0x1d9
> >
> > [<ffffffff80215932>] sock_readv+0xb7/0xd1
> >
> > [<ffffffff80088896>] __wake_up_common+0x3e/0x68
> >
> > [<ffffffff80013401>] filemap_nopage+0x148/0x322
> >
> > [<ffffffff80008863>] __handle_mm_fault+0x1f8/0xe5c
> >
> > [<ffffffff80066b9a>] do_page_fault+0x4cb/0x830
> >
> > [<ffffffff8005dde9>] error_exit+0x0/0x84
> >
> >
> >
> > Thanks!
> >
> >
> >
> > Venkat
>
>
>
> > _______________________________________________
> > mtt-users mailing list
> > mtt-users_at_[hidden]
> > http://www.open-mpi.org/mailman/listinfo.cgi/mtt-users
>
> _______________________________________________
> mtt-users mailing list
> mtt-users_at_[hidden]
> http://www.open-mpi.org/mailman/listinfo.cgi/mtt-users