Open MPI logo

Open MPI Development Mailing List Archives

  |   Home   |   Support   |   FAQ   |   all Development mailing list

From: Terry Dontje (Terry.Dontje_at_[hidden])
Date: 2007-09-17 13:00:05


Jeff Squyres wrote:
> I think Terry was asking about running at larger scale on a regular
> basis for correctness testing (i.e., nightly snapshot tarballs via MTT).
>
>
I was even hoping for a not so regular basis like when certain people go
down to Sandia to see if they
can start up an np=4096 job.

--td
> I, for one, would love to see the labs run some of our nightly
> tarballs at larger scale so that we have a more consistent datapoint
> of what "works" and what "doesn't work" at scale (even if it's a
> coarse-grained measurement of the tests we have in ompi-tests). Even
> if the tests are not run nightly -- running even a subset of them
> even once a week even at "medium" scale would be great. I realize
> that even with large clusters, we're all resource-constrained
> (needing to let real users run and all that), but any testing on a
> regular basis (even if it's sparse) would be really, really great/
> useful/good for the code/good for the community/etc.
>
> (yes, this is a not-so-subtle hint :-) )
>
>
>
> On Sep 17, 2007, at 11:15 AM, Andrew Friedley wrote:
>
>
>> I won't speak for the labs as a whole, but I generally don't run
>> things
>> at scale unless theres something specific I'm after, ie benchmarks or
>> apps I'm using as a benchmark, rather than test suites.
>>
>> You might look at some of the purple benchmarks:
>>
>> http://www.llnl.gov/asci/platforms/purple/rfp/benchmarks/limited/
>> code_list.html
>>
>> Andrew
>>
>> Terry Dontje wrote:
>>
>>> What about Sandia and LANL? Is there anything that is ran on their
>>> large clusters to confirm things seem to work at high np's?
>>>
>>> --td
>>>
>>> Jeff Squyres wrote:
>>>
>>>> Cisco is not yet testing that large, but we plan to shortly start
>>>> testing at np>=128 (I'm waiting for an internal cluster within Cisco
>>>> to be setup properly).
>>>>
>>>>
>>>> On Sep 11, 2007, at 5:31 PM, Rolf.Vandevaart_at_[hidden] wrote:
>>>>
>>>>
>>>>
>>>>> I am curious which tests are being used when running tests on
>>>>> larger
>>>>> clusters. And by larger clusters, I mean anything with np > 128.
>>>>> (Although I realize that is not very large, but it is bigger
>>>>> than most
>>>>> of the clusters I assume tests are being run on)
>>>>> I ask this because I planned on using some of the intel tests, but
>>>>> they
>>>>> clearly have limitations starting at np=64.
>>>>>
>>>>> To avoid mailing list clutter, feel free to just email me and I
>>>>> will
>>>>> summarize.
>>>>>
>>>>> Rolf
>>>>>
>>>>>
>>>>> _______________________________________________
>>>>> devel mailing list
>>>>> devel_at_[hidden]
>>>>> http://www.open-mpi.org/mailman/listinfo.cgi/devel
>>>>>
>>>>>
>>>>
>>> _______________________________________________
>>> devel mailing list
>>> devel_at_[hidden]
>>> http://www.open-mpi.org/mailman/listinfo.cgi/devel
>>>
>> _______________________________________________
>> devel mailing list
>> devel_at_[hidden]
>> http://www.open-mpi.org/mailman/listinfo.cgi/devel
>>
>
>
>