Open MPI logo

Open MPI Development Mailing List Archives

  |   Home   |   Support   |   FAQ   |  

This web mail archive is frozen.

This page is part of a frozen web archive of this mailing list.

You can still navigate around this archive, but know that no new mails have been added to it since July of 2016.

Click here to be taken to the new web archives of this list; it includes all the mails that are in this frozen archive plus all new mails that have been sent to the list since it was migrated to the new archives.

From: Jeff Squyres (jsquyres_at_[hidden])
Date: 2007-09-17 11:28:38

I think Terry was asking about running at larger scale on a regular
basis for correctness testing (i.e., nightly snapshot tarballs via MTT).

I, for one, would love to see the labs run some of our nightly
tarballs at larger scale so that we have a more consistent datapoint
of what "works" and what "doesn't work" at scale (even if it's a
coarse-grained measurement of the tests we have in ompi-tests). Even
if the tests are not run nightly -- running even a subset of them
even once a week even at "medium" scale would be great. I realize
that even with large clusters, we're all resource-constrained
(needing to let real users run and all that), but any testing on a
regular basis (even if it's sparse) would be really, really great/
useful/good for the code/good for the community/etc.

(yes, this is a not-so-subtle hint :-) )

On Sep 17, 2007, at 11:15 AM, Andrew Friedley wrote:

> I won't speak for the labs as a whole, but I generally don't run
> things
> at scale unless theres something specific I'm after, ie benchmarks or
> apps I'm using as a benchmark, rather than test suites.
> You might look at some of the purple benchmarks:
> code_list.html
> Andrew
> Terry Dontje wrote:
>> What about Sandia and LANL? Is there anything that is ran on their
>> large clusters to confirm things seem to work at high np's?
>> --td
>> Jeff Squyres wrote:
>>> Cisco is not yet testing that large, but we plan to shortly start
>>> testing at np>=128 (I'm waiting for an internal cluster within Cisco
>>> to be setup properly).
>>> On Sep 11, 2007, at 5:31 PM, Rolf.Vandevaart_at_[hidden] wrote:
>>>> I am curious which tests are being used when running tests on
>>>> larger
>>>> clusters. And by larger clusters, I mean anything with np > 128.
>>>> (Although I realize that is not very large, but it is bigger
>>>> than most
>>>> of the clusters I assume tests are being run on)
>>>> I ask this because I planned on using some of the intel tests, but
>>>> they
>>>> clearly have limitations starting at np=64.
>>>> To avoid mailing list clutter, feel free to just email me and I
>>>> will
>>>> summarize.
>>>> Rolf
>>>> _______________________________________________
>>>> devel mailing list
>>>> devel_at_[hidden]
>> _______________________________________________
>> devel mailing list
>> devel_at_[hidden]
> _______________________________________________
> devel mailing list
> devel_at_[hidden]

Jeff Squyres
Cisco Systems