On Mon, Feb 20, 2012 at 06:11:53PM -0500, Rayson Ho wrote:
> BTW, since most of the interviewees are opensource project
> maintainers, next time can you ask them how much external contribution
> they get (%), and who are the main external contributors (students?
> HPC labs? Industry?), and how do they handle external contributions
> (need copyright assignment?). And how do they handle testing, and
> performance regression...
external contributions: I wish I had more quantitative numbers for
you. I don't develop on a Lustre system, so we were grateful to the
community for contributing and testing an improved Lustre driver for
ROMIO. Weikuan Yu started work on a Lustre driver while he worked at
Oak Ridge, then Sun/CFS contributed some more improvements. Pascal
Deveze from Bull and Martin Pokorny from NRAO helped carry it over
the finish line, contributing some important bug fixes and nice little
performance improvement tweaks.
IBM has been a great industry partner, contributing improvements to
all of MPICH2. For BlueGene, IBM contributed a block-aligned
collective I/O implementation and an I/O aggregation strategy that
works better for the BlueGene topology. They also contributed what
we are calling "64 bit MPI_Aint" which works around a problem with MPI
file views and platforms with a 32 bit integer.
Our best academic partner -- and really at this point we should
consider them co-maintainers -- is Northwestern University. I've
worked with Wei-Keng for a decade and am always happy to see a
question, suggestion, or patch from him in my mailbox. Northwestern
provided us some great students over the years as well. Avery Ching
and Keenin Coloma did a lot of good work on MPI-IO before the bay area
lured them to industry.
ROMIO's testing and performance regression framework is honestly a
shambles. Part of that is a challenge with the MPI-IO interface
itself. For MPI messaging you exercise the API and you have pretty
much covered everything. MPI-IO, though, introduces hints. These
hints are great for tuning but make the testing "surface area" a lot
larger. We are probably going to have a chance to improve things
greatly with some recently funded proposals.
Mathematics and Computer Science Division
Argonne National Lab, IL USA