Open MPI logo

Open MPI Development Mailing List Archives

  |   Home   |   Support   |   FAQ   |   all Development mailing list

Subject: Re: [OMPI devel] Hang in collectives involving shared memory
From: Ashley Pittman (ashley_at_[hidden])
Date: 2009-06-10 11:57:04


On Wed, 2009-06-10 at 09:07 -0600, Ralph Castain wrote:
> Hi Ashley
>
> Thanks! I would definitely be interested and will look at the tool.

Great. My plan was to introduce the tool to this list today or tomorrow
anyway but this problem falls right it's it's target area so I brought
it up early.

> Meantime, I have filed a bunch of data on this in ticket #1944, so
> perhaps you might take a glance at that and offer some thoughts?
>
> https://svn.open-mpi.org/trac/ompi/ticket/1944

One thing that springs to mind is does the fortran reproducer hang on
other machines if you use the same process geometry. That would tell us
if we are looking for a pure OpenMPI problem or a wider issue,
potentially eliminating any questions about numa memory layout.

> Will be back after I look at the tool.

Ashley,

-- 
Ashley Pittman, Bath, UK.
Padb - A parallel job inspection tool for cluster computing
http://padb.pittman.org.uk