On Aug 11, 2009, at 5:17 AM, Ashley Pittman wrote:
> On Tue, 2009-08-11 at 03:03 -0600, Ralph Castain wrote:
>> If it isn't already there, try putting a print statement tight at
>> program start, another just prior to MPI_Init, and another just after
>> MPI_Init. It could be that something is hanging somewhere during
>> program startup since it sounds like everything is launching just
> If you suspect a hang then you can use the command orte-ps (on the
> where the mpirun is running) and it should show you your job. This
> tell you if the job is started and still running or if there was a
> problem launching.
> If the program did start and has really hung then you can get more
> in-depth information about it using padb which is linked to in my
FWIW: we use padb for this purpose, and it is very helpful!
> Ashley Pittman, Bath, UK.
> Padb - A parallel job inspection tool for cluster computing
> users mailing list