Open MPI logo

MTT Devel Mailing List Archives

  |   Home   |   Support   |   FAQ   |   all MTT Devel mailing list

Subject: Re: [MTT devel] Analysis of hung jobs.
From: Ethan Mallove (ethan.mallove_at_[hidden])
Date: 2009-10-08 09:51:42


On Wed, Oct/07/2009 09:38:07PM, Ashley Pittman wrote:
> On Wed, 2009-10-07 at 16:21 -0400, Ethan Mallove wrote:
>
> > No secret file (/home/em162155/.padb-secret)
> > Error: Could not load secret file on this node
>
> You need to do this once to set a secret key for security purposes, run
> the following two commands and try again.
>
> echo secret=ochi4aeZ > /home/em162155/.padb-secret
> chmod 0600 /home/em162155/.padb-secret

Getting closer ...

  $ padb --verbose --debug=all --config-option rmgr=mpirun --full-report=6336
  ...
  full job report for job 6336

  Attaching to job 6336
  mpirun resource manager requires pdsh to be installed
  Use of uninitialized value in printf at padb line 729.
  Use of uninitialized value in printf at padb line 729.
  DEBUG (verbose): 0: There are 0 processes over 0 hosts
  Fatal problem setting up the resource manager: mpirun

I assume it's referring to the below "pdsh"?

  http://sourceforge.net/projects/pdsh

-Ethan

>
> Ashley,
>
> --
>
> Ashley Pittman, Bath, UK.
>
> Padb - A parallel job inspection tool for cluster computing
> http://padb.pittman.org.uk
>
> _______________________________________________
> mtt-devel mailing list
> mtt-devel_at_[hidden]
> http://www.open-mpi.org/mailman/listinfo.cgi/mtt-devel