On Jan 3, 2013, at 3:01 AM, Ake Sandgren <ake.sandgren_at_[hidden]> wrote:
> On Thu, 2013-01-03 at 11:54 +0100, Ake Sandgren wrote:
>> On Thu, 2013-01-03 at 11:15 +0100, Ake Sandgren wrote:
>>> The grpcomm component hier seems to have vanished between 1.6.1 and
>>> It seems that the version of slurm we are using (not the latest at the
>>> moment) is using it for startup.
It should be using PMI if you are directly launching processes via srun, and should not be using hier any more.
>> Hmm it seems it is the ess_slurmd_module.c that is using grpcomm=hier.
Yes - that is the *only* scenario (a direct launch of procs via srun) that should use hier
> orte/mca/plm/base/plm_base_rsh_support.c also tries to use the hier
Something is very wrong if that is true. How was this configured, and how are you starting this job?
> users mailing list