Subject: Re: [OMPI users] PML add procs failed --> Returned "Unreachable" (-12) instead of "Success" (0)
From: Ralph Castain (rhc_at_[hidden])
Date: 2009-03-26 13:30:14
On Mar 26, 2009, at 10:59 AM, Alessandro Surace wrote:
> Hi Ralph,
> what do you mean to create/define a directly interface?
> The 3 hosts are network connected and ssh pub key enabled. Every
> hosts can see the other but they are not all on the same direct
> connected network . More in detail:
> grid01 and grid04 are in the same network
> grid03 is on different network.
This is the problem. If grid03 is on a different network, then there
is no way that an MPI process on that node can directly communicate
with one on grid04 or grid01. Grid03 must have a common network
interface with each of the machines, though it can be different for
In other words, grid03 and grid01 -must- have at least one network in
common. And grid04 and grid03 must also share at least one network,
though it can be different from the one that grid03 and grid01 share.
Does that help clarify?
> Although this difference the jobs on grid01 and grid03 run properly
> like that on grid01 and grid04. But the jobs that include
> simultaneously grid03 and grid04 fail.
> users mailing list