or do i just need to compile two versions, one with IB and one without?
On Thu, Jan 24, 2013 at 9:09 AM, Sabuj Pattanayek <sabujp_at_[hidden]> wrote:
> ahha, with --display-allocation I'm getting :
> mca: base: component_find: unable to open
> libpsm_infinipath.so.1: cannot open shared object file: No such file
> or directory (ignored)
> I think the system I compiled it on has different ib libs than the
> nodes. I'll need to recompile and then see if it runs, but is there
> anyway to get it to ignore IB and just use gigE? Not all of our nodes
> have IB and I just want to use any node.
> On Thu, Jan 24, 2013 at 8:52 AM, Ralph Castain <rhc_at_[hidden]> wrote:
>> How did you configure OMPI? If you add --display-allocation to your cmd line, does it show all the nodes?
>> On Jan 24, 2013, at 6:34 AM, Sabuj Pattanayek <sabujp_at_[hidden]> wrote:
>>> I'm submitting a job through torque/PBS, the head node also runs the
>>> Moab scheduler, the .pbs file has this in the resources line :
>>> #PBS -l nodes=2:ppn=4
>>> I've also tried something like :
>>> #PBS -l procs=56
>>> and at the end of script I'm running :
>>> mpirun -np 8 cat /dev/urandom > /dev/null
>>> mpirun -np 56 cat /dev/urandom > /dev/null
>>> ...depending on how many processors I requested. The job starts,
>>> $PBS_NODEFILE has the nodes that the job was assigned listed, but all
>>> the cat's are piled onto the first node. Any idea how I can get this
>>> to submit jobs across multiple nodes? Note, I have OSU mpiexec working
>>> without problems with mvapich and mpich2 on our cluster to launch jobs
>>> across multiple nodes.
>>> users mailing list
>> users mailing list