Open MPI logo

Open MPI User's Mailing List Archives

  |   Home   |   Support   |   FAQ   |   all Open MPI User's mailing list

Subject: Re: [OMPI users] error mesages appeared but program runs successfully?
From: Daofeng Li (lidaof_at_[hidden])
Date: 2010-12-12 22:14:01


Hi Gus Correa,

Thank you so much for such a detail explanation:)
i understand that now.

Best wishes ^_^

On Mon, Dec 6, 2010 at 7:41 AM, Gustavo Correa <gus_at_[hidden]>wrote:

> Hi Daofeng
>
> It is hard to tell what is happening in the Infiniband side of the problem.
> Did somebody perhaps remove the Infiniband card from this machine?
> Was it ever there?
> Did somebody perhaps changed the Linux kernel modules that are loaded
> (perhaps changing /etc/module.config or similar)?
> Maybe other people in your organization know.
>
> If this is a single computer, not a cluster, you don't loose anything by
> not
> having Infinband.
> In this case, you can reinstall OpenMPI without Infiniband support, by just
> doing "make distclean" in the OpenMPI build directory (to cleanup what is
> there),
> then "./configure --prefix=/wherever/you/want/to/install --without-openib",
> then "make", and "make install".
>
> Alternatively, you can continue to use what you already have with the "-mca
> btl ^openib" flag.
>
> If this is a cluster, of course you would benefit from Infiniband, which is
> a faster
> network than Ethernet or Gigabit Ethernet.
> In this case you need to ask for help of somebody that knows more about
> your cluster
> hardware, to restore the Infiniband to a sane and healthy state.
> Or, if there is no Infinband hardware, or if it is broken, just reinstall
> OpenMPi following
> the little recipe above. You will be able to run your programs using
> Ethernet (I assume
> the cluster would have Ethernet). Not very fast, but will work.
>
> My two cents,
> Gus Correa
>
>
> On Dec 4, 2010, at 4:47 AM, Daofeng Li wrote:
>
> > Hi Gus,
> >
> > thank you for your response.
> > i think this is much about hardware which i know little about them:)
> > might be the machine i used dont have the card you mentioned as i run:
> > /usr/sbin/ibstat
> > ibwarn: [4260] umad_init: can't read ABI version from
> /sys/class/infiniband_mad/abi_version (No such file or directory): is
> ib_umad module loaded?
> > ibpanic: [4260] main: can't init UMAD library: (No such file or
> directory)
> >
> > but you really helped me as:
> >
> > $ mpirun -mca btl ^openib -n 8 hello_cxx
> > Hello, world! I am 6 of 8
> > Hello, world! I am 0 of 8
> > Hello, world! I am 4 of 8
> > Hello, world! I am 7 of 8
> > Hello, world! I am 5 of 8
> > Hello, world! I am 2 of 8
> > Hello, world! I am 1 of 8
> > Hello, world! I am 3 of 8
> >
> > that's really cool~
> >
> > thank you all:)
> >
> > Best Wishes.
> > On Sat, Dec 4, 2010 at 11:12 AM, Gus Correa <gus_at_[hidden]>
> wrote:
> > Hi Daofeng
> >
> > Do you have an Infiniband card in the machine where you are
> > running the program?
> > (Open Fabrics / OFED is the software support for Infiniband.
> > I guess you need the same version installed in all machines.)
> >
> > Does the directory referred in the error message actually
> > exist in your machine (i.e, /dev/infiniband) ?
> >
> > Are you running it in the same machine where you installed OpenMPI?
> >
> > What output do you get from:
> > /usr/sbin/ibstat
> > ?
> >
> > Did you compile the programs with the mpicc,mpiCC, mpif77
> > from the same OpenMPI that you built?
> > (Some Linux distributions and compilers come with
> > their own flavors of MPI, or you may also
> > have installed MPICH or MVAPICH, so it is not uncommon to mix up.)
> >
> > Have you tried to suppress the use of Infinband, i.e.:
> >
> > mpirun -mca btl ^openib -n 8 hello_cxx
> >
> > (Well, "openib" is the OpenMPI support for Infiniband.
> > The "^" means "don't use it")
> >
> > I hope this helps,
> > Gus Correa
> >
> > Daofeng Li wrote:
> > Dear Jeff,
> > actually i didnot understand this....can you or anyone tell me what to
> do?
> > Thx.
> > Best.
> >
> > On Fri, Dec 3, 2010 at 9:41 PM, Jeff Squyres (jsquyres) <
> jsquyres_at_[hidden] <mailto:jsquyres_at_[hidden]>> wrote:
> >
> > It means that you probably have a version mismatch with your
> > OpenFabrics drivers and or you have no OpenFabrics hardware and you
> > should probably disable those drivers.
> > Sent from my PDA. No type good.
> > On Dec 3, 2010, at 4:56 AM, "Daofeng Li" <lidaof_at_[hidden]
> > <mailto:lidaof_at_[hidden]>> wrote:
> >
> > Dear list,
> > i am currently try to use the OpenMPI package
> > i install it at my home directory
> > ./configure --prefix=$HOME --enable-mpi-threads
> > make
> > make install
> > and the i add the ~/bin to the path and ~/lib to the
> > ld_library_path to my .bashrc file
> > everything seems normal as i can run the example programs:
> > mpirun -n 8 hello_cxx
> > mpirun -n 8 hello_f77
> > mpirun -n 8 hello_c
> > etc...
> > but error messages appeas:
> > $ mpirun -n 8 hello_cxx
> > librdmacm: couldn't read ABI version.
> > librdmacm: assuming: 4
> > libibverbs: Fatal: couldn't read uverbs ABI version.
> > CMA: unable to open /dev/infiniband/rdma_cm
> > libibverbs: Fatal: couldn't read uverbs ABI version.
> >
> --------------------------------------------------------------------------
> > [[32727,1],1]: A high-performance Open MPI point-to-point
> > messaging module
> > was unable to find any relevant network interfaces:
> > Module: OpenFabrics (openib)
> > Host: localhost.localdomain
> > Another transport will be used instead, although this may result in
> > lower performance.
> >
> --------------------------------------------------------------------------
> > librdmacm: couldn't read ABI version.
> > librdmacm: assuming: 4
> > libibverbs: Fatal: couldn't read uverbs ABI version.
> > CMA: unable to open /dev/infiniband/rdma_cm
> > libibverbs: Fatal: couldn't read uverbs ABI version.
> > librdmacm: couldn't read ABI version.
> > librdmacm: assuming: 4
> > libibverbs: Fatal: couldn't read uverbs ABI version.
> > CMA: unable to open /dev/infiniband/rdma_cm
> > libibverbs: Fatal: couldn't read uverbs ABI version.
> > librdmacm: couldn't read ABI version.
> > librdmacm: assuming: 4
> > libibverbs: Fatal: couldn't read uverbs ABI version.
> > CMA: unable to open /dev/infiniband/rdma_cm
> > libibverbs: Fatal: couldn't read uverbs ABI version.
> > librdmacm: couldn't read ABI version.
> > librdmacm: assuming: 4
> > libibverbs: Fatal: couldn't read uverbs ABI version.
> > CMA: unable to open /dev/infiniband/rdma_cm
> > libibverbs: Fatal: couldn't read uverbs ABI version.
> > librdmacm: couldn't read ABI version.
> > librdmacm: assuming: 4
> > libibverbs: Fatal: couldn't read uverbs ABI version.
> > CMA: unable to open /dev/infiniband/rdma_cm
> > librdmacm: couldn't read ABI version.
> > librdmacm: assuming: 4
> > libibverbs: Fatal: couldn't read uverbs ABI version.
> > libibverbs: Fatal: couldn't read uverbs ABI version.
> > CMA: unable to open /dev/infiniband/rdma_cm
> > libibverbs: Fatal: couldn't read uverbs ABI version.
> > CMA: unable to open /dev/infiniband/rdma_cm
> > librdmacm: couldn't read ABI version.
> > librdmacm: assuming: 4
> > libibverbs: Fatal: couldn't read uverbs ABI version.
> > libibverbs: Fatal: couldn't read uverbs ABI version.
> > Hello, world! I am 1 of 8
> > Hello, world! I am 0 of 8
> > Hello, world! I am 3 of 8
> > Hello, world! I am 5 of 8
> > Hello, world! I am 7 of 8
> > Hello, world! I am 4 of 8
> > Hello, world! I am 6 of 8
> > Hello, world! I am 2 of 8
> > [localhost.localdomain:30503] 7 more processes have sent help
> > message help-mpi-btl-base.txt / btl:no-nics
> > [localhost.localdomain:30503] Set MCA parameter
> > "orte_base_help_aggregate" to 0 to see all help / error messages
> > i am wondering whether i install openmpi the right way
> > anyone would give some suggestions?
> > thanks in advance.
> > Best Regards.
> > -- Daofeng Li
> > College of Biological Science
> > China Agricultural University
> > Beijing
> > China
> >
> > _______________________________________________
> > users mailing list
> > users_at_[hidden] <mailto:users_at_[hidden]>
> >
> > http://www.open-mpi.org/mailman/listinfo.cgi/users
> >
> > _______________________________________________
> > users mailing list
> > users_at_[hidden] <mailto:users_at_[hidden]>
> >
> > http://www.open-mpi.org/mailman/listinfo.cgi/users
> >
> >
> >
> >
> > --
> > Daofeng Li
> > College of Biological Science
> > China Agricultural University
> > Beijing
> > China
> >
> >
> > ------------------------------------------------------------------------
> >
> >
> > _______________________________________________
> > users mailing list
> > users_at_[hidden]
> > http://www.open-mpi.org/mailman/listinfo.cgi/users
> >
> > _______________________________________________
> > users mailing list
> > users_at_[hidden]
> > http://www.open-mpi.org/mailman/listinfo.cgi/users
> >
> >
> >
> > --
> > Daofeng Li
> > College of Biological Science
> > China Agricultural University
> > Beijing
> > China
> >
> > _______________________________________________
> > users mailing list
> > users_at_[hidden]
> > http://www.open-mpi.org/mailman/listinfo.cgi/users
>
>
> _______________________________________________
> users mailing list
> users_at_[hidden]
> http://www.open-mpi.org/mailman/listinfo.cgi/users
>

-- 
Daofeng Li
College of Biological Science
China Agricultural University
Beijing
China