Open MPI logo

Open MPI Development Mailing List Archives

  |   Home   |   Support   |   FAQ   |   all Development mailing list

Subject: Re: [OMPI devel] Open MPI (not quite) on Cray XC30
From: Ralph Castain (rhc_at_[hidden])
Date: 2013-01-25 20:45:15


On Jan 25, 2013, at 5:12 PM, Paul Hargrove <phhargrove_at_[hidden]> wrote:

> Ralph,
>
> Those are the result of the missing -lnuma that Nathan already identified earlier as missing in BOTH 1.7 and trunk.
> I see MORE missing symbols, which include ones from libxpmem and libugni.

Alright, let me try to be clearer. We are missing -lnuma as well as the required include file - both are necessary to remove the issue.

I find both the xpmem and ugni libraries *are* correctly included in both 1.7 and trunk. It could be a case of finding them in the configury, but we are finding them *and* correctly including them on the XE6.

HTH
Ralph

>
> -Paul
>
>
> On Fri, Jan 25, 2013 at 4:59 PM, Ralph Castain <rhc_at_[hidden]> wrote:
>
> On Jan 25, 2013, at 4:53 PM, Ralph Castain <rhc_at_[hidden]> wrote:
> > The repeated libs is something we obviously should fix, but all the libs are there - including lustre. I guess those were dropped due to the shared lib setting, so we probably should fix that in the platform file.
> >
> > Perhaps that is the cause of Nathan's issue? shrug...regardless, apps build and run just fine using mpicc for me.
>
> Correction - turns out I misspoke. I find apps *don't* build correctly with this setup:
>
> mpicc -g hello_c.c -o hello_c
> /usr/aprojects/hpctools/rhc/build/lib/libopen-pal.a(topology-linux.o): In function `hwloc_linux_set_area_membind':
> /lscratch1/rcastain/openmpi-1.9a1/opal/mca/hwloc/hwloc151/hwloc/src/topology-linux.c:1116: undefined reference to `mbind'
> /lscratch1/rcastain/openmpi-1.9a1/opal/mca/hwloc/hwloc151/hwloc/src/topology-linux.c:1135: undefined reference to `mbind'
> /usr/aprojects/hpctools/rhc/build/lib/libopen-pal.a(topology-linux.o): In function `hwloc_linux_get_area_membind':
> /lscratch1/rcastain/openmpi-1.9a1/opal/mca/hwloc/hwloc151/hwloc/src/topology-linux.c:1337: undefined reference to `get_mempolicy'
> /usr/aprojects/hpctools/rhc/build/lib/libopen-pal.a(topology-linux.o): In function `hwloc_linux_find_kernel_max_numnodes':
> /lscratch1/rcastain/openmpi-1.9a1/opal/mca/hwloc/hwloc151/hwloc/src/topology-linux.c:1239: undefined reference to `get_mempolicy'
> /usr/aprojects/hpctools/rhc/build/lib/libopen-pal.a(topology-linux.o): In function `hwloc_linux_set_thisthread_membind':
> /lscratch1/rcastain/openmpi-1.9a1/opal/mca/hwloc/hwloc151/hwloc/src/topology-linux.c:1183: undefined reference to `set_mempolicy'
> /lscratch1/rcastain/openmpi-1.9a1/opal/mca/hwloc/hwloc151/hwloc/src/topology-linux.c:1194: undefined reference to `migrate_pages'
> /lscratch1/rcastain/openmpi-1.9a1/opal/mca/hwloc/hwloc151/hwloc/src/topology-linux.c:1206: undefined reference to `set_mempolicy'
> /usr/aprojects/hpctools/rhc/build/lib/libopen-pal.a(topology-linux.o): In function `hwloc_linux_get_thisthread_membind':
> /lscratch1/rcastain/openmpi-1.9a1/opal/mca/hwloc/hwloc151/hwloc/src/topology-linux.c:1284: undefined reference to `get_mempolicy'
> /usr/aprojects/hpctools/rhc/build/lib/libopen-pal.a(topology-linux.o): In function `hwloc_linux_find_kernel_max_numnodes':
> /lscratch1/rcastain/openmpi-1.9a1/opal/mca/hwloc/hwloc151/hwloc/src/topology-linux.c:1239: undefined reference to `get_mempolicy'
> collect2: ld returned 1 exit status
> make: *** [hello_c] Error 1
>
> So it looks like hwloc is borked when built static.
>
> Sigh
> Ralph
>
>
> _______________________________________________
> devel mailing list
> devel_at_[hidden]
> http://www.open-mpi.org/mailman/listinfo.cgi/devel
>
>
>
> --
> Paul H. Hargrove PHHargrove_at_[hidden]
> Future Technologies Group
> Computer and Data Sciences Department Tel: +1-510-495-2352
> Lawrence Berkeley National Laboratory Fax: +1-510-486-6900
> _______________________________________________
> devel mailing list
> devel_at_[hidden]
> http://www.open-mpi.org/mailman/listinfo.cgi/devel