Open MPI logo

Open MPI Development Mailing List Archives

  |   Home   |   Support   |   FAQ   |   all Development mailing list

Subject: Re: [OMPI devel] r27078 and OMPI build
From: Eugene Loh (eugene.loh_at_[hidden])
Date: 2012-08-23 18:09:40


Thanks for the suggestion, but it didn't seem to help. The build still
fails on the same problem.

On 8/23/2012 2:14 PM, Shamis, Pavel wrote:
> Evgeny,
> I'm wondering if the issue is some how related to the fact that these functions are inline. Can you please, try the attached patch and see what happens ?
>
> On Aug 23, 2012, at 12:59 PM, Eugene Loh wrote:
>
> On 8/23/2012 8:58 AM, Shamis, Pavel wrote:
> > Did you have chance to make progress on the issue #2 ? I'm wondering how we want to proceed from> here.
> First of all, thanks for putting back the fixes for issue #1. That
> build is now successful.
>
> Issue #2? No. I don't know what to look at even if I had time to spend
> on this. It appears that mca/coll/ml is being pulled into libmpi. I
> tried comparing this component to others that aren't pulled in (e.g.,
> mca/coll/sm) or builds on this system (happens to be SPARC/Solaris, but
> I don't know what the key distinction is) versus other systems where
> mca/coll/ml is not pulled in. Nothing jumped out at me. So, I'm stuck
> (lack of ideas and lack of time).
>
> What would make an MCA component get pulled into libmpi? Again, many
> other components are not getting pulled in and this problem appears only
> on one system.
> On Aug 21, 2012, at 2:19 PM, Eugene Loh wrote:
>
> On 8/21/2012 9:31 AM, Ralph Castain wrote:
> The library issue sounds like something isn't right in the Makefile.am
> - perhaps the syntax has a typo there as well?
>
> I don't know. This is the part where I could use help. I took a quick
> peek at some Makefile.am files. I can't see what the essential
> difference is between, say, coll/ml/Makefile.am and, say,
> coll/sm/Makefile.am (which behaves all right). Nor do I see why there
> would be a difference in coll/ml between one system (happens to be
> SPARC, though I don't know that's significant) and another.
>
>
> On Aug 21, 2012, at 11:36 AM, Eugene Loh wrote:
>
> r27078 (ML collective component) broke some Solaris OMPI builds.
>
> 2) http://www.open-mpi.org/mtt/index.php?do_redir=2089 Somehow,
> coll_ml is getting pulled into libmpi.so. E.g., this doesn't look right:
>
> % nm ompi/.libs/libmpi.so | grep mca_coll_ml
> [13161] | 2556704| 172|FUNC |LOCL |0 |11
> |mca_coll_ml_alloc_op_prog_single_frag_dag
> [13171] | 2555488| 344|FUNC |LOCL |0 |11
> |mca_coll_ml_buffer_recycling
> [13173] | 2555392| 92|FUNC |LOCL |0 |11 |mca_coll_ml_err
> [23992] | 0| 0|FUNC |GLOB |0 |UNDEF
> |mca_coll_ml_memsync_intra
>
> The UNDEF is causing a problem, but I'm guessing all that mca_coll_ml_
> stuff shouldn't be in there at all in the first place. This is on one
> Solaris system, while another doesn't see the problem and builds fine.