On Fri, Oct 21, 2005 at 06:26:32PM -0500, Troy Benjegerdes wrote:
> On Fri, Oct 21, 2005 at 04:12:05PM -0500, Andrew Friedley wrote:
> > I just committed a fix to the trunk to fix your original segfault down
> > in opal_show_help() - this is the same problem Ken posted. This fix
> > should make it into the v1.0 branch eventually. Even so, you are going
> > to run into the real problem you were handling - this fix is just for
> > proper error handling/output.
> > The error below looks like a word size mismatch - one thing is compiled
> > 64bit, the other is compiled 32bit. Make sure everything is compiled
> > either 32bit or 64bit.
> Another note.. I think I may have had some problems because I built with
> 'make -j16'.. has anyone else tried parallel make builds?
> I have a working mpirun now.
> Now I'm back to having NetPIPE segfault when I run it..
And here's a backtrace:
0x00002aaaab6fb365 in malloc_usable_size () from /lib/libc.so.6
#0 0x00002aaaab6fb365 in malloc_usable_size () from /lib/libc.so.6
#1 0x00002aaaaaecb016 in opal_mem_free_free_hook ()
#2 0x00002aaaaac0c663 in ompi_convertor_cleanup ()
#3 0x00002aaaaeb41dbe in mca_pml_ob1_match_completion_cache ()
#4 0x00002aaaaf179c7b in mca_btl_sm_component_progress ()
#5 0x00002aaaaee5eefe in mca_bml_r2_progress ()
#6 0x00002aaaaeb3dd4e in mca_pml_ob1_progress ()
#7 0x00002aaaaaeb5c4a in opal_progress () from
#8 0x00002aaaaeb3c265 in mca_pml_ob1_recv ()
#9 0x00002aaaaf6a0936 in mca_coll_basic_barrier_intra_lin ()
#10 0x00002aaaaac1f3b8 in PMPI_Barrier () from
---Type <return> to continue, or q <return> to quit---#11
0x0000000000403016 inSync ()
#12 0x0000000000401ef8 in main ()