Open MPI logo

Open MPI User's Mailing List Archives

  |   Home   |   Support   |   FAQ   |   all Open MPI User's mailing list

Subject: Re: [OMPI users] 64-bit version of openmpi-1.6.5a1r28554 hangs
From: Ralph Castain (rhc_at_[hidden])
Date: 2013-05-24 10:12:46


Could you please clarify - are you mixing 32 and 64 bit versions in your runs that have a problem? If so, then I doubt that would work, so please check.

If that isn't the case, then the error message is telling you that the system thinks you are mixing optimized and debug versions - i.e., one node is using an optimized version of OMPI and another is using a debug version. This also isn't allowed.

If you check and find those two conditions are okay, then I suspect you are hitting the Solaris "bit rot" problem that we've talked about before - and are unlikely to be able to fix any time soon.

On May 24, 2013, at 12:02 AM, Siegmar Gross <Siegmar.Gross_at_[hidden]> wrote:

> Hi
>
> I installed openmpi-1.6.5a1r28554 on "openSuSE Linux 12.1", "Solaris 10
> x86_64", and "Solaris 10 sparc" with gcc-4.8.0 and "Sun C 5.12" in 32-
> and 64-bit versions. Unfortunately I have a problem with the 64-bit
> version, if I build Open MPI with gcc. The program hangs and I have
> to terminate it with <Ctrl-c>.
>
>
> sunpc1 hello_1 144 mpiexec -mca btl ^udapl -np 4 \
> -host sunpc1,linpc1,rs0 hello_1_mpi
> [sunpc1:15576] [[16182,0],0] ORTE_ERROR_LOG: Buffer type (described vs
> non-described) mismatch - operation not allowed in file
> ../../../../../openmpi-1.6.5a1r28554/orte/mca/grpcomm/bad/grpcomm_bad_module.c
> at line 841
> ^Cmpiexec: killing job...
>
> sunpc1 hello_1 145 which mpiexec
> /usr/local/openmpi-1.6.5_64_gcc/bin/mpiexec
> sunpc1 hello_1 146
>
>
> I have no problems with the 64-bit version, if I compile Open MPI
> with Sun C. Both 32-bit versions (compiled with "cc" or "gcc") work
> as expectedas well.
>
> sunpc1 hello_1 106 mpiexec -mca btl ^udapl -np 4 \
> -host sunpc1,linpc1,rs0 hello_1_mpi
> Process 2 of 4 running on rs0.informatik.hs-fulda.de
> Process 0 of 4 running on sunpc1
> Process 3 of 4 running on sunpc1
> Process 1 of 4 running on linpc1
> Now 3 slave tasks are sending greetings.
> Greetings from task 3:
> message type: 3
> msg length: 116 characters
> message:
> hostname: sunpc1
> operating system: SunOS
> release: 5.10
> processor: i86pc
> ...
>
> sunpc1 hello_1 107 which mpiexec
> /usr/local/openmpi-1.6.5_64_cc/bin/mpiexec
>
>
>
> sunpc1 hello_1 106 mpiexec -mca btl ^udapl -np 4 \
> -host sunpc1,linpc1,rs0 hello_1_mpi
> Process 2 of 4 running on rs0.informatik.hs-fulda.de
> Process 3 of 4 running on sunpc1
> Process 0 of 4 running on sunpc1
> Process 1 of 4 running on linpc1
> ...
>
> sunpc1 hello_1 107 which mpiexec
> /usr/local/openmpi-1.6.5_32_gcc/bin/mpiexec
>
>
> I would be grateful, if somebody can fix the problem for the
> 64-bit version with gcc. Thank you very much for any help in
> advance.
>
>
> Kind regards
>
> Siegmar
>
> _______________________________________________
> users mailing list
> users_at_[hidden]
> http://www.open-mpi.org/mailman/listinfo.cgi/users