Open MPI logo

Open MPI User's Mailing List Archives

  |   Home   |   Support   |   FAQ   |   all Open MPI User's mailing list

Subject: Re: [OMPI users] SM failure with mixed 32/64-bit procs on the samemachine
From: Katz, Jacob (jacob.katz_at_[hidden])
Date: 2010-06-04 14:18:05


This would be a quite serious limitation from my point of view. I'm a library developer, and my library is used in heterogeneous environment. Since 32b executables regularly work on 64b machines, they get intermixed by the users with 64b executables on the same machine. Switching to another BTL would incur serious performance issues...

I noticed an SM bug report that looks similar to mine and was reportedly fixed in 1.4.2. I'm going to check that version. If it still fails, what would be the effort to fix this?

--------------------------------
Jacob M. Katz | jacob.katz_at_[hidden] | Work: +972-4-865-5726 | iNet: (8)-465-5726

-----Original Message-----
From: users-bounces_at_[hidden] [mailto:users-bounces_at_[hidden]] On Behalf Of Jeff Squyres
Sent: Friday, June 04, 2010 17:26
To: Open MPI Users
Subject: Re: [OMPI users] SM failure with mixed 32/64-bit procs on the samemachine

I doubt that we have tested this kind of scenario much (specifically with shared memory). I guess I'm not too surprised that it doesn't work -- to my knowledge, you're the first person to ask for heterogeneous *on the same server*. As such, I don't know if we'll do much work to support it (there could be some gnarly issues with address ranges inside shared memory).

But your point is noted that we should not hang/crash in such a scenario. I'll file a bug to at least detect this scenario and indicate that we do not support it.

On Jun 3, 2010, at 10:29 AM, Katz, Jacob wrote:

> Hi,
> I have two processes, one a 32bit and another a 64bit, running on the same 64bit machine. When running with TCP BTL everything works fine, however with SM BTL it's not.
> In one application the processes just got stuck - one in Send and the other in Recv. In another application I even saw a segfault inside the MPI libraries in one of the processes.
>
> Is such scenario officially supported by SM BTL?
>
> Open MPI: 1.3.3
> Heterogeneous support: yes
>
> Thanks.
> --------------------------------
> Jacob M. Katz | jacob.katz_at_[hidden] | Work: +972-4-865-5726 | iNet: (8)-465-5726
>
> ---------------------------------------------------------------------
> Intel Israel (74) Limited
>
> This e-mail and any attachments may contain confidential material for
> the sole use of the intended recipient(s). Any review or distribution
> by others is strictly prohibited. If you are not the intended
> recipient, please contact the sender and delete all copies.
>
> _______________________________________________
> users mailing list
> users_at_[hidden]
> http://www.open-mpi.org/mailman/listinfo.cgi/users

-- 
Jeff Squyres
jsquyres_at_[hidden]
For corporate legal information go to:
http://www.cisco.com/web/about/doing_business/legal/cri/
_______________________________________________
users mailing list
users_at_[hidden]
http://www.open-mpi.org/mailman/listinfo.cgi/users
---------------------------------------------------------------------
Intel Israel (74) Limited
This e-mail and any attachments may contain confidential material for
the sole use of the intended recipient(s). Any review or distribution
by others is strictly prohibited. If you are not the intended
recipient, please contact the sender and delete all copies.