Open MPI logo

Open MPI User's Mailing List Archives

  |   Home   |   Support   |   FAQ   |   all Open MPI User's mailing list

Subject: Re: [OMPI users] OpenIB error messages: reporting the default or telling you what's happening?
From: Jeff Squyres (jsquyres_at_[hidden])
Date: 2011-09-12 12:13:20


FWIW, the default for the ib_timeout is 20 in both v1.4.x and v1.5.x.

As Ralph said, ompi_info will show the current value -- not the default value. Of course, the current value will be the default value, unless it has been overridden. In OMPI v1.5, ompi_info should indicate where the value came from:

-----
% ompi_info --param btl openib | grep ib_timeout
                 MCA btl: parameter "btl_openib_ib_timeout" (current value: <20>, data source: default value)
[snip]
% setenv OMPI_MCA_btl_openib_ib_timeout 13
% ompi_info --param btl openib | grep ib_timeout
                 MCA btl: parameter "btl_openib_ib_timeout" (current value: <13>, data source: environment or cmdline)
-----

On Sep 12, 2011, at 10:25 AM, Shamis, Pavel wrote:

>>
>> * btl_openib_ib_retry_count - The number of times the sender will
>> attempt to retry (defaulted to 7, the maximum value).
>> * btl_openib_ib_timeout - The local ACK timeout parameter (defaulted
>> to 10). The actual timeout value used is calculated as:
>>
>
> Actually I'm surprised that default value is 10. I think it used to be 20. Anyway I would recommend to bump it up.
>
> Pasha
>
>
> _______________________________________________
> users mailing list
> users_at_[hidden]
> http://www.open-mpi.org/mailman/listinfo.cgi/users

-- 
Jeff Squyres
jsquyres_at_[hidden]
For corporate legal information go to:
http://www.cisco.com/web/about/doing_business/legal/cri/