On 17 December 2010 14:45, Gilbert Grosdidier
<Gilbert.Grosdidier_at_[hidden]> wrote:
> Bonjour,
> About this issue, for which I got NO feedback ;-)
Gilbert, as you have an SGI cluster, have you filed a support request to SGI?
Also, which firmware do you have installed?
I have Firmware version: 2.5.0
http://www.openfabrics.org/downloads/OFED/ofed-1.4/OFED-1.4-docs/mlx4_release_notes.txt
Features that are enabled with FW 2.5.0 only:
- Send with invalidate and Local invalidate send queue work requests.
- Resize CQ support.
I recently spotted
> into btl_openib.c code, that this error message could come from
> some missing ConnectX HCA ibv_resize_cq function. Well ...
> I was unable yet to figure out why/how this could occur, but I have
> a now a closely related question about ConnectX Infiniband HCA :
> does anybody know which other unimplemented IB functionalities
> could be lacking for this ConnectX HCA ?
> This could allow me to patch appropriately by hand the OpenMPI code,
> since I currently believe these functionalities are going
> undetected as missing by the configure step.
> Thanks, Best, G.
>
> Le 15 déc. 10 à 08:59, Gilbert Grosdidier a écrit :
>
> Bonjour,
>
> Running with OpenMPI 1.4.3 on an SGI Altix cluster with 2048 cores, I got
> this error message on all cores, right at startup :
>
> btl_openib.c:211:adjust_cq] cannot resize completion queue, error: 12
>
> What could be the culprit please ?
> Is there a workaround ?
> What parameter is to be tuned ?
>
> Thanks in advance for any help, Best, G.
>
>
>
>
>
>
>
> _______________________________________________
> users mailing list
> users_at_[hidden]
> http://www.open-mpi.org/mailman/listinfo.cgi/users
>
|