Open MPI logo

Open MPI User's Mailing List Archives

  |   Home   |   Support   |   FAQ   |   all Open MPI User's mailing list

Subject: Re: [OMPI users] Infiniband Error
From: Yevgeny Kliteynik (kliteyn_at_[hidden])
Date: 2011-09-12 09:01:43


This means that you have some problem on that node,
and it's probably unrelated to Open MPI.
Bad cable? Bad port? FW/driver in some bad state?
Do other IB performance tests work OK on this node?
Try rebooting the node.

-- YK

On 12-Sep-11 7:52 AM, Ahsan Ali wrote:
> Hello all
>
> I am getting following error during an application run which causes it to crash.
>
> *[[36944,1],41][btl_openib_component.c:3227:handle_wc] from compute-01-19.private.dns.zone to: compute-01-04 error polling LP CQ with status RETRY EXCEEDED ERROR status number 12 for wr_id 167703304 opcode 128 vendor error 129 qp_idx 3*
>
> I removed that particular node and then the error was removed.Please suggest me what could be the solution to this. Thanking you in advance.
>
> --
> Syed Ahsan Ali Bokhari
> Electronic Engineer (EE)
>
> Research & Development Division
> Pakistan Meteorological Department H-8/4, Islamabad.
> Phone # off +92518358714
> Cell # +923155145014
>
>
>
> _______________________________________________
> users mailing list
> users_at_[hidden]
> http://www.open-mpi.org/mailman/listinfo.cgi/users