Dear list,
One of our users faces problems running his application (large CP2K cases)
Cluster:
OpenMPI 1.4.2, SLES 9, gcc 4.1.2, OFED 1.4 on Intel Nehalem (5350)
The message is:
[[45776,1],214][btl_openib_component.c:2951:handle_wc] from node140 to:
node400 error polling LP CQ with status LOCAL QP OPERATION ERROR status
number 2 for wr_id 250502144 opcode 1 vendor error 103 qp_idx 0
OpenMPI has been compiled using the following flags:
./configure --prefix=/som/prefix/dir --enable-branch-probabilities
--enable-mem-debug --enable-mem-profile --enable-picky --enable-peruse
--enable-per-user-config-files --enable-cxx-exceptions
--enable-mpi-threads --enable-openib-ibcm --enable-openib-rdmacm --with-sge
Any idea why and/or if something is wrong in the configuration ? Any fix ?
Thanks in advance
Best regards
Vince
--
---------------------------------------------------
Dr. Vincent KELLER
Universität Zürich
http://www.hpcn.uzh.ch
ADDRESS: Winterthurstrasse 190
CH - 8057 Zürich
Switzerland
PHONE : + 41 (0) 44/635'40'37
FAX : + 41 (0) 44/635'45'05
|