librdmacm - Unable to get RDMA device list

All questions about installation
Forum rules
Please read the forum usage recommendations before posting.
Post Reply
ffan
Posts: 66
Joined: Thu Jul 24, 2014 3:23 pm

librdmacm - Unable to get RDMA device list

Post by ffan »

Hi,

I compiled CS 4.0.4 on a cluster machine. The mpi on the system is mvapich2. Everything was fine in build and installation. When run a case, it showed the following: (Please also see the attached.)
librdmacm: Warning: couldn't read ABI version.
librdmacm: Warning: assuming: 4
librdmacm: Fatal: unable to get RDMA device list

Despite the message says Fatal: unable to get RDMA device list, the run continued to completion. What do these librdmacm messages mean and how to fix them? Thank you.

- ffan
Attachments
librdmacm - Unable to get RDMA device list.png
Yvan Fournier
Posts: 4075
Joined: Mon Feb 20, 2012 3:25 pm

Re: librdmacm - Unable to get RDMA device list

Post by Yvan Fournier »

Hello,

This seems to be a driver issue. If the computation runs normally, it is probably just a warning, though you might not be getting the maximum performance in some cases. In Code_Saturne, we mostly use MPI point-to-point and collective operations, and not one-sided operations, so RDMA issues are probably not a problem, but I am not knowledgeable enough in the low-level intricacies of various MPI libraries, and this is best checked with your system administrators or application support.

Regards,

Yvan
Post Reply