1.libhns: Support reporting wc as software mode
2.libhns: return error when post send in reset state
3.libhns: separate the initialization steps of lock
4.libhns: assign doorbell to zero when allocate it
5.libhns: Fix missing reset notification
Signed-off-by: Juan Zhou <zhoujuan51@h-partners.com>
According to Annex17_RoCEv2 (A17.4.5.2), for RoCEv2 UD, a CQE should
carry a flag that indicates if the received frame is an IPv4, IPv6 or
RoCE packet. But currently, the values of the flag corresponding to
these packet types haven't been defined yet in WC.
In UCX, 'sl' in ibv_wc for UD is used as the packet type flag, and the
packet type values have already been defined in the UCX patch of
ed28845b88
Therefore, to adapt UCX, add a create flag to hnsdv_create_qp() to allow
users to choose whether they use 'sl' in ibv_wc as service level or
packet type for UD. For the latter, obtain and translate the packet type
from CQE and fill it to 'sl' in ibv_wc.
Singed-off-by: Juan Zhou <zhoujuan51@h-partners.com>
(cherry picked from commit e102d4c9aa2992c125b26ad5cc237ae002bc6541)
1.Fix the owner bit error of sq in new io
2.Fix incorrect post-send with direct wqe of
3.Add a judgment to the congestion control algorithm
Singed-off-by: Juan Zhou <zhoujuan51@h-partners.com>
(cherry picked from commit 092143ba858a7aba0630fadd416faa2a4e7eaf06)
1. The hns hardware logic requires wr->num_sge to be 1 when
performing atomic operations. The code does not judge this
condition, and the current patch adds this constraint.
2. In the sq inline scenario, when num_sge in post_send is not 1, sge
array appears in the for loop without rotation and directly copy
out of bounds.
3. Currently local invalidate operation don't work properly.
Disable it for the time being.
HIP08 and HIP09 hardware does not support this feature, so
delete the associated code.
Signed-off-by: Juan Zhou <zhoujuan51@h-partners.com>
Some Kunpeng SoCs do not support the DWQE through NEON
instructions. In this case, the IO path works normally,
but the performance will deteriorate.
For these SoCs that do not support NEON DWQE, they support
DWQE through SVE instructions. This patch supports SVE DWQE
to guarantee the performance of these SoCs. In addition, in
this scenario, DWQE only supports acceleration through SVE's
ldr and str instructions. Other load and store instructions
also cause performance degradation.
Signed-off-by: Juan Zhou <zhoujuan51@h-partners.com>
Added the use of direct verbs to implement QP-level
user-configurable congestion control algorithms. Among them,
the user mode driver mainly provides interfaces for users to
choose, and the kernel mode driver is responsible for filling
the resources of different algorithms and providing the
supported algorithm types for user mode.
At the same time, provide a direct verbs interface for users to
query the type of congestion control algorithm.
Signed-off-by: Juan Zhou <zhoujuan51@h-partners.com>
Add an interface to the user space, which is used to receive
the kernel reset state. After receiving the reset flag, the
user space stops sending db.
Signed-off-by: Yixing Liu <liuyixing1@huawei.com>
(cherry picked from commit 8b56ab8b70262183cac5995dcb395ec099cc89a4)
DCA(Dynamic context attachment) support many RC QPs to share the WQE
buffer in a memory pool, this help reducing the memory consumption
when there are many QPs are inactive.
Signed-off-by: Chengchang Tang <tangchengchang@huawei.com>
(cherry picked from commit b88a370b79cd9b93f9a587c1e4e0d9c89d0b1d1e)
These patches support running the roce function in hns roh mode
Signed-off-by: Guofeng Yue <yueguofeng@hisilicon.com>
(cherry picked from commit 648d17f1efa1ca1c458618421ab01a0c4b8872af)
(cherry picked from commit 08d265affcb38d014d69436b5200692612baa394)
Backport patches from rdma-core 41.1.
And bugfix patches reported by #I5Q3S5 has also been included.
Signed-off-by: Chengchang Tang <tangchengchang@huawei.com>
(cherry picked from commit 6f27f67e51b251ea5185fa61a635ab9fdb64d467)
Update rdma-core version from 35.1 to 41.0.
Version 41.0 is the latest version in community until
2022/7/27. It includes some new bugfixes and new features,
we choose this version to facilitate future development.
The patches added to this repo has already included in the
new version, so remove them.
Signed-off-by: Chengchang Tang <tangchengchang@huawei.com>
HIP06 is no longer supported. So, backport these patches from
rdma-core v41 to remove support for HIP06.
Signed-off-by: zhengfeng luo <luozhengfeng@h-partners.com>
Signed-off-by: Chengchang Tang <tangchengchang@huawei.com>
Add support for hns CQ_EX and all its dependent patches are also added.
Signed-off-by: zhengfeng luo <luozhengfeng@h-partners.com>
Signed-off-by: Chengchang Tang <tangchengchang@huawei.com>
These patches are mainly related to IO operations.
Signed-off-by: zhengfeng luo <luozhengfeng@h-partners.com>
Signed-off-by: Chengchang Tang <tangchengchang@huawei.com>
1.Adding logging API for rdma-core.
2.Adding hns cleanups in rdma-core v37.
Signed-off-by: zhengfeng luo <luozhengfeng@h-partners.com>
Signed-off-by: Chengchang Tang <tangchengchang@huawei.com>
Bugfix for hns SRQ and SGE. And also added all related cleanups and
refactorings.
Signed-off-by: zhengfeng luo <luozhengfeng@h-partners.com>
Signed-off-by: Chengchang Tang <tangchengchang@huawei.com>
(cherry picked from commit 1725c906505414af1774a0e787b286c6ced8c027)
update rdma-core version from 28.1 to 35.0, version 35.0 is
the latest version in community until 2021/7/5. It includes
some bugfixes and new features, we choose this version to
facilitate future development.
Signed-off-by: Yangyang Li <liyangyang20@huawei.com>