ivampiresp/Spdk - Spdk - Leaflow Developers

Author	SHA1	Message	Date
Shuhei Matsumoto	0a61427ecc	nvme_rdma: Start qpair after resolving address and route when poll group is used Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I0b0f314c98368247582f2dfcaf69f78e24d715f9 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11366 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2022-04-01 08:28:45 +00:00
Shuhei Matsumoto	531c1b0f04	nvme_rdma: Make nvme_rdma_process_event() asynchronous Separate nvme_rdma_process_event() into nvme_rdma_process_event_start() and nvme_rdma_process_event_poll(). Use nvme_rdma_process_event_start() and nvme_rdma_process_event_poll() in nvme_rdma_process_event() to ensure compatibility. Change-Id: Idc960fab2540efec612dcf22f156acabd2e2874e Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10594 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2022-04-01 08:28:45 +00:00
Shuhei Matsumoto	791ee7deb4	nvme_rdma: nvme_rdma_process_events() returns negated errno It will be convenient for the following patches to return negated errno directly. Change-Id: Ic80181b2ee449946dd60ad0c97a325fd48b92231 Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10990 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-04-01 08:28:45 +00:00
Shuhei Matsumoto	cf7f253302	nvme_rdma: Add callback to nvme_rdma_process_event() Change-Id: I66aa89dc54d5aaedbe2f06239cbf04aeeb2c739e Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11359 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2022-04-01 08:28:45 +00:00
Shuhei Matsumoto	bcf0845727	nvme_rdma: Make CM event operations callback functions Change-Id: I9f2551a07187400dd9ef624348cd465e64557e1b Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11138 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-04-01 08:28:45 +00:00
Shuhei Matsumoto	e5927c02e9	nvme_rdma: Remove cm_channel param from process_event() nvme_rdma_poll_events() gets the cm_channel pointer itself. Before calling nvme_rdma_process_event(), we checks the rctrlr is valid. Hence we do not have to pass the cm_channel pointer to nvme_rdma_process_event() via a parameter. This simplifies the code and makes the following patches a little easier. Change-Id: I03f095833469c5b64592264d63a592106d49e13b Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11167 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2022-04-01 08:28:45 +00:00
Shuhei Matsumoto	29974dc882	nvme_rdma: Make fabric_qpair_connect() asynchronous Replace nvme_fabric_qpair_connect() by nvme_fabric_qpair_connect_async() and nvme_fabric_qpair_connect_poll(). The following is a detail. Define state of the nvme_rdma_qpair and each rqpair holds it. Initialize rqpair->state by INVALID at nvme_rdma_ctrlr_create_qpair(). _nvme_rdma_ctrlr_connect_qpair() sets rqpair->state to FABRIC_CONNECT_SEND instead of calling nvme_fabric_qpair_connect(). Then the new function nvme_rdma_ctrlr_connect_qpair_poll() calls nvme_fabric_qpair_connect_async() at FABRIC_CONNECT_SEND and nvme_fabric_qpair_connect_poll() until it returns 0 at FABRIC_CONNECT_POLL. nvme_rdma_qpair_process_completions() or nvme_rdma_poll_group_process_completions() calls nvme_rdma_ctrlr_connect_qpair_poll() if qpair->state is CONECTING. This patter follows the TCP transport. Change-Id: I411f4fa8071cb5ea27581f3820eba9b02c731e4c Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11334 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-04-01 08:28:45 +00:00
Alexey Marchuk	94494579ce	nvme_rdma: Update reportring of RDMA responder resources responder_resources parameter of rdma cm tells remote side how many outstaing RDMA_READ of atomic operations local side can handle. Previously it was adjusted on queue depth but that was not correct since these parameters do not depend on each other. Even with qdepth=1 remote side may send several RDMA_READ operations per 1 IO request. With this change we report responder_resources equal to the maximum supported by RDMA device. Linux kernel nvme rdma driver reports this value in the same way. Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: I77e5c2ead6269da44c32a75a9188429f50d32ae4 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11698 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-03-25 08:18:37 +00:00
Shuhei Matsumoto	6a89f75ec7	nvme_rdma: Remove handling stale connect The feature will be redesigned and restored in the following patches. For the NVMe bdev module, it can reconnect by itself without relying on the feature. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I2d9c0437f7ad8412ad8cf40d11e574723b735bee Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11440 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-03-21 10:49:11 +00:00
Shuhei Matsumoto	0c77cf90bf	nvme_rdma: Consolidate fail_qpair() calls into a single place For nvme_rdma_qpair_process_completions(), consolidate the operations to call nvme_rdma_fail_qpair() and return -ENXIO into a single place. Besides, shorten pointer references for nvme_rdma_qpair_process_completions() and nvme_rdma_poll_group_process_completions(). These will make the following patches a little easier. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Iaf72cfca0b5b3ba223d86e267da8069d43a15292 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11439 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2022-03-21 10:49:11 +00:00
Shuhei Matsumoto	cfe11bd1db	nvme: Factor out operations done after disconnect qpair completes This is a preparation to make nvme_transport_ctrlr_disconnect_qpair() asynchronous. For nvme_transport_ctrlr_disconnect_qpair(), factor out operations after returning from transport's specific ctrlr_disconnect_qpair() into a helper function nvme_transport_ctrlr_disconnect_qpair_done(). Then move nvme_transport_ctrlr_disconnect_qpair_done() into the end of the transport specific ctrlr_disconnect_qpair(). Additionally remove the operation to overwrite the qpair state to DISCONNECTED from nvme_transport_connect_qpair_fail() because this is duplicated and nvme_transport_ctrlr_disconnect_qpair() is responsible to make the qpair disconnected even after it completes asynchronously. Change-Id: I9c8faa7039d306d3e31a8f51826755ce8840a8aa Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10851 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2022-03-21 10:49:11 +00:00
Shuhei Matsumoto	486f46e867	nvme_rdma: Call disconnected_qpair_cb when qpair is in disconnected_qpairs list We want to call disconnected_qpairs_cb only if qpair is actually disconnected. When we disconnect qpair asynchronously, for qpairs in the group->disconnected_qpairs list, we want to poll them until actually disconnected and then call disconnected_qpairs_cb for them. As a preparation, call disconnected_qpair_cb only for qpairs which is in the group->disconnected_qpairs list. For TCP and PCIe transports, disconnecting qpair will continue to be synchronous for now. So we change only RDMA transport. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Ifaf6157e1e02fa13f52a66409c9e60fc814d71dd Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11495 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-03-15 09:05:09 +00:00
Evgeniy Kochetov	5c80b1e5ab	nvme/rdma: Limit max_sges by command capsule size According to NVMe over Fabrics spec number of SGLs supported by the controller is reported in MSDBD. But it is also implicitly limited by command capsule size (IOCCSZ) since SGL are passed in capsule. This patch adjusts max_sges to capsule size if required. Adjustment to MSDBD is also moved to transport layer because it is fabrics specific parameter and is not valid for PCIe transport. Signed-off-by: Evgeniy Kochetov <evgeniik@nvidia.com> Change-Id: I44918eb949345c61242ca50a524d21d04b6ac058 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11669 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-02-25 08:18:32 +00:00
Shuhei Matsumoto	7594030409	nvme: Set dnr to zero for abort_reqs() including a fix of degradation The patch nvme: Set dnr to zero for nvme_qpair_abort_reqs() `1b3172f726` did the change stated in the title. However, Revert "nvme/rdma: Correct qpair disconnect process" `c8f986c7ee` destroyed it for RDMA transport. Additionally, we had still set DNR to 1 in nvme_qpair_init(). This patch fixes both. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Iee60ac24aa7e04cce0f394014c9d9afc9d2b56ec Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11644 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-02-24 14:56:03 +00:00
Evgeniy Kochetov	834e3c5a0e	nvme: Fix submission queue overflow SPDK can submit more commands to remote NVMf target than allowed by negotiated queue size. SPDK submits up to SQSIZE commands, but only SQSIZE-1 are allowed. Here is a relevant quote from NVMe over Fabrics rev.1.1a ch.2.4.1 “Submission Queue Flow Control Negotiation”: If SQ flow control is disabled, then the host should limit the number of outstanding commands for a queue pair to be less than the size of the Submission Queue. If the controller detects that the number of outstanding commands for a queue pair is greater than or equal to the size of the Submission Queue, then the controller shall: a) stop processing commands and set the Controller Fatal Status (CSTS.CFS) bit to ‘1’ (refer to section 10.5 in the NVMe Base specification); and b) terminate the NVMe Transport connection and end the association between the host and the controller. Signed-off-by: Evgeniy Kochetov <evgeniik@nvidia.com> Change-Id: Ifbcf5d51911fc4ddcea1f7cde3135571648606f3 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11413 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2022-02-10 15:22:08 +00:00
Evgeniy Kochetov	486426529d	nvme/rdma: Remove queue depth adjustment to crqsize According to NVMe over Fabrics specification (rev.1.1a) HSQSIZE sent in RDMA_CM_REQUEST private data (ch.7.3.6.4) shall be the same as SQSIZE later sent in Connect command (ch.3.3). SPDK NVMe RDMA initiator adjusts SQSIZE to CRQSIZE received from target in RDMA_CM_ACCEPT private data. Target is allowed to send CRQSIZE < HSQSIZE if RNR retries are used. So, it is possible that SQSIZE sent by SPDK will be lower than previously sent HSQSIZE. There are targets validating this match and they reject connection from SPDK. Linux kernel NVMe initiator doesn't perform such adjustments and connects well to such targets. This patch aligns SPDK behavior with specification and Linux kernel implementation. Signed-off-by: Evgeniy Kochetov <evgeniik@nvidia.com> Change-Id: I01968d1c07d284396fa5939932d85841351d7a45 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11350 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-02-10 15:22:08 +00:00
Jaylyn Ren	3e937f07eb	test/accel&rdma: Fix unittest_accel and unittest_nvme_rdma failure There are errors occur that uninitialised value created by a stack allocation when running unittest_accel and unittest_nvme_rdma with valgrind. Change-Id: I4b48b472cc7c189cbcaf8ca772830a23118e7e17 Signed-off-by: Jaylyn Ren <jaylyn.ren@arm.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10559 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Xiaodong Liu <xiaodong.liu@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-02-09 22:22:04 +00:00
Shuhei Matsumoto	fc48cf8681	nvme_rdma: Check only if Soft RoCE receive normal completion after disconnect We saw this unexpected behavior by the current SPDK master. Add the check to clarify this behavior occurs only when we use Soft RoCE. Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I3a5eaa9064a0601c65139e7868898545926d0dbf Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11225 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com>	2022-01-26 08:09:15 +00:00
Shuhei Matsumoto	c8f986c7ee	Revert "nvme/rdma: Correct qpair disconnect process" This reverts commit `eb09178a59`. Reason for revert: This caused a degradation for adminq. For adminq, ctrlr_delete_io_qpair() is not called until ctrlr is destructed. So necessary delete operations are not done for adminq. Reverting the patch is practical for now. Change-Id: Ib55ff81dfe97ee1e2c83876912e851c61f20e354 Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10878 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-01-26 08:09:15 +00:00
Shuhei Matsumoto	194dc9e2f9	Revert "nvme_rdma: Continue even if we receive a normal WC when qpair is disconnected" This reverts commit `b9518a5540`. Reason for revert: Fix a degradation for adminq Change-Id: I0e2c5e48a5ca34171fa98fa68216da4354b5d262 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10879 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2022-01-26 08:09:15 +00:00
Shuhei Matsumoto	728e3721a4	nvme_rdma: Remove a guard for recursive calls from poll_group_disconnect_qpair() nvme_poll_group_disconnect_qpair() is called only by a single place now. We do not need the flag poll_group_disconnect_in_progress any more. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I8f9c0f14baa8fcb9b0637635a5bb3d34a8b11af5 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10673 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-01-19 08:44:09 +00:00
Shuhei Matsumoto	4c8ccb5403	nvme: Remove poll_group_disconnect_qpair() call from poll_group_remove() spdk_nvme_poll_group_remove() is available only for disconnected qpairs now. Hence spdk_nvme_poll_group_remove() does not have to check if qpair is connected and call nvme_ctrlr_disconnect_qpair(). Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I3b05246c4be6adfa3392b8f0e5ecaf274a8a7795 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10846 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Monica Kenguva <monica.kenguva@intel.com>	2022-01-19 08:44:09 +00:00
Shuhei Matsumoto	1b3172f726	nvme: Set dnr to zero for nvme_qpair_abort_reqs() This is necessary to failover another path when multipath is configured. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I0b6bcf63501e38f75efb4b0d6bec58abb4b67aef Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10250 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Monica Kenguva <monica.kenguva@intel.com>	2022-01-17 14:25:15 +00:00
Shuhei Matsumoto	b9518a5540	nvme_rdma: Continue even if we receive a normal WC when qpair is disconnected We recently improved qpair disconnect process and added assert if we get a completion without any error when a qpair is disconnected. However unexpectedly we saw this case very often when we ran the test test/nvmf/host/multipath.sh for the real hardware in the test pool. So we remove the assert and change the ERRLOG to INFOLOG. Fixes one of the issues in #2300 Signed-off-by: Shuhei Matsumoto <shuheimatsumoto@gmail.com> Change-Id: Iedbf7e0afa5025da6a810043ba95348ba5b856b3 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10901 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2021-12-29 02:19:58 +00:00
Alexey Marchuk	eb09178a59	nvme/rdma: Correct qpair disconnect process In current implementation RDMA qpair is destroyed right after disconnect. That is not graceful qpair shutdown process since there can be requests submitted to HW and we may receive completions for already destroyed/freed qpair. To avoid this, only disconnect qpair in ctrlr_disconnect_qpair transport callback, all other resources will be released in ctrlr_delete_io_qpair cb. This patch is useful when nvme poll groups are used since in that case we use shared CQ, if the disconnected qpair has WRs submitted to HW then qpair's destruction will be deferred to poll group. When nvme poll groups are not used, this patch doesn't change anything, in that case destruction flow is still ungraceful. However since CQ is destroyed immediately after qpair, we shouldn't receive any requests which point to released resources. A correct solution for non-poll group case requires async diconnect API which may lead to significant rework. There is a bug when Soft Roce is used - we may receive a completion with "normal" status when qpair is already disconnected and all nvme requests are aborted. Added a workaround for it. Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: I0680d9ef9aaa8737d7a6d1454cd70a384bb8efac Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10327 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Shuhei Matsumoto <shuheimatsumoto@gmail.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-12-23 08:44:40 +00:00
Josh Soref	cc6920a476	spelling: lib Part of #2256 * accessible * activation * additional * allocate * association * attempt * barrier * broadcast * buffer * calculate * cases * channel * children * command * completion * connect * copied * currently * descriptor * destroy * detachment * doesn't * enqueueing * exceeds * execution * extended * fallback * finalize * first * handling * hugepages * ignored * implementation * in_capsule * initialization * initialized * initializing * initiator * negotiated * notification * occurred * original * outstanding * partially * partition * processing * receive * received * receiving * redirected * regions * request * requested * response * retrieved * running * satisfied * should * snapshot * status * succeeds * successfully * supplied * those * transferred * translate * triggering * unregister * unsupported * urlsafe * virtqueue * volumes * workaround * zeroed Change-Id: I569218754bd9d332ba517d4a61ad23d29eedfd0c Signed-off-by: Josh Soref <jsoref@gmail.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10405 Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2021-12-03 08:12:55 +00:00
Alexey Marchuk	2db77dc9c7	nvme: Explicitly disconnect qpair before destroy spdk_nvme_ctrlr_free_io_qpair can be called when qpair is already disconnected. In that case qpair's state is changed to NVME_QPAIR_DESTROYING and transport's ctrlr_delete_io_qpair callback is called. RDMA and TCP transports call nvme_transport_ctrlr_disconnect_qpair in the callback and since qpair's state is not DISCONNECTED or DISCONNECTING, qpair is disconnected for the second time. If spdk_nvme_ctrlr_free_io_qpair is called when qpair is in ENABLED state than nothing changes, qpair will be disconnected before destroy. PCIE/vfio_user don't implement transport disconnect callback, so they are not affected. Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: I23e11856ecafb51669acf4a3118be049c11eecda Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10326 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2021-11-23 09:01:05 +00:00
Alexey Marchuk	64fa301f67	rdma: Update for memory map Add a parameter which determines the owner of the map - target or initiator. It allows to set different access flags when creating Memory Regions Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: I0016847fe116e193d0954db1c8e65066b4ff82bf Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10283 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2021-11-19 08:29:59 +00:00
Alexey Marchuk	2696886c75	dma: Update translation result to hold iovec pointer In some cases a single virtually contriguos memory buffer can be translated to several chunks of memory. To make such translation possible, update structure spdk_memory_domain_translation_result to use a pointer to iovec. Add a single iov structure or cases where translation is always 1:1, it will make easier translation callback implementation. For RDMA transport translation of address is always 1:1, so treat iovcnt other than 1 as an error. Change-Id: I65605575d43a490490eba72c1eb19f3a09d55ec6 Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Signed-off-by: Max Gurtovoy <mgurtovoy@nvidia.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9779 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-10-20 22:55:52 +00:00
Alexey Marchuk	549bcdc0a4	dma: Update memory domain context structure Instead of a union with domain type specific parameters, store an opaque pointer to user context. Depending on the memory domain type, this context can be cast to a specific struct, e.g. to spdk_memory_domain_rdma_ctx for RDMA memory domains. This change provides more flexibility to applications to create and manage custom memory domains Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Signed-off-by: Max Gurtovoy <mgurtovoy@nvidia.com> Change-Id: Ib0a8297de80773d86edc9849beb4cbc693ef5414 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9778 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2021-10-20 22:55:52 +00:00
Alexey Marchuk	9381d8d399	nvme: Update spdk_nvme_ctrlr_get_memory_domain Allow to return more than one memory domain. This change aligns bdev and nvme API and provides more flexibility for custom transports. Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: Ica9b12ad8463c361be6cb62ee2c0513eec0b486d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9546 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2021-09-24 07:37:45 +00:00
Alexey Marchuk	abc45c4642	nvme/rdma: Don't log error for WC Flush Error This type of errors is not fatal and can be observed when qpairs are diconnected. The same approach is used in target side. Change-Id: Ic3c7b1731c0cbd2e98d776f0f0c5d82464b3d556 Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9416 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-09-10 16:00:33 +00:00
Alexey Marchuk	ca5ce67f6e	nvme/rdma: Ignore completion when we can't find qpair When poll_group is used, several qpairs share the same CQ and it is possible to receive a completion with error (e.g. IBV_WC_WR_FLUSH_ERR) for already disconnected qpair That happens due to qpair is destroyed while there are submitted but not completed send/receive Work Requests To avoid such situation, we should not detroy ibv qpair until we reap completions for all submitted send/receive work requests. That requires some rework in rdma transport and will be implemented later Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: Idb6213d45c2a7954b9ab280f5eb5e021be00505f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9056 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-09-08 08:04:04 +00:00
Alexey Marchuk	00277bc5d5	nvme/rdma: Fix searching for qpair by qp_num Poll group holds lists of qpairs in different states and when we got rdma completion with error, we iterate these lists to find a qpair which qp_num matches. qp_num is stored inside of ibv_qp which belongs to spdk_rdma_qp structure. When nvme_rdma_qpair is disconnected, pointer to spdk_rdma_qp is cleaned but qpair may still exist in poll group list and when we start searhing for qpair by qp_num we may dereference NULL pointer. This patch adds a check that pointer to spdk_rdma_qp is valid before dereferencing it. To minimize boilerplate code, wrap all check in macro. Add unit test to verify this fix. Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: I1925f93efb633fd5c176323d3bbd3641a1a632a9 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9050 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-09-08 08:04:04 +00:00
Alexey Marchuk	110335f192	nvme: Add functions spdk_nvme_ns_cmd_readv/writev_ext These functions accept extendable structure with IO request options. The options structure contains a memory domain that can be used to translate or fetch data, metadata pointer and end-to-end data protection parameters Change-Id: I65bfba279904e77539348520c3dfac7aadbe80d9 Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/6270 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com>	2021-08-20 07:26:10 +00:00
Alexey Marchuk	a422d8b06f	nvme: Add API to get SPDK memory domain per nvme controller Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: I6db64c7075b1337b1489b2716fc686a6bed595e3 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/7239 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com>	2021-08-20 07:26:10 +00:00
Alexey Marchuk	d06b6097e3	nvme/rdma: Create memory domain per Protection Domain Add a global list of memory domains with reference counter. Memory domains are used by NVME RDMA qpairs. Also refactor ibv_resize_cq in nvme_rdma_ut.c to stub Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: Ie58b7e99fcb2c57c967f5dee0417e74845d9e2d1 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8127 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com>	2021-08-20 07:26:10 +00:00
Konrad Sztyber	98b483a35e	nvme/rdma: use timeout when destroying qpairs Replaced poll cycle count with a timeout when destroying a qpair that is part of a poll group. Tracking the time instead of a poll count is more stable, as the number of poll cycles can vary based on the application's behavior when destroying a qpair. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I7445bc1b411f2905aab7bf3dc7b2d3344712e1eb Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9200 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2021-08-18 08:11:51 +00:00
Monica Kenguva	771f65bb1f	nvme: asynchronous create io qpair async_mode option is currently supported in PCIe transport layer to create io qpair asynchronously. User polls the io_qpair for completions, after create cq and sq completes in order, pqpair is set to READY state. I/O submitted before the qpair is ready is queued internally. Currently other transports only support synchronous io qpair creation. Signed-off-by: Monica Kenguva <monica.kenguva@intel.com> Signed-off-by: Ben Walker <benjamin.walker@intel.com> Change-Id: Ib2f9043872bd5602274e2508cf1fe9ff4211cabb Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8911 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-08-13 07:27:07 +00:00
Ben Walker	ea0aaf5e85	nvme: Transports now set qpair state to NVME_QPAIR_CONNECTED inside .ctrlr_connect_qpair Previously this was assumed to be a synchronous process so the generic layer transport code updated the state after .ctrlr_connect_qpair returned. In preparation for making this support asynchronous mode, shift that responsibility down into the individual transports. While none of the transports actually do this asynchronously, insert a busy wait in nvme_transport_ctrlr_connect_qpair to wait for the qpair to exit from the CONNECTING state. None of the upper layer code can actually correct handle a transport doing this asynchronously, so the busy wait will cover that. Signed-off-by: Ben Walker <benjamin.walker@intel.com> Change-Id: I3c1a5c115264ffcb87e549765d891d796e0c81fe Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8909 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Monica Kenguva <monica.kenguva@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-07-28 07:04:00 +00:00
Jim Harris	59c8bb527b	nvme: do not try to resubmit requests on error If the transport returns error when polling for completions, it gets to a uint32_t and we end up trying to resubmit all of the requests that are currently queued. But that's not correct - if the transport returns an error we shouldn't be trying to resubmit requests at all. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I9198e3e2d71875cc1e46e0ac928338bb983487f3 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8395 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-06-17 09:02:14 +00:00
Jim Harris	d6f6ffd274	nvme: add NVME_CTRLR_STATE_CONNECT_ADMINQ Connect the adminq as part of controller initialization instead of controller construction. We never actually 'connected' the adminq for PCIe or vfio-user transports, since its a nop. But their connect_qpair transport ops function is also a nop for the adminq, so it's fine to generically connect the adminq across all transports. Note that we cannot read registers (cc or csts) during controller initialization now until after the adminq has been connected since reading fabrics registers depends on a connected adminq. This gets special cased for now, but eventually reading cc and csts will need to be part of the state machine itself to make it asynchronous. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ia5566d7c549d78d24b94ea253df51e697da6237f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8079 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2021-06-01 07:43:12 +00:00
Jim Harris	f5ba8a5ef5	nvme: add NVME_CTRLR_STATE_READ_CAP Read CAP (Capabilities) register as part of controller initialization instead of controller construction. For now, still read CAP in the pcie and vfio-user controller construction, since they need the drstd (doorbell stride) to construct the admin queue. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I000fe880f2ec0d6de1d565c883d7ea0ae1ac2c81 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8078 Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot	2021-05-28 08:14:06 +00:00
Jim Harris	df01076f70	nvme: add NVME_CTRLR_STATE_READ_VS Read VS (Version) register as part of controller initialization instead of controller construction. This prepares for upcoming changes to make controller attach fully asynchronous. Since reading fabrics registers is an asynchronous operation, it will be easier to read the VS register as part of controller initialization which operates as an asynchronous state machine. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I771386dbdf5902633e0d9f91b3b20be98f26fdc3 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8076 Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot	2021-05-28 08:14:06 +00:00
Alexey Marchuk	3fcda8e779	nvme: Add transport intrafce to get/free stats The new 2 API function allow to get and free stats per poll group. New function to get transport name have been added to report not only transport type but also the name. For now only RDMA transport reports statistics, other transports will be added later. Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: I2824cb474fde5fa859cf8196dabac2c48c05709c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/6299 Community-CI: Broadcom CI Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2021-04-13 21:30:52 +00:00
Alexey Marchuk	50569293ef	nvme/rdma: Add poller statistics New statistics include number of poller calls, number of idle polls and total number of completions. These statistics allow to estimate % of idle polls and the number of completions per poll. Since nvme_rdma_cq_process_completions function returns number of completed NVMF requests and each NVMF request consumes 2 RDMA completions (send+recv), this function was extended to return the number of RDMA completions. Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: Ifdc1e2e467f645adb5d66d39ff2a379e161fbd77 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/6298 Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2021-04-13 08:41:39 +00:00
Alexey Marchuk	527f406b6b	nvme/rdma: Use RDMA statistics These statistics allow to estimate WRs batching efficiency. The number of send WRs equals the total number of submitted NVME commands. Change-Id: I96c9836cd6b9070cf5f62e43b4d2738506866e94 Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/6297 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2021-04-13 08:41:39 +00:00
Jim Harris	6156777bd4	nvme: assert if user tries to delete NULL tcp qpair It is invalid to try to delete a NULL qpair, so do not check for it in nvme_tcp_ctrlr_delete_io_qpair and return an error when NULL. Just change it to an assert instead. This makes it consistent with pcie and rdma. While here, add an assert in rdma as well. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ic2f76deecb21b78749dac85e33fb1fa0d14a1239 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/6917 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: <dongx.yi@intel.com>	2021-03-18 14:41:44 +00:00
Alexey Marchuk	47afb9280f	nvme/rdma: Use RDMA provider API to post recv WRs Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: I47cc1a21af1104f681519e542edaf66e363bb214 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/6296 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-03-09 08:54:12 +00:00
Mao Jiang	6b3ec9683e	nvme/rdma: Fix rdma ctrlr creating qpair memory leak Change-Id: Ie94cacac0b8dcf90b0243e8d568bb728dc7d3045 Signed-off-by: Mao Jiang <maox.jiang@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/6126 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: <dongx.yi@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Community-CI: Mellanox Build Bot	2021-02-04 08:41:33 +00:00
Ziye Yang	74b2916c4a	nvme/rdma: Only wait for the RDMA event if spdk_rdma_qp_disconnect return 0 If rdma_qp_disconnect is not correctly sent out, we will not wait for the event. Change-Id: I99701e421dc93909d481ccf35e9bfd8004e60da8 Signed-off-by: Ziye Yang <ziye.yang@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/6163 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: <dongx.yi@intel.com>	2021-02-04 08:37:38 +00:00
Alexey Marchuk	b6efb964cd	nvme/rdma: Use RDMA provider memory translation Change-Id: Ie0995a55d252c0167b82ef54aaf7c7b8e5fd75d0 Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/5122 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2021-01-14 16:19:48 +00:00
Mao Jiang	6fd1459493	nvme/rdma: Fix rdma allocation return unique pointer Allocate memory with zero number or size, maybe return a unique pointer rather than NULL. Add a check before common allocation APIs. Change-Id: I83e07cab5145035e705bc32364652be90f238633 Signed-off-by: Mao Jiang <maox.jiang@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/5809 Reviewed-by: GangCao <gang.cao@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2021-01-08 09:35:43 +00:00
yidong0635	846bb4b9e1	nvme/nvme_rdma: Fix possible used uninitialized value. In file included from nvme_rdma_ut.c:36: /home/clear/spdk/lib/nvme/nvme_rdma.c:651:22: note: ‘bad_send_wr’ was declared here 651 \| struct ibv_send_wr *bad_send_wr; \| ^~~~~~~~~~~ In file included from /home/clear/spdk/lib/nvme/nvme_rdma.c:41, from nvme_rdma_ut.c:36: /home/clear/spdk/lib/nvme/nvme_rdma.c: In function ‘nvme_rdma_poll_group_process_completions’: /home/clear/spdk/include/spdk/log.h:132:2: error: ‘bad_send_wr’ may be used uninitialized in this function [-Werror=maybe-uninitialized] 132 \| spdk_log(SPDK_LOG_ERROR, __FILE__, __LINE__, __func__, __VA_ARGS__) \| ^~~~~~~~ cc1: all warnings being treated as errors. Signed-off-by: yidong0635 <dongx.yi@intel.com> Change-Id: I38ae36756b4bacef7e89f0f1737684c8b8981b12 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4696 Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-10-16 08:15:57 +00:00
Tomasz Zawadzki	2172c432cf	log: simplify SPDK_LOG_REGISTER_COMPONENT This patch removes the string from register component. Removed are all instances in libs or hardcoded in apps. Starting with this patch literal passed to register, serves as name for the flag. All instances of SPDK_LOG_* were replaced with just * in lowercase. No actual name change for flags occur in this patch. Affected are SPDK_LOG_REGISTER_COMPONENT() and SPDK_*LOG() macros. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I002b232fde57ecf9c6777726b181fc0341f1bb17 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4495 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Mellanox Build Bot Reviewed-by: Anil Veerabhadrappa <anil.veerabhadrappa@broadcom.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Community-CI: Broadcom CI	2020-10-14 08:00:35 +00:00
Alexey Marchuk	eb78b90ca8	nvme/rdma: Check that SGL descriptors fit into ICD The issue happens when SPDK RDMA initiator is connected to a remote target and this target reports rather small (or zero) ICD and we try to send several SGL descriptors. Since SGL descriptors are located in ICD, we should check that their total length fits into ICD. In other case sending such a command will cause RDMA errors (local length error) Change-Id: I8c0e8375dae799bc442ed2fab249cad2c4ccce51 Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reported-by: Or Gerlitz <ogerlitz@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4131 Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-09-16 07:58:13 +00:00
Seth Howell	316f92d118	lib/nvme: pass up fabric connect rc to app. This will allow applications to understand why they were unable to connect. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: Ic04c7e72098c6ec1823de7d6a07d90150ef5ac20 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3836 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2020-08-26 09:47:24 +00:00
Alexey Marchuk	8bec9feb76	nvme/rdma: Remove unused spdk_nvme_send_wr_list nvme_rdma_qpair::sends_to_post is not used, remove it and spdk_nvme_send_wr_list structure Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: If9c42736d4e796a947bbfe80f59efd2fd7f77859 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3822 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-08-21 08:24:43 +00:00
Shuhei Matsumoto	f2bd635ecf	lib/nvme: Add qpair_iterate_requests() to iterate the common operation among transports To abort requests whose cb_arg matches, add child abort request greedily. Iterating all outstanding requests is unique for each transport but adding child abort is common among transports, and adding child abort is replaceable by other operations. Hence add qpair_iterate_requests() function to the function pointer table of transport, and pass the operation done in the iteration by a parameter of it. In each transport, the implementation of qpair_iterate_requests() uses TAILQ_FOREACH_SAFE() for potential future use cases. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Ic70d1bf2613fce2566eade26335ceed731f66a89 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2038 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Michael Haeuptle <michaelhaeuptle@gmail.com>	2020-07-08 07:54:01 +00:00
Shuhei Matsumoto	aa2ea2bed5	nvme/rdma: Follow the fix in TCP transport and restore nvme_rdma_req_put() Recently two patches were merged but we should have get more reviews. The fix done in TCP transport will be better because we can keep the existing functions and make the code change minimum. Restore nvme_rdma_req_put() and move removing rdma_req from rqpair->outstanding_reqs to nvme_rdma_req_complete(). One exception is the case that only nvme_rdma_req_put() is called. For the case remove rdma_req from rqpair->outstanding_reqs before calling nvme_rdma_req_put(). Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I3f68dbc88c60af6b8f4ecc3209fde9b763ac3189 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3073 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Michael Haeuptle <michaelhaeuptle@gmail.com>	2020-07-08 07:54:01 +00:00
Jin Yu	d76951c7ba	nvme_rdma: fix the recvs_to_post.first to NULL nvme_rdma_qpair_submit_recvs is not judged in nvme_rdma_poll_group_process_completions path. If we do not clean the recvs_to_post.first we may get the wrong current_num_recvs when the rc is non-zero and call it again. Change-Id: If0046e711525dcfcb419132a01fed7a09db13ba0 Signed-off-by: Jin Yu <jin.yu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3163 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: <dongx.yi@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-07-06 07:20:33 +00:00
Jin Yu	05805e54a0	nvme:disconnected state then destroying state Put the destroying state after the disconnected state. Because nvme_transport_ctrlr_disconnect_qpair will modify the state of qpair to disconnected, and in the path of rdma, it will postpone the deletion of qpair until the release of pg by judging the destroying state. So qpair is not deleted. Change-Id: Ica606905cddf67d0ffda14bd48cc5f4e424f01ee Signed-off-by: Jin Yu <jin.yu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3136 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: <dongx.yi@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-07-06 07:20:26 +00:00
Jin Yu	19228a0602	nvme_rdma:fix current_num_sends to current_num_recvs Change-Id: I1a3067165c06db3fe7d7fd1c1ec149e845100b27 Signed-off-by: Jin Yu <jin.yu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3162 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: <dongx.yi@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-07-06 07:20:26 +00:00
Alexey Marchuk	e762508854	nvme_rdma: Add check for keyed SGL length The length of a keyed SGL data block is limited by 3 bytes. Add a check to fail requests which length exceeds 3 bytes. In other case we can send an incorrectly formed SGL request with an invalid or zero length. Fixes issue #1450 Change-Id: I77cdaff5fbf4be5754a3ac6008b8ccd532ac5905 Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3056 Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-07-02 07:21:31 +00:00
Seth Howell	203ed4f673	lib/nvme: report rdma_connect errors up the stack. This will allow applications to discern specific connect behavior and make choices relative to it. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: I46182c285367ceb8a72511defe4508b3592b4572 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3095 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-06-29 09:19:09 +00:00
Alexey Marchuk	8421f83973	rdma: Fix qpair desctruction in error flow rdma_qp may not be initialized when qpair is not fully created. When such a qpair is being destroyed we may pass a NULL pointer to spdk_rdma_qp_disconnect or spdk_rdma_qp_destroy and hit an assert. This patch fixes this problem for NVMEoF target and initiator. Change-Id: I84787dc1b1211293c2a19f59d47727eaecd9d5a1 Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3050 Community-CI: Broadcom CI Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-06-29 09:18:52 +00:00
Shuhei Matsumoto	465b2f8a6b	nvme/rdma: Inline nvme_rdma_req_put() nvme_rdma_req_complete() and nvme_rdma_req_put() are called in a row except a single case. Move clearing completion_flags and req of rdma_req from nvme_rdma_req_put() to nvme_rdma_req_complete(), and then inline nvme_rdma_req_put() because nvme_rdma_req_put() does only insert now. To do this, change the type of the second parameter of nvme_rdma_req_complete() from struct nvme_request to struct spdk_nvme_rdma_req. For the exceptional case that only nvme_rdma_req_put() is called, change nvme_rdma_req_init() to clear rdma_req->req if returned with error. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Ibf7e6d245f3a48fb895cd9e6d92596ef833f26d1 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2876 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Michael Haeuptle <michaelhaeuptle@gmail.com>	2020-06-24 08:19:43 +00:00
Shuhei Matsumoto	a57aeac1fe	nvme/rdma: Dequeue request from outstanding list before calling completion Each request has a callback context as cb_arg, and the callback to nvme_complete_request() for the completed request may reuse the context to the new request. On the other hand, RDMA transport dequeues rdma_req from rqpair->outstanding_reqs after calling nvme_complete_request() for the request pointed by rdma_req. Hence while nvme_complete_request() is executed, rqpair->outstanding_reqs may have two requests which has the same callback context, the completed request and the new submitted request. The upcoming patch will search all requests whose cb_arg matches to abort them. In the above case, the search may find two requests by mistake. To avoid such error, move dequeueing rdma_req from rqpair->outstanding_reqs before calling nvme_request_complete(). Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Ia183733f4a4cd4f85de17514ef3a884693910a05 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2863 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Michael Haeuptle <michaelhaeuptle@gmail.com>	2020-06-24 08:19:43 +00:00
Alexey Marchuk	268aacb24a	rdma: Add new API spdk_rdma_qp_accept This API is a wrapper for rdma_accept which allows to remove spdk_rdma_qp_init_attr::initiator_side. Change-Id: Iba2be5e74e537c498fb11c939c922b2bbda95309 Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2908 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2020-06-18 07:28:04 +00:00
Seth Howell	1039254319	nvme/rdma: add cq resizing. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: I6350d76b8c1e778c18e693b2dfbb10dd36b3e3d8 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1927 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-06-04 07:20:16 +00:00
Seth Howell	67b0dcfe29	nvme_rdma: add tracking for rdma objects in qpair. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: I0b45aed21dc649888bb9d93c5937fb553f35eb27 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2568 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-06-04 07:20:16 +00:00
Seth Howell	8bef6f0bdf	lib/nvme: rdma poll group with shared cq. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: Ifde29f633f09cccbebfdcde5ab2f96d9590449f1 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1167 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI Community-CI: Mellanox Build Bot Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-06-04 07:20:16 +00:00
Seth Howell	1a9c19a954	lib/nvme: remove spdk prefix from internal headers. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: Iccde5860b83217163428ff504cba87a1cf209720 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2444 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com>	2020-06-01 13:07:30 +00:00
Seth Howell	6d18ea425b	lib/nvme: force qpair disconnect before aborting rdma requests. This is needed for shared completion queues which can still give us successful completions on aborted requests if the qpair hasn't been disconnected. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: I85cf1a81ef563d8c02d684b09d2f7ad5008e38cb Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1961 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-06-01 09:22:05 +00:00
Seth Howell	b4e060b560	lib/nvme: check that req is not null in RDMA. When a request has been aborted, it's possible to get a completion for an rdma request but the rdma_req->req object has already been cleared to NULL. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: I5f7b1b96ff4be8c436aae9a7e2a7c9927d04e627 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1960 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-06-01 09:22:05 +00:00
Shuhei Matsumoto	f21f51bd81	lib/nvme: Remove inclusion of SPDK event library Remove inclusion of spdk/event.h and spdk_internal/event.h from SPDK NVMe library. Their dependency had been removed before. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Ide3a0902b1cebb9c9033ade45d7488622e38696c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2688 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-06-01 09:20:41 +00:00
Seth Howell	63732d8880	lib/nvme: split cq completion processing to its own function. This helps create a separation between processing a qpair and processing a completion queue which can be shared across multiple qpairs. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: I111dd16ec4327854f232988a96891a65813f00e1 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1166 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-05-28 07:13:44 +00:00
zkhatami88	fe3fab26bf	nvme/rdma: Using hooks in reg mr Signed-off-by: zkhatami88 <z.khatami88@gmail.com> Change-Id: I9493fe82b5b758c0092d20ef18b79d652fefed85 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1905 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-05-28 07:11:39 +00:00
Seth Howell	fadfef63d1	lib/nvme: provide mechanism for tracking request completions Add wrappers around the request and response values and track those using the wr_id value. This will come in handy when we start doing poll group based completion processing. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: Iaff75b03e41d49f53e55e0ce65d384567988fc9d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1165 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2020-05-21 09:21:27 +00:00
Alexey Marchuk	9b86f31a38	nvme/rdma: Handle failed send/recv as a fatal error Do not make attempt to resubmit failed send/recv WR, instead report and error to the upper layer (in case of new request) or fail a qpair (in case of active polling). In the case of failed ibv_post_send and disabled `delay_cmd_submit` nvme_rdma_qpair_submit_request returns an error to the caller. The caller completes failed request but RDMA layer still keeps it in a send queue. Later RDMA layer can send the corresponding WR and notify the upper layer about the completion of the request for the second time. Change-Id: I1260f215b8523d39157a5cc3fda39cd4bd87c8ec Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1662 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Seth Howell <seth.howell@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-05-20 12:03:50 +00:00
Alexey Marchuk	8c6a345534	nvme/rdma: Use RDMA provider API to send WRs Change-Id: I3dc87751d250da84d988b1c7a9c57112b5bd10b0 Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1661 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Seth Howell <seth.howell@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-05-20 12:03:50 +00:00
Alexey Marchuk	daee62a05b	rdma: Add mlx5_dv RDMA provider The new RDMA provider can be enabled by passing --with-rdma=mlx5_dv parameter to configure script This provider uses "externally created qpair" functionality of rdma cm - it must move a qpair to RTS state manually Change-Id: I72484f6edd1f4dad15430e2c8d36b65d1975e8a2 Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1658 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2020-05-20 12:03:50 +00:00
Alexey Marchuk	63c8cea783	rdma: Add API function to disconnect qpair This is a wrapper over RDMA CM rdma_disconnect function The wrapper is needed since in Mellanox Direct Verbs (aka DV) we must move qpair to error state manually before calling rdma_disconnect Change-Id: Ia8623c6989e7679591f2da56bafa7f4262eeebf9 Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1975 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Seth Howell <seth.howell@intel.com> Reviewed-by: Jacek Kalwas <jacek.kalwas@intel.com>	2020-05-20 12:03:50 +00:00
Alexey Marchuk	b4a9d7d318	nvme/rdma: Use RDMA provider API to create/destroy qpair This patch adds use of RDMA provider API to NVMEoF initiator. Makefiles have been updated with new RDMA lib dependency Change-Id: Ieaefeb12ee9681d3db2b618c5cf0c54dc52230af Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1657 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-05-20 12:03:50 +00:00
WANGHAILIANG	023e3624e7	lib/nvme: remove lkey and rkey's warnings in nvme_rdma.c One of these warnings, such as: /home/wanghailiang/spdk20200428/lib/nvme/nvme_rdma.c: In function ‘nvme_rdma_qpair_submit_request’: /home/wanghailiang/spdk20200428/lib/nvme/nvme_rdma.c:1512:29: warning: ‘lkey’ may be used uninitialized in this function [-Wmaybe-uninitialized] rdma_req->send_sgl[1].lkey = lkey; ^ /home/wanghailiang/spdk20200428/lib/nvme/nvme_rdma.c:1480:11: note: ‘lkey’ was declared here uint32_t lkey; ^ Change-Id: I67b25cb62c7a0d5b298ebfe7d2673b73261040ef Signed-off-by: WANGHAILIANG <hailiangx.e.wang@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2197 Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-05-07 10:44:02 +00:00
zkhatami88	58a8fe2eee	nvme/rdma: When RDMA hooks exist, prefer spdk_zmalloc for internal allocations Signed-off-by: zkhatami88 <z.khatami88@gmail.com> Change-Id: I7f810ee78fecca7eb8a4387f6d63e1a952966e57 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1593 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-05-05 08:03:39 +00:00
Seth Howell	bf0561f741	nvme/nvme_rdma: assign rctrlr in each qpair->ctrlr check While in practice the qpair->ctrlr variable will not change within the disconnect function, when the code is built without debug enabled, gcc thinks that rctrlr may be uninitialized. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: I355cd62f3a2baaba65d806e3746f615a0dc37f58 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2056 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-04-29 06:32:12 +00:00
Seth Howell	1b818a28b5	lib/nvme: add naive poll_group implementation for rdma. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: I55bae6dddc887a95c3e37195fac821de5aa1ed89 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/631 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-04-24 16:36:03 +00:00
Seth Howell	fc86e792e4	lib/nvme: switch poll group to use connect/disconnect semantics. This makes more sense within the context of the nvme driver and helps us avoid the awkward situation of getting a failed_qp callback on a qpair that simply hasn't been connected. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: Ibac83c87c514ddcf7bd360af10fab462ae011112 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1734 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-04-22 19:06:26 +00:00
Seth Howell	6189c0ceb7	lib/nvme: abort all requests when disconnecting a qpair. By aborting all requests from every qpair when it is disconnected, we can completely avoid having to abort requests when we enable the qpair since nothing will be left enabled. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: Iba3bd866405dd182b72285def0843c9809f6500e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1788 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-04-22 19:06:26 +00:00
Seth Howell	6338af34fc	lib/nvme: handle qpair state in transport layer. The state should be changed and checked by the transport layer. All transports should follow the same list of steps when disconnecting/reconnecting. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: If2647624345f2c70f78a20bba4e2206d2762f120 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1853 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-04-22 19:06:26 +00:00
Seth Howell	e1c9185005	lib/nvme: always call the transport disconnect function. The qpair states should be maintained at the generic level. Always going through the transport disconnect function is one step in that direction. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: I019b2b4a14fe192eff5293f918d633dde2c5400a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1851 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2020-04-22 19:06:26 +00:00
Seth Howell	9649ee09fa	lib/nvme: rename NVME_QPAIR_DISABLED This variable really indicates when a qpair is no longer connected. So NVME_QPAIR_DISCONNECTED is actually much more accurate. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: Ia480d94f795bb0d8f5b4eff9f2857d6fe8ea1b34 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1850 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2020-04-22 19:06:26 +00:00
Seth Howell	c3eac3435a	nvme/rdma: send an rdma_disconnect during disconnect. The rdma_disconnect call triggers an RDMA_CM_EVENT_DISCONNECTED message on the target side. The hope is that the target side will reply with the same message in a reasonable amount of time. If the target doesn't have that mechanism implemented, print an error message and continue with the process. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: I164a3538714fa3adfc306ea0c88220ea710e7c39 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1879 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-04-20 07:40:31 +00:00
Alexey Marchuk	f11989385e	nvme/rdma: Clean pointer to nvme_request That is done to make sure that scenario described in github issue #1292 won't happen Change-Id: Ie2ad001da701e25ef984ae57da850fb84d51b734 Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1771 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: <dongx.yi@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2020-04-14 11:33:39 +00:00
Alexey Marchuk	581e1bb576	nvme/rdma: Wait for completions of both RDMA RECV and SEND In some situations we may get a completion of RDMA_RECV before completion of RDMA_SEND and this can lead to a bug described in #1292 To avoid such situations we must complete nvme_request only when we received both RMDA_RECV and RDMA_SEND completions. Add a new field to spdk_nvme_rdma_req to store response idx - it is used to complete nvme request when RDMA_RECV was completed before RDMA_SEND Repost RDMA_RECV when both RDMA_SEND and RDMA_RECV are completed Side changes: change type of spdk_nvme_rdma_req::id to uint16_t, repack struct nvme_rdma_qpair Fixes #1292 Change-Id: Ie51fbbba425acf37c306c5af031479bc9de08955 Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1770 Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: <dongx.yi@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-04-14 11:33:39 +00:00
Seth Howell	c998c6c69e	nvme: add API for qpair poll groups. This API will allow us to simplify the polling mechanism for qpairs on a single thread. It also will pave the way for doing transport specific aggregation of qpair polling to increase performance. The generic implementation is included. The transport specific calls have yet to be implemented. Change-Id: If07b4170b2be61e4690847c993ec3bde9560b0f0 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/579 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2020-04-07 08:38:40 +00:00
Alexey Marchuk	14425544a6	nvme/rdma: Factor out memory key translation Add function nvme_rdma_get_key to get either lkey or rkey, use it in request building functions Change-Id: Ic9e3429e07a10b2dddc133b553e437359532401d Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1462 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-04-06 07:49:48 +00:00
Alexey Marchuk	d2510a56f3	nvme/rdma: Simplify nvme_rdma_req_init Cache payload type and in-capsule data transfer support Change-Id: Id40a6e86d1f29235ca3e0189d7fbcf19baa30ffe Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1461 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-04-06 07:49:48 +00:00
yidong0635	20564d423b	nvme/nvme_rdma: Reduced the code lines. Here destruct contrllers are in one function, and we can remove the duplicated codes using goto. It can save several lines of codes. Signed-off-by: yidong0635 <dongx.yi@intel.com> Change-Id: Ibf3cb9fe2ea4bfc65d42603a7b13aaf575854580 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1638 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2020-04-03 06:31:52 +00:00

1 2 3 4 5 ...

393 Commits