ivampiresp/Spdk - Spdk - Leaflow Developers

Author	SHA1	Message	Date
Shuhei Matsumoto	0e4b13dc53	nvme_rdma: Destroy qpair after it is disconnected and drained By the previous patches, a qpair is destroyed after it is actually disconnected. But after the qpair is destroyed, it is checked if drained by using rqpair->current_num_sends and rqpair->current_num_recvs. However, if the qpair is the last of a poller of a poll group, CQ is destroyed before checking if the qpair is drained. If CQ is destroyed, at least rqpair->current_num_recvs is not updated, and we may get one second timeout. This should be avoided. Hence, destroy the qpair after it is disconnected and drained. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Ibd6c83e8a3e7b6e11e9b45cee42669da6d42a621 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14278 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-05 12:49:11 +00:00
Shuhei Matsumoto	1d58eb038b	nvme_rdma: Release poller from poll group when qpair is actually disconnected If the being disconnected qpair is the last of a poller of a poll group, CQ is destroyed and the poller is released before the qpair is actually disconnected. This patch destroy CQ and release the poller after the qpair is actually disconnected. One exception is when spdk_nvme_ctrlr_free_io_qpair() is called to a connected qpair. In this case, the qpair is removed from a poll group before the qpair is actually disconnected. In this case, destroy CQ and release the poller when the qpair is removed from the poll group. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Idf266bbb6dbb40f04ae6313db724fabf80865763 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14253 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-05 12:49:11 +00:00
Shuhei Matsumoto	80d75fda06	nvme_rdma: Clean up releasing poller from poll group We have two cases to call nvme_rdma_poll_group_put_poller(). For consistency, make the two cases the same sequence. This will make the next patch easier. The next patch will release poller from poll group when qpair is actually disconnected as possible as we can. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I4178113d5277240e287e83a57e97cf32fd0f7457 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14252 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-05 12:49:11 +00:00
Shuhei Matsumoto	4a6f858872	nvme_rdma: Set REUSEADDR to reuse source address among multiple CM IDs When we specify source address for admin and I/O qpairs, rdma_resolve_addr() succeeded only for admin qpair and failed for following all I/O qpairs because rdma_resolve_addr() returned -EADDRINUSE. To reuse source address among multiple qpairs, set the REUSEADDR option for each CM ID before executing rdma_resolve_addr() if source address is specified. We may miss something. Even if rdma_set_option() fails, execute rdma_resolve_addr(). Fixes issue #2604 Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: If03f82d4499cf83c0e428a62e91c9d9e6aad28e0 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14229 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: GangCao <gang.cao@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com>	2022-08-29 11:41:17 +00:00
Shuhei Matsumoto	cd65512d08	nvme_rdma: Fix assertion for rqpair->current_num_sends/recvs assert() in nvme_rdma_queue_recv_wr() was wrong and assert() in nvme_rdma_cq_process_completions() was missing. This patch fixes both. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Signed-off-by: Denis Nagorny <denisn@nvidia.com> Signed-off-by: Evgeniy Kochetov <evgeniik@nvidia.com> Change-Id: Ied057d75dbfd9e54ce3c3671355b9ec3acad7ff5 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13597 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-08-12 08:59:43 +00:00
Shuhei Matsumoto	41bb31a36d	nvme_rdma: Replace rdma_dereg_mr() by ibv_dereg_mr() rdma_reg_msgs() was replaced by ibv_reg_mr() recently to support persistent PD per RDMA device. The difference between rdma_dereg_mr() and ibv_dereg_mr() is only return value and errno. For consistency, replace rdma_dereg_mr() by ibv_dereg_mr(). Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I55e0743690e74f9510863bfa122a75d0632dce4e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13949 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-08-12 08:59:43 +00:00
Shuhei Matsumoto	d75daea532	nvme_rdma: Use persistent protection domain for qpair Get a PD for the device from the PD pool managed by the RDMA provider when creating a QP, and put the PD when destroying the PD. By this change, PD is managed completely by the RDMA provider or the hooks. nvme_rdma_ctrlr::pd was added long time ago but is not referenced anywhere. Remove nvme_rdma_ctrlr::pd for cleanup and clarification. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: If8dc8ad011eed70149012128bd1b33f1a8b7b90b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13770 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-08-12 08:59:43 +00:00
Shuhei Matsumoto	4f2f1aa9c5	nvme_rdma: Use pd of rdma_qp instead of default pd of cm_id This is another preparation to create and use ibv_context and pd. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Signed-off-by: Denis Nagorny <denisn@nvidia.com> Signed-off-by: Evgeniy Kochetov <evgeniik@nvidia.com> Change-Id: Id594fa1ccb2daf535b1aaaef0a397bda2ec98578 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13710 Community-CI: Mellanox Build Bot Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-08-02 07:39:41 +00:00
Shuhei Matsumoto	a3a51453b8	nvme_rdma: Pass pd instead of cm_id to nvme_rdma_reg_mr() The following patches will create and use ibv_context and pd explicitly instead of using default ibv_context and pd created by rdmacm. As a preparation, pass pd instead of cm_id to nvme_rdma_reg_mr(). Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Signed-off-by: Denis Nagorny <denisn@nvidia.com> Signed-off-by: Evgeniy Kochetov <evgeniik@nvidia.com> Change-Id: Ifdcd18ed363b8ba4a23a920bf3559237e38821c6 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13599 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-08-02 07:39:41 +00:00
Evgeniy Kochetov	3b26e2c594	nvme/rdma: Create poller and CQ on demand Original implementation creates pollers and CQs for all discovered devices at poll group creation. Device (ibv_context) that has no references, i.e. has no QPs, may be removed from the system and ibv_context may be closed by rdma_cm. In this case we will have a CQ that refers to closed ibv_context and it may crash in ibv_poll_cq. With this patch pollers are created on demand when we create the first QP for a device. When there are no more QPs on the poller, we destroy the poller. This also helps to avoid polling CQs that don't have any QPs attached. Signed-off-by: Evgeniy Kochetov <evgeniik@nvidia.com> Change-Id: I46dd2c8b9b2902168dba24e139c904f51bd1b101 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13692 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-07-22 07:27:22 +00:00
Jim Harris	a6704e454c	nvme: put rdma req in nvme_rdma_req_complete All of the callers immediately put the req right after the nvme_rdma_req_complete call, so just move the put into that function instead. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ic370cf689850924e0c902a6071af8b3a7ed58c0b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13527 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com>	2022-07-04 07:23:13 +00:00
Jim Harris	e415bf0033	nvme: add cmd/cpl printing for rdma errors This follows similar logic in the pcie and tcp completion paths, including omitting error messages when aborting aers by adding a print_on_error parameter to the completion function. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Id558d0af2cdd705dfb60abb842bd567a0949ccce Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13525 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com>	2022-07-04 07:23:13 +00:00
Ben Walker	8dd1cd2104	check_format: For C files only, fix return type breaks In SPDK, declarations have the return type on the same line. Definitions have the return type on a separate line. Astyle has an option for enforcing this. Unfortunately, it seems to have two bugs: 1) It doesn't work correctly at all on C++ files. 2) It often fails on functions that return enums, or long type names Deal with 1) by adjusting the check_format.sh script to only tell astyle to fix return type line breaks for C files and not C++. Deal with 2) by adding a few typedefs to work around the problem. Change-Id: Idf28281466cab8411ce252d5f02ab384166790c6 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13437 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com>	2022-06-27 09:33:48 +00:00
Jim Harris	488570ebd4	Replace most BSD 3-clause license text with SPDX identifier. Many open source projects have moved to using SPDX identifiers to specify license information, reducing the amount of boilerplate code in every source file. This patch replaces the bulk of SPDK .c, .cpp and Makefiles with the BSD-3-Clause identifier. Almost all of these files share the exact same license text, and this patch only modifies the files that contain the most common license text. There can be slight variations because the third clause contains company names - most say "Intel Corporation", but there are instances for Nvidia, Samsung, Eideticom and even "the copyright holder". Used a bash script to automate replacement of the license text with SPDX identifier which is checked into scripts/spdx.sh. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Iaa88ab5e92ea471691dc298cfe41ebfb5d169780 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12904 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: <qun.wan@intel.com>	2022-06-09 07:35:12 +00:00
Or Gerlitz	9b5dabff7f	nvme/rdma: Always use spdk allocation scheme Use the conventional huge-pages based spdk allocation scheme for the initiator data-structures unconditionally. Change-Id: I5baee7614e3ac9b5497b3d771dfddfbaa7fdf65b Signed-off-by: Or Gerlitz <ogerlitz@nvidia.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12687 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-05-25 07:42:47 +00:00
Ben Walker	813756e75e	nvme: Do not abort transport commands when disconnecting a qpair Make this a transport-level decision instead. TCP and RDMA do want to abort, but PCIe cannot because these commands may still be receiving DMA operations from the device. Change-Id: I305acddc3819c903eb3217e8f710d4216d0b3931 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11509 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Michael Haeuptle <michaelhaeuptle@gmail.com>	2022-05-19 08:23:57 +00:00
Alexey Marchuk	622ceb7f07	nvme/rdma: Use rdma qpair as cm_id context It simplifies code and removes cast of nvme_qpair to rdma_qpair Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: I363246cf9d8c9cbafd48b26facdb5cc37fdd8e67 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12701 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-05-18 00:34:29 +00:00
Alexey Marchuk	1003e28623	nvme/rdma: Fix qpair destroy/disconnect race When qpair is attached to a poll group, disconnect process is async - we are waiting for the DISCONNECTED event from rdmacm to destroy rdma resources. However the user (nvme_perf) can destroy qpair immediatelly, so memory allocated for qpair is freed but rdma resouces are still allocated. That means that we may receive rdmacm event (DISCONNECTED) for the destroyed qpair, that leads to use-after-free. To fix this problem, add a check for internal qpair state when qpair is destroyed, if disconnect is not finished, then we forcefully destroy rdma resources. Fixes issue #2515 Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reported-by: Or Gerlitz <ogerlitz@nvidia.com> Change-Id: I7bfa53c9f6fe6ed787323a8941f1f2db17ea0c20 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12700 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-05-18 00:34:29 +00:00
Alexey Marchuk	b0f4249c59	nvme/rdma: Add async set/get registers Now controller initialization with RDMA transport is fully async Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: I26e857740d3137d0b0e987facc81fc5f6ef81f2b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10756 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-04-22 09:44:57 +00:00
Shuhei Matsumoto	2c13441ba8	nvme_rdma: Destroy qpair after qpair is actually disconnected The RDMA transport can disconnect qpair asynchronously now. Previously, we tried to release the resource of the qpair after disconnected. However it did not work because it was done when deleting the qpair. The admin qpair was not deleted in a ctrlr reset sequence. This patch tries to satisfy the same aim again but by a different way. Previously, we released the resource of the qpair before starting actual disconnection process. This patch release the resource of the qpair after the qpair is actually disconnected. The related patches are: `b9518a5540` `eb09178a59` Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Id6a814895a35b1589b781a91744ef872b42aaa69 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11783 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-04-18 18:35:29 +00:00
Shuhei Matsumoto	4b73223542	nvme_rdma: Wait until lingering qpair becomes quiet before completing disconnection The code to handle the lingering qpair when deleting it was really complicated. The RDMA transport can connect or disconnect qpair asynchronously. Then we can include the code to handle the lingering qpair into the code to disconnect qpair now. If the disconnected qpair is still busy, defer completion of the disconnection until qpair becomes idle. If poll group is not used, we can complete disconnection immediately because cq is already destroyed. The related data and unit test cases are not necessary anymore. So delete them in this patch. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Ic8f81143fcad0714ac9b7db862313aa8094eeefb Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11778 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-04-18 18:35:29 +00:00
Shuhei Matsumoto	20cf90801e	nvme_rdma: Handle stale connection asynchronously Include delayed disconnect/connect retries with finite times into the state machine of asynchronous qpair connnection. We do not need to call back to the common transport layer but we need to do the following, clear rqpair->cq before starting disconnection if qpair uses poll group, and clear qpair->transport_failure_reason after disconnected. Additionally locate the new state STALE_CONN before INITIALIZING because cq is not ready to use for admin qpair when the state is STALE_CONN. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Ibc779a2b772be9506ffd8226d5f64d6d12102ff2 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11690 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2022-04-18 18:35:29 +00:00
Shuhei Matsumoto	77c4657140	nvme_rdma: Factor out destroying rdma qpair operation Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I18e166a726cca69f13e7c5818eba57f478726286 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11689 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: John Kariuki <John.K.Kariuki@intel.com>	2022-04-18 18:35:29 +00:00
Shuhei Matsumoto	aa36c18196	nvme_rdma: Pass callback to ctrlr_disconnect_qpair() via a parameter Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I06cbb9739286d1928ad9fc07de3715a449914d75 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11688 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-04-18 18:35:29 +00:00
Shuhei Matsumoto	75d38a301d	nvme: poll_group_process_completions() returns -ENXIO if any qpair failed TCP transport already does it but was not documented clearly. RDMA and PCIe transports follow it and document it clearly. Then we can check each qpair's state if spdk_nvme_poll_group_process_completions() returns -ENXIO before disconnected_qpair_cb() is called. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I2afe920cfd06c374251fccc1c205948fb498dd33 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11328 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-04-18 18:35:29 +00:00
Shuhei Matsumoto	9717b0c3df	nvme_rdma: Connect and disconnect qpair asynchronously Add three states, INITIALIZING, EXITING, and EXITED to the rqpair state. Add async parameter to nvme_rdma_ctrlr_create_qpair() and set it to opts->async_mode for I/O qpair and true for admin qpair. Replace all nvme_rdma_process_event() calls by nvme_rdma_process_event_start() calls. nvme_rdma_ctrlr_connect_qpair() sets rqpair->state to INITIALIZING when starting to process CM events. nvme_rdma_ctrlr_connect_qpair_poll() calls nvme_rdma_process_event_poll() with ctrlr->ctrlr_lock if qpair is not admin qpair. nvme_rdma_ctrlr_disconnect_qpair() returns if qpair->async is true or qpair->poll_group is not NULL before polling CM events, or polls CM events until completion otherwise. Add comments to clarify why we do like this. nvme_rdma_poll_group_process_completions() does not process submission for any qpair which is still connecting. Change-Id: Ie04c3408785124f2919eaaba7b2bd68f8da452c9 Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11442 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-04-18 18:35:29 +00:00
Shuhei Matsumoto	0a61427ecc	nvme_rdma: Start qpair after resolving address and route when poll group is used Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I0b0f314c98368247582f2dfcaf69f78e24d715f9 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11366 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2022-04-01 08:28:45 +00:00
Shuhei Matsumoto	531c1b0f04	nvme_rdma: Make nvme_rdma_process_event() asynchronous Separate nvme_rdma_process_event() into nvme_rdma_process_event_start() and nvme_rdma_process_event_poll(). Use nvme_rdma_process_event_start() and nvme_rdma_process_event_poll() in nvme_rdma_process_event() to ensure compatibility. Change-Id: Idc960fab2540efec612dcf22f156acabd2e2874e Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10594 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2022-04-01 08:28:45 +00:00
Shuhei Matsumoto	791ee7deb4	nvme_rdma: nvme_rdma_process_events() returns negated errno It will be convenient for the following patches to return negated errno directly. Change-Id: Ic80181b2ee449946dd60ad0c97a325fd48b92231 Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10990 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-04-01 08:28:45 +00:00
Shuhei Matsumoto	cf7f253302	nvme_rdma: Add callback to nvme_rdma_process_event() Change-Id: I66aa89dc54d5aaedbe2f06239cbf04aeeb2c739e Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11359 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2022-04-01 08:28:45 +00:00
Shuhei Matsumoto	bcf0845727	nvme_rdma: Make CM event operations callback functions Change-Id: I9f2551a07187400dd9ef624348cd465e64557e1b Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11138 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-04-01 08:28:45 +00:00
Shuhei Matsumoto	e5927c02e9	nvme_rdma: Remove cm_channel param from process_event() nvme_rdma_poll_events() gets the cm_channel pointer itself. Before calling nvme_rdma_process_event(), we checks the rctrlr is valid. Hence we do not have to pass the cm_channel pointer to nvme_rdma_process_event() via a parameter. This simplifies the code and makes the following patches a little easier. Change-Id: I03f095833469c5b64592264d63a592106d49e13b Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11167 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2022-04-01 08:28:45 +00:00
Shuhei Matsumoto	29974dc882	nvme_rdma: Make fabric_qpair_connect() asynchronous Replace nvme_fabric_qpair_connect() by nvme_fabric_qpair_connect_async() and nvme_fabric_qpair_connect_poll(). The following is a detail. Define state of the nvme_rdma_qpair and each rqpair holds it. Initialize rqpair->state by INVALID at nvme_rdma_ctrlr_create_qpair(). _nvme_rdma_ctrlr_connect_qpair() sets rqpair->state to FABRIC_CONNECT_SEND instead of calling nvme_fabric_qpair_connect(). Then the new function nvme_rdma_ctrlr_connect_qpair_poll() calls nvme_fabric_qpair_connect_async() at FABRIC_CONNECT_SEND and nvme_fabric_qpair_connect_poll() until it returns 0 at FABRIC_CONNECT_POLL. nvme_rdma_qpair_process_completions() or nvme_rdma_poll_group_process_completions() calls nvme_rdma_ctrlr_connect_qpair_poll() if qpair->state is CONECTING. This patter follows the TCP transport. Change-Id: I411f4fa8071cb5ea27581f3820eba9b02c731e4c Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11334 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-04-01 08:28:45 +00:00
Alexey Marchuk	94494579ce	nvme_rdma: Update reportring of RDMA responder resources responder_resources parameter of rdma cm tells remote side how many outstaing RDMA_READ of atomic operations local side can handle. Previously it was adjusted on queue depth but that was not correct since these parameters do not depend on each other. Even with qdepth=1 remote side may send several RDMA_READ operations per 1 IO request. With this change we report responder_resources equal to the maximum supported by RDMA device. Linux kernel nvme rdma driver reports this value in the same way. Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: I77e5c2ead6269da44c32a75a9188429f50d32ae4 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11698 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-03-25 08:18:37 +00:00
Shuhei Matsumoto	6a89f75ec7	nvme_rdma: Remove handling stale connect The feature will be redesigned and restored in the following patches. For the NVMe bdev module, it can reconnect by itself without relying on the feature. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I2d9c0437f7ad8412ad8cf40d11e574723b735bee Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11440 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-03-21 10:49:11 +00:00
Shuhei Matsumoto	0c77cf90bf	nvme_rdma: Consolidate fail_qpair() calls into a single place For nvme_rdma_qpair_process_completions(), consolidate the operations to call nvme_rdma_fail_qpair() and return -ENXIO into a single place. Besides, shorten pointer references for nvme_rdma_qpair_process_completions() and nvme_rdma_poll_group_process_completions(). These will make the following patches a little easier. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Iaf72cfca0b5b3ba223d86e267da8069d43a15292 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11439 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2022-03-21 10:49:11 +00:00
Shuhei Matsumoto	cfe11bd1db	nvme: Factor out operations done after disconnect qpair completes This is a preparation to make nvme_transport_ctrlr_disconnect_qpair() asynchronous. For nvme_transport_ctrlr_disconnect_qpair(), factor out operations after returning from transport's specific ctrlr_disconnect_qpair() into a helper function nvme_transport_ctrlr_disconnect_qpair_done(). Then move nvme_transport_ctrlr_disconnect_qpair_done() into the end of the transport specific ctrlr_disconnect_qpair(). Additionally remove the operation to overwrite the qpair state to DISCONNECTED from nvme_transport_connect_qpair_fail() because this is duplicated and nvme_transport_ctrlr_disconnect_qpair() is responsible to make the qpair disconnected even after it completes asynchronously. Change-Id: I9c8faa7039d306d3e31a8f51826755ce8840a8aa Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10851 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2022-03-21 10:49:11 +00:00
Shuhei Matsumoto	486f46e867	nvme_rdma: Call disconnected_qpair_cb when qpair is in disconnected_qpairs list We want to call disconnected_qpairs_cb only if qpair is actually disconnected. When we disconnect qpair asynchronously, for qpairs in the group->disconnected_qpairs list, we want to poll them until actually disconnected and then call disconnected_qpairs_cb for them. As a preparation, call disconnected_qpair_cb only for qpairs which is in the group->disconnected_qpairs list. For TCP and PCIe transports, disconnecting qpair will continue to be synchronous for now. So we change only RDMA transport. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Ifaf6157e1e02fa13f52a66409c9e60fc814d71dd Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11495 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-03-15 09:05:09 +00:00
Evgeniy Kochetov	5c80b1e5ab	nvme/rdma: Limit max_sges by command capsule size According to NVMe over Fabrics spec number of SGLs supported by the controller is reported in MSDBD. But it is also implicitly limited by command capsule size (IOCCSZ) since SGL are passed in capsule. This patch adjusts max_sges to capsule size if required. Adjustment to MSDBD is also moved to transport layer because it is fabrics specific parameter and is not valid for PCIe transport. Signed-off-by: Evgeniy Kochetov <evgeniik@nvidia.com> Change-Id: I44918eb949345c61242ca50a524d21d04b6ac058 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11669 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-02-25 08:18:32 +00:00
Shuhei Matsumoto	7594030409	nvme: Set dnr to zero for abort_reqs() including a fix of degradation The patch nvme: Set dnr to zero for nvme_qpair_abort_reqs() `1b3172f726` did the change stated in the title. However, Revert "nvme/rdma: Correct qpair disconnect process" `c8f986c7ee` destroyed it for RDMA transport. Additionally, we had still set DNR to 1 in nvme_qpair_init(). This patch fixes both. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Iee60ac24aa7e04cce0f394014c9d9afc9d2b56ec Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11644 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-02-24 14:56:03 +00:00
Evgeniy Kochetov	834e3c5a0e	nvme: Fix submission queue overflow SPDK can submit more commands to remote NVMf target than allowed by negotiated queue size. SPDK submits up to SQSIZE commands, but only SQSIZE-1 are allowed. Here is a relevant quote from NVMe over Fabrics rev.1.1a ch.2.4.1 “Submission Queue Flow Control Negotiation”: If SQ flow control is disabled, then the host should limit the number of outstanding commands for a queue pair to be less than the size of the Submission Queue. If the controller detects that the number of outstanding commands for a queue pair is greater than or equal to the size of the Submission Queue, then the controller shall: a) stop processing commands and set the Controller Fatal Status (CSTS.CFS) bit to ‘1’ (refer to section 10.5 in the NVMe Base specification); and b) terminate the NVMe Transport connection and end the association between the host and the controller. Signed-off-by: Evgeniy Kochetov <evgeniik@nvidia.com> Change-Id: Ifbcf5d51911fc4ddcea1f7cde3135571648606f3 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11413 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2022-02-10 15:22:08 +00:00
Evgeniy Kochetov	486426529d	nvme/rdma: Remove queue depth adjustment to crqsize According to NVMe over Fabrics specification (rev.1.1a) HSQSIZE sent in RDMA_CM_REQUEST private data (ch.7.3.6.4) shall be the same as SQSIZE later sent in Connect command (ch.3.3). SPDK NVMe RDMA initiator adjusts SQSIZE to CRQSIZE received from target in RDMA_CM_ACCEPT private data. Target is allowed to send CRQSIZE < HSQSIZE if RNR retries are used. So, it is possible that SQSIZE sent by SPDK will be lower than previously sent HSQSIZE. There are targets validating this match and they reject connection from SPDK. Linux kernel NVMe initiator doesn't perform such adjustments and connects well to such targets. This patch aligns SPDK behavior with specification and Linux kernel implementation. Signed-off-by: Evgeniy Kochetov <evgeniik@nvidia.com> Change-Id: I01968d1c07d284396fa5939932d85841351d7a45 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11350 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-02-10 15:22:08 +00:00
Jaylyn Ren	3e937f07eb	test/accel&rdma: Fix unittest_accel and unittest_nvme_rdma failure There are errors occur that uninitialised value created by a stack allocation when running unittest_accel and unittest_nvme_rdma with valgrind. Change-Id: I4b48b472cc7c189cbcaf8ca772830a23118e7e17 Signed-off-by: Jaylyn Ren <jaylyn.ren@arm.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10559 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Xiaodong Liu <xiaodong.liu@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-02-09 22:22:04 +00:00
Shuhei Matsumoto	fc48cf8681	nvme_rdma: Check only if Soft RoCE receive normal completion after disconnect We saw this unexpected behavior by the current SPDK master. Add the check to clarify this behavior occurs only when we use Soft RoCE. Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I3a5eaa9064a0601c65139e7868898545926d0dbf Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11225 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com>	2022-01-26 08:09:15 +00:00
Shuhei Matsumoto	c8f986c7ee	Revert "nvme/rdma: Correct qpair disconnect process" This reverts commit `eb09178a59`. Reason for revert: This caused a degradation for adminq. For adminq, ctrlr_delete_io_qpair() is not called until ctrlr is destructed. So necessary delete operations are not done for adminq. Reverting the patch is practical for now. Change-Id: Ib55ff81dfe97ee1e2c83876912e851c61f20e354 Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10878 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-01-26 08:09:15 +00:00
Shuhei Matsumoto	194dc9e2f9	Revert "nvme_rdma: Continue even if we receive a normal WC when qpair is disconnected" This reverts commit `b9518a5540`. Reason for revert: Fix a degradation for adminq Change-Id: I0e2c5e48a5ca34171fa98fa68216da4354b5d262 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10879 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2022-01-26 08:09:15 +00:00
Shuhei Matsumoto	728e3721a4	nvme_rdma: Remove a guard for recursive calls from poll_group_disconnect_qpair() nvme_poll_group_disconnect_qpair() is called only by a single place now. We do not need the flag poll_group_disconnect_in_progress any more. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I8f9c0f14baa8fcb9b0637635a5bb3d34a8b11af5 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10673 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-01-19 08:44:09 +00:00
Shuhei Matsumoto	4c8ccb5403	nvme: Remove poll_group_disconnect_qpair() call from poll_group_remove() spdk_nvme_poll_group_remove() is available only for disconnected qpairs now. Hence spdk_nvme_poll_group_remove() does not have to check if qpair is connected and call nvme_ctrlr_disconnect_qpair(). Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I3b05246c4be6adfa3392b8f0e5ecaf274a8a7795 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10846 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Monica Kenguva <monica.kenguva@intel.com>	2022-01-19 08:44:09 +00:00
Shuhei Matsumoto	1b3172f726	nvme: Set dnr to zero for nvme_qpair_abort_reqs() This is necessary to failover another path when multipath is configured. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I0b6bcf63501e38f75efb4b0d6bec58abb4b67aef Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10250 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Monica Kenguva <monica.kenguva@intel.com>	2022-01-17 14:25:15 +00:00
Shuhei Matsumoto	b9518a5540	nvme_rdma: Continue even if we receive a normal WC when qpair is disconnected We recently improved qpair disconnect process and added assert if we get a completion without any error when a qpair is disconnected. However unexpectedly we saw this case very often when we ran the test test/nvmf/host/multipath.sh for the real hardware in the test pool. So we remove the assert and change the ERRLOG to INFOLOG. Fixes one of the issues in #2300 Signed-off-by: Shuhei Matsumoto <shuheimatsumoto@gmail.com> Change-Id: Iedbf7e0afa5025da6a810043ba95348ba5b856b3 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10901 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2021-12-29 02:19:58 +00:00

1 2 3 4 5 ...

369 Commits