ivampiresp/Spdk - Spdk - Leaflow Developers

Author	SHA1	Message	Date
Jin Yu	19228a0602	nvme_rdma:fix current_num_sends to current_num_recvs Change-Id: I1a3067165c06db3fe7d7fd1c1ec149e845100b27 Signed-off-by: Jin Yu <jin.yu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3162 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: <dongx.yi@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-07-06 07:20:26 +00:00
Alexey Marchuk	e762508854	nvme_rdma: Add check for keyed SGL length The length of a keyed SGL data block is limited by 3 bytes. Add a check to fail requests which length exceeds 3 bytes. In other case we can send an incorrectly formed SGL request with an invalid or zero length. Fixes issue #1450 Change-Id: I77cdaff5fbf4be5754a3ac6008b8ccd532ac5905 Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3056 Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-07-02 07:21:31 +00:00
Seth Howell	203ed4f673	lib/nvme: report rdma_connect errors up the stack. This will allow applications to discern specific connect behavior and make choices relative to it. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: I46182c285367ceb8a72511defe4508b3592b4572 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3095 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-06-29 09:19:09 +00:00
Alexey Marchuk	8421f83973	rdma: Fix qpair desctruction in error flow rdma_qp may not be initialized when qpair is not fully created. When such a qpair is being destroyed we may pass a NULL pointer to spdk_rdma_qp_disconnect or spdk_rdma_qp_destroy and hit an assert. This patch fixes this problem for NVMEoF target and initiator. Change-Id: I84787dc1b1211293c2a19f59d47727eaecd9d5a1 Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3050 Community-CI: Broadcom CI Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-06-29 09:18:52 +00:00
Shuhei Matsumoto	465b2f8a6b	nvme/rdma: Inline nvme_rdma_req_put() nvme_rdma_req_complete() and nvme_rdma_req_put() are called in a row except a single case. Move clearing completion_flags and req of rdma_req from nvme_rdma_req_put() to nvme_rdma_req_complete(), and then inline nvme_rdma_req_put() because nvme_rdma_req_put() does only insert now. To do this, change the type of the second parameter of nvme_rdma_req_complete() from struct nvme_request to struct spdk_nvme_rdma_req. For the exceptional case that only nvme_rdma_req_put() is called, change nvme_rdma_req_init() to clear rdma_req->req if returned with error. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Ibf7e6d245f3a48fb895cd9e6d92596ef833f26d1 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2876 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Michael Haeuptle <michaelhaeuptle@gmail.com>	2020-06-24 08:19:43 +00:00
Shuhei Matsumoto	a57aeac1fe	nvme/rdma: Dequeue request from outstanding list before calling completion Each request has a callback context as cb_arg, and the callback to nvme_complete_request() for the completed request may reuse the context to the new request. On the other hand, RDMA transport dequeues rdma_req from rqpair->outstanding_reqs after calling nvme_complete_request() for the request pointed by rdma_req. Hence while nvme_complete_request() is executed, rqpair->outstanding_reqs may have two requests which has the same callback context, the completed request and the new submitted request. The upcoming patch will search all requests whose cb_arg matches to abort them. In the above case, the search may find two requests by mistake. To avoid such error, move dequeueing rdma_req from rqpair->outstanding_reqs before calling nvme_request_complete(). Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Ia183733f4a4cd4f85de17514ef3a884693910a05 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2863 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Michael Haeuptle <michaelhaeuptle@gmail.com>	2020-06-24 08:19:43 +00:00
Alexey Marchuk	268aacb24a	rdma: Add new API spdk_rdma_qp_accept This API is a wrapper for rdma_accept which allows to remove spdk_rdma_qp_init_attr::initiator_side. Change-Id: Iba2be5e74e537c498fb11c939c922b2bbda95309 Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2908 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2020-06-18 07:28:04 +00:00
Seth Howell	1039254319	nvme/rdma: add cq resizing. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: I6350d76b8c1e778c18e693b2dfbb10dd36b3e3d8 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1927 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-06-04 07:20:16 +00:00
Seth Howell	67b0dcfe29	nvme_rdma: add tracking for rdma objects in qpair. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: I0b45aed21dc649888bb9d93c5937fb553f35eb27 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2568 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-06-04 07:20:16 +00:00
Seth Howell	8bef6f0bdf	lib/nvme: rdma poll group with shared cq. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: Ifde29f633f09cccbebfdcde5ab2f96d9590449f1 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1167 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI Community-CI: Mellanox Build Bot Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-06-04 07:20:16 +00:00
Seth Howell	1a9c19a954	lib/nvme: remove spdk prefix from internal headers. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: Iccde5860b83217163428ff504cba87a1cf209720 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2444 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com>	2020-06-01 13:07:30 +00:00
Seth Howell	6d18ea425b	lib/nvme: force qpair disconnect before aborting rdma requests. This is needed for shared completion queues which can still give us successful completions on aborted requests if the qpair hasn't been disconnected. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: I85cf1a81ef563d8c02d684b09d2f7ad5008e38cb Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1961 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-06-01 09:22:05 +00:00
Seth Howell	b4e060b560	lib/nvme: check that req is not null in RDMA. When a request has been aborted, it's possible to get a completion for an rdma request but the rdma_req->req object has already been cleared to NULL. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: I5f7b1b96ff4be8c436aae9a7e2a7c9927d04e627 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1960 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-06-01 09:22:05 +00:00
Shuhei Matsumoto	f21f51bd81	lib/nvme: Remove inclusion of SPDK event library Remove inclusion of spdk/event.h and spdk_internal/event.h from SPDK NVMe library. Their dependency had been removed before. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Ide3a0902b1cebb9c9033ade45d7488622e38696c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2688 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-06-01 09:20:41 +00:00
Seth Howell	63732d8880	lib/nvme: split cq completion processing to its own function. This helps create a separation between processing a qpair and processing a completion queue which can be shared across multiple qpairs. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: I111dd16ec4327854f232988a96891a65813f00e1 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1166 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-05-28 07:13:44 +00:00
zkhatami88	fe3fab26bf	nvme/rdma: Using hooks in reg mr Signed-off-by: zkhatami88 <z.khatami88@gmail.com> Change-Id: I9493fe82b5b758c0092d20ef18b79d652fefed85 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1905 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-05-28 07:11:39 +00:00
Seth Howell	fadfef63d1	lib/nvme: provide mechanism for tracking request completions Add wrappers around the request and response values and track those using the wr_id value. This will come in handy when we start doing poll group based completion processing. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: Iaff75b03e41d49f53e55e0ce65d384567988fc9d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1165 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2020-05-21 09:21:27 +00:00
Alexey Marchuk	9b86f31a38	nvme/rdma: Handle failed send/recv as a fatal error Do not make attempt to resubmit failed send/recv WR, instead report and error to the upper layer (in case of new request) or fail a qpair (in case of active polling). In the case of failed ibv_post_send and disabled `delay_cmd_submit` nvme_rdma_qpair_submit_request returns an error to the caller. The caller completes failed request but RDMA layer still keeps it in a send queue. Later RDMA layer can send the corresponding WR and notify the upper layer about the completion of the request for the second time. Change-Id: I1260f215b8523d39157a5cc3fda39cd4bd87c8ec Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1662 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Seth Howell <seth.howell@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-05-20 12:03:50 +00:00
Alexey Marchuk	8c6a345534	nvme/rdma: Use RDMA provider API to send WRs Change-Id: I3dc87751d250da84d988b1c7a9c57112b5bd10b0 Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1661 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Seth Howell <seth.howell@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-05-20 12:03:50 +00:00
Alexey Marchuk	daee62a05b	rdma: Add mlx5_dv RDMA provider The new RDMA provider can be enabled by passing --with-rdma=mlx5_dv parameter to configure script This provider uses "externally created qpair" functionality of rdma cm - it must move a qpair to RTS state manually Change-Id: I72484f6edd1f4dad15430e2c8d36b65d1975e8a2 Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1658 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2020-05-20 12:03:50 +00:00
Alexey Marchuk	63c8cea783	rdma: Add API function to disconnect qpair This is a wrapper over RDMA CM rdma_disconnect function The wrapper is needed since in Mellanox Direct Verbs (aka DV) we must move qpair to error state manually before calling rdma_disconnect Change-Id: Ia8623c6989e7679591f2da56bafa7f4262eeebf9 Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1975 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Seth Howell <seth.howell@intel.com> Reviewed-by: Jacek Kalwas <jacek.kalwas@intel.com>	2020-05-20 12:03:50 +00:00
Alexey Marchuk	b4a9d7d318	nvme/rdma: Use RDMA provider API to create/destroy qpair This patch adds use of RDMA provider API to NVMEoF initiator. Makefiles have been updated with new RDMA lib dependency Change-Id: Ieaefeb12ee9681d3db2b618c5cf0c54dc52230af Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1657 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-05-20 12:03:50 +00:00
WANGHAILIANG	023e3624e7	lib/nvme: remove lkey and rkey's warnings in nvme_rdma.c One of these warnings, such as: /home/wanghailiang/spdk20200428/lib/nvme/nvme_rdma.c: In function ‘nvme_rdma_qpair_submit_request’: /home/wanghailiang/spdk20200428/lib/nvme/nvme_rdma.c:1512:29: warning: ‘lkey’ may be used uninitialized in this function [-Wmaybe-uninitialized] rdma_req->send_sgl[1].lkey = lkey; ^ /home/wanghailiang/spdk20200428/lib/nvme/nvme_rdma.c:1480:11: note: ‘lkey’ was declared here uint32_t lkey; ^ Change-Id: I67b25cb62c7a0d5b298ebfe7d2673b73261040ef Signed-off-by: WANGHAILIANG <hailiangx.e.wang@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2197 Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-05-07 10:44:02 +00:00
zkhatami88	58a8fe2eee	nvme/rdma: When RDMA hooks exist, prefer spdk_zmalloc for internal allocations Signed-off-by: zkhatami88 <z.khatami88@gmail.com> Change-Id: I7f810ee78fecca7eb8a4387f6d63e1a952966e57 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1593 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-05-05 08:03:39 +00:00
Seth Howell	bf0561f741	nvme/nvme_rdma: assign rctrlr in each qpair->ctrlr check While in practice the qpair->ctrlr variable will not change within the disconnect function, when the code is built without debug enabled, gcc thinks that rctrlr may be uninitialized. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: I355cd62f3a2baaba65d806e3746f615a0dc37f58 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2056 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-04-29 06:32:12 +00:00
Seth Howell	1b818a28b5	lib/nvme: add naive poll_group implementation for rdma. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: I55bae6dddc887a95c3e37195fac821de5aa1ed89 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/631 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-04-24 16:36:03 +00:00
Seth Howell	fc86e792e4	lib/nvme: switch poll group to use connect/disconnect semantics. This makes more sense within the context of the nvme driver and helps us avoid the awkward situation of getting a failed_qp callback on a qpair that simply hasn't been connected. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: Ibac83c87c514ddcf7bd360af10fab462ae011112 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1734 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-04-22 19:06:26 +00:00
Seth Howell	6189c0ceb7	lib/nvme: abort all requests when disconnecting a qpair. By aborting all requests from every qpair when it is disconnected, we can completely avoid having to abort requests when we enable the qpair since nothing will be left enabled. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: Iba3bd866405dd182b72285def0843c9809f6500e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1788 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-04-22 19:06:26 +00:00
Seth Howell	6338af34fc	lib/nvme: handle qpair state in transport layer. The state should be changed and checked by the transport layer. All transports should follow the same list of steps when disconnecting/reconnecting. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: If2647624345f2c70f78a20bba4e2206d2762f120 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1853 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-04-22 19:06:26 +00:00
Seth Howell	e1c9185005	lib/nvme: always call the transport disconnect function. The qpair states should be maintained at the generic level. Always going through the transport disconnect function is one step in that direction. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: I019b2b4a14fe192eff5293f918d633dde2c5400a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1851 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2020-04-22 19:06:26 +00:00
Seth Howell	9649ee09fa	lib/nvme: rename NVME_QPAIR_DISABLED This variable really indicates when a qpair is no longer connected. So NVME_QPAIR_DISCONNECTED is actually much more accurate. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: Ia480d94f795bb0d8f5b4eff9f2857d6fe8ea1b34 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1850 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2020-04-22 19:06:26 +00:00
Seth Howell	c3eac3435a	nvme/rdma: send an rdma_disconnect during disconnect. The rdma_disconnect call triggers an RDMA_CM_EVENT_DISCONNECTED message on the target side. The hope is that the target side will reply with the same message in a reasonable amount of time. If the target doesn't have that mechanism implemented, print an error message and continue with the process. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: I164a3538714fa3adfc306ea0c88220ea710e7c39 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1879 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-04-20 07:40:31 +00:00
Alexey Marchuk	f11989385e	nvme/rdma: Clean pointer to nvme_request That is done to make sure that scenario described in github issue #1292 won't happen Change-Id: Ie2ad001da701e25ef984ae57da850fb84d51b734 Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1771 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: <dongx.yi@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2020-04-14 11:33:39 +00:00
Alexey Marchuk	581e1bb576	nvme/rdma: Wait for completions of both RDMA RECV and SEND In some situations we may get a completion of RDMA_RECV before completion of RDMA_SEND and this can lead to a bug described in #1292 To avoid such situations we must complete nvme_request only when we received both RMDA_RECV and RDMA_SEND completions. Add a new field to spdk_nvme_rdma_req to store response idx - it is used to complete nvme request when RDMA_RECV was completed before RDMA_SEND Repost RDMA_RECV when both RDMA_SEND and RDMA_RECV are completed Side changes: change type of spdk_nvme_rdma_req::id to uint16_t, repack struct nvme_rdma_qpair Fixes #1292 Change-Id: Ie51fbbba425acf37c306c5af031479bc9de08955 Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1770 Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: <dongx.yi@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-04-14 11:33:39 +00:00
Seth Howell	c998c6c69e	nvme: add API for qpair poll groups. This API will allow us to simplify the polling mechanism for qpairs on a single thread. It also will pave the way for doing transport specific aggregation of qpair polling to increase performance. The generic implementation is included. The transport specific calls have yet to be implemented. Change-Id: If07b4170b2be61e4690847c993ec3bde9560b0f0 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/579 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2020-04-07 08:38:40 +00:00
Alexey Marchuk	14425544a6	nvme/rdma: Factor out memory key translation Add function nvme_rdma_get_key to get either lkey or rkey, use it in request building functions Change-Id: Ic9e3429e07a10b2dddc133b553e437359532401d Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1462 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-04-06 07:49:48 +00:00
Alexey Marchuk	d2510a56f3	nvme/rdma: Simplify nvme_rdma_req_init Cache payload type and in-capsule data transfer support Change-Id: Id40a6e86d1f29235ca3e0189d7fbcf19baa30ffe Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1461 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-04-06 07:49:48 +00:00
yidong0635	20564d423b	nvme/nvme_rdma: Reduced the code lines. Here destruct contrllers are in one function, and we can remove the duplicated codes using goto. It can save several lines of codes. Signed-off-by: yidong0635 <dongx.yi@intel.com> Change-Id: Ibf3cb9fe2ea4bfc65d42603a7b13aaf575854580 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1638 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2020-04-03 06:31:52 +00:00
Shuhei Matsumoto	c3d0a83347	nvme/rdma: Move post WRs on send/recv queue after poll CQ If nvme_rdma_qpair_submit_sends() returns -ENOMEM, nvme_rdma_qpair_process_completions() returns immediately. In this case, nvme_rdma_qpair_process_completions() does not poll CQ. However, nvme_rdma_qpair_process_completions() can poll CQ even when there is no free slot in SQ. Hence move nvme_rdma_qpair_submit_sends() and nvme_rdma_qpair_submit_recvs() after the loop to poll CQ. nvme_rdma_qpair_submit_sends() and nvme_rdma_qpair_submit_recvs() output error log and so checking return code of them is not necessary and is removed in this patch. This fixes part of the github issue #1271. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Icf22879c69c3f84e6b1d91dc061b6f44237eedd1 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1342 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-03-20 08:39:53 +00:00
Seth Howell	2248e52150	nvme/rdma: make sure we free resources in error path. Not sure how we missed this. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: If920cb3a7708c33032e1da28c564d4c28ddafdf4 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1122 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-03-17 08:23:50 +00:00
Seth Howell	3b99ee9929	lib/nvme: move connect directly into alloc_io_qpair. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: Iadbada599764c7a2f4cdd4848a81a2fa39a89b46 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1120 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-03-17 08:23:50 +00:00
Seth Howell	7f82fb653d	nvme/rdma: Move stale connection retries to connect call. This gives us a more standard path in the create_io_qpair path. Eventually this will allow us to bring the connection commands out to the generic layer in alloc_io_qpair. Then we can split the calls to create and connect at the generic level making it possible to add rdma qpairs to a poll group in a meaningful way. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: Ib1b125f834c3c39a2b5050ff4a9bc4a053b95c99 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1119 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-03-17 08:23:50 +00:00
Seth Howell	1850842461	nvme_rdma: rearrange spdk_nvme_rdma_req. This allows it to fit on three cachelines instead of four. Change-Id: I2510b50ffcefb77fa570e738b2c6588749f30a00 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1143 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-03-17 08:23:50 +00:00
Jacek Kalwas	62e0342eac	nvme: minor alignment in ctrlr construct for pcie and rdma Signed-off-by: Jacek Kalwas <jacek.kalwas@intel.com> Change-Id: I35db32e867f91269608c72dbb9290a7ed2e3f31d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1234 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-03-12 09:04:26 +00:00
Jacek Kalwas	daa8f941e4	nvme: extend ctrlr opts with admin queue size Align rdma and tcp to respect opts. Reduce default number of entries for admin queue so it becomes memory optimization. Linux driver by default creates admin queue with 32 depth, there is no good reason to enlarge that queue by default within SPDK NVMe driver. Signed-off-by: Jacek Kalwas <jacek.kalwas@intel.com> Change-Id: I97ceea8f350c52313021a63190fb0980f604c48e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1110 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2020-03-12 09:04:18 +00:00
Seth Howell	f146bbe42d	lib/nvme: move common connect code into transport shim This gets rid of some duplicate lines of code. Change-Id: I24d4864921f6030672f3640b33f88f37a9e8175a Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1136 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-03-06 10:29:21 +00:00
Alexey Marchuk	94966468ae	nvme/rdma: Introduce transport_ack_timeout Add transport_ack_timeout parameter to nvme controller opts. This parameter allows to configure RDMA ACK timeout according to the formula 4.096 * 2^(transport_ack_timeout) usec. The parameter should be in range 0..31 where 0 means use driver-specific default value. Change-Id: I0c8a5a636aa9d816bda5c1ba58f56a00a585b060 Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/502 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-02-27 10:16:00 +00:00
Changpeng Liu	8d6f48fbf8	nvme: set transport string before the probe based on transport type Users may only set the transport type, but for the actual probe process, the trstring field is mandatory, so set the trstring based on transport type at first. Also remove unnecessary spdk_nvme_trid_populate_transport() call from each transport module. Fix #1228. Change-Id: I2378065945cf725df4b1997293a737c101969e69 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1001 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2020-02-26 09:26:09 +00:00
Alexey Marchuk	c3ba9127d0	nvme: Store NVMEoF ioccsz and icdoff in ctrlr structure This allows to avoid calculation of ioccsz bytes on each request and removes access to "cold" ctrlr structures in data path. Add UT to check validness of calculation Change-Id: I55ceff99eb924156155e69a20f587a4f92b83f0b Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/519 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-02-17 10:06:30 +00:00
Ben Walker	5ac51a3214	nvme:Make ctrlr_alloc_cmb_io_buffer optional for transports If the transport doesn't define one, don't call it. Change-Id: I8b83132f9fc0accbd4faa8fa0fc17a6bd11e543e Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/783 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-02-17 10:06:20 +00:00
Ben Walker	7dbe0e7c61	nvme: Remove nvme_transport_get_ctrlr_registers Wasn't used. Change-Id: I9812e24540f6d86f47d39091ea5fd9b7880b4413 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/735 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-02-12 12:07:16 +00:00
Ben Walker	54a022dda2	nvme: Don't DECLARE_TRANSPORT(rdma) With the transport plugin system, this is no longer necessary. Change-Id: Ia73878599658db84150603223ac811cb5a34ffba Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/713 Reviewed-by: Seth Howell <seth.howell5141@gmail.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-02-12 12:07:16 +00:00
Alexey Marchuk	f1539c2820	nvme/rdma: Use transport_retry_count from controller opts This allows to configure desired retry_count instead of using hard coded value Change-Id: I25c9601997ace916dfb735469a4b443c0cd2a96b Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/482499 Reviewed-by: Seth Howell <seth.howell@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Community-CI: SPDK CI Jenkins <sys_sgci@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-02-04 18:19:20 +00:00
Seth Howell	ca693eaba8	lib/nvme: fix cm event handling during rdma qpair shutdown. In the event that we have more than one event outstanding for a qpair at the time of destruction, we need to ack all of the events, Luckily the synchronization is already there in the form of the ctrlr lock. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: Ib297598f2e28d9b9bd83e904f950795a61fa883a Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/479171 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2020-01-23 15:14:55 +00:00
Seth Howell	9436ab59ba	nvme/rdma: inline buffers for all host to ctrlr ops Not inlining all host to controller operations breaks the target within the context of fused commands. This issue was discovered when enabling the compare-and-write fused command. Only the write command buffer was being inlined which caused the write to jump the compare in the transport specific state machine on the target side before our fused command checks in the generic code. Change-Id: I9e52ae6160e01ffd36d20429ffc8459491c729ef Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/482001 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-01-22 13:41:47 +00:00
Seth Howell	b2225ff593	lib/nvme: remove extra transport functions. Now that we have a more flexible function table strategy for transports, we can get rid of some of the wrapping we were doing to match the macro definitions exactly. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: I12c868babfa7bd27dc8ed5e86d35e179f8ec984f Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/478874 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-01-16 09:10:38 +00:00
Seth Howell	f6cf92a31f	lib/nvme: make transport.c use fn tables. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: Ida58785784b4ed50393e1d43a9cd902de74a2eaa Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/478873 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-01-16 09:10:38 +00:00
Seth Howell	e4eef6975c	lib/nvme: add function tables for all transports. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: I4e7af1c42a19346f4abcb17910a41f8104a2de1b Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/478871 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-01-16 09:10:38 +00:00
Seth Howell	7ed0904b9b	lib/nvme: update trid struct with trstring. The trtype should be stored as both an enum and string. This is intended to help pave the way for pluggable NVMe-oF transports. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: I6af658d7a17c405e191ff401b80ab704c65497e7 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/478744 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Alexey Marchuk <alexeymar@mellanox.com>	2020-01-16 09:10:38 +00:00
Evgeniy Kochetov	e749c115c6	nvme/rdma: Fix error return code in nvme_rdma_register_rsps nvme_rdma_register_rsps returned ENOMEM for all failure cases. All of them are not directly related to shortage of memory. Every point of failure now sets relevant return code. Signed-off-by: Evgeniy Kochetov <evgeniik@mellanox.com> Signed-off-by: Sasha Kotchubievsky <sashakot@mellanox.com> Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: Ia340f6c6fd3a68d8c34acfefc2c9224ffcdcad3f Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/477302 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Seth Howell <seth.howell@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Community-CI: SPDK CI Jenkins <sys_sgci@intel.com>	2019-12-23 08:41:48 +00:00
Evgeniy Kochetov	731dca3d77	nvme/rdma: Add work requests batching to NVMe RDMA initiator RDMA work requests generated between two calls to NVMe RDMA QP processing function are chained into a list and then posted together to a queue in next call to processing function. Batching improves performance in scenarios with deep queues and heavy load on CPU. But it may cause latency increase on smaller loads. Batching is configurable with RPC methods and configuration file. Signed-off-by: Evgeniy Kochetov <evgeniik@mellanox.com> Signed-off-by: Sasha Kotchubievsky <sashakot@mellanox.com> Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: I600bce78427eb7e8ed819bbbe523ad318e2da32b Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/462585 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-12-23 08:41:48 +00:00
Ben Walker	3d06a83fa4	nvme/rdma: Increase timeout when waiting for CM_EVENTS In some real data center deployments, 100ms is not enough. Increase the timeout to 1 second. Change-Id: I8195a1c1e987b7eff2d8541509f79381be32ed4b Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/478638 Reviewed-by: Seth Howell <seth.howell@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-by: yidong0635 <dongx.yi@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-12-23 08:41:29 +00:00
Ziye Yang	3455bfad55	nvme/rdma: fix the reaped number caculation issue. To address the error message: SPDK_ERRLOG("Unable to resubmit as many requests as we completed.\n"); Reason: The "reaped" variable is used to caculate the free slots of rdma_reqs after calling the nvme_transport_qpair_process_completions. And we should correctly caculate the free slots when the rdma_req is really put. If we caculate the slots more than we will have, we will trigger the error print described above. Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: I269bdb63646eee6444d340b904882736c4cbca36 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/477913 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Seth Howell <seth.howell@intel.com> Reviewed-by: qun wan <qun.wan@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Community-CI: SPDK CI Jenkins <sys_sgci@intel.com>	2019-12-17 09:30:24 +00:00
Evgeniy Kochetov	54f81b37ef	nvme/rdma: Add 'delay_cmd_submit' option to RDMA transport qpair Signed-off-by: Evgeniy Kochetov <evgeniik@mellanox.com> Signed-off-by: Sasha Kotchubievsky <sashakot@mellanox.com> Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: I12e12d37baf1f74717a60a4f9d8309a994509e42 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/475308 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Community-CI: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com>	2019-12-10 17:32:10 +00:00
Seth Howell	61537a190e	nvme: replace nvme_qpair_state_equals. nvme_qpair_get_state fits more closely with the semantics in other modules. Change-Id: I6ea8e02abe27253d9b4d779a43ac1963be56356a Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/476920 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Alexey Marchuk <alexeymar@mellanox.com>	2019-12-09 13:55:41 +00:00
Seth Howell	24bca2eadd	nvme: add an enum for why a qpair disconnected Change-Id: I1a9517d9673051615942c873416505704740691a Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/475805 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-12-09 13:55:41 +00:00
Seth Howell	3911922005	nvme: remove redundant transport_qp_is_failed checks The qpair state transport_qpair_is_failed is actually equivalent to NVME_QPAIR_IS_CONNECTED in the qpair state machine. There are a couple of places where we check against transport_qp_is_failed and then immediately check to see if we are in the connected state. If we are failed, or we are not in the connected state we return the same value to the calling function. Since the checks for transport_qpair_is_failed are not necessary, they can be removed. As a result, there is no need to keep track of it and it can be removed from the qpair structure. Change-Id: I4aef5d20eb267bfd6118e5d1d088df05574d9ffd Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/475802 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-12-09 13:55:41 +00:00
Seth Howell	e9e3f61525	nvme/rdma: add connect retry in edge case If the initiator dies without a disconnecting a qpair, the target can possibly retain the state of the connection. In this case, it will inform us that the connection is stale, and we need to try again. Change-Id: I4d349c634aee59ce9ea4af795b07dd8649db56b3 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/473063 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-11-21 08:03:33 +00:00
Seth Howell	6b87dd8023	nvme_rdma: Detect stale connection failures. This is the first step in properly reconnecting after a hard power off event. Change-Id: I9739bffacd66ec6d9f8f1d376bf42291c84f90f2 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/473061 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-11-15 20:26:46 +00:00
Seth Howell	208fbb675c	nvme_rdma: more cm_event validation to a helper function. This step is going to become more involved, so it's best to keep it in a separate function entirely. Change-Id: Iefa9860420edf28e858c4ed8aa932985c686cfd9 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/473060 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-11-15 20:26:46 +00:00
Seth Howell	a4925ba744	nvme: take the lock when disconnecting qpairs. If we disconnect qpairs without taking the lock, we run the risk of trying to double free qpair resources before they have been marked as NULL. For example, polling on one thread and calling nvme_rdma_qpair_disconnect from one thread while doing an nvme_ctrlr_reset on another thread. nvme_ctrlr_reset will call down to nvme_rdma_qpair_disconnect on the same qpair and without any locking it can result in trying to destroy the qpair resources multiple times. Change-Id: I9eef6f2f92961ef8e3f8ece0e4a3d54f3434cff8 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/472413 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-10-28 17:19:46 +00:00
Seth Howell	0a42e658b5	nvme_rdma: let UL know when we fail qpairs. Also, adds a field to the generic qpair for future use in other transports. Change-Id: Ie5a66e7f5ebfec1131155fc07e3c671be814fb9b Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/471414 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-10-22 21:14:22 +00:00
Seth Howell	6b314fb5dc	nvme_rdma: properly separate alloc_reqs and register_reqs. The way these two functions were separated previously represented a pretty sserious bug when doing a controller reset. If there were any outstanding requests in the rqpair, they would get overwritten during the call to nvme_rdma_qpair_register_reqs and the application would never get a completion for the higher level requests. The only thing that we need to do in this function is assign the proper lkeys. Change-Id: I304c70646daf9b563cd00badba7141e5e8653aad Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/471659 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-10-22 21:14:22 +00:00
Seth Howell	6035f73d7b	nvme_fabrics: move ctrlr_scan to common code. This function is identical between the two transports. Change-Id: If50b781259f224eb2c21de7da14564e6ce487650 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/471778 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-10-22 21:14:22 +00:00
Seth Howell	fa9f668a8b	nvme: call the generic qpair_connect fn from all transports. This wasn't being done in the previous case which meant that I/O qpairs were not being moved to the connecting state when connecting for the first time. However, to prepare the way for a coherent state machine for nvme qpairs, we need to ensure that all qpairs go through the same states. Change-Id: I3cfe799a003acd926b24c107ab1461a96239c1bb Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/471753 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-10-22 21:14:22 +00:00
Seth Howell	1399a42bbc	nvme_rdma: put requests when ibv_post_send fails. Leaving these on the stack outstanding list can cause unnecessary buildup. If we fail to post the request to ibv, then the upper layer request will be freed immediately for reuse, but we will keep that request in the outstanding queue at the RDMA layer. Change-Id: Ib422dc9fcb50344ce7c01749f3e20ea9310fd5cb Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/470255 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Alexey Marchuk <alexeymar@mellanox.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-10-15 16:53:59 +00:00
Seth Howell	2c68fef058	nvme: move queued request resubmit to generic layer We were already passing up from each transport the number of completions done during the transport specific call. So just use that return code and batch all of the submissions together at one time in the generic code. This change and subsequent moves of code from the transport layer to the genric layer are aimed at making reset handling at the generic NVMe layer simpler. Change-Id: I028aea86d76352363ffffe661deec2215bc9c450 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/469757 Reviewed-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-10-07 15:05:00 +00:00
Seth Howell	7630daa204	nvme: move queueing requests to the generic layer The tailq and the requests all belong to the generic layer, might as well put the queueing code there for better encapsulation. Change-Id: Id5f08f798121b50a21044cfc61856999c50ca227 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/469758 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-09-30 21:17:47 +00:00
Seth Howell	13fb1b690e	nvme_rdma: add a timeout for spinning on cm events. Previously we would just sit forever. preventing us from properly attempting reconnects and timing out. Change-Id: Id7386ab95cf75fd9ac972b44afa2719aad412f49 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/469021 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-09-30 21:17:47 +00:00
Seth Howell	5ac814e36c	nvme_rdma: share the cm_event channel between qpairs. This enables us to create a single file descriptor and a single event channel to poll for completions. With that accomplished, we can easily poll for events on the admin qpair each time we check it for completions. Change-Id: I8b901252510744a956bef12594d1e045715e002e Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/467549 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-09-30 21:17:47 +00:00
Seth Howell	f12e6bc041	nvme_rdma: in qp_disconnect, set resources to NULL This prevents us from failing a reset and then trying to double put the rqpair->cq which ends up causing seg faults. Change-Id: If3e14a3d039b4b19cc587a7482157f4b23f8ee32 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/469609 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Alexey Marchuk <alexeymar@mellanox.com>	2019-09-30 21:17:47 +00:00
Seth Howell	579d44b0ee	nvme_rdma: make handling of cm_events more robust By splitting all cm_event handling into a single function, we can create a single point of contact for cm_events, whether we want to process them synchronously or asynchronously. Change-Id: I053a850358605115362f424de55e66806a769320 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/467546 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-09-18 22:19:37 +00:00
Seth Howell	ad7a01bde3	nvme_rdma: make cm_event fd asynchronous. This is paving the way for additional changes to enable polling for cm_events in the initiator. For now, just present the same blocking API on top of the now polled file descriptor. Later, we will change this API to be more useful. Change-Id: I174dac028720f95c30100f6dc2ed49b5bb2a7e40 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/467545 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-09-18 22:19:37 +00:00
Ben Walker	647afdec44	Revert "nvme: small code cleanup for nvme_transport_ctrlr_scan" This reverts commit `6129e78d26`. When the initiator sends the discovery log page, if the log page exceeds the size of its data buffer, it will break it up into multiple log page commands with appropriate offsets. However, supporting offsets in log pages is an optional feature in NVMe and reported by the EDLP bit in the identify data. This commit changed the discovery process to no longer send an identify command prior to doing the discovery log page command, so the values in the identify data are always 0. If the discovery log page exceeds the size of the data buffer (4k), it will then fail to send the second log page with an offset because it believes the controller does not support the feature. Revert this change to fix it. An identify should always be sent as part of the discovery process. A test case is included in a follow up patch the demonstrates the bug. Reported-by: Zahra Khatami <zahra.k.khatami@oracle.com> Reported-by: Akshay Shah <akshay.shah@oracle.com> Change-Id: Iefd512a7521e0fea90541b3eb547671cfa816ea6 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/466819 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-09-09 21:52:07 +00:00
Shuhei Matsumoto	8b539eb553	nvme: Set appropriate value to max_xfer_size and max_sge SPDK NVMe-oF initiator driver could not transfer IO whose size is more than 128KiB even if NVMe-oF target allows IO whose size is more than 128KiB both for RDMA and TCP transport. Some use cases need to transfer IO larger than 128KiB. For RDMA transport, max_mr_size by ibv_query_device of RDMA devices indicates the maximum size of a single memory region and is independent from the actual I/O size, and is very likely to be larger than 2 MiB which is the granularity we currently register memory regions. Actually some RDMA NICs return UINT64_MAX for max_mr_size by ibv_query_device. Hence use UINT32_MAX and let the generic layer use the controller data to moderate this value. On the other hand, for TCP transport, there is no limit for maximum IO size and hence use UINT32_MAX. Besides, for RDMA transport, max_sges should be the minimum of max_sge got by querying RDMA devices and NVME_RDMA_MAX_SGL_DESCRIPTORS. Hence do this change together in this patch. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Idc813afd3e525bf5f370c0fcd2623f9c146a5528 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459218 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-07-05 06:35:41 +00:00
Shuhei Matsumoto	cf3c54bc03	nvme: Ensure max_sges not to exceed what controller supports in generic layer Previously comparing the transport supported value and the target value was done in RDMA transport layer. However this comparison should be done in the generic layer like the maximum IO transfer size. Hence change the comparison to do in the generic layer in this patch. Besides, for MSDBD, the value 0 indicates no limit but we had handled this as maximum number of SGS entries was 0 by mistake. This patch fixes the bug together. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I54365cf114169b10180ec2c659f9c7302672674c Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459574 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-07-05 06:35:41 +00:00
Jim Harris	ef1f844395	nvme: add qpair parameter to nvme_complete_request In some cases we have the qpair already when calling this function. So pass the qpair to avoid having to get it from the request. This shows about a 3% performance improvement for high IOPs single core tests. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I22fcca560492f4e7cf5ffedd252e41a027d0dd79 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/455286 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-05-22 14:51:01 +00:00
Jim Harris	79fad08a7e	nvme: add transport qpair_disconnect function Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I9918f8fed0e559be5d865702b647566dd1e2ed18 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/453936 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-05-14 08:48:11 +00:00
Jim Harris	e7d8c05b5d	nvme: break out rdma disconnect from destroy Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I2ce5413635c68403edf532e53d8e15d04f0fd6c5 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/453933 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-05-14 08:48:11 +00:00
Jim Harris	8986de8b98	nvme: rename transport reconnect function to just connect The RDMA transport was the only one implementing this function, and it only does a connect - not a disconnect followed by a connect. A later patch will add a matching disconnect function. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ib68eb0ff2f8e59f437d6d8831bb37dfddf83e9a4 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/453929 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-05-14 08:48:11 +00:00
Jim Harris	f0be163639	nvme: check is_enabled flag at common layer Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I85e8289d10b481d3ca1cd125f73bd5abc4d1bf16 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/453928 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-05-14 08:48:11 +00:00
James Bergsten	740b2f5622	nvme: spdk_nvme_ctrlr_get_registers This function returns a pointer to the PCIe I/O registers for a controller or NULL if unsupported for this transport. Used for PCIe only, other transports return NULL. Use with caution. Signed-off-by: James Bergsten <jamesx.bergsten@intel.com> Change-Id: I849f9de9ad259a65b1eef9c1237345eb7195b9bf Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/452927 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-05-09 22:58:05 +00:00
Jim Harris	fabd7fbb41	nvme: remove qpair_disable This transport function is a complete nop now, so remove it. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I5cc6ac75795a3cf5311f24e2ac293fb53d4b9f8c Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/453487 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-05-08 01:44:20 +00:00
Jim Harris	783a2a20f1	nvme: add transport_qpair_abort_reqs This will allow us to move more of the reset-related functionality to the common layer, as part of enabling resets for fabrics controllers. The transport qpair_enable and qpair_fail functions acted similarly - so those are both removed now and replaced with this new qpair_abort_reqs function. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I9486630ad5b807239b0b5bcde50e8cfd313695d3 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/453486 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-05-08 01:44:20 +00:00
Jim Harris	5d431efd6d	nvme: move is_enabled logic to common layer Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Idd938f255226256d864f70921ecd70c54769b9b2 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/453485 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-05-08 01:44:20 +00:00
Jim Harris	f366e261a6	nvme: abort aers at common layer We submit AERs to all controllers - both pcie and fabrics. But currently we only manually abort the aers when disabling the qpair for pcie. Make this common instead by creating a new transport function for aborting aers. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I1e926b61b8035488cdc6e8cb4336b373732f985e Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/453482 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-05-08 01:44:20 +00:00
Jim Harris	14e67af3c5	nvme: rename reinit_io_qpair to reconnect_qpair This better explains what the function is doing, and makes the name more general so we can use it for the adminq as well. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I6b55761cb141a9a79cdef876be47995d8813b312 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/453480 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-05-08 01:44:20 +00:00
Jim Harris	3a1b2ec262	nvme/rdma: alloc req/rsp during construct (not connect) This moves us towards not freeing and reallocating this memory if and when we reconnect the qpair. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ic20d3c221442f6206d161760a8bfa7f9b8989d4c Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/453479 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-05-08 01:44:20 +00:00
Jim Harris	6949c71dca	nvme/rdma: separate req/rsp allocation from registration This will simplify some upcoming changes to reconnect a qpair. In these cases we only need to re-register the memory - we shouldn't have to allocate it again. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Id8adff313f191fbf11d7502127a2b961f2ca2f6e Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/453478 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-05-08 01:44:20 +00:00
Seth Howell	eb6006c242	nvme_rdma: don't send split sgl requests inline. In order to truly support multi-sgl inline requests in the RDMA transport, we would need to increase the size of the spdk_nvme_rdma_req object dramatically. This is because we would need enough ibv_sge objects in it to support up to the maximum number of SGEs supported by the target (for SPDK that is up to 16). Instead of doing that or creating a new pool of shared ibv_sge objects to support that case, just send split multi-sgl requests through the regular sgl path. Change-Id: I78313bd88f3ed1cea3b772d9476a00087f49a4dd Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/452266 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-05-06 18:11:13 +00:00

1 2 3 4 5 ...

331 Commits