ivampiresp/Spdk - Spdk - Leaflow Developers

Author	SHA1	Message	Date
Seth Howell	825cac2720	rdma.c: Create a single point of entry for qpair disconnect Since there are multiple events/conditions that can trigger a qpair disconnection, we need to funnel them to a single point of entry. If more than one of these events occurs, we can ignore all but the first since once a disconnect starts, it can't be stopped. Change-Id: I749c9087a25779fcd5e3fe6685583a610ad983d3 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/443305 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-02-12 20:39:44 +00:00
Seth Howell	b6b0a0ba59	rdma: adjust I/O unit based on device SGL support For devices that support fewer SGE elements than our default values, we need to adjust the I/O unit size so that we don't ever try to submit more SGLs than we are allowed to. Change-Id: I316d88459380f28009cc8a3d9357e9c67b08e871 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/442776 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-02-12 18:46:57 +00:00
Seth Howell	92f5548a91	rdma: properly account num_outstanding_data_wr This value was not being decremented when we got SEND completions for write operations because we were using the recv send to indicate when we had completed all writes associated with the request. I also erroneously made the assumption that spdk_nvmf_rdma_request_parse_sgl would properly reset this value to zero for all requests. However, for requests that return SPDK_NVME_DATA_NONE rom spdk_nvmf_rdma_request_get_xfer, this funxtion is skipped and the value is never reset. This can cause a coherency issue on admin queues when we request multiple log files. When the keep_alive request is resent, it can pick up an old rdma_req which reports the wrong number of outstanding_wrs and it will permanently increment the qpairs curr_send_depth. This change decrements num_outstanding_data_wrs on writes, and also resets that value when the request is freed to ensure that this problem doesn't occur again. Change-Id: I5866af97c946a0a58c30507499b43359fb6d0f64 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/443811 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Sasha Kotchubievsky <sashakot@mellanox.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-02-12 18:43:44 +00:00
Seth Howell	ceb32abbd8	nvmf: don't set qpair->group to NULL. The typical rdma qpair disconnect function goes through the function _nvmf_rdma_disconnect_retry. When this function was introduced, it was discovered that we could receive a qpair disconnect event for a given qpair before that qpair had been assigned to a poll group. In order to ensure that the disconnect procedure completed properly, we waited on the current thread in _nvmf_rdma_disconnect_retry for the qpair to be assigned a poll group before we finally disconnected. see rdma.c:2250. Since _nvmf_rdma_disconnect_retry was not necessarily called from the poll group's thread, we relied upon the assumption that the group variable would never be set back to NULL. See the comment on rdma.c: 2243. However, in _spdk_nvmf_qpair_destroy we were setting the group back to NULL. This operation can result in the following set of operations across multiple threads that prevent a qpair from ever being fully destroyed. 1. thread 1: receive a disconnect event - call nvmf_rdma_disconnect 2. thread 1: from nvmf_rdma_disconnect call spdk_nvmf_rdma_qpair_inc_refcnt - setting rqpair->refcnt to 1. 3. thread 2: call spdk_nvmf_rdma_poller_poll. 4. thread 2: in spdk_nvmf_rdma_poller_poll reap a completion with an error status which causes us to call spdk_nvmf_qpair_disconnect - rdma:2846 5. thread 2: spdk_nvmf_qpair_disconnect calls _spdk_nvmf_qpair_destroy which sets qpair->group = NULL 6. thread 1: from nvmf_rdma_disconnect we call _nvmf_rdma_disconnect_retry which checks if qpair->group == NULL. If that is the case, we assume that the qpair has not been assigned a group yet and send ourself a message to call _nvmf_rdma_disconnect_retry again. see rdma.c:2253 7. thread 2: from _spdk_nvmf_qpair_destroy we call spdk_nvmf_transport_qpair_fini which results in a call to spdk_nvmf_rdma_close_qpair. which sends dummy send and recvs to the qpair. 8. thread 2: we call poller_poll and get completions for both the send and recv dummy requests. This results in a call to spdk_nvmf_rdma_qpair_destroy. 9. thread 2: spdk_nvmf_rdma_qpair_destroy checks rqpair->refcnt and when it sees that it does not = 0 (see step 2 above) it returns without freeing the resources. see rdma.c:629 10. thread 1: we keep churning in _nvmf_rdma_disconnect_retry sending ourselves messages because rqpair->group is going to be null. Thread 1 never reaches line 2257 where it sends a message to call _nvmf_rdma_qpair_disconnect. _nvmf_rdma_qpair_disconnect is the function that decreases the rqpair->refcnt and allows us to make forward progress on destroying the qpair. I encountered this issue while trying to disconnect from our target using the kernel initiator with an x722 NIC. I think the timing on this bug comes out with that specific configuration because come of the calls in the disconnect path on thread 1 fail causing it to take longer giving a chance to the second thread to delete the qpair. There are really two issues at play here. We don't have a single point of entry for disconnecting RDMA qpairs, and we rely on the qpair->group variable never being set back to NULL. This patch addresses the second issue, and the next patch in the series addresses the first. Change-Id: I65395d0bbb67edfa7bad2ddc70906606c3d83781 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/443304 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-02-11 19:25:51 +00:00
Ben Walker	7a4d6af182	nvmf/tcp: Stay in AWAIT_PDU_READY state until atleast 1 byte arrives This doesn't fix any bug, but it makes more sense to leave the qpair in the NVME_TCP_PDU_RECV_STATE_AWAIT_PDU_READY state until it receives at least one byte. Change-Id: Ic5f34a733a80b58f65a1334fae7e07dbded2b3d0 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/441811 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-02-08 16:35:12 +00:00
Ben Walker	63de221bf6	nvmf/tcp: Eliminate management channel in favor of poll group The management channel was used in the RDMA transport prior to the introduction of poll groups and made its way over to the TCP transport when it was written. Eliminate it in favor of just using the poll group. Change-Id: Icde631dd97a6a29190c4a4a6a10a0cb7c4f07a0e Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/442432 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com>	2019-02-06 16:02:43 +00:00
Seth Howell	41cd5ff4fb	rdma: fix max_read_depth_definition. max_read_depth should be based on max_qp_init_read_atomic, or the maximum number of read values that the initiator will accept as outstanding. The device attributes object contains values for both the initiator (remote side) and the target (local side). All attributes with the name init in them are meant to correspond to the initiator. The qp_read_atomic value represents the number of reads and atomic operations that can have this device as the target. qp_init_read_atomic represents how many read operations the initiator has said that we can have outstanding that have the initiator's rdma device as the target. Since this number represents how many outstanding reads we will send to the initiator at once, we should use the qp_init_read_atomic value. Change-Id: Iacc044e8321080de8accd9128ac3777bbb948afc Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/442409 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-02-05 18:04:04 +00:00
Ben Walker	9521d11bdb	nvmf/rdma: Remove stray spdk_nvmf_rdma_wr Wasn't used. Change-Id: I5b440e18a0a6cbb9b6137b7074a0312e51f41b95 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/441592 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-02-04 19:14:54 +00:00
Ben Walker	608d80a033	nvmf/rdma: Eliminate management channel This is a holdover from before poll groups were introduced. We just need a per-thread context for a set of connections, so now that a poll group exists we can use that instead. Change-Id: I1a91abf52dac6e77ea8505741519332548595c57 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/442430 Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-02-04 18:20:13 +00:00
Ben Walker	4e614b3127	nvmf/rdma: Capitalize SEND in code comment for consistency The READ and ATOMIC in the comment above are capitalized, so make this all caps too. Change-Id: I49fae2ceb826b22953d9b26d42b95f17e2dac617 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/442427 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-02-04 18:12:31 +00:00
Ben Walker	a4d666fd7a	nvmf: Collapse request.c into ctrlr.c request.c didn't have much code, so let's collapse it into ctrlr.c and make that the place where all software emulator of the NVMe controller, including request handling, is done. Change-Id: Id7c98010cb222a414a5aa0b78bfb299a0ffc418f Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/440592 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-02-04 18:11:33 +00:00
Ben Walker	1b6b6cc440	nvmf: Move spdk_nvmf_ctrlr_process_io_cmd into ctrlr.c Previously, all I/O commands were implemented by simply passing them to the bdev layer. Now, some I/O commands will be emulated. Prepare for that by moving the code for this function to ctrlr.c, where the emulation will occur. Change-Id: Id34e5549e5ce216d602fb347b4506fbd324eed4e Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/440591 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-02-04 18:11:33 +00:00
Ben Walker	5f0df58532	nvmf: Refactor ctrlr_bdev_dsm_cmd to prepare for more dsm commands This was previously very unmap specific. Make at least the top level DSM call more general purpose by eliminating the unmap_ctx. Change-Id: I9c044263e9b7e4ce7613badc36b51d00b6957d3a Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/440590 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-02-04 18:11:33 +00:00
Ben Walker	f52f6aee0e	nvmf: Change some "virtual" names to "bdev" These are left over from the removal of virtual mode over a year ago. Change-Id: Ia797c4570bf9090346ff22ab9c7d719a78d023d0 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/440589 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-02-04 18:11:33 +00:00
Ben Walker	2b59852b65	nvmf/tcp: Rename nvme_tcp_qpair to spdk_nvmf_tcp_qpair Naming consistency. Change-Id: Ia044a41fa9939c17b52d306c2a053ffc56f03d56 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/442441 Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-02-04 16:24:00 +00:00
Ben Walker	55e12a6cdb	nvmf/tcp: Remove tqpair pointer from pdu This was only used by the target, and it didn't actually need it. Change-Id: Ibcef410165efdc16077da24419580ed51b087d70 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/442440 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-02-04 16:24:00 +00:00
Ben Walker	c57bafed51	nvmf/tcp: Rename nvme_tcp_req to spdk_nvmf_tcp_req Naming consistency. Change-Id: I9a5ca6fb22fd80f818c4e2223a90af4257140fac Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/442439 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-02-04 16:24:00 +00:00
Ben Walker	d3e3f7622b	nvmf/tcp: Remove forward declaration of nvme_tcp_req from nvme_tcp.h This type was actually two entirely different types for the initiator and the target, so just make it void. Change-Id: I15512d9d4efd790dce0fa4323b7230de66144bc6 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/442438 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-02-04 16:24:00 +00:00
Ben Walker	2d07fa1532	nvmf/tcp: Rename spdk_nvme_tcp_term_req_fes_str Switch nvme to nvmf Change-Id: Ibc2540018b7f6d062d2ad6c4ffa8337b94d22614 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/442436 Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-02-04 16:24:00 +00:00
Ben Walker	e1dd85a5b7	nvmf: Don't increment current_recv_depth for dummy RECV When a connection goes to close and has no I/O outstanding, the current_recv_depth was being decremented beyond 0 and rolling over. If the poll group then finds a successful receive completion on the next poll (for a command that arrived prior to starting the disconnect but hadn't been processed yet), it would trip the max queue depth check added recently and start another disconnect process. If only one command arrives in this window, everything actually works out ok. However, if there are two receive completions sitting in the completion queue after the disconnect process is started, the first one does the double disconnect and the second one does another disconnect which ends up dereferencing a null pointer. Since there is always a special reserved slot for the dummy recv, don't do decrements or increments of the current_recv_depth for the dummy recv. This allows the code to still enforce the actual max_queue_depth on recvs without underflowing or overflowing the counter. Change-Id: I56c95b2424e956a3b007b25c50cbf47262245b8f Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/442642 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-01-30 19:03:46 +00:00
zkhatami88	8e2f0cdb01	nvmf: Add mechanism to override nvmf pd/mr behavior Change-Id: I8d3abfcd1934bbab5bf8dacae08e8a7f29992b93 Signed-off-by: zkhatami88 <z.khatami88@gmail.com> Reviewed-on: https://review.gerrithub.io/c/433977 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com> Reviewed-by: Sasha Kotchubievsky <sashakot@mellanox.com>	2019-01-30 19:03:35 +00:00
Seth Howell	1d0a8e1cec	rdma: split PENDING_DATA_TRANSFER into two states. Since we have different requirements for submitting RDMA read and write operations, we should track them separately so that we don't block writes when the device does not have enough resources for read operations. Change-Id: I5d6424c0e26f2f5362866d1bb21eb46700c245da Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/441794 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-01-28 16:58:50 +00:00
Seth Howell	158dc9470d	rdma: Make sure we don't submit too many WRs Before, the number of WRs and the number of RDMA requests were linked by a constant multiple. This is no longer the case so we need to make sure that we don't overshoot the limit of WRs for the qpair. Change-Id: I0eac75e96c25d78d0656e4b22747f15902acdab7 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/439573 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-01-28 16:58:50 +00:00
Seth Howell	dfdd76cf21	rdma: track outstanding data work requests directly. This gives us more realistic control over the number of requests we can submit. Change-Id: Ie717912685eaa56905c32d143c7887b636c1a9e9 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/441606 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-01-25 19:12:17 +00:00
Seth Howell	7289d370f7	rdma: fix rw_depth to read_depth: rw_depth was a misinterpretation of the spec. It is based on the value of max_qp_rd_atom which only governs the number of read and atomic operations. However, we were using rw_depth to block both read and write operations which is an unnecessary restriction. write operations should only be governed by the number of Work Requests posted to the send queue. We currently guarantee that we will never overshoot the queue depth for Work requests since they are embedded in the requests and limited to a size of max_queue_depth. Change-Id: Ib945ade4ef9a63420afce5af7e4852932345a460 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/441165 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-01-25 19:12:17 +00:00
Seth Howell	5301be93cd	rdma: set wr opcodes while parsing the SGL. Change-Id: I88fdf0b48653997f790cf5de6774d1c16621a9c1 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/441605 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-01-25 19:12:17 +00:00
Seth Howell	1f9ac1179e	rdma: add num_outstanding_data_wr tracker to req This will be necessary later on when we need to throttle send and recv requests in software. Change-Id: Ifb25eaabd15e101fbfc2959a08a321f80857b280 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/441604 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-01-25 19:12:17 +00:00
Changpeng Liu	faacc87811	nvmf: set default KAS value to 10 seconds Both initiator and target are using the minium 10 seconds timeout value, so set it in kas field when initializing the controller. Change-Id: Idda68bdfe27613ebaf706a0de497145d3f9ed766 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.gerrithub.io/c/441995 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-01-25 18:52:45 +00:00
Ziye Yang	81faea1b2d	nvmf/tcp: remove the timeout handling code Currently, the code does not comply with the spec, so remove such code for 19.01 and will add the code which complies with the spec for 19.04 Change-Id: Icd3b2573fbc46dc2fa7a00c6672c23ea01ffe0ee Signed-off-by: Ziye Yang <optimistyzy@gmail.com> Reviewed-on: https://review.gerrithub.io/c/441985 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-01-25 16:38:13 +00:00
Ziye Yang	9dd9adda38	nvmf: To correctly handle the socket read error. If there is socket read error, we should directly disconnect the socket instead of set the tqpair into RECV_ERROR state. When it is in ERROR_RECV state, it does not mean that we should close the socket immediately. Change-Id: I975906653c13eb3fa5195799c517015435176785 Signed-off-by: Ziye Yang <optimistyzy@gmail.com> Reviewed-on: https://review.gerrithub.io/c/441830 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-01-25 07:24:16 +00:00
Xiaodong Liu	db5c3ce362	nvmf/rdma: dynamically enlarge CQ size Assigned CQ size when creating CQ may run over due to heavy workload with too many qpairs. Enlarge it dynamically can prevent IBV_EVENT_CQ_ERR caused by CQ's runover. This patch fixes issue #498: https://github.com/spdk/spdk/issues/498 Change-Id: I6c2d7194d4147d812d49d4fe787fcba5c6bbede9 Signed-off-by: Xiaodong Liu <xiaodong.liu@intel.com> Reviewed-on: https://review.gerrithub.io/c/440853 Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com>	2019-01-24 21:51:09 +00:00
Seth Howell	4620386417	nvmf: abort I/O from pg queued list when destroying qp This change was provided by GitHub user vikasbrcm to fix issue 562. I am uploading his change to facilitate testing of the issues and possibly get it merged before the 19.01 window closes. Change-Id: I58fb1058f68c6c02006ceed6e577be627e6dbc09 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/441611 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-01-24 20:27:21 +00:00
JinYu	b8769cdb08	nvmf: Add the Keep Alive feature The controller shall treat a Keep Alive Timeout in the same manner as connection loss. If the Keep Alive feature is in use and the timer expires, then the controller shall: 1, stop processing commands and set the Controller Fatal Status (CSTS,CFS) bit to '1'; 2, terminate the NVMe Transport connection; 3, break the host to controller association; A timer poller is added to each subsystem to monitor timeout event. Change-Id: I001afab8a6764f30c39df37fa96384180d117486 Signed-off-by: JinYu <jin.yu@intel.com> Reviewed-on: https://review.gerrithub.io/c/439330 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-01-24 04:24:11 +00:00
Ziye Yang	c43cb6a706	nvmf/tcp: fix the issues of qpair resource recycling to avoid memory leak. This patch will solve the following two cases: 1 Free the pdu resources. Add the checkout of c2h_pdu_data_cnt of the qpair. 2 Do not recyecle the req accoriding to the pdu in the send_queue, but directly recylcing the reqs in TCP_REQUEST_STATE_TRANSFERRING_CONTROLLER_TO_HOST state. Change-Id: I5856c3421019ec49d576d3dae4c62fefbb3925ca Signed-off-by: Ziye Yang <optimistyzy@gmail.com> Reviewed-on: https://review.gerrithub.io/c/440847 Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com>	2019-01-21 07:45:09 +00:00
JinYu	a3c9ab66c8	nvmf/ctrlr: free ctrlr->qpair_mask when failure to create ctrlr Fix potential bug. In _spdk_nvmf_subsystem_add_ctrlr(), befor free( ctrlr) we should free ctrlr->qpair_mask. Because we set qpair->ctrlr = NULL, when destroy qpair the qpair_mask is not released. For the same reason, req->qpair->ctlr = ctrlr is placed at the bottom of the function. Change-Id: I38e268b532ff3ce87721c02f15ac4f674856d103 Signed-off-by: JinYu <jin.yu@intel.com> Reviewed-on: https://review.gerrithub.io/c/440858 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com>	2019-01-21 03:52:02 +00:00
Seth Howell	cf73fb2f1f	nvmf/rdma: add a pool of request_data structs This change is related to enabling multi-sgl element support in the NVMe-oF target. For single SGL use cases, there is a 1:1 relationship between rdma_requests and ibv_wrs used to transfer the data associated with the request. In the ingle SGL case that ibv_wr is embedded inside of the spdk_nvmf_rdma_request structure as part of an rdma_request_data structure. However, with Multi-SGL element support, we require multiple ibv_wrs per rdma_request. Insted of embedding these structures inside of the rdma_request and bloating up that object, I opted to leave the first one embedded in the object and create a pool that requests can pull from in the Multi-SGL path. By leaving the first request_data object embedded in the rdma_request structure, we avoid adding the latency of requesting a mempool object in the basic cases. Change-Id: I7282242f1e34a32eb59b55f326a6c331d455625e Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/428561 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com>	2019-01-18 20:43:46 +00:00
Ziye Yang	3c88819bc0	nvmf/tcp: Use the common buffer cache for each polling group Purpose: To avoid the buffer contention among different polling groups if there are multiple core configurations for NVMe-oF tcp transport. Change-Id: I1c1b0126f3aad28f339ec8bf9355282e08cfa8db Signed-off-by: Ziye Yang <optimistyzy@gmail.com> Reviewed-on: https://review.gerrithub.io/c/440444 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-01-18 19:21:58 +00:00
Seth Howell	caa06154bd	rdma: fix the poll_group_create error paths. It was possible to leak pollers if we had multiple devices in the transport. The new err_exit path fixes this. Change-Id: Iafd5643c67fae741113f10afe761af1988cb6a9b Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/439419 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-01-18 16:57:37 +00:00
Seth Howell	e6ddb7df3f	rdma: use the new common poll group data buffer cache. This change is aimed at addressing github issue #555 Change-Id: I5112ac38c59f2f0a17d0c560e7e2f640a11f58a9 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/440419 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Ziye Yang <optimistyzy@gmail.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-01-18 16:57:37 +00:00
Seth Howell	8cb172f2a9	nvmf/transport->add per-pg cache This is implemented at a generic level. Change-Id: Ibf8167e828f8da27cc26cd04e611c3f3c084319a Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/440418 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-01-18 16:57:37 +00:00
Ziye Yang	b62a1f9ef1	nvmf/tcp: dump the req state of the tqpair This patch is used to dump the requests state if the tqpair's resource is not freed. Change-Id: Ic4780662558d73267d4f1ebabfc22780fafec4ec Signed-off-by: Ziye Yang <ziye.yang@intel.com> Reviewed-on: https://review.gerrithub.io/c/440846 Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com>	2019-01-18 01:35:47 +00:00
Seth Howell	e28605f47a	nvmf/transport: move buffer_pool to generic struct. This is shared between all currently valid transports. Just move it up to the generic structure. This will make implementing more shared features on top of this a lot easier. Change-Id: Ia896edcb7555903ba97adf862bc8d44228df2d36 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/440416 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-01-17 19:42:01 +00:00
Seth Howell	e816c8fda8	nvmf: add a buffer_cache to transport opts This patch series is geared at solving github issue 555. Ultimately the goal of this series is to add a per-poll-group buffer cache to prevent starvation. Change-Id: I8ddaa47487665c2f9adce2109eb71b8fa71a7927 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/439415 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-01-16 19:06:20 +00:00
Seth Howell	b17e0ae7db	rdma: process pending reqs before destroying qp This is an attempt to clean up requests sititng in the waiting_for_buffer state before destroying it for good. Change-Id: I8ae047e4d7fd01f30419ae346e4da49355dc033d Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/440127 Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-01-15 18:11:41 +00:00
Seth Howell	e0280b1100	rdma: add drain argument to process_pending This allows us to drain all of the pending requests from the qpairs before we destroy them, preventing them from being picked up on subsequent process_pending polls. Change-Id: I149deff437b4c1764fabf542cdd25dd067a8713a Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/440428 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-01-15 18:11:41 +00:00
Ziye Yang	a13a359ebe	nvmf/tcp: fix the qpair disconnect handling. Due to qpair timeout handling refactoring, we removed the qpair destroying related code. And this patch is submitted to address this issue. With this patch, we can detect sock close of the fd from the initiator, and correctly free the qpair related resource (e.g., pid) managed by nvmf layer. Otherwise, the initatior thinks the qpair related source is freed, however it is not freed in the target side. Change-Id: Ia2de07bd849fa5d3bc0e0e0d4941464dfd16d266 Signed-off-by: Ziye Yang <ziye.yang@intel.com> Reviewed-on: https://review.gerrithub.io/c/440242 Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-01-15 17:17:20 +00:00
Ziye Yang	2b787d487e	nvmf/rdma: remove the duplicated code in spdk_nvmf_rdma_request_free The purpose of this patch is to remove the duplicated code used in spdk_nvmf_rdma_request_free Change-Id: I3f74466a7ec788000eff9c2a75c9ea2cacaf5cc2 Signed-off-by: Ziye Yang <optimistyzy@gmail.com> Reviewed-on: https://review.gerrithub.io/c/439942 Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-01-14 03:56:28 +00:00
Changpeng Liu	a9c30bcceb	nvmf: save the NSID when adding a new Namespace The nsid field can be used for per namespace basis reservation notification. Change-Id: Ia7212020ec893ea367afe79933e1629895fe41b8 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.gerrithub.io/c/439930 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-01-14 03:35:04 +00:00
Ziye Yang	3dc3f4164b	nvmf: Update the subsystem state check during connect Observed some issues related with AER in the testpool, which states that the subsystem is not ready. So change the check, which will be more accurate. We only did not allow the subsystem in inactive state or deactivitating state. For others, we can still queue the requests. Change-Id: Ic041298dfc5f7d7bfab5f5e5314ade377273df32 Signed-off-by: Ziye Yang <ziye.yang@intel.com> Reviewed-on: https://review.gerrithub.io/c/439797 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-01-11 06:25:02 +00:00
Ziye Yang	cb1c3fae98	nvmf/rdma: fix the coredump issue when ctrlr + c target When the host connects the target and does the io related job, if we use ctrlr + c, it will be crash. The issue is that we found the rqpair->qpair.group is NULL. Change-Id: Id36cfac2be9abc707bf75a2e1ddb3f414610b6f1 Signed-off-by: Ziye Yang <optimistyzy@gmail.com> Reviewed-on: https://review.gerrithub.io/c/437232 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-01-09 21:05:32 +00:00
Seth Howell	212fd2196f	rdma: Complete rdma_req when RDMA_READ op fails This operation is not attached to a send request so we need to put the request into the completed state right away since there is no send associated with it during the draining process. Change-Id: I294f99950b00a584d8940bb4f93ac046c478d3b3 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/439437 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-01-09 20:55:55 +00:00
JinYu	76675f6f60	rdma: check ibv state after rdma update it We found ibv state value may be unreasonable, so before we use the state value we do some judgement. The unreasonable state probably means hardware issue, so the process flow become unpredicatable. Fix GitHub issue #508. Change-Id: I213f4d684b103cce7bc072aecd591e2c491e0596 Signed-off-by: JinYu <jin.yu@intel.com> Reviewed-on: https://review.gerrithub.io/c/436920 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Ziye Yang <optimistyzy@gmail.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-01-09 08:29:43 +00:00
Seth Howell	fa757dc96d	rdma: dump outstanding requests from rqpairs If this happens, we have something going seriously wrong and we need as much debug information as we can get. Change-Id: I305512790461443316b9f231fa2afeb69593af1b Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/438097 Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-01-09 08:28:50 +00:00
Ziye Yang	f31096782c	nvmf: Only present subsystem if it is ready We do not want to present those subsystems which are not ready. Change-Id: I7f5c171fbac4c31d839421e37e93e62569c0e87a Signed-off-by: Ziye Yang <optimistyzy@gmail.com> Reviewed-on: https://review.gerrithub.io/c/437222 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-01-07 06:02:26 +00:00
Ziye Yang	0b20f2e552	nvmf/rdma: Remove data buffer num dependency on SPDK_NVMF_MAX_SGL_ENTRIES The least needed data buffer number should only be larger for completing one RDMA (read/write RDMA). Change-Id: I44eb51db279fc055f687eb78b6a642dbb5cb23f3 Signed-off-by: Ziye Yang <optimistyzy@gmail.com> Reviewed-on: https://review.gerrithub.io/437808 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com>	2018-12-25 01:35:43 +00:00
Ziye Yang	58f1624497	nvmf: add the transport shared buffer num configuration option. Previously, we allocate the buffer size according to the MaxQueueDepth info, however this is not exactly a good way for customers to configure, we should provided a shared buffer number configuration for the transport. Change-Id: Ic6ff83076a65e77ec7376688ffb3737fd899057c Signed-off-by: Ziye Yang <optimistyzy@gmail.com> Reviewed-on: https://review.gerrithub.io/437450 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2018-12-20 19:55:57 +00:00
Ziye Yang	94cd652b18	nvmf/tcp: Add a poller to check the timeout of each qpair This makes the timeout check for each qpair in the group efficient. If there are many qpairs in the group, we can scale. Change-Id: I75c29a92107dc32377a2ef7edb5ac92868f1c5df Signed-off-by: Ziye Yang <ziye.yang@intel.com> Reviewed-on: https://review.gerrithub.io/435277 Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2018-12-18 08:34:30 +00:00
Ziye Yang	9d11abfd0e	nvmf: Do not set the error state of the qpair Reason: I checked the code in different transport, the qpair is already freed, so we dot need to set any state. Change-Id: I3d78c259c3f79ea4426dc9408e5c3469bc171358 Signed-off-by: Ziye Yang <optimistyzy@gmail.com> Reviewed-on: https://review.gerrithub.io/437493 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2018-12-18 04:00:59 +00:00
Ziye Yang	04d09f9207	nvmf/tcp: Use generic transport options structure Remove the unnessary fields in spdk_nvmf_tcp_transport Change-Id: I632608ba654b30f3511f5e1d925c6743c9100365 Signed-off-by: Ziye Yang <optimistyzy@gmail.com> Reviewed-on: https://review.gerrithub.io/437271 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2018-12-17 04:25:25 +00:00
Evgeniy Kochetov	d722a1742d	nvmf: Improve error handling in spdk_nvmf_transport_poll_group_create At least in case of RDMA transport, poll_group_create (spdk_nvmf_rdma_poll_group_create) can return error (NULL). Change-Id: If1576b3515e7f9ede76af08bfa6b1c8399dcda09 Signed-off-by: Sasha Kotchubievsky <sashakot@mellanox.com> Signed-off-by: Evgeniy Kochetov <evgeniik@mellanox.com> Reviewed-on: https://review.gerrithub.io/436887 Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2018-12-14 16:15:48 +00:00
Evgeniy Kochetov	7da9f8faba	nvmf/rdma: Fix refcnt check on RDMA QP destroy Check for QP reference counter in RDMA QP destroy function was wrong and QP resources were never released. Change-Id: I6ab0ce39452e8263f89589d138c90f749516ebb1 Signed-off-by: Evgeniy Kochetov <evgeniik@mellanox.com> Signed-off-by: Sasha Kotchubievsky <sashakot@mellanox.com> Reviewed-on: https://review.gerrithub.io/436974 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com>	2018-12-14 16:15:12 +00:00
Ziye Yang	ea8aa1bf0a	nvmf: check the qpair->ctrlr The ctrlr may be NULL, so we need to add a check here to present segment fault. Change-Id: I6c5361cc829af065082a95df0b8cc2f8d49a6002 Signed-off-by: Ziye Yang <ziye.yang@intel.com> Reviewed-on: https://review.gerrithub.io/436950 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Maciej Szwed <maciej.szwed@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2018-12-13 21:52:45 +00:00
Ziye Yang	527c825c81	nvmf: Re-add spdk_nvmf_transport_poll_group_remove For TCP/IP transport, we need to remove the socket from the polling group since we do not want to keep the tgroup info in the NVMe/TCP qpair, it should be general. Change-Id: I4b064d8378f66ea5d91ac554fe628d9ccebd07f4 Signed-off-by: Ziye Yang <optimistyzy@gmail.com> Reviewed-on: https://review.gerrithub.io/434128 Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2018-12-13 02:41:14 +00:00
Ziye Yang	5f03a9c1f3	nvmf/tcp: remove the unnecessary check. Since we already make the recv state handling in a correct way, so we do not need this check any more. Change-Id: Id71ab2e0ef60be302f8cf6ea776259d7312663ec Signed-off-by: Ziye Yang <optimistyzy@gmail.com> Reviewed-on: https://review.gerrithub.io/436896 Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2018-12-12 20:45:32 +00:00
Seth Howell	a451c8385e	NVMe-oF: Add explicit reports for MR-split buffers: This is a failsafe for finding and reporting data buffers that span multiple Memory Regions. These errors should never be triggered, but finding and reporting them will help any debugging. Change-Id: I3c61e3cc510f5a36039fc1815ff0de45fce794d5 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/436054 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2018-12-10 23:47:38 +00:00
Seth Howell	a52fc70d51	nvmf: Discover commands use the nvmf_req->iov struct Discover commands previously blindly used the nvmf_req->data structure. This only works if the entire command fits in a single contiguous buffer. commit `1d9be84bfd` changed the default buffer size such that this would become a problem for as few as 8 subsystems. Fixes github issue 525 This change may also help prevent data corruption as we were copying up to nvmf_req->length data into the buffer. For requests with multiple data buffers this can cause us to copy off the end of that buffer. Change-Id: I788259da988b2458f57ee2795e1c5d3ced8803dd Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/435544 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2018-12-10 22:59:22 +00:00
Ziye Yang	408728025e	nvmf/tcp: Fix the recv state switch if no shared buffer available The purpose of this patch is to fix the issue when there is no data buffer allocated, the previous method is wrong to set the recv pdu state. The reason is that: 1 When there is no data buffer allocated, we still need to handle the incoming pdu. It means that we should switch the pdu recv state immedidately. 2 And when there is a buffer, we resume the req handling with the allocated buffer, that time we should not switch the pdu receving state of the tqpair. Change-Id: I1cc2723acc7b0a17407c3a2e6273313a4e612916 Signed-off-by: Ziye Yang <optimistyzy@gmail.com> Reviewed-on: https://review.gerrithub.io/436153 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2018-12-10 20:21:41 +00:00
Ziye Yang	4c627d0af5	nvmf/tcp: Remove the queued_r2t_tcp_req list The usage of this list is duplicated with the state_queue[TCP_REQUEST_STATE_DATA_PENDING_FOR_R2T] list of tqpair, so remove it. Change-Id: I7a67a5c8049bb9492bf97e0d60e2040a29b0a7e4 Signed-off-by: Ziye Yang <optimistyzy@gmail.com> Reviewed-on: https://review.gerrithub.io/436274 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com>	2018-12-10 20:21:41 +00:00
Ziye Yang	d40be3da1a	nvmf/tcp: fix the error usages of list in spdk_nvmf_tcp_cleanup_all_states Change-Id: Iebfe412c684572c63e3b1b2d8c3237b0e6081880 Signed-off-by: Ziye Yang <ziye.yang@intel.com> Reviewed-on: https://review.gerrithub.io/436106 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com>	2018-12-10 20:21:41 +00:00
Ziye Yang	71cd1ea7e7	nvme/tcp: Fix the term req data len calculation. Fix the issue in both target and host sides. Change-Id: I1bf31072b2164a3035b443fe6c5418a6a7829d81 Signed-off-by: Ziye Yang <optimistyzy@gmail.com> Reviewed-on: https://review.gerrithub.io/436099 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2018-12-07 18:26:03 +00:00
Ziye Yang	a158309ce5	nvmf/tcp: Remove the hd_is_read field. Previously, this field is used to optimize the code. When we receive the capsule cmd pdu, we need to allocate the related buffer, if there is read or write request. If the related buffer is not valid, then we cannot enter the next pdu handling phase. So we use this field to mark. After carefully checking the code, I think that we use the tcp_req which is assoicated with the pdu, thus it is efficient. Change-Id: Ic1634d706dd40a706269bce199bf6031ea0462c0 Signed-off-by: Ziye Yang <optimistyzy@gmail.com> Reviewed-on: https://review.gerrithub.io/435995 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com>	2018-12-07 18:26:03 +00:00
Changpeng Liu	187e2dfbbf	nvmf: use spdk_uuid_copy() API instead of memcpy. For NVMeoF, extened host identifer is used which is exactly the same size as uuid, while here, use uuid data structure makes sense. For NVMeoF reservation features, host identifier need to be used with each registrant, using spdk_uuid_compare becomes straightforward. Change-Id: Ib6ffaa92fab5e0ae5037682be14fcc415f9714d7 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.gerrithub.io/436302 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: Jim Harris <james.r.harris@intel.com>	2018-12-06 22:25:09 +00:00
Ziye Yang	d40f805d54	nvmf: fix the error path for shared data buffer free. Since we use aligned buffer, I think that the error handling path here is not correct, the address is wrong. Change-Id: I5bcb7f050199496423f861fd6aea65e0fe48c804 Signed-off-by: Ziye Yang <ziye.yang@intel.com> Reviewed-on: https://review.gerrithub.io/435992 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com>	2018-12-05 05:57:09 +00:00
Ziye Yang	1b7c0f54d0	nvmf/tcp: add an assert for transport destroy. Add a check, which will be required for the further unit test. Change-Id: Ib1987fef914e6546f2bdbacd23bf9bb6005b8155 Signed-off-by: Ziye Yang <ziye.yang@intel.com> Reviewed-on: https://review.gerrithub.io/435197 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2018-12-04 01:56:39 +00:00
Jim Harris	72f8c6a1f3	log: remove "trace" from internal API Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I8b1c0d4b00d5d41aae89d3b33f18d1ae957567dc Reviewed-on: https://review.gerrithub.io/435344 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Xiaodong Liu <xiaodong.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2018-12-03 19:50:15 +00:00
Seth Howell	5aca5cd71b	rdma: don't print a notice on QP state change. This notice was scaring a lot of people because every time we disconnect a qpair it tells the user that qpair is entering an error state. That is part of the normal state flow of qpairs during disconnect, but makes it seem like something is going wrong. Change-Id: I776e71db2b24fa963113fee88b5cf02c0820f171 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/435555 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ziye Yang <optimistyzy@gmail.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com>	2018-12-03 09:44:06 +00:00
Jim Harris	942e02aa68	nvmf: add some instrumentation in error path Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I1b5fad59c76fb4dbb6fcedf3f5a1e24af2064c4d Reviewed-on: https://review.gerrithub.io/434271 Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Sasha Kotchubievsky <sashakot@mellanox.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2018-11-30 19:58:14 +00:00
Liu Xiaodong	0e7ca66922	lib/trace: show specific usage of trace mask Previously, if want to know which mask bit is used for specific trace group, the only way is to check source code. Now list each trace group with its trace tpoint group mask bit in usage message Change-Id: I7a85fe9c0885f1919f6ffbdc97dab81f1986fb07 Signed-off-by: Liu Xiaodong <xiaodong.liu@intel.com> Reviewed-on: https://review.gerrithub.io/435448 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2018-11-30 14:52:28 +00:00
Liu Xiaodong	73a3e13280	lib/nvmf: realign tab for TRACE_GROUP_NVMF_XXX Change-Id: I7be0c7c417c84421e6abdbefb734cd0c05561194 Signed-off-by: Liu Xiaodong <xiaodong.liu@intel.com> Reviewed-on: https://review.gerrithub.io/435405 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ziye Yang <optimistyzy@gmail.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2018-11-30 14:52:28 +00:00
Seth Howell	0e6a32deab	nvme_rdma/nvmf: add cb_fns to check mr contiguity This is necessary to confirm that a buffer that spans a 2_MB boundary is still in a single MR. Change-Id: If0d14e514ab2197a0d2e3af4f565f56d50591210 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/435179 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com>	2018-11-29 15:16:13 +00:00
Changpeng Liu	d2525134e7	nvmf: check block size is 512 aligned for each Namespace NVMf target can't support extended LBA format for now, so print a error log for those NVMe backend devices with extended LBA format. Fix the issue #497. Change-Id: Idda76ba934dd0eb45f92ae22b0b71398b3ae69dd Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.gerrithub.io/432799 Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: <dongx.yi@intel.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com>	2018-11-28 22:56:50 +00:00
Maciej Szwed	6569a529d6	nvmf: destroy mutex on controller destruction Signed-off-by: Maciej Szwed <maciej.szwed@intel.com> Change-Id: I0eb5c7891a8614313607cd006f23e00c75d7d789 Reviewed-on: https://review.gerrithub.io/434818 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com>	2018-11-27 11:04:53 +00:00
Maciej Szwed	be0eb272d8	tcp: Initialize mutex only if everything else succeeded Signed-off-by: Maciej Szwed <maciej.szwed@intel.com> Change-Id: Ib0bb6b40852ca4b49d46c2cbeb603b7a2ec4c46f Reviewed-on: https://review.gerrithub.io/434080 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2018-11-26 07:06:42 +00:00
Ziye Yang	e956be96eb	nvme: Add the NVMe over fabrics TCP/IP transport support It is the first patch to follow the NVMe over fabrics spec and implmenent the NVMe/TCP transport. It can be divided into work in the host and target sides: Host side: Add the TCP/IP transport in nvme lib (lib/nvme). Target side: Add the TCP/IP transport in nvmf lib (lib/nvmf). Change-Id: Idc4f93750df676354f6c2ea8ecdb234e3638fd44 Signed-off-by: Ziye Yang <optimistyzy@gmail.com> Reviewed-on: https://review.gerrithub.io/425191 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2018-11-19 20:36:05 +00:00
Seth Howell	1180bf8343	rdma: clean up SGE definitions and properly set values We have historically conflated SPDK_NVMF_MAX_SGL_ENTRIES and the maximum number of SGEs associated with a wr data object. For now these are the same thing, but there should be nothing tying the number of NVMe request SGL elements to the number of rdma request wr sgl elements. Also, clarify the rx_sge and tx_sge enums to reflect the actual maximum number of SGEs associated with either the send and receive queues. This change doesn't actually modify these values, but sets us up to do things like split the data in an NVMe SGE into multiple WR SGEs in case the buffer associated with the NVMe SGE is not contained in a single RDMA mr. We also need to store these values in the qpair for later usage. Change-Id: Iff3756fc72787a4b72a99b2bdf90bf486a8010fa Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/433196 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2018-11-16 15:18:41 +00:00
Seth Howell	1d9be84bfd	nvmf/rdma: change the default buffer size. Having the buffers be the same size as the maximum xfer size doesn't do us any favors. Make these buffers a ratio of the maximum transfer size and the number of supported nvmf SGLs. Also configure the number of nvmf request iovs to correspond with this new ratio. Change-Id: I3147dcd86b599c74521ebfdf3bcdbcdee8871a3a Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/428747 Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2018-11-15 08:17:39 +00:00
Seth Howell	962ba4e89a	nvmf: remove tgt_opts from nvmf_tgt This option is deprecated. Also, rename the rpc and configuration options for setting the opts to reflect that they now only set the max number of subsystems Change-Id: Iaabcbf33dd0a0dc489d81233fda74e9e7f3e0d2e Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/430161 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2018-11-08 23:08:26 +00:00
Evgeniy Kochetov	90b4bd6cf9	nvmf/rdma: Fix QP shutdown procedure implementation This patch implements the following QP shutdown flow: 1. Move the QP to ERR state 2. Post dummy work requests to send and receive queues 3. Poll CQ until it returns dummy work requests (with WR Flush Error status) 4. Call ibv_destroy_qp and release resources In order to differentiate dummy and normal WRs new spdk_nvmf_rdma_wr structure was introduced which contains type of WR. Since now it is expected that wr_id field in ibv_recv/send_wr and ibv_wc always points to this structure. Based on WR type wr_id can be safely casted to correct container structure. In case of unsuccessful work completions 'opcode' can not be used for this purpose because it may be invalid (see "IB Architecture Specification Volume 1", ch. 11.4.2.1 "Poll for completion"). Change-Id: Ifb791e36114c619c71ad4d831f2c7972fe7cf13d Signed-off-by: Evgeniy Kochetov <evgeniik@mellanox.com> Signed-off-by: Sasha Kotchubievsky <sashakot@mellanox.com> Reviewed-on: https://review.gerrithub.io/430754 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com>	2018-11-08 21:20:25 +00:00
Seth Howell	7f128c757b	nvmf: don't implicitly create the transport in tgt listen. In order to prepare for multiple transports, the nvmf tgt should never implicitly create a transport when listen is called. Change-Id: If1286e7e3f7bce422a4acd66390852736113df7a Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/430160 Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2018-11-02 18:04:06 +00:00
Seth Howell	433a1e7b67	nvmf: add functions for iterating over transports Part of a larger series aimed at exposing NVMe-oF transports though rpc and spdkcli. This is in line with the goal of initializing all NVMe-oF options on a per-transport basis. Change-Id: I4f07d58d49b925cf51df3980d2e2161c50169cee Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/430622 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2018-11-02 18:04:06 +00:00
yidong0635	bb2486a468	nvmf: change the return type of calloc failed 1.nvmf: change the return type of calloc failed to -ENOMEM and keep consistency in this file. 2.thread: revise rc condition to ( rc!= 0),to deal with all abnormal return. Change-Id: I7cccb548f30448eaa1bac1a5904c3edcad9c1208 Signed-off-by: yidong0635 <dongx.yi@intel.com> Reviewed-on: https://review.gerrithub.io/431459 Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2018-11-02 17:56:40 +00:00
Ben Walker	5941ab0351	nvmf/rdma: Simplify code that casts wr_id field We were previously doing lots of checks in debug mode to verify the validity of this field. Now we understand how it works, so these checks are never going to hit and are just making the code harder to read. Change-Id: Ic82d479ae34a8c7db06db62aee1cdf6e8bec126e Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/430866 Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Sasha Kotchubievsky <sashakot@mellanox.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com>	2018-11-02 16:39:37 +00:00
Ben Walker	91b9b4b2a1	nvmf: Simplify qpair states When we thought we could do error recovery we differentiated between inactive and erro states. However, that's not possible so collapse them back into one. Change-Id: I57622c400378f2d4c518efbc12fb52e665a9ba4c Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/430627 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com>	2018-11-02 16:39:37 +00:00
Ben Walker	50a438d3bc	nvmf/rdma: No longer rely on wr.opcode being valid on error The specification states that opcode is not valid when the status is not success. Instead, keep track of the operation type ourselves. Change-Id: I60af4b35e761c46f5f296a61cedfca198836197f Signed-off-by: Ben Walker <benjamin.walker@intel.com> Co-authored-by: Evgeniy Kochetov <evgeniik@mellanox.com> Reviewed-on: https://review.gerrithub.io/430865 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Sasha Kotchubievsky <sashakot@mellanox.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com>	2018-11-02 16:39:37 +00:00
Ben Walker	8e7295036b	nvmf/rdma: Remove error recovery of RDMA qps After some lengthy discussions with the RDMA experts, the only way forward on an RDMA qp error is to disconnect it. The initiator can create a new qp if it wants to later on. Remove all of the error recovery code and disconnect the qp any time an error is encountered. Change-Id: I11e1df5aaeb8592a47ca01529cfd6a069828bd7f Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/430389 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2018-11-02 16:39:37 +00:00
Ben Walker	d3fa0181e3	nvmf/rdma: Move cm event processing down near where it is referenced Code movement only. No other changes. Change-Id: I04cf179ecd57154172a9369926cbeaaa37e11a52 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/430505 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Sasha Kotchubievsky <sashakot@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2018-10-31 21:56:31 +00:00
Ben Walker	039c8341e3	nvmf/rdma: Remove handling for LAST_WQE_REACHED This event only occurs when using shared receive queues, which the target does not currently support. Change-Id: If155843610cf0e961b9783d4afd64b969b4316f4 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/430388 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Sasha Kotchubievsky <sashakot@mellanox.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2018-10-31 21:56:31 +00:00
yidong0635	decb59575b	nvmf/ctrlr: add debug log for volatile write cache Add debug log in set feature, spdk_nvmf_ctrlr_set_features_volatile_write_cache to indicate the volatile write cache is disabled or enabled according to the conditon. Change-Id: Idc0a7fb461e2bbf1371d4a3faf5d839c7370bb65 Signed-off-by: yidong0635 <dongx.yi@intel.com> Reviewed-on: https://review.gerrithub.io/428953 Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2018-10-23 08:10:12 +00:00
GangCao	98e119f7a9	lib/nvmf: add the nvmf qpair to the available poll group In the case that the subsystem in the related poll group has NULL IO channel assigned due to some problem like out of resource, for example, the NVMe SSD hardware itself has limited number of IO qpairs. The subsystems in the particular poll group could have zero valid channels. In this case, the creation of assoicated poll group will fail and when adding the new qpair to the specified poll group, needs to have a check and pick the available poll group. Change-Id: Iedee2a6375e48eb7bf899cfb0542c565c7ebd231 Signed-off-by: GangCao <gang.cao@intel.com> Reviewed-on: https://review.gerrithub.io/423646 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2018-10-16 12:54:02 +00:00
Piotr Pelplinski	acca82acf2	nvmf: set noiob no larger than mdts Signed-off-by: Piotr Pelplinski <piotr.pelplinski@intel.com> Change-Id: I875cc9d6a6bd1e9e9ac25ca9103a2070226ac236 Reviewed-on: https://review.gerrithub.io/428877 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2018-10-15 17:29:30 +00:00

1 2 3 4 5 ...

943 Commits