ivampiresp/Spdk - Spdk - Leaflow Developers

Author	SHA1	Message	Date
Tomasz Zawadzki	8df52a0f4a	lib/nvme_tcp: assert tcp_req->req before it is dereferenced The value of tcp_req->req was asserted after it was already dereferenced. This patch fixes that. Change-Id: I5eb01e88be09d41fb8e632c49d5a7ccf2315788f Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/462508 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-07-24 18:09:33 +00:00
Chunyang Hui	07f432641a	opal: Fix memory leakage Change-Id: I37f1468a41d568f7313143f0270f854f73bc4000 Signed-off-by: Chunyang Hui <chunyang.hui@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/461560 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: yidong0635 <dongx.yi@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-07-22 04:32:59 +00:00
Chunyang Hui	8522624d03	opal: Add multiuser support Admin can enable user and add user to locking range. Then the user can lock/unlock his range. Change-Id: Ifc5a8cf5c6b5febeb59c86333981f0cf5b938500 Signed-off-by: Chunyang Hui <chunyang.hui@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/460891 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-07-22 04:32:59 +00:00
Changpeng Liu	e27421b344	nvme: fix req leaks There are many req leaks when a controller failure occurs during submitting IO. It must free all of the children before freeing the parent req. If a part of the child req has been sent to the back end and a part of the child req fails, removes the failed req from the parent req and the parent req must be retained, freeing the parent req after all of the submitted reqs return. Change-Id: Ieb5423fd19c9bb0420f154b3cfc17918c2b80748 Signed-off-by: Huiming Xie <xiehuiming@huawei.com> Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/461734 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com>	2019-07-22 04:15:34 +00:00
Changpeng Liu	c4f7c1bc2a	nvme: put child I/O helper functions in nvme_internal.h Existing children split functions defined in nvme_ns_cmd.c can also be used in nvme_qpair.c to free children requests with error paths. Change-Id: I640b32884424709da67ee89ff780c2de45acc54c Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/461372 Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-22 04:15:13 +00:00
James Bergsten	5acf617c6e	nvme: add functions to pretty-print commands and completions This change attempts to address the Trello request to decode I/O errors in NVMe hello_world example. See https://trello.com/c/MzJJw7hM/2-decode-io-errors-in-nvme-helloworld-example As part of this change, spdk_nvme_cpl_get_status_string was declared in nvme.h, and spdk_nvme_qpair_print_command and spdk_nvme_qpair_print_completion were renamed and added to nvme.h, allowing all three to used "externally." To test the failing paths, two compile time defines were added to force a write or read error (bad LBA) respectively. As the example does a read after write, if the write fails, the example fails. Signed-off-by: James Bergsten <jamesx.bergsten@intel.com> Change-Id: Ib94b4a02495eb40966e3f49517a5bdf64485538a Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/457076 Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-15 07:47:03 +00:00
Richael Zhuang	d4cbbf1751	nvme: use atomic builtins for g_signal_lock The __sync builtin based implementation generates full memory barriers on some non-x86 platforms. Replace it with C11 atomic builtins can make: ·arm and ppc from full barrier to half barrier ·x86 code same as before Signed-off-by: Richael Zhuang <richael.zhuang@arm.com> Change-Id: Ib6624ef8e45af497b9eced6ecfa7710bcc88a733 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/461590 Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-15 06:01:37 +00:00
yidong0635	ff0a7dfc42	nvme: Handle CQ polling failures by marking the controller as failed. nvme_transport_qpair_process_completions calls nvme_rdma_qpair_process_completions There are some cases return -1 due to failure of "CQ errors". Handle CQ polling failures by marking the controller as failed. That a completion with an error will be treated as controller failed. Requests will be aborted after retry counter exceeded. Otherwise, code will keep on reporting errors without recovery. This is to fix issue #850. Change-Id: I0b324232310e107bf7fd5722aca54d402a19b14d Signed-off-by: yidong0635 <dongx.yi@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/460569 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-07-09 01:43:02 +00:00
Andrey Kuzmin	fa6bfa80af	Nvme: check spdk_nvme_qpair_process_completions return value. nvme_tcp_qpair_process_completions returns -1 on socket I/O error. Unless the caller checks this return value (which spdk_nvme_wait_for_completion_robust_lock currently doesn't), on connection loss or any other fatal connection error spdk_nvme_wait_for_completion will never exit the completion check loop. Change-Id: I92bb349beb071db312e6c31b84db2a7b51ec486c Signed-off-by: Andrey Kuzmin <akuzmin@jetstreamsoft.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/460657 Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-09 00:27:54 +00:00
Shuhei Matsumoto	8b539eb553	nvme: Set appropriate value to max_xfer_size and max_sge SPDK NVMe-oF initiator driver could not transfer IO whose size is more than 128KiB even if NVMe-oF target allows IO whose size is more than 128KiB both for RDMA and TCP transport. Some use cases need to transfer IO larger than 128KiB. For RDMA transport, max_mr_size by ibv_query_device of RDMA devices indicates the maximum size of a single memory region and is independent from the actual I/O size, and is very likely to be larger than 2 MiB which is the granularity we currently register memory regions. Actually some RDMA NICs return UINT64_MAX for max_mr_size by ibv_query_device. Hence use UINT32_MAX and let the generic layer use the controller data to moderate this value. On the other hand, for TCP transport, there is no limit for maximum IO size and hence use UINT32_MAX. Besides, for RDMA transport, max_sges should be the minimum of max_sge got by querying RDMA devices and NVME_RDMA_MAX_SGL_DESCRIPTORS. Hence do this change together in this patch. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Idc813afd3e525bf5f370c0fcd2623f9c146a5528 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459218 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-07-05 06:35:41 +00:00
Shuhei Matsumoto	cf3c54bc03	nvme: Ensure max_sges not to exceed what controller supports in generic layer Previously comparing the transport supported value and the target value was done in RDMA transport layer. However this comparison should be done in the generic layer like the maximum IO transfer size. Hence change the comparison to do in the generic layer in this patch. Besides, for MSDBD, the value 0 indicates no limit but we had handled this as maximum number of SGS entries was 0 by mistake. This patch fixes the bug together. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I54365cf114169b10180ec2c659f9c7302672674c Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459574 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-07-05 06:35:41 +00:00
Chunyang Hui	fbd2f3fd2e	opal: add support for getting locking range info Change-Id: I8e3e39673c260f823a9703e86006b5334dedc987 Signed-off-by: Chunyang Hui <chunyang.hui@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/457576 Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-05 02:23:28 +00:00
Chunyang Hui	505dbf59ff	Opal: Add locking range support Change-Id: I4974d4134aed3b63e204b79c9292ce940e32d40c Signed-off-by: Chunyang Hui <chunyang.hui@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/455175 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-07-05 02:23:28 +00:00
Chunyang Hui	755b4390f9	Opal: Add activate locking SP method Change-Id: I4189bdefdb5a6651bb73bd32e61c16e899b2ae5a Signed-off-by: Chunyang Hui <chunyang.hui@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/454211 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-07-05 02:23:28 +00:00
Shuhei Matsumoto	3ff1ff004e	nvme/tcp: Minor cleanups for SGL operations Using naming rules consistent with other related libraries is helpful to ensure the quality as verified by this patch series. This patch changes a few parts to use iov and iovcnt for SGL operations. Besides, name of an array points to the head of the array and is constant. So copying name of array to an another pointer is not necessary and can be removed. Change-Id: I2324f28126b3088098c1c767cf6c060f22c175c3 Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/455629 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Maciej Szwed <maciej.szwed@intel.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com>	2019-07-04 08:58:40 +00:00
Shuhei Matsumoto	3184884f9d	nvmf/tcp: Properly handle multiple iovecs in processing H2C and C2H NVMe/TCP target had assumed the size of each iovec was io_unit_size. Using nvme_tcp_pdu_set_data_buf() instead removes the assumption and supports any alignment transparently. Hence this patch moves nvme_tcp_pdu_set_data_buf() to include/spdk_internal/nvme_tcp.h and replaces the current code to use it. Besides, this patch simplifies spdk_nvmf_tcp_calc_c2h_data_pdu_num() because sum of iov_len of iovecs is equal to the variable length now. We cannot separate code movement (lib/nvme/nvme_tcp.c to include/ spdk_internal/nvme_tcp.h) and code replacement (lib/nvmf/tcp.c) because moved functions are static and compiler give warning if they are not referenced in lib/nvmf/tcp.c. The next patch will add UT code. Change-Id: Iaece5639c6d9a41bd35ee4eb2b75220682dcecd1 Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/455625 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-07-04 08:58:40 +00:00
Hailiang Wang	3a65c8729b	lib/nvme: fix a warning of spdk_pci_addr->domain Compilation Warning on fedora30. In file included from nvme_ut.c:42: /home/vagrant/spdk_repo/spdk/test/common/lib/test_env.c:517:17: warning: The left operand of '>' is a garbage value if (a1->domain > a2->domain) { ~~~~~~~~~~ ^ This is related to issue #822. Change-Id: I2b61e821130b89af04db3c475e81d2e91a380a90 Signed-off-by: Hailiang Wang <hailiangx.e.wang@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459923 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-07-01 13:07:48 +00:00
Shuhei Matsumoto	f62d5ccbe6	nvme/tcp: Properly handle multiple iovecs in nvme_tcp_pdu_set_data_buf nvme_tcp_pdu_set_data_buf() has been used to process C2H and H2C for NVMe/TCP initiator. In this case, NVMe/TCP cuts out the part of the input data buffer and transfers the part, and repeats these cut and transfers until the whole data buffer is transferred. NVMe/TCP uses two SGLs, and use one to parse from the offset datao to datao + datal and another to append from the offset 0 to datal. However, the current nvme_tcp_pdu_set_data_buf() had used data_length as not data length of this transfer but total length of the whole transfers by mistake. Recently DIF library updated to properly handle very similar cases, and so this patch takes DIF library as a reference and corrects the implementation. The next patch will add UT code to verify the bug will be fixed. The code size is pretty large and so UT code is separated. Change-Id: Ibeed4de182b8b8740566e874e2757280dc21f9e8 Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/455623 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com>	2019-07-01 08:28:20 +00:00
Shuhei Matsumoto	a7b6d2ef00	nvme/tcp: Change parameters of nvme_tcp_pdu_set_data_buf to use in target This patch is the first patch of the patch series. The purpose of this patch series is to correct the bug of nvme_tcp_pdu_set_data_buf() when the multiple iovecs array is passed, to share nvme_tcp_pdu_set_data_buf() between NVMe/TCP initiator and target, and utilize nvme_tcp_pdu_set_data_buf() not only for C2H and H2C but also in-capsule data in NVMe/TCP target. This patch is necessary to satisfy the second requirement, to share nvme_tcp_pdu_set_data_buf() between NVMe/TCP initiator and target because struct nvme_tcp_req and struct spdk_nvmf_tcp_req are different. Four variables, iov, iovcnt, data_offset, and data_len are common, and hence this patch changes the parameters of nvme_tcp_pdu_set_data_buf() to accept them. The bug is fixed in the next patch and tested in after the next patch. Change-Id: Ifabd9a2227b25f4820738656e804d05dc3f874a5 Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/455622 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com>	2019-07-01 08:28:20 +00:00
Darek Stojaczyk	f9a6588f57	nvme: switch to spdk_malloc(). spdk_dma_malloc() is about to be deprecated. Change-Id: I6c308ee546c28c479ceb903bc1749bf5209dc6fe Signed-off-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/448172 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: <uma.willpower@gmail.com>	2019-06-27 04:34:50 +00:00
JinYu	77290bfe6b	nvme: fix the endless loop of aborting trackers The completion cb of outstanding_tr may submit new requeset to the outstanding_tr list of the qpair, it's an endless loop. We only abort the remaining outstanding trackers. Fix #819 Change-Id: I342f52f4d1836f8ef620ef9e3add0b1986727282 Signed-off-by: JinYu <jin.yu@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/457755 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com>	2019-06-21 08:34:41 +00:00
Chunyang Hui	e3d21c7778	Opal: Optimize key creation and remove dev->dev_key Change-Id: Iaf20c8ec0d208e03269406b62608d981d84cc48c Signed-off-by: Chunyang Hui <chunyang.hui@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/457775 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-06-19 00:28:57 +00:00
James Bergsten	8785d5052d	nvme: spdk_nvme_ctrlr_alloc_io_qpair extensions Adds fields to structure spdk_nvme_io_qpair_opts. These fields allow specifying the locations of memory buffers used for the submission and/or completion queues. By default, vaddr is set to NULL meaning SPDK will allocate the memory to be used. If vaddr is NULL then paddr must be set to 0. If vaddr is non-NULL, and paddr is zero, SPDK derives the physical address for the NVMe device, in this case the memory must be registered. If a paddr value is non-zero, SPDK uses the vaddr and paddr as passed. SPDK assumes that the memory passed is both virtually and physically contiguous. If these fields are used, SPDK will NOT impose any restriction on the number of elements in the queues. The buffer sizes are in number of bytes, and are used to confirm that the buffers are large enough to contain the appropriate queue. These fields are only used by PCIe attached NVMe devices. They are presently ignored for other transports. Signed-off-by: James Bergsten <jamesx.bergsten@intel.com> Change-Id: Ibfab3939eefe48109335f43a1167082dd4865e7c Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/454074 Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-06-18 12:19:41 +00:00
Chunyang Hui	dd26583316	Opal: Add opal_create_key function Change-Id: Id1705636e25fe3ad90ff60a57aca7b1e4c2ef687 Signed-off-by: Chunyang Hui <chunyang.hui@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/453972 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com>	2019-06-11 01:12:24 +00:00
Chunyang Hui	9f988238fc	Opal: Refactor and clean functions Delete opal_next, introduce opal_add_tokens. Delete spdk_opal_cmd, seperate cmds to new APIs. Change-Id: Ide56817eec7fde7b110818966ebf10e65a952fc9 Signed-off-by: Chunyang Hui <chunyang.hui@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/454433 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-06-11 01:12:24 +00:00
Ziye Yang	679257db88	nvme/tcp: Properly deal with supporting single r2t According to the TP 8000 spec in Page 26: Maximum Number of Outstanding R2T (MAXR2T): Specifies the maximum number of outstanding R2T PDUs for a command at any point in time on the connection. This patch makes the current host driver implementation support one r2t. We cleanup the code to do the right advertising to the target in the icreq and avoid attempts to deal with multiple rt2s. Reported-by: Or Gerlitz <ogerlitz@mellanox.com> Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: If06ad2e8bde31c2fd7e1c3739f651fb64040e3a9 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/455750 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Or Gerlitz <gerlitz.or@gmail.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-06-06 00:58:58 +00:00
Ziye Yang	fe2dddbbbc	nvme/tcp: Correct nvme_tcp_qpair_disconnect behavior The current nvme_tcp_qpair_disconnect behaviour is not exactly correct, we do not re-initialize the state of some data structures of the tqpair. And this caused the coredump. Purpose: Fixes #808. Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: I4d2cad8fc0712dbebfc2f3e52373cbe3b9908bf7 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/456755 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-06-05 16:13:55 +00:00
Ziye Yang	31607f3f9e	nvme/tcp: fix the user iov length caculation in nvme_tcp_build_sgl_request The length should be no larger than the remaining_size. For example, The remaining_size(firstly, assigned by payload_size) is 128KB, and user's sgl length is 1MB. Since we already split the I/O, so we should not use the original length(1MB), but use the remaining_size. Fix issue reported by: https://github.com/spdk/spdk/issues/808 Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: I0a7d0f2282c8ad0e253d8de7091b6c5b87018e9a Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/456760 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-06-05 01:46:52 +00:00
Ziye Yang	5391b29c79	nvme/tcp: Fix the issue of handling send pdu failure Previously, if the return value of nvme_tcp_qpair_process_send_queue is not zero, we directly return but not continue receiving the pdu. But this is wrong, we should only handle the case when the return value is negative. Reported-by: Or Gerlitz <ogerlitz@mellanox.com> Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: I83453733f5a3e3350a0461b4cb0bc409fde32fea Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/455899 Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-06-05 01:44:49 +00:00
Jim Harris	6550abbac1	nvme: prefetch stailq before freeing pcie request We will need to put the recently completed nvme_request object on the qpair's STAILQ. We don't reference any real data from the nvme_request in the completion path since we've already stashed the cb_fn and cb_arg in the nvme_tracker. But we will need to reference the STAILQ_ENTRY to put it back in the qpair's STAILQ, so prefetch that cacheline. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Id76122afe4150c84a61fbe38bc874f10d606b3b3 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/456673 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-06-04 00:01:35 +00:00
Jim Harris	b3d884b700	nvme: assign qpair when req is allocated There's no need to set this every time we allocate a request. While here, fix a typo near where we needed to modify the unit test to remove the qpair assertion. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I8af41a6c483415950f625d1ed2ef46088b75a622 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/456270 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-06-04 00:01:35 +00:00
lorneli	a5dfbc4daf	nvme: zero request->submit_tick in allocation Request may be submitted several times via nvme_qpair_submit_request function, such as request in queued_req queue being re-submitted. With enabling timeout feature, nvme_qpair_submit_request compares request->submit_tick to zero to check if this is the first submission for this request. If true, record submit_tick for this reuqest. So request->submit_tick needs to be set zero in allocation. Change-Id: Ie3f420aa337802c5ad3962c3fdcd680dec1ccdcb Signed-off-by: lorneli <lorneli@163.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/456328 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-06-03 19:15:13 +00:00
Jim Harris	da366fd09f	nvme: explicitly mark _nvme_ns_cmd_rw as inline This is a small optimization. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ib593908d3aeb17aac55be06b8e3be42e28a23061 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/456268 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-06-03 03:11:08 +00:00
Jim Harris	d09874f3a2	nvme: remove avx optimizations when copying command Using AVX512 or AVX2 ends up being a small pessimization. I think AVX works better for copies when there are multiple cachelines to copy. I see a 2-3% improvement in high IOPs benchmarks when reverting to SSE. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I3d70a1e359e98cec2a9da41ccf9af2de9baa5868 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/456247 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-05-30 23:09:16 +00:00
Jim Harris	c85164bd69	nvme: add explicit "inline" keyword to a couple of functions Profiling showed these weren't getting inlined - so add the inline keyword to make sure it happens. This helps improve performance a bit. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ia86edccc9163258efdcddcce6989a71fb180caf6 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/456099 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com>	2019-05-30 23:09:16 +00:00
Jim Harris	6c820f84cb	nvme: add tracker prefetching in completion path At 10M IO/s, we see a lot of CPU cycles wasted getting the next tracker into cache. If we only get one completion at a time, this is unavoidable, but when there are multiple completions pending, we can prefetch the second tracker while processing the completion for the first. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I9de702bee3719e4494eec6f05b09be3672f1e0ac Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/456097 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-05-30 23:09:16 +00:00
James Bergsten	f2d46446ca	nvme: add spdk_nvme_ctrlr_get_registers implementation Prior merge contained all of the code EXCEPT for the user-callable function. Signed-off-by: James Bergsten <jamesx.bergsten@intel.com> Change-Id: I1cb7105ab85ffae8ed4f600261fed86c9c778893 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/456282 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-05-30 22:38:27 +00:00
Ziye Yang	804ca3e995	nvme/tcp: change the name of max_r2t to maxr2t Purpose: Make the variable definition consistent with the same variable in the target side. Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: Ibc4ff92b6346f0a1ad803dcb79d041289f5648b2 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/455807 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-05-30 21:38:02 +00:00
Jim Harris	f0dd2b789e	nvme: add spdk_nvme_ctrlr_get_transport_id() Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ie32a1bb144c239b923b5cbb9e608a7dfc9c05208 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/456076 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Maciej Szwed <maciej.szwed@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-05-29 20:27:10 +00:00
JinYu	11047d5b23	nvme: add vfio driver parse event In Fedora release 28, plug in nvme device and run setup.sh, the uevent is like this: UDEV [1060.112118] add /devices/virtual/vfio/81 (vfio) ACTION=add DEVNAME=/dev/vfio/81 DEVPATH=/devices/virtual/vfio/81 MAJOR=509 MINOR=1 SEQNUM=8544 SUBSYSTEM=vfio USEC_INITIALIZED=1060111894 UDEV [1060.122089] bind /devices/pci0000:d7/0000:d7:00.0/0000:d8:00.0 (pci) ACTION=bind DEVPATH=/devices/pci0000:d7/0000:d7:00.0/0000:d8:00.0 DRIVER=vfio-pci ID_MODEL_FROM_DATABASE=PCIe Data Center SSD (DC P3700 SSD [2.5" SFF]) ID_PCI_CLASS_FROM_DATABASE=Mass storage controller ID_PCI_INTERFACE_FROM_DATABASE=NVM Express ID_PCI_SUBCLASS_FROM_DATABASE=Non-Volatile memory controller ID_VENDOR_FROM_DATABASE=Intel Corporation MODALIAS=pci:v00008086d00000953sv00008086sd00003703bc01sc08i02 PCI_CLASS=10802 PCI_ID=8086:0953 PCI_SLOT_NAME=0000:d8:00.0 PCI_SUBSYS_ID=8086:3703 SEQNUM=8545 SUBSYSTEM=pci USEC_INITIALIZED=1060121805 Have tested several kernel versions such as v3.10, v4.10, v4.15, v4.19. We didn't see an event which is like this: ACTION=add DRIVER=vfio-pci Change-Id: I7299a2fb4d634edaa6bab3412ee8f363f66aae6f Signed-off-by: JinYu <jin.yu@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/452053 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-05-29 02:36:41 +00:00
Shuhei Matsumoto	d6ec6850e2	nvme/tcp: Rename _iov_ctx to _nvme_tcp_sgl to match DIF library This is the same intention as the patch for iSCSI in this series. This change will be helpful to extract common part into a specific helper library if necessary in future. Change-Id: I1ce36b424ff2afb85f998149f4ef0d7153290802 Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/455621 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-05-24 23:19:24 +00:00
Shuhei Matsumoto	9315f02254	nvme/tcp: Unify array size and used count in SGL operation Recently DIF library refined SGL create operation by unifying size and used count into unused count. This patch applies the good practice in DIF library to create SGL in NVMe/TCP. The next patch refines names of related function and variables to be consistent in NVMe/TCP. Change-Id: I1e73310c0e3650ede53672d76071a6c37dba82c1 Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/455473 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-05-24 23:19:24 +00:00
Jim Harris	37184dd471	nvme: add nvme_free_request() variant that takes qpair This avoids dereferencing the request to get the qpair in cases where we already know the qpair. Adding a new variant instead of just modifying nvme_free_request() since there are 72 calls to this function and I don't want to change all of them. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ifd6fd964e546bcd71ff180fd71d5bf5cbab79d4f Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/455287 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-05-22 14:51:01 +00:00
Jim Harris	ef1f844395	nvme: add qpair parameter to nvme_complete_request In some cases we have the qpair already when calling this function. So pass the qpair to avoid having to get it from the request. This shows about a 3% performance improvement for high IOPs single core tests. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I22fcca560492f4e7cf5ffedd252e41a027d0dd79 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/455286 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-05-22 14:51:01 +00:00
Jim Harris	af38d200e6	nvme: add ctrlr option for logging errors Currently the nvme driver will always log any request completed with error status. Some applications may not want this behavior. So provide an option to disable it at the controller level. When this option is enabled, any failed requests from queues associated with that controller (including the admin queue) will not log the failed request. Of course the application will still receive the failed status code and can decide to do its own logging there. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ia093fcd23cf321a820fd53183ee7e2dac4f9d378 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/454081 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-05-14 13:51:44 +00:00
Jim Harris	bb01a08915	nvme: plumb disconnect/connect in reset path This will (finally) enable resets for fabrics controllers. Move some of the work previously done in enable_admin_queue up to this new disconnect/connect logic. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I6239f0c0f36192db921d33f2322b1874b9382a01 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/453939 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-05-14 13:49:19 +00:00
Jim Harris	5309873d39	nvme: add qpair is_connecting flag This will be used on the adminq, and set while the qpair is connecting. It allows the qpair_process_completions routine to know that it should still try to process completions, even if the controller is resetting. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I377b9c934295eb5f45f03efd90c2a268defb4bd4 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/453938 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-05-14 08:48:11 +00:00
Jim Harris	36d2149a70	nvme: allow admin queue fabrics cmds while resetting For fabrics controllers, the fabrics cmds are what gets the controller out of reset. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I6804874e867466669a55dff11a0a865add8bbc99 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/453937 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-05-14 08:48:11 +00:00
Jim Harris	79fad08a7e	nvme: add transport qpair_disconnect function Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I9918f8fed0e559be5d865702b647566dd1e2ed18 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/453936 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-05-14 08:48:11 +00:00
Jim Harris	3895ca40d9	nvme: break out tcp disconnect from destroy Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Iacce5c98ebfb768bdd5babc39a3ebc6d1dfbc8ea Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/453935 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-05-14 08:48:11 +00:00

1 2 3 4 5 ...

817 Commits