ivampiresp/Spdk - Spdk - Leaflow Developers

Author	SHA1	Message	Date
Evgeniy Kochetov	87ebcb08c1	nvmf/rdma: Handle completions for destroyed QP associated with SRQ IB Architecture Specification vol.1 rel.13. in ch.10.3.1 "QUEUE PAIR AND EE CONTEXT STATES" suggests the following destroy procedure for QPs associated with SRQ: - Put the QP in the Error State; - wait for the Affiliated Asynchronous Last WQE Reached Event; - either: * drain the CQ by invoking the Poll CQ verb and either wait for CQ to be empty or the number of Poll CQ operations has exceeded CQ capacity size; or * post another WR that completes on the same CQ and wait for this WR to return as a WC; - and then invoke a Destroy QP or Reset QP. Without the drain step it is possible that LAST_WQE_REACHED event is received and QP is destroyed before the last receive WR completion is polled from the CQ. In SPDK there is no risk of resource leakage in this case. So, instead of draining we can destroy QP and then just ignore receive completions without QP and post receive WRs back to SRQ. Fixes #903 Signed-off-by: Evgeniy Kochetov <evgeniik@mellanox.com> Signed-off-by: Sasha Kotchubievsky <sashakot@mellanox.com> Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: Ice6d3d5afc205c489f768e3b51c6cda8809bee9a Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/465747 Reviewed-by: Seth Howell <seth.howell@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-09-12 17:04:48 +00:00
Michal Ben Haim	62615117f7	SPDK: changing TREQ value from 'not specified' to 'not required'. Signed-off-by: Michal Ben Haim <michal.benhaim@kaminario.com> Change-Id: Ia7bda5b18db24df97172d4500a499c4635d592d5 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/467499 Community-CI: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Reviewed-by: John Kariuki <John.K.Kariuki@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-09-10 17:51:26 +00:00
Ben Walker	59e34aa865	nvmf/tcp: Don't set socket recvbuf size anymore The default behavior is to set it to 2MB, so this isn't required anymore. Change-Id: I62d7605cd4d5bc41347128f32f9a1aa373a15744 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/466993 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com>	2019-09-10 17:48:49 +00:00
Ziye Yang	24eb7a84b0	nvme/tcp: fix the iov vector count. Since we use pdu->data_iovcnt to build the iov in nvme_tcp_build_iovs, so send out pdu has the maximal iov number equals to: 2 + pdu->data_iovcnt, so we change the comparison. This makes sure that we can handle all the data owned by one pdu. Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: I2b9258cc5716d706c0fa38af609726c439708768 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/467207 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com>	2019-09-09 02:08:31 +00:00
Shuhei Matsumoto	9796768132	nvmf: Move pending_data_buf_queue to common struct spdk_nvmf_transport_poll_group This unifies buffer management among transports further and is a preparation to make buffer allocation asynchronous. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I8c588eeac4081f50fe32605feb7352f72c628d95 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/466847 Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Anil Veerabhadrappa <anil.veerabhadrappa@broadcom.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-09-09 00:42:22 +00:00
Shuhei Matsumoto	cb5c661274	nvmf/fc: Move pending_data_buf_queue from fc_conn to fc_poll_group I/O buffer cache is per transport_poll_group now. Hence moving pending_data_buf_queue from struct spdk_nvmf_fc_conn to struct spdk_nvmf_fc_poll_group is reasonable and do it in this patch. This change is based on RDMA and TCP transport. Further unification among transports will be done in subsequent patches. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Ic857046be8da238cb3ff9e89b83cdac5f6349bcf Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/466844 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-09-09 00:42:22 +00:00
Shuhei Matsumoto	2ed1b6c253	nvmf/fc: Use transport pointer stored in transport_poll_group The pointer to transport is set to struct nvmf_transport_poll_group in nvmf_transport_poll_group_create() after returning nvmf_fc_poll_group_create(). Hence use it and remove ftransport pointer from struct nvmf_fc_poll_group. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I9f2b2ade77afa18d0e97949fc0c2403eb000cdad Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/467060 Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Anil Veerabhadrappa <anil.veerabhadrappa@broadcom.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-09-09 00:42:22 +00:00
Shuhei Matsumoto	b913e01644	nvmf/fc: Rename pointer to nvmf_fc_transport from fc_transport to ftransport RDMA transport have used rtransport and TCP transport have used ttransport, respectively. So FC transport changes to use ftransport instead of fc_transport. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I7d98eb2f6efbae7e2b4784f31b9de5e1a81bc2ac Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/467059 Reviewed-by: Anil Veerabhadrappa <anil.veerabhadrappa@broadcom.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-09-09 00:42:22 +00:00
Shuhei Matsumoto	b9dc11f98d	nvmf/fc: Rename transport_poll_group instance in nvmf_fc_poll_group to group Both RDMA and TCP transport have uesd group for such case. Hence FC transport changes to use group instead of tp_poll_group. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Ic4b401179da506bb204c3ec48650db87f91fe72a Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/466843 Reviewed-by: Anil Veerabhadrappa <anil.veerabhadrappa@broadcom.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-09-09 00:42:22 +00:00
Shuhei Matsumoto	01df17d007	nvmf/fc: Use pointer stored in transport_poll_group and remove it from fc_poll_group The pointer to nvmf_poll_group is set in nvmf_transport_poll_group_create() after returning nvmf_fc_poll_group_create(). Hence holding it into struct spdk_nvmf_fc_poll_group is duplicated and can be removed. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I7087c5cdb94b0b0c5f51b0b63b631c08266c90d0 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/466842 Reviewed-by: Anil Veerabhadrappa <anil.veerabhadrappa@broadcom.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-09-09 00:42:22 +00:00
Shuhei Matsumoto	99ea1d3612	nvmf/fc: Rename nvmf_fc_poll_group pointer held in struct to fgroup RDMA transport have used rgroup and TCP transport have used tgroup for such case. Hence FC transport changes to use fgroup instead of fc_poll_group. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I91b7ad6a1c6e45caf92801b0635b18d48b3c9810 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/466841 Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Anil Veerabhadrappa <anil.veerabhadrappa@broadcom.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-09-09 00:42:22 +00:00
Seth Howell	20b35d769d	nvmf: don't keep a global discovery log page. Keeping a global discovery log page was meant to be a time saving mechanism, but in the current implementation, it doesn't work properly, and can cause undesirable behavior and potential crashes. There are two main problems with keeping a global log page. 1. Admin qpairs can be assigned to any SPDK thread. This means that when multiple initiators connect to the host and request the discovery log, they can both be running through the spdk_nvmf_ctrlr_get_log_page function at the same time. In the event that the discovery generation counter is incremented while these accesses are occurring, it can cause one or both of the threads to update the log at the same time. This results in both logs trying to free the old log page (double free) and set their log as the new one (possible memory leak). 2. The second problem is that each host is supposed to get a unique discovery log based on the subsystems to which they have access. Currently the code relies on whether the discovery log page offset in the request is equal to 0 to determine if it should load a new discovery log page or use the cached one. This is inherently faulty because it relies on initiator provided value to determine what information to provide from the log page. An initiator could easily send a discovery request with an offset greater than 0 on purpose to procure most of a log page provided to another host. Overall, I think it's safest to not cache the log page at all anymore and rely on a thread local fresh log page each time. Reported-by: Curt Bruns <curt.e.bruns@intel.com> Change-Id: Ib048e26f139927d888fed7019e0deec346359582 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/466839 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-09-03 00:30:59 +00:00
Shuhei Matsumoto	0b068f8530	nvmf/rdma: Pass nvmf_request to nvmf_rdma_fill_buffers Most variables related with I/O buffer are in struct spdk_nvmf_request now. So we can pass nvmf_request instead of nvmf_rdma_request to nvmf_rdma_request_fill_buffers and do it in this patch. Additionally, we use the cached pointer to nvmf_request in spdk_nvmf_rdma_request_fill_iovs which is the caller to nvmf_rdma_request_fill_buffers in this patch. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Ia7664e9688bd9fa157504b4f5075f79759d0e489 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/466212 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com>	2019-08-30 16:56:46 +00:00
Shuhei Matsumoto	b4778363b4	nvmf/tcp: Pass nvmf_request to nvmf_tcp_req_fill_buffers Most variables related with I/O buffer are in struct spdk_nvmf_request now. So we can pass nvmf_request instead of nvmf_tcp_req to nvmf_tcp_req_fill_buffers and do it in this patch. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I00eff578a98891e99fcb9a3aafa3d99126d6f1c1 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/466089 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com>	2019-08-30 16:56:46 +00:00
Shuhei Matsumoto	90a2be2006	nvmf/fc: Pass nvmf_request to nvmf_fc_request_fill_buffers Most variables related with I/O buffer are in struct spdk_nvmf_request now. So we can pass nvmf_request instead of nvmf_fc_request to nvmf_fc_request_fill_buffers and do it in this patch. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Ibe87e7641e5c364b20a6d877ce7928c612b0b83a Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/466088 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Anil Veerabhadrappa <anil.veerabhadrappa@broadcom.com> Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com>	2019-08-30 16:56:46 +00:00
Shuhei Matsumoto	9412a8370d	nvmf/fc: Use STAILQ for pending_data_buf_queue This is a small performance optimization and an effor to unify I/O buffer management further among transports. it is ensured that the request is the first of STAILQ when nvmf_fc_request_execute() completes successfully. Hence change TAILQ_REMOVE to STAILQ_REMOVE_HEAD for the case. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: If982842bf53ba00426a854a18eaadf8a1b8d642d Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/466676 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Anil Veerabhadrappa <anil.veerabhadrappa@broadcom.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com>	2019-08-30 16:56:46 +00:00
Shuhei Matsumoto	6c8b297262	nvmf/fc: Rename pending_queue to pending_data_buf_queue This is an effort to unify I/O buffer management further among transports. RDMA and TCP transport have named pending_queue pending_data_buf_queue. So FC transport follows RDMA and TCP transport. The next patch will change pending_data_buf_queue to use STAILQ instead of TAILQ. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I57c3c678a1e92ec262eb8940418529a62b6768c3 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/466675 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com>	2019-08-30 16:56:46 +00:00
Shuhei Matsumoto	2bc819dd52	nvmf/tcp: Use STAILQ for queued_c2h_data_tcp_req and pending_data_buf_queue This is a small performance optimization and an effort to unify I/O buffer management further among transports. It is ensured that the request is the first of STAILQ when spdk_nvmf_tcp_send_c2h_data() is called or the case TCP_REQUEST_STATE_NEED_BUFFER is executed in spdk_nvmf_tcp_req_process(). Hence change TAILQ_REMOVE to STAILQ_REMOVE_HEAD for these two cases. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I0b195874ac22a8d5ecfb283a9865d2615b7d5912 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/466637 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-08-30 16:56:46 +00:00
Ziye Yang	5e7b8d18f3	nvmf/tcp: Remove the potential pdu hdr memory copy. In this patch, we directly point the hdr_p to the memory owned by the pdu_recv_buf to avoid memory copy. Change-Id: Iee0dd98058928f429bf7ad22103cd4826226400f Signed-off-by: Ziye Yang <ziye.yang@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/465158 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com>	2019-08-30 02:25:22 +00:00
Shuhei Matsumoto	8a80461ac6	nvmf/tcp: execute buffer allocation only if request is the first of pendings RDMA transport executes spdk_nvmf_rdma_request_parse_sgl() only if the request is the first of the pending requests in the case RDMA_REQUEST_STATE_NEED_BUFFER in the state machine spdk_nvmf_rdma_requests_process(). This made RDMA transport possible to use STAILQ for pending requests because STAILQ_REMOVE parses from head and is slow when the target is in the middle of STAILQ. On the other hand, TCP transport executes spdk_nvmf_tcp_req_parse_sgl() even if the request is in the middle of the pending request in the case TCP_REQUEST_STATE_NEED_BUFFER in the state machine spdk_nvmf_tcp_req_process() if the request has in-capsule data. Hence TCP transport have used TAILQ for pending requests. This patch removes the condition if the request has in-capsule data from the case TCP_REQUEST_STATE_NEED_BUFFER. The purpose of this patch is to unify I/O buffer management further. Performance degradation was not observed even after this patch. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Idc97fe20f7013ca66fd58587773edb81ef7cbbfc Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/466636 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-08-29 18:17:38 +00:00
Shuhei Matsumoto	0f73c253b5	nvmf/fc: Replace FC specific get/free_buffers by common APIs Use spdk_nvmf_request_get_buffers() and spdk_nvmf_request_free_buffers(), and then remove nvmf_fc_request_free_buffers() and nvmf_fc_request_get_buffers(). Set fc_req->data_from_pool to false after spdk_nvmf_request_free_buffers(). Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I046a642156411da3935bc2fa2c2816fc2e025147 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/465877 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Anil Veerabhadrappa <anil.veerabhadrappa@broadcom.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com>	2019-08-29 18:17:38 +00:00
Shuhei Matsumoto	9968035884	nvmf/tcp: Replace TCP specific get/free_buffers by common APIs Use spdk_nvmf_request_get_buffers() and spdk_nvmf_request_free_buffers(), and then remove spdk_nvmf_tcp_request_free_buffers() and spdk_nvmf_tcp_request_get_buffers(). Set tcp_req->data_from_pool to false after spdk_nvmf_request_free_buffers(). Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I286b48149530c93784a4865b7215b5a33a4dd3c3 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/465876 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-08-29 18:17:38 +00:00
Shuhei Matsumoto	85b9e716e9	nvmf/rdma: Replace RDMA specific get/free_buffers by common APIs Use spdk_nvmf_request_get_buffers() and spdk_nvmf_request_free_buffers(), and then remove spdk_nvmf_rdma_request_free_buffers() and nvmf_rdma_request_get_buffers(). Set rdma_req->data_from_pool to false after spdk_nvmf_request_free_buffers(). Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Ie1fc4c261c3197c8299761655bf3138eebcea3bc Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/465875 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-08-29 18:17:38 +00:00
Shuhei Matsumoto	cc4d1f82cc	nvmf: Add spdk_nvmf_request_get/free_buffers() usable among transports This patch adds new APIs spdk_nvmf_request_get_buffers() and spdk_nvmf_request_free_buffers() to be used among transports. Subsequent patches will replace transport specific APIs by them. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Ib153e2c5806b7276915a0aa91179fe9dbcb2a1f0 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/465874 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Anil Veerabhadrappa <anil.veerabhadrappa@broadcom.com> Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-08-29 18:17:38 +00:00
Shuhei Matsumoto	005b053a02	nvmf: Move data_from_pool flag to common struct spdk_nvmf_request This is a prepration to unify buffer management among transports. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I6b1c208207ae3679619239db4e6e9a77b33291d0 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/466002 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-08-29 18:17:38 +00:00
Shuhei Matsumoto	04ae83ec93	nvmf: Move allocated buffer pointers to common struct spdk_nvmf_request This is a preparation to unify buffer management among transports. struct spdk_nvmf_request already has SPDK_NVMF_MAX_SGL_ENTRIES (16) * 2 iovecs. Hence incresing the number of buffers twice will be no problem. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Idb525abbf35dc9f4b8547b785b5dfa77d106d8c9 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/465873 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com>	2019-08-29 18:17:38 +00:00
Evgeniy Kochetov	01887d3c96	nvmf/rdma: Fix data WR release One of stop conditions in data WR release function was wrong. This can cause release of uncompleted data WRs. Release of WRs that are not yet completed leads to different side-effects, up to data corruption. The issue was introduced with send WR batching feature in commit `9d63933b7f`. This patch fixes stop condition and contains some refactoring to simplify WR release function. Signed-off-by: Evgeniy Kochetov <evgeniik@mellanox.com> Signed-off-by: Sasha Kotchubievsky <sashakot@mellanox.com> Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: Ie79f64da345e38038f16a0210bef240f63af325b Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/466029 Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Seth Howell <seth.howell@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-08-29 18:09:14 +00:00
Ziye Yang	d50736776c	nvmf/tcp: Use a big buffer for PDU receving. Purpose: Reduce the recv/readv system call. Method: Use a big recv buffer to conduct the read. Though it will introduce addtional buffer copy, we hope that the overhead introduced by buffer copy will be smaller compared with frequent recv/readv system call overhead. And the design is to make a trade off between them. Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: I9286fd9cec0b512cea8e3f2c335c5bf862b98573 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/464842 Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-08-28 15:38:02 +00:00
Ziye Yang	ea5ad0b286	nvme/tcp: Change hdr in nvme_tcp_pdu to pointer Purpose: Prepare the further optimnization in the target side whening receving pdu headers, we expect to use zero copy. Change-Id: Iae7f9106844736d7160d39d0af1f5941084422ec Signed-off-by: Ziye Yang <ziye.yang@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/465380 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com>	2019-08-28 15:38:02 +00:00
Shuhei Matsumoto	eab7360bcb	nvmf/tcp: Factor out getting and filling buffers from nvmf_tcp_req_fill_iovs This follows the practice of RDMA transport and is a preparation to unify buffer allocation among transports. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Ib85625f2a0eca01ef4028685dd838d6c41faad7b Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/465872 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-08-26 19:04:24 +00:00
Shuhei Matsumoto	72c10f7094	nvmf/tcp: Use spdk_mempool_get_bulk in nvmf_tcp_req_fill_iovs This follows the practice of RDMA transport and a preparation to unify buffer management among transports. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I4e9b81b2bec813935064a6d49109b6a0365cb950 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/465871 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-08-26 19:04:24 +00:00
Shuhei Matsumoto	8aac212005	nvmf/tcp: Pass number of alloc buffers s as param to nvmf_tcp_request_free_buffers This is a preparation to the next patch to use spdk_mempool_get_bulk. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I28a5ad941004f139c9032d85c2ef92680081f1ce Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/465870 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-08-26 19:04:24 +00:00
Shuhei Matsumoto	5437470cdc	nvmf/fc: Factor out getting and filling buffers from nvmf_fc_request_alloc_buffers This follows the practice of RDMA transport and is a preparation to unify buffer allocation among transports. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I3cd4377ae31e47bbde697837be2d9bc1b1b582f1 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/465869 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Anil Veerabhadrappa <anil.veerabhadrappa@broadcom.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-08-26 19:04:24 +00:00
Shuhei Matsumoto	71ae39594f	nvmf/fc: Use buffer cache in nvmf_fc_request_alloc/free_buffers FC transport can use buffer cache as same as RDMA and TCP transport now. The next patch will factor out getting buffers and filling buffers to iovs in nvmf_fc_request_alloc_buffers(). Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I0d7b4552f6ba053ba8fb5b3ca8fe7657b86f9984 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/465868 Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Anil Veerabhadrappa <anil.veerabhadrappa@broadcom.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-08-26 19:04:24 +00:00
Shuhei Matsumoto	fbb0f0faf9	nvmf/fc: Pass transport and num_buffers as params to nvmf_fc_request_free_buffers This is a preparation to the next patch to use buffer cache in FC transport. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I116b064ea0b0a437f9a3293a6f3d46a0e5fc8ecf Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/465867 Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Anil Veerabhadrappa <anil.veerabhadrappa@broadcom.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-08-26 19:04:24 +00:00
Shuhei Matsumoto	e3b8c31d03	nvmf/fc: Use spdk_mempool_get_bulk in nvmf_fc_request_alloc_buffers This follows the practice of RDMA transport and a preparation to unify buffer management among transport types. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Ic7dc8e6b826baf7f471d192630e8a048a35056ac Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/465866 Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Anil Veerabhadrappa <anil.veerabhadrappa@broadcom.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-08-26 19:04:24 +00:00
Shuhei Matsumoto	c5b15dde18	nvmf/fc: Use common buffer pool for FC transport NVMe-oF FC transport have used its own buffer pool and have not used common buffer pool yet. It looks that there is no particular reason to prevent FC transport from using the common buffer pool. This patch removes FC transport specific buffer pool and changes FC transport to use common buffer pool instead. Add transport as a parameter to nvmf_fc_request_free_buffers() because similar APIs of RDMA and TCP transport do that. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Iae3a117466c21eaddbe78a8e8023d80ef37bb3e9 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/465865 Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Anil Veerabhadrappa <anil.veerabhadrappa@broadcom.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-08-26 19:04:24 +00:00
Shuhei Matsumoto	cdf80adccc	nvmf/fc: Check if buffer came from pool prior to nvmf_fc_request_free_buffers() NVMe-oF FC transport have used its own buffer pool and have not used common buffer pool yet. It looks that there is no particular reason to prevent FC transport from using the common buffer pool. This patch extract checking fc_req->data_from_pool from nvmf_fc_request_free_buffers() to make the transition easier. fc_req->req.iovcnt and fc_req->req.data should be cleared regardless of fc_req->data_from_pool. Hence extract them into callees. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I36420f0e573d1ec3f9f3a75f6b2ced82ade89dd3 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/465864 Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-08-26 19:04:24 +00:00
Shuhei Matsumoto	cbd3500019	nvmf/fc: Use common setting to FC specific data buffer pool NVMe-oF FC transport have used its own buffer pool and have not used common buffer pool yet. It looks that there is no particular reason to prevent FC transport from using the common buffer pool. This patch adjust the setting of the FC transport specific buffer pool to the common buffer pool to make the transition easier. Large alignment requirement consumes more memory but is acceptable. Cache size calculation looks dated. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Id3224b65f39187c4d8e99c00cf54b1cfdd902250 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/465863 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Anil Veerabhadrappa <anil.veerabhadrappa@broadcom.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-08-26 19:04:24 +00:00
Seth Howell	f8433aad23	rpc/nvmf: add tgt_name options to relevant RPCs. All of the RPCs in lib/nvmf/nvmf_rpc.c rely on knowing which nvmf_tgt they should work with. They have historically relied on the assumption that there will only be a single target in a given application. This is true for the example application in the spdk repo, but it is not necessarily true generally, By adding an option tgt_name parameter to the RPCs we enable them for multi-target NVMe-oF applications. We also further reduce the coupling between the library and the example application. Change-Id: I03b6695da05a42af3024842ed87d2ce2c296f33f Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/465442 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com>	2019-08-21 17:20:28 +00:00
Seth Howell	a54a6a266c	lib/nvmf: extract RPCs from the subsystem directory There are one or two RPCs that deal with application specific configuration. We can leave these there for now. Change-Id: I9c40aa3403d32d3e2214c8c904fb1c414ad99967 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/465365 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-08-21 17:20:28 +00:00
Seth Howell	79d876716c	nvmf: add spdk_nvmf_get_tgt function This function will allow applications (and RPCs) to obtain an spdk_nvmf_tgt pointer by name. Change-Id: I82792e06a819e06d9fddb5429830008653d92cd1 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/465349 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-08-21 17:20:28 +00:00
Seth Howell	8d6d26bd29	nvmf: add a name entry to the spdk_nvmf_tgt struct This will provide a unique identifier which can be used to provide get and set methods within the RPCs. Change-Id: Idd144e99e49b8d26530f60530d2e908b18fa251b Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/465330 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-08-20 19:15:04 +00:00
Seth Howell	7d6d95db3c	nvmf: change the function signature of spdk_nvmf_tgt_create This is necessary to allow the spdk_nvmf_tgt structure to evolve over time without having to further change the target API. Change-Id: Ib0f0f9b1f190913feff0229c96df4e84b1bf35f7 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/465363 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Anil Veerabhadrappa <anil.veerabhadrappa@broadcom.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com>	2019-08-20 19:15:04 +00:00
Seth Howell	0ac5050624	lib/nvmf: add a global list of targets As part of moving the nvmf rpc code to the library, we will need to make it more inclusive of use cases outside of the example spdk nvmf_tgt application. That application only supports a single nvmf target structure. As such, many of the RPCs have this assumption built into them. In order to enable the multi-target use case, we need to configure a way to translate between user supplied RPCs and actual target objects in the library. Change-Id: I5d3745afe9c2ca1c33f6e1a1bcc2b8bb3196ccd6 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/465329 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com>	2019-08-20 19:15:04 +00:00
Ben Walker	1e82ec0640	nvmf: Delay sending AER until subsystem resumes Change-Id: Id5152a793c6b530cb1419c559ac3ed71ee042037 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/464614 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Seth Howell <seth.howell@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-08-14 21:24:27 +00:00
Ziye Yang	1917d3b413	nvmf: move the assigment of pdu outside the switch Purpose: To reduce the duplicated code. And one minor fix: add an empty line between two functions Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: I12c9ddba6526c094cd2bd945e14f9d8bf5209adf Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/464504 Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-08-09 07:37:12 +00:00
Jacek Kalwas	8a14af685b	nvmf/rdma: fix missing destory qp From rdma_cma.h "Users must destroy any QP associated with an rdma_cm_id before destroying the ID." Signed-off-by: Jacek Kalwas <jacek.kalwas@intel.com> Change-Id: I5ed0c25221c5401cdde8b31a4e217b9d79e7caaa Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/464290 Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Seth Howell <seth.howell@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-08-08 20:07:11 +00:00
Ziye Yang	73d9cef8c5	nvmf/tcp: add nvme_tcp_pdu_cal_psh function. Purpose: 1 Do not caculated the psh_len every time. 2 Small fix, for ch_valid_bypes, and psh_valid_bytes, we do not need to use uin32_t. Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: I9b643da4b0ebabdfe50f30e9e0a738fe95beb159 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/464253 Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-08-07 01:46:54 +00:00
Seth Howell	59a3afa0ff	nvmf/rdma: pass iov_base to spdk_mem_map_translate We should be checking directly against the base of the iov when doing memory map translations. The current behavior is to check against the starting address of the buffer which is a close address, but not exactly the same. Change-Id: I7f65224a6836a814708438f2866d84ae22882216 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/463893 Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Reviewed-by: <jiandong.zheng@broadcom.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-08-02 07:15:36 +00:00
Jacek Kalwas	db0c7f6a4f	nvmf/rdma: fix missing return statement In case of failure during resource allocation within poll_group_create there is a lack of return statement which could lead to NULL ptr dereference. Signed-off-by: Jacek Kalwas <jacek.kalwas@intel.com> Change-Id: I84abe64a1843117d76b97e62656bdfc4fe2b35d8 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/463195 Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Seth Howell <seth.howell@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-08-02 03:55:32 +00:00
Shuhei Matsumoto	cf95d4a24f	sock: Fix return value of spdk_sock_group_poll to return number of events spdk_sock_group_poll() and spdk_sock_group_poll_count() had returned 0 on success. The implementation didn't match the specification described in the header file, and couldn't be used to collect stats correctly because 0 means idle. This patch fixes the return value of spdk_sock_group_poll() and spdk_sock_group_poll_count() to return number of events and the callers not to overwrite the return value by 0. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I7e2a17187fc74ea44d3acf2f35d63f5e5a254eda Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/463710 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-08-02 00:19:43 +00:00
Evgeniy Kochetov	c9c80e6932	nvmf/rpc: Fix io channel reference counting in NVMf statistics NVMf statistics functions use spdk_get_io_channel function to get a poll group. It increases reference counter in io channel and causes problems on application exit. spdk_put_io_channel calls were added to release the channel. Signed-off-by: Evgeniy Kochetov <evgeniik@mellanox.com> Change-Id: I832d1eae346c3bc3858ed0ed063ff7a7a897a2f5 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/463389 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-29 18:05:09 +00:00
Anil Veerabhadrappa	ed56a3d482	NVMe-oF Target: Add FC transport. - New files and updates to existing SPDK files to add the NVMf-FC transport. - Depends on an existing low level driver library. This driver is not part of SPDK repository. - Makefile updates to build FC transport (using CONFIG_FC) - Update configure script for FC build. - New FC unit test for FC-LS commands. - Update unittest.sh to run FC unit test (when built). Signed-off-by: John Barnard <john.barnard@broadcom.com> Signed-off-by: Anil Veerabhadrappa <anil.veerabhadrappa@broadcom.com> Change-Id: If31d4d25feab76c2dbe90a7faf71d465c2c3a354 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/450077 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-07-26 22:17:17 +00:00
Ziye Yang	6d4f580e79	nvmf/tcp: Remove spdk_nvmf_tcp_qpair_process_pending Phenomenon: Test case: Using the following command to test ./test/nvmf/target/shutdown.sh --iso --transport=tcp without this patch, it will cause coredump. The error is that the NVMe/TCP request in data buffer waiting list has "FREE" state. We do not need call this function in spdk_nvmf_tcp_qpair_flush_pdus_internal, it causes the bug during shutdown test since it will call the function recursively, and it does not work for the shutdown path. There are two possible recursive calls: (1)spdk_nvmf_tcp_qpair_flush_pdus_internal -> spdk_nvmf_tcp_qpair_process_pending -> spdk_nvmf_tcp_qpair_flush_pdus_internal -> >.. (2) spdk_nvmf_tcp_qpair_flush_pdus_internal-> pdu completion (pdu->cb) ->.. -> spdk_nvmf_tcp_qpair_flush_pdus_internal. And we need to move the processing for NVMe/TCP requests which are waiting buffer in another function to handle in order to avoid the complicated possbile recursive function calls. (Previously, we found the simliar issue in spdk_nvmf_tcp_qpair_flush_pdus_internal for pdu sending handling) But we cannot remove this feature, otherwise, the initiator will hang for waiting the I/O. So we add the same functionality in spdk_nvmf_tcp_poll_group_poll function. Purpose: To fix the NVMe/TCP shutdown issue. And this patch also reables the test for shutdown and bdevio. Change-Id: Ifa193faa3f685429dcba7557df5b311bd566e297 Signed-off-by: Ziye Yang <ziye.yang@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/462658 Reviewed-by: Seth Howell <seth.howell@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-26 21:16:23 +00:00
Evgeniy Kochetov	fbe8f8040c	nvmf/rdma: Add request latency statistics This patch adds measurement of time request spends from the moment it was polled till completion. Signed-off-by: Evgeniy Kochetov <evgeniik@mellanox.com> Change-Id: I1fcda68735f2210c5365dd06f26c10162e4ddf33 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/445291 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-07-26 20:30:00 +00:00
Evgeniy Kochetov	251db8144f	nvmf/rdma: Add NVMf RDMA transport pending statistics This patch adds statistics for pending state in NVMf RDMA subsytem which may help to detect lack of resources and adjust configuration correctly. Signed-off-by: Evgeniy Kochetov <evgeniik@mellanox.com> Change-Id: I9560d931c0dfb469659be42e13b8302c52912420 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/452300 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-07-26 20:30:00 +00:00
Evgeniy Kochetov	38ab383a8f	nvmf/rdma: Add RDMA polling statistics RDMA polling statistics: number of polls and number of completion entries returned. Signed-off-by: Evgeniy Kochetov <evgeniik@mellanox.com> Change-Id: Iabcf2cb6f6a35f595b89b58cdfcd177a637dda13 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/445289 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-07-26 20:30:00 +00:00
Evgeniy Kochetov	43bb4e6b1f	rpc: Add NVMf transport statistics to nvmf_get_stats RPC method This patch adds transport part to nvmf_get_stats RPC method and basic infrastructure to report NVMf transport specific statistics. Signed-off-by: Evgeniy Kochetov <evgeniik@mellanox.com> Change-Id: Ie83b34f4ed932dd5f6d6e37897cf45228114bd88 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/452299 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-07-26 20:30:00 +00:00
Changpeng Liu	28439890e4	nvmf: always update discovery log page if the offset is zero Global tgt->discovery_log_page may contain old hostnqn log page, so we will update the discovery log page if the offset is zero. Change-Id: Iba24409b16626d157d2782c6813fe5a0c27f1082 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/463123 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: <shahar.salzman@kaminario.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-07-25 18:05:13 +00:00
Changpeng Liu	3fe300609e	nvmf: check HOSTNQN access right for discovery service Initiator can use `nvme discover` command to display all the subsystem's information, because we don't check the allowed HOSTNQN for Discovery service, so here adding this feature so that only return the log pages to the allowed hosts. Fix issue #576. Change-Id: I51e6770bd67ea0b41caf9de3a8899923377e6255 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/462440 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: yidong0635 <dongx.yi@intel.com>	2019-07-24 11:25:59 +00:00
Changpeng Liu	234eb48bf6	nvmf: save hostnqn to controller data structure When creating a new controller in the NVMe-oF target, hostnqn is a must parameter, so we save the hostnqn to controller data structure, and it can be used to verify the access right of Discovery service. Change-Id: I86a6f50d3209d5bbb8ac85508288173d826ea216 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/462439 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: yidong0635 <dongx.yi@intel.com>	2019-07-24 11:25:59 +00:00
Alexey Marchuk	f0b7a6e7d1	rdma: fix possible double free on qpair destruction Update rqpair->last_wqe_reached in the context of thread that owns qpair's poll group to avoid possible double free This patch fixes #858 Change-Id: If5422944b7928c2cc05af528fbcc4482aeef22df Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Signed-off-by: Sasha Kotchubievsky <sashakot@mellanox.com> Signed-off-by: Evgenii Kochetov <evgeniik@mellanox.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/462012 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Lorne Li <lorneli@163.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-07-23 22:56:57 +00:00
Alexey Marchuk	5282edfd7b	rdma: fix double free of qpair struct in case of failed initialization qpair structure is freed and an error code is returned to the caller in the case of failed qpair initialization in function spdk_nvmf_rdma_qpair_initialize (e.g. bad return value of rdma_create_qp). The return code is handled by nvmf_tgt_poll_group_add function which destroys the qpair for the second time. This patch fixes #857 Change-Id: I0773652ecccbbd634ad272106e0a93c1e591d7d2 Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Signed-off-by: Sasha Kotchubievsky <sashakot@mellanox.com> Signed-off-by: Evgenii Kochetov <evgeniik@mellanox.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/462011 Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Lorne Li <lorneli@163.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-23 22:55:43 +00:00
lorneli	ba323d44ca	nvmf/rdma: log spdk_nvmf_rdma_destroy_defunct_qpair Func spdk_nvmf_rdma_destroy_defunct_qpair is a "last chance option" to destroy qp manually if some driver/hardware doesn't drain qp's failed wr as expected. There's a probability that ibv_poll_cq polls wr of the destoryed qp after spdk_nvmf_rdma_destroy_defunct_qpair's execution. Although in practice the risk of this situation is minimal(if not non-existent), add a log here so that we could detect this situation easily. Change-Id: Ifa9534397513bcea34c18fbb8168eef8f53599c1 Signed-off-by: lorneli <lorneli@163.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/462441 Reviewed-by: Seth Howell <seth.howell@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-23 19:35:16 +00:00
lorneli	b4d3066890	nvmf/rdma: defer qp destruction until nvmf layer closes qp Currently rqpair will be destroyed directly in ibv_poll_cq path if it has been drained, regardless of whether there are outstanding I/Os issued to bdev layer. So after outstanding I/Os completing, spdk_nvmf_rdma_close_qpair will be called from nvmf layer, accessing a destroyed qp. This path defers qp destruction in nvmf_rdma_destroy_drained_qpair func until nvmf layer closes qp. Fixes 851 Change-Id: I8bcce66f8053ddb105702ac603d5d73af54bdcfc Signed-off-by: lorneli <lorneli@163.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/461237 Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Seth Howell <seth.howell@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-23 19:35:16 +00:00
Alexey Marchuk	0754417fa9	rdma: Use optimal ceiling integer division This form of the celinig division allows to remove an extra condition Change-Id: I8a2de792172ec9115563e7fb914745c476f16e8d Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Signed-off-by: Sasha Kotchubievsky <sashakot@mellanox.com> Signed-off-by: Evgenii Kochetov <evgeniik@mellanox.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/462198 Reviewed-by: Seth Howell <seth.howell@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-22 09:22:11 +00:00
Ziye Yang	9375616ae2	nvmf/tcp: code cleanup move the staement location of TCP request setting and remove the duplicated code. Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: Ia659756185547ff4f8aa26c5bc01f63defe6c113 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/462589 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-07-22 02:40:35 +00:00
Ziye Yang	6ad6a1131b	nvmf/tcp: Add a feature to allow set the sock priority of the connection. This priority is used to differentiate the sock priority on the TCP connections between NVMe-oF TCP target and other TCP based applications. Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: I6ee294e647420b56d1d91a07c2e37bf34ce24e03 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/461801 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-07-19 06:30:19 +00:00
Darek Stojaczyk	96ec8bff78	nvmf/rdma: switch to spdk_malloc() spdk_dma_malloc() is about to be deprecated. Change-Id: I5bcac50baca785255eb068086e67c07d120b042f Signed-off-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459432 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-07-17 01:28:57 +00:00
Darek Stojaczyk	36ccca2c08	nvmf/tcp: switch to spdk_malloc() spdk_dma_malloc() is about to be deprecated. Change-Id: Ic42db528bbae4b3ca2e91cb9ac46def99ecb5f28 Signed-off-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459431 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-07-17 01:28:57 +00:00
Jacek Kalwas	e95e4028c1	nvmf/rdma: exclude getaddrinfo from lock No need to have it under lock. Additionally in case of failure there was a lack of rdma_destroy_id(). This is addresed within this change as well. Signed-off-by: Jacek Kalwas <jacek.kalwas@intel.com> Change-Id: Idbb36d51ad4ef7ef81051463f56efc87ef00c966 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/462054 Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-17 01:03:36 +00:00
Jacek Kalwas	0d4a5f7e69	nvmf/rdma: free list of devices In case of failure during pd or map allocation freeing list of devices was missing. Signed-off-by: Jacek Kalwas <jacek.kalwas@intel.com> Change-Id: If62f7b072f3894fd1a7e856c19b4ea51646dd20e Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/462079 Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-17 00:59:34 +00:00
Jacek Kalwas	114a067738	nvmf/rdma: pd null check In case of pd allocation by nvmf hooks there is a lack of null check as oposed to pd allocation by ibv_alloc_pd. Signed-off-by: Jacek Kalwas <jacek.kalwas@intel.com> Change-Id: Iead6e0332bdee3da4adb6e657af298215c4e2196 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/461576 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-07-16 01:29:03 +00:00
Evgeniy Kochetov	9d5037275d	nvmf: Add BDEV IO pending statistics This patch adds statistics for BDEV IO pending state in NVMf subsytem which may help to detect lack of resources and configure pool size correctly. Signed-off-by: Evgeniy Kochetov <evgeniik@mellanox.com> Change-Id: I6c60c27efe3efed194b2d2c46a707af7c2808fe9 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/445290 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com>	2019-07-12 12:46:29 +00:00
Evgeniy Kochetov	da999b69b8	nvmf: Add queue pair counts statistics This patch adds number of admin and IO queue pairs per poll group in NVMf statistics. It can be useful to troubleshoot load sharing issues. Signed-off-by: Evgeniy Kochetov <evgeniik@mellanox.com> Change-Id: I2a9c0fc99cf5d0729eb130d30540ae52b5207fc9 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/445288 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com>	2019-07-12 12:46:29 +00:00
Evgeniy Kochetov	fca6ff8f75	rpc: Add nvmf_get_stats RPC method This patch adds nvmf_get_stats RPC method and basic infrastructure to report NVMf global and per poll group statistics in JSON format. Signed-off-by: Evgeniy Kochetov <evgeniik@mellanox.com> Change-Id: I13b83e28b75a02bc1dcb7b95cbce52ae10ff0f7b Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/452298 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com>	2019-07-12 12:46:29 +00:00
Ben Walker	88da8a91f9	nvmf: spdk_nvmf_subsystem_remove_ns is no longer asynchronous Now that the resume path can correctly handle the case where a namespace was removed and a new one added with the same nsid, this no longer needs to be asynchronous. Change-Id: I693045e66a7d4e75255b526d8f5ca5ef8695533e Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459606 Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-11 11:19:53 +00:00
Shuhei Matsumoto	7ee58b90e1	nvmf/tcp: Set DIF context to PDU when processing in-capsule, C2H, or H2C data Set DIF context of the corresponding request to PDU when - processing in-capsule data of the command, - processing data of C2H PDU, or - processing data of H2C PDU. Change-Id: I3a668a55be21dbe2ee6ecf26476290670bd7b4a8 Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/458929 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com>	2019-07-11 05:30:28 +00:00
Shuhei Matsumoto	e3e023cfd3	nvmf/tcp: Increase in-capsule buffer size to fill DIF fields When NVMe/TCP initiator transfers in-capsule data, NVMe/TCP has to process it as in-capsule data. If DIF insert/strip is enabled, in-capsule data size will be increased by NVMe/TCP target to insert metadata. However size of in-capsule data buffer had not been increased, and buffer overflow occurred when NVMe/TCP initiator transfers in-capsule data to NVMe/TCP target with DIF insert/strip being enabled. This patch increases size of in-capsule data buffer size to store metadata. 16 byte metadata per 512 byte data block is the current maximum ratio of metadata per block. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I88b127efd7a945bde167a95df19a0b9175cb8cd0 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/461333 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com>	2019-07-11 05:30:28 +00:00
Shuhei Matsumoto	9d4ee5f344	nvmf/tcp: Fix wrong data offset in nvmf_tcp_pdu_payload_insert_dif We updated readv_offset before generating DIF to avoid adding the temporary variable _rc in the previous patch, but that caused write error when inserting DIF. Fix the bug in this patch. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Id0788280a83cbea2554c851db77751432fc00cba Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/461116 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-07-11 05:30:28 +00:00
Shuhei Matsumoto	2c9b0af271	nvmf/tcp: Get DIF context when handling capsule command header When handling the capsule command header, call spdk_nvmf_request_get_dif_ctx by passing the NVMf request and the reference to the DIF context, and set the flag dif_insert_or_strip of the NVMf/TCP request to true. spdk_nvmf_request_get_dif_ctx returns false immediately when the corresponding NVMf controller disables DIF insert/strip. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I16f6b322f2692d5f9653d011a490e7929ec37365 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/458928 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com>	2019-07-11 05:30:28 +00:00
Shuhei Matsumoto	1c7f92f075	nvmf: Hide DIF setting of the backend bdev if DIF insert/strip is enabled When the NVMf controller's flag dif_insert_or_strip is enabled, DIF is inserted for write I/O and stripped for read I/O, and the corresponding NVMe-oF initiator should not be aware of the DIF setting of the backend bdev. Hence this patch hides the DIF setting of the backend bdev when the flag dif_insert_or_strip is enabled. Change-Id: I3c14880c2e94cba7f76b1bca78afb36bfe884e26 Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/456731 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com>	2019-07-11 05:30:28 +00:00
Shuhei Matsumoto	4ff3665ce9	nvmf: Check DIF insert/strip setting of NVMf controller when getting DIF context The first idea was that the caller of spdk_nvmf_request_get_dif_ctx() should check if the current transport enables DIF insert/strip before calling spdk_nvmf_request_get_dif_ctx(). But NVMf controller knows if DIF/insert/strip is enabled now by the previous patch. Hence spdk_nvmf_request_get_dif_ctx() checks if the NVMf controller enables DIF insert/strip at its head. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I78253d356b694800c3a9a9608514df58e0c631a6 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/461314 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com>	2019-07-11 05:30:28 +00:00
Shuhei Matsumoto	91da9aaafe	nvmf: Add a flag dif_insert_or_strip to struct spdk_nvmf_ctrlr Add a flag dif_insert_or_strip to struct spdk_nvmf_ctrlr that indicates whether DIF insert/strip is done. Copy the DIF insert/strip setting of the corresponding transport options to the flag at NVMf controller creation. The purpose of this patch is to make DIF insert/strip not per-transport option but per-controller option because we may want to be able to control DIF insert/strip per controller at some point. Besides this patch will clean the implementation. Besides align indent around the change. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I57f65960b430e55f4021ed514aacd85581ff9993 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/461313 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com>	2019-07-11 05:30:28 +00:00
Ziye Yang	750a4213ef	nvmf: add spdk_nvmf_get_optimal_poll_group This patch is used to do the following work: 1 It is optimized for NVMe/TCP transport. If the qpair's socket has same NAPI_ID, then the qpair will be handled by the same polling group. 2. We add a new connection scheduling strategy, named as ConnectionScheduler in the configuration file. It will be used to input different scheduler according to the customers' input. Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: Ifc9246eece0da69bdd39fd63bfdefff18be64132 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/454550 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-07-10 02:30:41 +00:00
Ziye Yang	960460f0d1	nvmf: add spdk_nvmf_transport_get_optimal_poll_group Add the optimal poll group get function. Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: Ia9e57c6924a6563d79269cf535814883e83698cd Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/454549 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-07-10 02:30:41 +00:00
Ben Walker	09ef0593d4	nvmf: Leverage bdev uuid to correctly detected remove+add ns while paused Change-Id: Idbf00956394f7ee7ff7e27f2627785cd7146b01f Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459605 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com>	2019-07-10 01:59:05 +00:00
Ben Walker	85e9760161	nvmf: Capture ns_info onto stack in poll_group_update_subsystem By capturing this pointer onto the stack, we inform the compiler that we don't expect it to change. That allows the compiler to generate more efficient code. Change-Id: I0f3ff9373662198e915269c4498e4902a2cdb808 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459754 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com>	2019-07-10 01:59:05 +00:00
Ben Walker	ab3abc15aa	nvmf: Capture channel variable to stack when updating poll groups This signals to the compiler and analysis programs that this won't change during iteration, so it may produce better code. Change-Id: I478c0c9445d4ddf8a69ab1b3deaf628b82a0eaea Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459753 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com>	2019-07-10 01:59:05 +00:00
Changpeng Liu	7b74274fbf	nvmf: add parameter check when loading reservation information from a JSON file Change-Id: Id217212fd82e57a4cfb32f62f11798c72187879e Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/460794 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-07-10 01:40:26 +00:00
Shuhei Matsumoto	aa322721cb	nvmf: Add dif_insert_or_strip to transport options This is a place holder and subsequent patches will use the option dif_insert_or_strip and provide JSON RPCs to configure it. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I7e3fbb1d49c47647a9a0a1a2149152801591b283 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/456452 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-07-10 00:43:02 +00:00
Shuhei Matsumoto	ddb680ebab	nvmf: Add helper function to get DIF context from NVMf request Add a helper function to get DIF context when the passed NVMf request is for I/O queue, NVMe read, write, or compare command, and its NSID is valid. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I796c20607c7b64a8be85da5131c5ea95ffd9f8e4 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/458713 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com>	2019-07-10 00:43:02 +00:00
Shuhei Matsumoto	9b04e29173	nvmf: Add helper function to get DIF context from bdev and NVMe cmd Add a helper function to get necessary DIF information and set them into the passed DIF context and return. This function will be called only when the specific requirement is satisfied and the caller will be added in the next patch. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Ic435886ca936a211f34278b813f547ffa43b9000 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/458712 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com>	2019-07-10 00:43:02 +00:00
Shuhei Matsumoto	7bfbc388d7	nvmf/tcp: Pass extended LBA based length as I/O length to NVMf controller When DIF is inserted or stripped, - in the TCP transport layer, we can use LBA based length throughout, but - in the NVMf controller layer and BDEV layer, extended LBA based length must be used, and NVMf controller gets the length from tcp_req->req.length. Hence by adding and using two variables, elba_length and orig_length to struct spdk_nvmf_tcp_req, set the extended LBA length to tcp_req->req.length before calling spdk_nvmf_request_exec(), and then restore the original LBA based length to tcp_req->req.length after calling spdk_nvmf_tcp_req_complete(). Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I9309b8923c6386644c4fd8ef3ee83a19f5d21ce5 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/458926 Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-09 03:39:25 +00:00
Shuhei Matsumoto	51b643648c	nvmf/tcp: Increase buffer to insert/strip DIF in spdk_nvmf_tcp_req_parse_sgl If tcp_req->dif_insert_or_strip, increase the length from LBA based to extended LBA based by using its own DIF context. Change-Id: Ie9f5cf757328dda795b43a7b6c70a72259865115 Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/458925 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-07-09 03:39:25 +00:00
Shuhei Matsumoto	536bd70eb4	nvmf/tcp: Use cached length variable in spdk_nvmf_tcp_req_parse_sgl The next patch will extend the length from LBA based to extended LBA based and use it as buffer length to insert or strip DIF. So cache sgl.unkeyed.length at the top of spdk_nvmf_tcp_req_parse_sgl and use it throughout. Besides, one unrelated change-the-line to improve the readability is included. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I2a1dc9379bb5671ec80b5b478504c9879a4f0fff Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/458924 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-07-09 03:39:25 +00:00
Shuhei Matsumoto	975239c29d	nvmf/tcp: Insert DIF to the newly read data to create extended LBA payload Generate and insert DIF to each data block when reading more than a single byte. This update is very similar with the use case of spdk_dif_generate_stream in iSCSI target. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I063919a32153ac0daf6d6eb1836c0d5995b65d33 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459092 Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-09 03:39:25 +00:00
Changpeng Liu	1edc5f0040	nvmf: restore the loaded reservation information to NS Load reservation information based on ptpl configuration file, and restore the information to NS data structure. Change-Id: I5f46d49a6d1e6e49aab93ca7cd654469a3a08659 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/455912 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-07-08 08:21:03 +00:00
Shuhei Matsumoto	8448adaefa	nvmf/tcp: Verify DIF before sending C2H data in spdk_nvmf_tcp_send_c2h_data If DIF mode is local and C2H data is extended LBA payload, DIF should be verified just before sending the payload. Add a helper function nvmf_tcp_pdu_verify_dif and call it in spdk_nvmf_tcp_send_c2h_data after completing nvme_tcp_pdu_set_data_buf. When nvmf_tcp_pdu_verify_dif returns error, treat the error as fatal transport error because the error is caused by the target itself. Handle the fatal NVMe/TCP transport error by terminating the connection as described in the NVMe specification. On the other hand, data digest error is treated as a non-fatal transport error because the error is caused outside the target. This is reasonable. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I9680af2556c08f5888aeaf0a772097e4744182be Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/458921 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-07-08 03:33:07 +00:00
Ziye Yang	57efada508	nvmf/tcp: reorg the structure of struct spdk_nvmf_tcp_req I used pahole to see whether the alignment of the structure is reasonable. After reorgnization, we can saved 16 bytes and 1 cacheline according to the information by pahole. Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: I1347e7c582fe2b00707e2841690b87d53cc61e33 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/460572 Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-05 04:18:41 +00:00
Shuhei Matsumoto	3ff1ff004e	nvme/tcp: Minor cleanups for SGL operations Using naming rules consistent with other related libraries is helpful to ensure the quality as verified by this patch series. This patch changes a few parts to use iov and iovcnt for SGL operations. Besides, name of an array points to the head of the array and is constant. So copying name of array to an another pointer is not necessary and can be removed. Change-Id: I2324f28126b3088098c1c767cf6c060f22c175c3 Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/455629 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Maciej Szwed <maciej.szwed@intel.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com>	2019-07-04 08:58:40 +00:00
Shuhei Matsumoto	127cfac020	nvmf/tcp: Use nvme_tcp_pdu_set_data_buf for incapsule data Previously we had used nvme_tcp_pdu_set_data() for incapsule data. This patch changes handling incapsule data to use nvme_tcp_pdu_set_data_buf() as same as H2C and C2H. This unification is necessary to support DIF insert and strip in NVMe/TCP target later. Change-Id: I02cae8db94e51cf79a354dd64ad45f0e491ec08e Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/455920 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com>	2019-07-04 08:58:40 +00:00
Shuhei Matsumoto	3184884f9d	nvmf/tcp: Properly handle multiple iovecs in processing H2C and C2H NVMe/TCP target had assumed the size of each iovec was io_unit_size. Using nvme_tcp_pdu_set_data_buf() instead removes the assumption and supports any alignment transparently. Hence this patch moves nvme_tcp_pdu_set_data_buf() to include/spdk_internal/nvme_tcp.h and replaces the current code to use it. Besides, this patch simplifies spdk_nvmf_tcp_calc_c2h_data_pdu_num() because sum of iov_len of iovecs is equal to the variable length now. We cannot separate code movement (lib/nvme/nvme_tcp.c to include/ spdk_internal/nvme_tcp.h) and code replacement (lib/nvmf/tcp.c) because moved functions are static and compiler give warning if they are not referenced in lib/nvmf/tcp.c. The next patch will add UT code. Change-Id: Iaece5639c6d9a41bd35ee4eb2b75220682dcecd1 Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/455625 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-07-04 08:58:40 +00:00
Ziye Yang	b09bd95ad3	sock: update spdk_sock_group_add_sock And also add spdk_sock_group_get_ctx function Change-Id: I2a2a58b0588ff7d99d3538ea0a633a3b8c7a234b Signed-off-by: Ziye Yang <ziye.yang@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/454538 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Maciej Szwed <maciej.szwed@intel.com>	2019-07-04 08:21:05 +00:00
Shuhei Matsumoto	12d6dce2aa	nvmf: Use not malloc'ed but fixed size string for host NQN Maximum size of NQN is already defined to be SPDK_NVMF_NQN_MAX_LEN, and hence use fixed size string whose size is SPDK_NVMF_NQN_MAX_LEN + 1 for spdk_nvmf_vhost::nqn. This change will reduce the potential malloc failure. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I2b9c7cc21200b3e88b5485ebfdcd5040bc6e3589 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459742 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-07-04 00:30:22 +00:00
Changpeng Liu	af6ed1e94a	nvmf: update the reservation information for ACQUIRE/RLEASE commands Change-Id: Ibfebffa4d683da08ae8f9350cce144fafe6a5538 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/455910 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-07-02 00:06:59 +00:00
Changpeng Liu	196d4f704a	nvmf: enable ptpl feature with reservation register command Add file based reservation information definition, the data structure can be used to store all the reservation information to a json based configuration file, and enable this feature with REGISTER command. Change-Id: Ic93cfc5934a4ad96f11b96ec77bacb877edf6c10 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/455909 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-07-02 00:06:59 +00:00
Ziye Yang	cdc0170c1b	nvmf/tcp: Add a maximal PDU loop number In our previous code, we will handle all the PDU until there is no incoming data from the network if we can continue the loop. However this is not quite fair when we handling multiple connections in a polling group. And this change is setting a maximal NVME/TCP PDU we can handle for each conneciton, it can improve the performance. After some tuing, 32 should be a good loop number. Our iSCSI target uses 16. The following shows some performance data: Configuration: 1 Command used in the initiator side: ./examples/nvme/perf/perf -r 'trtype:TCP adrfam:IPv4 traddr:192.168.4.11 trsvcid:4420' -q 128 -o 4096 -w randrw -M 50 -t 10 2 target side, export 4 malloc bdev in a same subsystem Result: Before patch: Starting thread on core 0 ======================================================== Latency(us) Device Information : IOPS MiB/s Average min max TCP (addr:192.168.4.11 subnqn:nqn.2016-06.io.spdk:cnode1) from core 0: 51554.20 201.38 2483.07 462.31 4158.45 TCP (addr:192.168.4.11 subnqn:nqn.2016-06.io.spdk:cnode1) from core 0: 51533.00 201.30 2484.12 508.06 4464.07 TCP (addr:192.168.4.11 subnqn:nqn.2016-06.io.spdk:cnode1) from core 0: 51630.20 201.68 2479.30 481.19 4120.83 TCP (addr:192.168.4.11 subnqn:nqn.2016-06.io.spdk:cnode1) from core 0: 51700.70 201.96 2475.85 442.61 4018.67 ======================================================== Total : 206418.10 806.32 2480.58 442.61 4464.07 After patch: Starting thread on core 0 ======================================================== Latency(us) Device Information : IOPS MiB/s Average min max TCP (addr:192.168.4.11 subnqn:nqn.2016-06.io.spdk:cnode1) from core 0: 57445.30 224.40 2228.46 450.03 4231.23 TCP (addr:192.168.4.11 subnqn:nqn.2016-06.io.spdk:cnode1) from core 0: 57529.50 224.72 2225.17 676.07 4251.76 TCP (addr:192.168.4.11 subnqn:nqn.2016-06.io.spdk:cnode1) from core 0: 57524.80 224.71 2225.29 627.08 4193.28 TCP (addr:192.168.4.11 subnqn:nqn.2016-06.io.spdk:cnode1) from core 0: 57476.50 224.52 2227.17 663.14 4205.12 ======================================================== Total : 229976.10 898.34 2226.52 450.03 4251.76 Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: I86b7af1b669169eee2225de2d28c2cc313e7d905 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459572 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-06-28 12:28:54 +00:00
Or Gerlitz	6629202cbd	nvmf/tcp: Use the success optimization by default By now (5.1 is released), the Linux kernel initiator supports the success optimization and further, the version that doesn't support it (5.0) was EOL-ed. As such, lets open it up @ spdk by default. Doing so provides a notable performance improvement: running perf with iodepth of 64, randread, two threads and block size of 512 bytes for 60s ("-q 64 -w randread -o 512 -c 0x5000 -t 60") over the VMA socket acceleration library and null backing store, we got 730K IOPS with the success optimization vs 550K without it. IOPS MiB/s Average min max 549274.10 268.20 232.99 93.23 3256354.96 728117.57 355.53 175.76 85.93 14632.16 To allow for interop with older kernel initiators, we added a config knob under which the success optimization can be enabled or disabled. Change-Id: Ia4c79f607f82c3563523ae3e07a67eac95b56dbb Signed-off-by: Or Gerlitz <ogerlitz@mellanox.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/457644 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com>	2019-06-26 06:24:03 +00:00
Changpeng Liu	cf5c4a8a2e	nvmf: add ptpl activated flag to Namespace If users set the persist through power loss configuation file, that means the Namespace has the capability to support ptpl feature, here we added a ptpl_activated flag to indicate that the users enable the feature or not. Users can use Set features or Reservation Register commands to change the value. Change-Id: Iae3fd44085c5be5bf9574e49efa567e8212dee20 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/455906 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-06-26 01:54:10 +00:00
Hailiang Wang	73a171a07c	rdma: assert ibv_send_wr is not NULL Vhost testing crashed from Nightly testing, because a member access within null pointer of type 'struct ibv_send_wr'. Change-Id: If8f34f23864883ea73516d2d1fe3b30137c04316 Signed-off-by: Hailiang Wang <hailiangx.e.wang@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/458913 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-06-25 13:37:15 +00:00
Evgeniy Kochetov	9e3d841d3e	nvmf: Fix connect command SQ size validation for IO queues SQSIZE parameter validation in Connect command was broken because QID field in qpair was used before intialization. Signed-off-by: Evgeniy Kochetov <evgeniik@mellanox.com> Change-Id: I8a0b359937d661df3b9888e6084e7d0b4a9056ea Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/455667 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-06-18 11:39:29 +00:00
Shuhei Matsumoto	c758dc088a	nvmf: Reject bdev with separate metadata to attach to subsystem NVMe bdev module support separate metadata now but NVMf subsystem cannot process bdev with separate metadata yet. Hence reject any bdev with separate metadata to be attached explicitly by this patch. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I793c6c5f61deb766d7bf427ff67ccc57a48974cf Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/457167 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-06-13 00:48:11 +00:00
Changpeng Liu	3ec061800f	nvmf: add a persist through power loss configuration file when constructing NS For reservation feature in NVMoF, we can't support the persist through power loss feature, now we will add the configuration file parameter with Namespace, after users set the configuration file parameter with one NS, then the PTPL feature can be enabled. Change-Id: Id72699093f7e68318b9529f7bacc5c9804f7f86b Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/455905 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-06-12 00:30:03 +00:00
Alexey Marchuk	53777de855	rdma: Unset IBV_SEND_SIGNALED flag for RDMA_WRITE operations Unsetting this flag will decrease the number of WRs retrieved during CQ polling and will decrease the oeverall processing time. Since RDMA_WRITE operations are always paired with RDMA_SEND (response), it is possible to track the number of outstanding WRs relying on the completed response WR. Completed WRs of type RDMA_WR_TYPE_DATA are now always RDMA_READ operations. The patch shows %2 better peformance for read operations on x86 machine. The performance was measured using perf with the following parameters: -q 16 -o 4096 -w read -t 300 -c 2 with nvme null device, each measurement was done 4 times avg IOPS (with patch): 865861.71 avg IOPS (master): 847958.77 avg latency (with patch): 18.46 [us] avg latency (master): 18.85 [us] Change-Id: Ifd3329fbd0e45dd5f27213b36b9444308660fc8b Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Signed-off-by: Sasha Kotchubievsky <sashakot@mellanox.com> Signed-off-by: Evgenii Kochetov <evgeniik@mellanox.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/456469 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-06-11 18:07:28 +00:00
JinYu	8fc9ac7b0e	nvmf: complete all I/Os before changing sgroup to PAUSED For the nvme device, I/Os are completed asynchronously. So we need to check the outstanding I/Os before putting IO channel when we hot remove the device. We should be sure that all the I/Os have been completed when we change the sgroup->state to PAUSED, so that we can update the subsystem. Fix #615 #755 Change-Id: I0f727a7bd0734fa9be1193e1f574892ab3e68b55 Signed-off-by: JinYu <jin.yu@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/452038 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-06-11 01:51:56 +00:00
Ziye Yang	0bb626672b	nvmf/tcp: Support single r2t usage According to the TP 8000 spec in Page 26: Maximum Number of Outstanding R2T (MAXR2T): Specifies the maximum number of outstanding R2T PDUs for a command at any point in time on the connection. Note that by the spec, the target may only support single r2t (which is the minimum possible), it doesn't have to use multiple r2ts even if the initiator supports that. So remove the maxr2t and pending_r2t variable in the tcp qpair structure. In the original design, we think that maxr2t is the maximal active r2t numbers for each connection. So if the initiator sends out maxr2t=16, it means that all the commands of a qpair can use such number of R2T pdus. So we need to wait for the available R2Ts for the request when the maxr2t reaches the maximal value. But it is the wrong understanding of the spec. In fact, each command has its own number of maximal r2t numbers, then we do not need to use the wait method for R2T method anymore. So we remove the state TCP_REQUEST_STATE_DATA_PENDING_FOR_R2T. Futhermore, we adjust the related SPDK_TPOINT_ID definition. In current patch, the target will support one active R2T for each write NVMe command. Thus, we remove the function spdk_nvmf_tcp_handle_queued_r2t_req. Reported-by: Or Gerlitz <ogerlitz@mellanox.com> Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: I7547b8facbc39139b4584637ccc51ba8b33ca285 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/455763 Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Or Gerlitz <gerlitz.or@gmail.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-06-05 16:46:55 +00:00
Jim Harris	f758598c44	nvmf: fix assert in spdk_nvmf_tcp_req_fill_iovs It's OK for iovcnt to equal SPDK_NVMF_MAX_SGL_ENTRIES. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ic95d04f5667858e7fbb025f469c027e2d47b8ba1 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/456111 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com>	2019-05-31 14:46:35 +00:00
Jim Harris	bf647c168a	nvmf: increase default max num qps to 128 This matches the Linux kernel target. Users can still decrease this default when creating the transport (i.e. -p option for nvmf_create_transport in rpc.py). Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Icad59350a2cd35cfc4ad76d06399345191680c05 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/454820 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-05-22 14:50:05 +00:00
Seth Howell	61948a1ca7	rdma: add check for allocating too many SRQ. We could run into issues with this if we were using an arbitrarily large amount of cores to run SPDK. Change-Id: Ia7add027d7e6ef1ccb4a69ac328dbdf4f2751fd8 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/452250 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-05-15 20:29:32 +00:00
Seth Howell	14777890a6	rdma: add an stailq for qpairs pending recv This will help us not iterate through the whole list of connections when only some of them have pending recvs. Change-Id: I681bc98befbdda4e77ef333b7a086c08b2708eb3 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/449266 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-05-13 22:09:55 +00:00
Seth Howell	c3884f943c	rdma: batch rdma recvs per poll. This will help save MMIO overhead. Especially in the SRQ case. Change-Id: I6fb70cf6de4763450f97961f41ccdce3acec2e63 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/449265 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-05-13 22:09:55 +00:00
Seth Howell	b4dc10fbb7	rdma: create a list for qpairs pending send transfers By creating a list of qpairs, we can avoid looping over every connected qpair to process sends each time we poll. Change-Id: If24bbc363176f52fbfb756d56719edd885a21a11 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/449264 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-05-10 22:24:35 +00:00
Seth Howell	9d63933b7f	rdma: batch rdma sends. By batching ibv sends each time we poll, we can reduce the number of MMIO writes that we do. Change-Id: Ia5a07b0037365abfa8732629c34d34a9ed49ac70 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/449253 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-05-10 22:24:35 +00:00
Ben Walker	fbbbd6ab50	nvmf: Print a message out when a host is disconnecting due to keep alive It isn't obvious why hosts are being disconnected at the moment. Change-Id: I5515ba40883ccb20921d0da013b27670212bf649 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/453034 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-05-09 15:35:11 +00:00
Seth Howell	350e429a57	rdma: add a flag for disabling srq. There are cases where srq can be a detriment. Add a flag to allow users to disable srq even if they have a piece of hardware that supports it. Change-Id: Ia3be8e8c8e8463964e6ff1c02b07afbf4c3cc8f7 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/452271 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-05-06 18:11:13 +00:00
Jim Harris	a95fdad68f	nvmf: remove unnecessary size checks when creating transport The individual transports will adjust these sizes when necessary. In fact, we have to remove this check, since RDMA transport may adjust the io_unit_size based on the max number of SGEs - and can adjust it to a value that will fail this check if we reload the configuration. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I2708c7f5aaa54a368ec932ec40dd6447f1a4fde0 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/452474 Reviewed-by: Seth Howell <seth.howell5141@gmail.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-05-02 14:44:57 +00:00
Jim Harris	b6206d657c	trace: shorten max name from 44 to 24 characters This restriction helps reduce the amount of padding when printing out the event trace, allowing it to fit in a small number of columns. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ifa31e5a6967c7b9bc7028069effb71533f80596f Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/452736 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-05-02 08:41:56 +00:00
Jim Harris	617184be3b	trace: remove short_name This was not used by any of the trace register descriptions. Let's remove it rather keeping it around if we don't need it. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Idda809e2911db5be555ff6aa13695484a14bf665 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/452734 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com>	2019-05-02 08:41:56 +00:00
Anil Veerabhadrappa	2061874474	lib/nvmf: Validate requested SQ size for both admin and IO queue During connect call based on queue type (AQ or IOQ), SQ size should be validated against max sq size for that particular queue type. Change-Id: I977d7556e4d04e37004d16c87efffd3b467fa62c Signed-off-by: Anil Veerabhadrappa <anil.veerabhadrappa@broadcom.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/452376 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-05-01 18:51:28 +00:00
Seth Howell	6cc18a64aa	rdma.c: Don't set recv->qpair to NULL We can use the rpoller->srq to check if a qpair is valid when processing recv completions. Change-Id: I6aa360adc48a3312ddcf79f10e2a65b502a7314f Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/452247 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-05-01 18:48:13 +00:00
Seth Howell	33f60621af	lib: resize key mempools Mempools are based off of a ring structure which allocates its elements as a power of two. It also only exposes n-1 elements to the user. So when we create a mempool with 2^n elements in it, we have to allocate a ring with 2^n+1 entries. By decreasing the number of elements in these key mempools by 1, we can save a decent amount of memory. Change-Id: I942c9dd4cf59096969bc2559fb46fd2084a07f09 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/448875 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-05-01 17:45:29 +00:00
Seth Howell	d05c553827	rdma: don't spam people with async event messages. It used to be that we would get async events very infrequently. However, with the introduction of SRQ, this number has gone up tremendously. Change the way we report our these events so that we don't spam/confuse people running the target. Change-Id: I33070281fa854cbc17784d61bbbb870196ca8780 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/452159 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-04-26 18:10:56 +00:00
Seth Howell	ec47f92b9b	rdma: fix potential heap-use-after-free in srq shutdown If there are outstanding recvs for a qpair when it is destroyed, we need to clear the qpair from it before reposting it. Otehrwise, we have a potential heap-use-after-free of double free (depending on whether the recv completion is in error state or not). See github issues #730 Change-Id: Ic2009c761cbcc5e89174f62fbd0872d0489c67ca Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/452122 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-04-26 11:16:22 +00:00
Seth Howell	3856d82b50	subsystem: check for NULL bufs in reservation ops. At the RDMA level, we allow processing requests that should contain a data transfer, but specify a length of zero to be passed up the stack without a data buffer. See spdk_nvmf_rdma_request_get_xfer. In the case of the reservation requests, we weren't checking whether req->data was NULL before trying to copy into it causing us to segfault if we got a malformed reservation request. Found when using the fuzzer. Change-Id: I320174ec72a8d298ab6ca44ef6a99691631f00ca Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/451786 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-04-25 22:52:12 +00:00
Changpeng Liu	3f4426878a	nvmf: disable the protection if the backend doesn't contain valid type It's not an error if the NVMe hard drive was formatted to 512 + 8 but has no protection type, so we will also disable the protection for NVMoF target. Change-Id: I07e605cff9545f46c642f7ca783a4727a26abece Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/451926 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-04-24 21:47:18 +00:00
Seth Howell	89d2efe07e	rdma: set the srq param in the initiator. We were setting this value in the target from our initiator, but it turns out the rdma_conn_params struct is responsible for setting the opposite side so we need to add it in the target side when accepting connections. Also, add a test to demonstrate target functionality when we overwhelm the SRQ. It is useful to note that performance really tanks when you start overwhelming the srq so it may be useful to use this test case to check performance gains in edge cases over time. Change-Id: Iac541bd9fc1d82eca9f21e7abc3f625663a6c460 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/451678 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-04-24 09:22:16 +00:00
Jim Harris	b92c3d412d	nvmf: add tcp trace points for data read from socket Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ib04abb64dd379dd73c7ff3c8318591124b4bb7dd Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/451477 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-04-23 17:59:23 +00:00
Gregory Shapiro	14032a984c	NVMF: Add model number as parameter to construct_nvmf_subsystem (-d option). Change-Id: Ia1a458a0ac1c5a17d2955a3f31c6dfe77538eb17 Signed-off-by: Gregory Shapiro <gregory.shapiro@kaminario.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/438562 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-04-23 16:51:16 +00:00
Changpeng Liu	68bb3995aa	nvmf: trivial optimization to make the code more consistent Make the use of spdk_uuid_compare() to be consistent in the file, also change the SPDK_INFOLOG to SPDK_DEBUGLOG to avoid the repeated log messages for RESERVATION CONFLICT response. Change-Id: I72fefbd520cefcaf25182c3ca3d21e3d87d17e94 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/450884 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-04-23 16:30:24 +00:00
Changpeng Liu	4fa486a1e3	nvmf: add asynchronous event for reservation notificaiton Now Host can get an asynchronous event notification when registrants were unregistered/preempted or reservation was released from the associate namespace, Host can send get log page to clear related log pages and reservation report to get the full overview of current reservation configuration. Change-Id: Idc57c19812490c7536503308989871515e9f2361 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/439935 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-04-23 16:30:24 +00:00
jiaqizho	b70e698465	rdma:fix core dump when rdma_create_qp return error. Signed-off-by: jiaqizho <jiaqi.zhou@intel.com> Change-Id: Ie900e01820f69fc5b2d5e30d519c6b619d7a7281 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/449507 Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-04-22 18:40:35 +00:00
Yair Elharrar	2b0ae30bf1	nvmf: fix segfault in case of multi-range unmap In case of a DSM Deallocate (unmap) with multiple ranges, individual bdev IOs are submitted for each range. If the bdev IO cannot be allocated, the request is queued on io_wait_queue; however previously submitted ranges may complete before memory is available for the next range. In such a case, the completion callback will free unmap_ctx, while the request is still queued for memory - causing a segfault when the request is dequeued. To fix, introduce a new field tracking the unmap ranges, and make sure the count is nonzero when the request is queued for memory. Signed-off-by: Yair Elharrar <yair@excelero.com> Change-Id: Ifcac018f14af5ca408c7793ca9543c1e2d63b777 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/447542 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-04-22 15:42:51 +00:00
Jim Harris	4ff7949893	nvmf: remove unused tcp trace point Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I8f2e26f46f8c37312c3201df8210b449279640d0 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/451476 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-04-22 15:25:37 +00:00
Seth Howell	7d7b44f2a6	rdma: decrement descriptor before checking SEND_WITH_INVAL We were incrementing over the end of the descriptor list and assigning undefined values to the rsp opcode in SEND_WITH_INVAL case. We were only hitting this error when mixing sgl and inline requests in the same workload. We were just by chance hitting a four bit value that was set to all 1s from the in capsule data from the last request. Change-Id: Ied06356f3d22fa34a2cd869dfad6bdca8720791d Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/450873 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-04-19 17:29:45 +00:00
Seth Howell	2cc6b0dfcb	rdma: set the number of wr sge_entries per I/O This was not being properly set in the multi-sgl path. Also add a verification step to the fio configuration file to prevent against future regressions. Change-Id: I510b6acd92bc2fbc9b6fbec1d59945cc53584ad3 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/450305 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-04-19 17:29:45 +00:00
Changpeng Liu	468c6c18bd	nvmf: enable get log page with reservation notification page Reservation notification log page can be returned via the get log page command with correct page number, users can get zeored page buffer if the controller didn't have any reservation notification log. Change-Id: I99f5e4b8917a6919eb68359628efa1bead4b21b5 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/439934 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: GangCao <gang.cao@intel.com>	2019-04-18 22:33:26 +00:00
Changpeng Liu	6025375024	nvmf: generate reservation notice log on controller's thread All the reservation commands are processed on subsystem's thread, however the reservation notice log are controller related, and the get log page command with reservation page will be processed on controller's thread, so we use the same thread for generating the log. Change-Id: Ie000320d74242b979f6638d703523f063347ec29 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/449852 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-04-18 22:33:26 +00:00
Changpeng Liu	c596ea4bd5	nvmf: update subsystem's poll group information for register command Existing code only update the subsystem's poll group reservation information when unregistering the key, however, new registrant and update the key actions also need to be updated. Change-Id: Ib8db9eb457977757251403edb92eda073b846e59 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/451274 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Liang Yan <liang.z.yan@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-04-18 22:33:26 +00:00

1 2 3 4 5 ...

1213 Commits