ivampiresp/Spdk - Spdk - Leaflow Developers

Author	SHA1	Message	Date
Evgeniy Kochetov	87ebcb08c1	nvmf/rdma: Handle completions for destroyed QP associated with SRQ IB Architecture Specification vol.1 rel.13. in ch.10.3.1 "QUEUE PAIR AND EE CONTEXT STATES" suggests the following destroy procedure for QPs associated with SRQ: - Put the QP in the Error State; - wait for the Affiliated Asynchronous Last WQE Reached Event; - either: * drain the CQ by invoking the Poll CQ verb and either wait for CQ to be empty or the number of Poll CQ operations has exceeded CQ capacity size; or * post another WR that completes on the same CQ and wait for this WR to return as a WC; - and then invoke a Destroy QP or Reset QP. Without the drain step it is possible that LAST_WQE_REACHED event is received and QP is destroyed before the last receive WR completion is polled from the CQ. In SPDK there is no risk of resource leakage in this case. So, instead of draining we can destroy QP and then just ignore receive completions without QP and post receive WRs back to SRQ. Fixes #903 Signed-off-by: Evgeniy Kochetov <evgeniik@mellanox.com> Signed-off-by: Sasha Kotchubievsky <sashakot@mellanox.com> Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: Ice6d3d5afc205c489f768e3b51c6cda8809bee9a Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/465747 Reviewed-by: Seth Howell <seth.howell@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-09-12 17:04:48 +00:00
Michal Ben Haim	62615117f7	SPDK: changing TREQ value from 'not specified' to 'not required'. Signed-off-by: Michal Ben Haim <michal.benhaim@kaminario.com> Change-Id: Ia7bda5b18db24df97172d4500a499c4635d592d5 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/467499 Community-CI: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Reviewed-by: John Kariuki <John.K.Kariuki@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-09-10 17:51:26 +00:00
Shuhei Matsumoto	9796768132	nvmf: Move pending_data_buf_queue to common struct spdk_nvmf_transport_poll_group This unifies buffer management among transports further and is a preparation to make buffer allocation asynchronous. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I8c588eeac4081f50fe32605feb7352f72c628d95 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/466847 Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Anil Veerabhadrappa <anil.veerabhadrappa@broadcom.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-09-09 00:42:22 +00:00
Shuhei Matsumoto	0b068f8530	nvmf/rdma: Pass nvmf_request to nvmf_rdma_fill_buffers Most variables related with I/O buffer are in struct spdk_nvmf_request now. So we can pass nvmf_request instead of nvmf_rdma_request to nvmf_rdma_request_fill_buffers and do it in this patch. Additionally, we use the cached pointer to nvmf_request in spdk_nvmf_rdma_request_fill_iovs which is the caller to nvmf_rdma_request_fill_buffers in this patch. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Ia7664e9688bd9fa157504b4f5075f79759d0e489 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/466212 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com>	2019-08-30 16:56:46 +00:00
Shuhei Matsumoto	85b9e716e9	nvmf/rdma: Replace RDMA specific get/free_buffers by common APIs Use spdk_nvmf_request_get_buffers() and spdk_nvmf_request_free_buffers(), and then remove spdk_nvmf_rdma_request_free_buffers() and nvmf_rdma_request_get_buffers(). Set rdma_req->data_from_pool to false after spdk_nvmf_request_free_buffers(). Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Ie1fc4c261c3197c8299761655bf3138eebcea3bc Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/465875 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-08-29 18:17:38 +00:00
Shuhei Matsumoto	005b053a02	nvmf: Move data_from_pool flag to common struct spdk_nvmf_request This is a prepration to unify buffer management among transports. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I6b1c208207ae3679619239db4e6e9a77b33291d0 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/466002 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-08-29 18:17:38 +00:00
Shuhei Matsumoto	04ae83ec93	nvmf: Move allocated buffer pointers to common struct spdk_nvmf_request This is a preparation to unify buffer management among transports. struct spdk_nvmf_request already has SPDK_NVMF_MAX_SGL_ENTRIES (16) * 2 iovecs. Hence incresing the number of buffers twice will be no problem. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Idb525abbf35dc9f4b8547b785b5dfa77d106d8c9 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/465873 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com>	2019-08-29 18:17:38 +00:00
Evgeniy Kochetov	01887d3c96	nvmf/rdma: Fix data WR release One of stop conditions in data WR release function was wrong. This can cause release of uncompleted data WRs. Release of WRs that are not yet completed leads to different side-effects, up to data corruption. The issue was introduced with send WR batching feature in commit `9d63933b7f`. This patch fixes stop condition and contains some refactoring to simplify WR release function. Signed-off-by: Evgeniy Kochetov <evgeniik@mellanox.com> Signed-off-by: Sasha Kotchubievsky <sashakot@mellanox.com> Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: Ie79f64da345e38038f16a0210bef240f63af325b Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/466029 Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Seth Howell <seth.howell@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-08-29 18:09:14 +00:00
Jacek Kalwas	8a14af685b	nvmf/rdma: fix missing destory qp From rdma_cma.h "Users must destroy any QP associated with an rdma_cm_id before destroying the ID." Signed-off-by: Jacek Kalwas <jacek.kalwas@intel.com> Change-Id: I5ed0c25221c5401cdde8b31a4e217b9d79e7caaa Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/464290 Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Seth Howell <seth.howell@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-08-08 20:07:11 +00:00
Seth Howell	59a3afa0ff	nvmf/rdma: pass iov_base to spdk_mem_map_translate We should be checking directly against the base of the iov when doing memory map translations. The current behavior is to check against the starting address of the buffer which is a close address, but not exactly the same. Change-Id: I7f65224a6836a814708438f2866d84ae22882216 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/463893 Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Reviewed-by: <jiandong.zheng@broadcom.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-08-02 07:15:36 +00:00
Jacek Kalwas	db0c7f6a4f	nvmf/rdma: fix missing return statement In case of failure during resource allocation within poll_group_create there is a lack of return statement which could lead to NULL ptr dereference. Signed-off-by: Jacek Kalwas <jacek.kalwas@intel.com> Change-Id: I84abe64a1843117d76b97e62656bdfc4fe2b35d8 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/463195 Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Seth Howell <seth.howell@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-08-02 03:55:32 +00:00
Evgeniy Kochetov	c9c80e6932	nvmf/rpc: Fix io channel reference counting in NVMf statistics NVMf statistics functions use spdk_get_io_channel function to get a poll group. It increases reference counter in io channel and causes problems on application exit. spdk_put_io_channel calls were added to release the channel. Signed-off-by: Evgeniy Kochetov <evgeniik@mellanox.com> Change-Id: I832d1eae346c3bc3858ed0ed063ff7a7a897a2f5 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/463389 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-29 18:05:09 +00:00
Evgeniy Kochetov	fbe8f8040c	nvmf/rdma: Add request latency statistics This patch adds measurement of time request spends from the moment it was polled till completion. Signed-off-by: Evgeniy Kochetov <evgeniik@mellanox.com> Change-Id: I1fcda68735f2210c5365dd06f26c10162e4ddf33 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/445291 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-07-26 20:30:00 +00:00
Evgeniy Kochetov	251db8144f	nvmf/rdma: Add NVMf RDMA transport pending statistics This patch adds statistics for pending state in NVMf RDMA subsytem which may help to detect lack of resources and adjust configuration correctly. Signed-off-by: Evgeniy Kochetov <evgeniik@mellanox.com> Change-Id: I9560d931c0dfb469659be42e13b8302c52912420 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/452300 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-07-26 20:30:00 +00:00
Evgeniy Kochetov	38ab383a8f	nvmf/rdma: Add RDMA polling statistics RDMA polling statistics: number of polls and number of completion entries returned. Signed-off-by: Evgeniy Kochetov <evgeniik@mellanox.com> Change-Id: Iabcf2cb6f6a35f595b89b58cdfcd177a637dda13 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/445289 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-07-26 20:30:00 +00:00
Evgeniy Kochetov	43bb4e6b1f	rpc: Add NVMf transport statistics to nvmf_get_stats RPC method This patch adds transport part to nvmf_get_stats RPC method and basic infrastructure to report NVMf transport specific statistics. Signed-off-by: Evgeniy Kochetov <evgeniik@mellanox.com> Change-Id: Ie83b34f4ed932dd5f6d6e37897cf45228114bd88 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/452299 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-07-26 20:30:00 +00:00
Alexey Marchuk	f0b7a6e7d1	rdma: fix possible double free on qpair destruction Update rqpair->last_wqe_reached in the context of thread that owns qpair's poll group to avoid possible double free This patch fixes #858 Change-Id: If5422944b7928c2cc05af528fbcc4482aeef22df Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Signed-off-by: Sasha Kotchubievsky <sashakot@mellanox.com> Signed-off-by: Evgenii Kochetov <evgeniik@mellanox.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/462012 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Lorne Li <lorneli@163.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-07-23 22:56:57 +00:00
Alexey Marchuk	5282edfd7b	rdma: fix double free of qpair struct in case of failed initialization qpair structure is freed and an error code is returned to the caller in the case of failed qpair initialization in function spdk_nvmf_rdma_qpair_initialize (e.g. bad return value of rdma_create_qp). The return code is handled by nvmf_tgt_poll_group_add function which destroys the qpair for the second time. This patch fixes #857 Change-Id: I0773652ecccbbd634ad272106e0a93c1e591d7d2 Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Signed-off-by: Sasha Kotchubievsky <sashakot@mellanox.com> Signed-off-by: Evgenii Kochetov <evgeniik@mellanox.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/462011 Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Lorne Li <lorneli@163.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-23 22:55:43 +00:00
lorneli	ba323d44ca	nvmf/rdma: log spdk_nvmf_rdma_destroy_defunct_qpair Func spdk_nvmf_rdma_destroy_defunct_qpair is a "last chance option" to destroy qp manually if some driver/hardware doesn't drain qp's failed wr as expected. There's a probability that ibv_poll_cq polls wr of the destoryed qp after spdk_nvmf_rdma_destroy_defunct_qpair's execution. Although in practice the risk of this situation is minimal(if not non-existent), add a log here so that we could detect this situation easily. Change-Id: Ifa9534397513bcea34c18fbb8168eef8f53599c1 Signed-off-by: lorneli <lorneli@163.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/462441 Reviewed-by: Seth Howell <seth.howell@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-23 19:35:16 +00:00
lorneli	b4d3066890	nvmf/rdma: defer qp destruction until nvmf layer closes qp Currently rqpair will be destroyed directly in ibv_poll_cq path if it has been drained, regardless of whether there are outstanding I/Os issued to bdev layer. So after outstanding I/Os completing, spdk_nvmf_rdma_close_qpair will be called from nvmf layer, accessing a destroyed qp. This path defers qp destruction in nvmf_rdma_destroy_drained_qpair func until nvmf layer closes qp. Fixes 851 Change-Id: I8bcce66f8053ddb105702ac603d5d73af54bdcfc Signed-off-by: lorneli <lorneli@163.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/461237 Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Seth Howell <seth.howell@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-23 19:35:16 +00:00
Alexey Marchuk	0754417fa9	rdma: Use optimal ceiling integer division This form of the celinig division allows to remove an extra condition Change-Id: I8a2de792172ec9115563e7fb914745c476f16e8d Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Signed-off-by: Sasha Kotchubievsky <sashakot@mellanox.com> Signed-off-by: Evgenii Kochetov <evgeniik@mellanox.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/462198 Reviewed-by: Seth Howell <seth.howell@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-22 09:22:11 +00:00
Darek Stojaczyk	96ec8bff78	nvmf/rdma: switch to spdk_malloc() spdk_dma_malloc() is about to be deprecated. Change-Id: I5bcac50baca785255eb068086e67c07d120b042f Signed-off-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459432 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-07-17 01:28:57 +00:00
Jacek Kalwas	e95e4028c1	nvmf/rdma: exclude getaddrinfo from lock No need to have it under lock. Additionally in case of failure there was a lack of rdma_destroy_id(). This is addresed within this change as well. Signed-off-by: Jacek Kalwas <jacek.kalwas@intel.com> Change-Id: Idbb36d51ad4ef7ef81051463f56efc87ef00c966 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/462054 Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-17 01:03:36 +00:00
Jacek Kalwas	0d4a5f7e69	nvmf/rdma: free list of devices In case of failure during pd or map allocation freeing list of devices was missing. Signed-off-by: Jacek Kalwas <jacek.kalwas@intel.com> Change-Id: If62f7b072f3894fd1a7e856c19b4ea51646dd20e Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/462079 Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-17 00:59:34 +00:00
Jacek Kalwas	114a067738	nvmf/rdma: pd null check In case of pd allocation by nvmf hooks there is a lack of null check as oposed to pd allocation by ibv_alloc_pd. Signed-off-by: Jacek Kalwas <jacek.kalwas@intel.com> Change-Id: Iead6e0332bdee3da4adb6e657af298215c4e2196 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/461576 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-07-16 01:29:03 +00:00
Hailiang Wang	73a171a07c	rdma: assert ibv_send_wr is not NULL Vhost testing crashed from Nightly testing, because a member access within null pointer of type 'struct ibv_send_wr'. Change-Id: If8f34f23864883ea73516d2d1fe3b30137c04316 Signed-off-by: Hailiang Wang <hailiangx.e.wang@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/458913 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-06-25 13:37:15 +00:00
Alexey Marchuk	53777de855	rdma: Unset IBV_SEND_SIGNALED flag for RDMA_WRITE operations Unsetting this flag will decrease the number of WRs retrieved during CQ polling and will decrease the oeverall processing time. Since RDMA_WRITE operations are always paired with RDMA_SEND (response), it is possible to track the number of outstanding WRs relying on the completed response WR. Completed WRs of type RDMA_WR_TYPE_DATA are now always RDMA_READ operations. The patch shows %2 better peformance for read operations on x86 machine. The performance was measured using perf with the following parameters: -q 16 -o 4096 -w read -t 300 -c 2 with nvme null device, each measurement was done 4 times avg IOPS (with patch): 865861.71 avg IOPS (master): 847958.77 avg latency (with patch): 18.46 [us] avg latency (master): 18.85 [us] Change-Id: Ifd3329fbd0e45dd5f27213b36b9444308660fc8b Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Signed-off-by: Sasha Kotchubievsky <sashakot@mellanox.com> Signed-off-by: Evgenii Kochetov <evgeniik@mellanox.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/456469 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-06-11 18:07:28 +00:00
Jim Harris	bf647c168a	nvmf: increase default max num qps to 128 This matches the Linux kernel target. Users can still decrease this default when creating the transport (i.e. -p option for nvmf_create_transport in rpc.py). Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Icad59350a2cd35cfc4ad76d06399345191680c05 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/454820 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-05-22 14:50:05 +00:00
Seth Howell	61948a1ca7	rdma: add check for allocating too many SRQ. We could run into issues with this if we were using an arbitrarily large amount of cores to run SPDK. Change-Id: Ia7add027d7e6ef1ccb4a69ac328dbdf4f2751fd8 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/452250 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-05-15 20:29:32 +00:00
Seth Howell	14777890a6	rdma: add an stailq for qpairs pending recv This will help us not iterate through the whole list of connections when only some of them have pending recvs. Change-Id: I681bc98befbdda4e77ef333b7a086c08b2708eb3 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/449266 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-05-13 22:09:55 +00:00
Seth Howell	c3884f943c	rdma: batch rdma recvs per poll. This will help save MMIO overhead. Especially in the SRQ case. Change-Id: I6fb70cf6de4763450f97961f41ccdce3acec2e63 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/449265 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-05-13 22:09:55 +00:00
Seth Howell	b4dc10fbb7	rdma: create a list for qpairs pending send transfers By creating a list of qpairs, we can avoid looping over every connected qpair to process sends each time we poll. Change-Id: If24bbc363176f52fbfb756d56719edd885a21a11 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/449264 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-05-10 22:24:35 +00:00
Seth Howell	9d63933b7f	rdma: batch rdma sends. By batching ibv sends each time we poll, we can reduce the number of MMIO writes that we do. Change-Id: Ia5a07b0037365abfa8732629c34d34a9ed49ac70 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/449253 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-05-10 22:24:35 +00:00
Seth Howell	350e429a57	rdma: add a flag for disabling srq. There are cases where srq can be a detriment. Add a flag to allow users to disable srq even if they have a piece of hardware that supports it. Change-Id: Ia3be8e8c8e8463964e6ff1c02b07afbf4c3cc8f7 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/452271 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-05-06 18:11:13 +00:00
Jim Harris	b6206d657c	trace: shorten max name from 44 to 24 characters This restriction helps reduce the amount of padding when printing out the event trace, allowing it to fit in a small number of columns. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ifa31e5a6967c7b9bc7028069effb71533f80596f Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/452736 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-05-02 08:41:56 +00:00
Jim Harris	617184be3b	trace: remove short_name This was not used by any of the trace register descriptions. Let's remove it rather keeping it around if we don't need it. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Idda809e2911db5be555ff6aa13695484a14bf665 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/452734 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com>	2019-05-02 08:41:56 +00:00
Seth Howell	6cc18a64aa	rdma.c: Don't set recv->qpair to NULL We can use the rpoller->srq to check if a qpair is valid when processing recv completions. Change-Id: I6aa360adc48a3312ddcf79f10e2a65b502a7314f Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/452247 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-05-01 18:48:13 +00:00
Seth Howell	33f60621af	lib: resize key mempools Mempools are based off of a ring structure which allocates its elements as a power of two. It also only exposes n-1 elements to the user. So when we create a mempool with 2^n elements in it, we have to allocate a ring with 2^n+1 entries. By decreasing the number of elements in these key mempools by 1, we can save a decent amount of memory. Change-Id: I942c9dd4cf59096969bc2559fb46fd2084a07f09 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/448875 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-05-01 17:45:29 +00:00
Seth Howell	d05c553827	rdma: don't spam people with async event messages. It used to be that we would get async events very infrequently. However, with the introduction of SRQ, this number has gone up tremendously. Change the way we report our these events so that we don't spam/confuse people running the target. Change-Id: I33070281fa854cbc17784d61bbbb870196ca8780 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/452159 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-04-26 18:10:56 +00:00
Seth Howell	ec47f92b9b	rdma: fix potential heap-use-after-free in srq shutdown If there are outstanding recvs for a qpair when it is destroyed, we need to clear the qpair from it before reposting it. Otehrwise, we have a potential heap-use-after-free of double free (depending on whether the recv completion is in error state or not). See github issues #730 Change-Id: Ic2009c761cbcc5e89174f62fbd0872d0489c67ca Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/452122 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-04-26 11:16:22 +00:00
Seth Howell	89d2efe07e	rdma: set the srq param in the initiator. We were setting this value in the target from our initiator, but it turns out the rdma_conn_params struct is responsible for setting the opposite side so we need to add it in the target side when accepting connections. Also, add a test to demonstrate target functionality when we overwhelm the SRQ. It is useful to note that performance really tanks when you start overwhelming the srq so it may be useful to use this test case to check performance gains in edge cases over time. Change-Id: Iac541bd9fc1d82eca9f21e7abc3f625663a6c460 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/451678 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-04-24 09:22:16 +00:00
jiaqizho	b70e698465	rdma:fix core dump when rdma_create_qp return error. Signed-off-by: jiaqizho <jiaqi.zhou@intel.com> Change-Id: Ie900e01820f69fc5b2d5e30d519c6b619d7a7281 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/449507 Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-04-22 18:40:35 +00:00
Seth Howell	7d7b44f2a6	rdma: decrement descriptor before checking SEND_WITH_INVAL We were incrementing over the end of the descriptor list and assigning undefined values to the rsp opcode in SEND_WITH_INVAL case. We were only hitting this error when mixing sgl and inline requests in the same workload. We were just by chance hitting a four bit value that was set to all 1s from the in capsule data from the last request. Change-Id: Ied06356f3d22fa34a2cd869dfad6bdca8720791d Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/450873 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-04-19 17:29:45 +00:00
Seth Howell	2cc6b0dfcb	rdma: set the number of wr sge_entries per I/O This was not being properly set in the multi-sgl path. Also add a verification step to the fio configuration file to prevent against future regressions. Change-Id: I510b6acd92bc2fbc9b6fbec1d59945cc53584ad3 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/450305 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-04-19 17:29:45 +00:00
JinYu	dd90ff7a21	nvmf/rdma: fix bugs in spdk_nvmf_rdma_qpair_destroy Rqpair qp and resources maybe not be created, if rqpair fail to initialise. For example, in function new_qpair, the code run to spdk_nvmf_qpair_disconnect, but rqpair is initialised in poll_group_add. Fix #557 segmentaion fault(core dump) Change-Id: I1892e6d13e2d53dd5a7c4856d775f9b3b85da961 Signed-off-by: JinYu <jin.yu@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/450986 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com> Reviewed-by: Hailiang Wang <hailiangx.e.wang@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-04-18 21:47:57 +00:00
JinYu	c7395a1171	nvmf: fix the rqpair->current_send_depth If rsp->status.sc != SUCCESS and xfer == DATA_CONTROLLER_TO_HOST, We would not send the data WR, so clean the num_outstanding_data_wr. Fix #728 Change-Id: I32259788e495ed76f8f02a9d871bd56356d93dc4 Signed-off-by: JinYu <jin.yu@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/450726 Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-04-16 14:42:03 +00:00
Seth Howell	1fb629c4d2	rdma: make the pending_data_buf_queue an STAILQ Should speed up operations, and allows us to remove the 16 byte link object from the request structure. Change-Id: Ie62df1f44d22580a7a7ae41c498295841d1e3064 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/448080 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-04-04 21:34:55 +00:00
Seth Howell	9f7582c3a5	rdma: reorder qpair elements to plug hole Saves 8 bytes Change-Id: Icb429ba79d7a085978950dd3045aa9ef28351101 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/448073 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-04-04 04:34:59 +00:00
Seth Howell	91105e2031	rdma: Don't store ibv_qp_attr in the qpair. We were only using one enum from this whole struct, so there is no need to store it. Plus the queries we use to update it are so infrequent and only occur during connect and disconnect so I think we can save quite a bit of space by removing this without compromising performance. Change-Id: Icf29977a3c10cb289564fa2760a0059f07a0f8cb Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/448072 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-04-04 04:34:59 +00:00
Seth Howell	ab79560e65	rdma: simplify spdk_nvmf_rdma_poller_poll. There was a lot of duplicated code here between states. I'm trying to minimize the duplicated code without making it confusing. Change-Id: I13183431e554c8a9f501b3385bbd7b59e2c83161 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/448066 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-04-04 04:34:59 +00:00
Seth Howell	a8169c37e0	rdma: add error path for fill_iovs_multi_sgl Catch an edge case where a multi sgl request is longer than the allowed transfer size. Change-Id: I79779050fe951d16f1240e2c3d8cf5037e576ea2 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/440766 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-04-04 04:34:59 +00:00
Seth Howell	6812b63c5f	rdma: always allocate buffers for requests upfront This is important to avoid thrash when we don't have enough buffers to satisfy a request. Change-Id: Id35fd492078b8e628c2118317f674f07e95d4dba Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/449109 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-04-04 04:34:59 +00:00
Seth Howell	f4adbc79ce	rdma: optimize and move buffers variable. The buffers are really specific to the request and not the wr or data object. In the case of multiple wr requests, the maximum number of buffers per req is equal to the number of SGEs in the NVMe-oF request *2. Change-Id: Ic59498bfed461d180adb2fb9a481ac5b11fa9252 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/449108 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-04-02 23:26:08 +00:00
Seth Howell	62700dac2e	nvmf/rdma: Add support for multiple sgl descriptors to sgl parser Enable parsing an nvmf request that contains an inline nvme_sgl_last_segment_descriptor element. This is the next step towards NVMe-oF SGL support in the NVMe-oF target. Change-Id: Ia2f1f7054e0de8a9e2bfe4dabe6af4085e3f12c4 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/428745 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-04-02 23:26:08 +00:00
Seth Howell	934775db43	rdma: make semantic changes to fill_buffers func Changing i to iovcnt in all references to the req->iov structure will be important when we start processing multi-sgl requests. Change-Id: I90a9b6d872b94f846ae7d29a45dd2703eafa6175 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/449201 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-03-29 19:02:22 +00:00
Seth Howell	e70a759489	rdma: pull buffer assignment out of fill_iovs This will be used by the multi-sgl version of this function as well. Change-Id: Iafeba4836a77482fa2a158f86f1c17fe7fdeb510 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/449104 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-03-29 19:02:22 +00:00
Seth Howell	a9fc7e1db8	rdma: use LAST_WQE_REACHED event in the SRQ path This event is generated by NICs utilizing the SRQ feature when the last RECV for that qpair is processed. I have confirmed this feature. Change-Id: Ib6d6b6d02987f789b4d5dd3daf734e3351ee1974 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/448063 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-03-25 17:23:51 +00:00
yidong0635	fc43fbba04	rdma: fixed heap used after free issue. With ASAN to run this cases, it will report issue about heap used after free in spdk_nvmf_rdma_qpair_destroy. Resources have been released before, change the order to in this tailq to release resources. ERROR: AddressSanitizer: heap-use-after-free on address 0x6080000080e0 at pc 0x0000006e1e3f bp 0x7fd48b6c3df0 sp 0x7fd48b6c3de0 READ of size 8 at 0x6080000080e0 thread T3 (reactor_1) 0x6e1e3e in spdk_nvmf_rdma_qpair_destroy spdk/lib/nvmf/rdma.c:813 Change-Id: Ia1c12bca84955a2de60399e6b265c9b8901bb51e Signed-off-by: yidong0635 <dongx.yi@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/448534 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-03-21 18:00:04 +00:00
Seth Howell	e59ac513fb	rdma: remove reqs from read/write queues in error Not doing so can cause us to hit asserts during the shutdown path. This should fix an intermittent failure we are seeing on the test pool where we hit the assert rdma_req->state != RDMA_REQUEST_STATE_FREE in spdk_nvmf_rdma_request_process. Note that this problem doesn't cause any data corruption when debug is not enabled, it just causes us to probcess a subset of commands through the state machine one extra time suring qpair shutdown. Change-Id: Ibc36bfea87ec4089b8e2c7a915f48714fddb0b09 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/447843 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-03-19 18:18:45 +00:00
Seth Howell	33668b2254	rdma: change structure of drained_qpair to work w/ messages. This will become important later on. Change-Id: I94e5af03359e476afbc68664e43f44269ad5974c Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/448074 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-03-18 23:32:21 +00:00
Seth Howell	7dd3cf441a	rdma: limit the completion queue based on the SRQ. When we have a shared receive queue, the number of outstanding items associated with a completion queue is deterministic, and limited by how many RECVs we have total in the SRQ. So, we can set the total size of the Completion queue at the beginning of time and never resize it. Change-Id: I787e4c5bbd52ac8948a323d1301f926f887cd91c Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/447492 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-03-18 23:32:21 +00:00
Seth Howell	a5972c6245	rdma: consolidate common error paths in qpair_init Consolidating error paths is common practice in SPDK so do that here to make the function more uniform and save space. Change-Id: I98c5d5f7feeb688f1d8b24f4d2d3461a43d00c1d Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/448191 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-03-18 23:32:21 +00:00
Seth Howell	97a43680a9	rdma: move cq_resize to its own function. Change-Id: I07aef399320fd4a014f63760670ea765d2e18b4b Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/448190 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-03-18 23:32:21 +00:00
Seth Howell	fa79f64ad1	rdma: Keep a pointer to the SRQ in the qpair Change-Id: Id173038b6ad6b1564acf5d6886814f7d310964c7 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/447471 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-03-18 23:32:21 +00:00
Seth Howell	01201d3e87	rdma: remove compile time config for SRQ Change-Id: I44af3ee4dc6ec76045e1d0614910402487098a3d Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/447120 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-03-18 23:32:21 +00:00
Seth Howell	0d3fcd10e9	rdma: add function to create qpair resources. Change-Id: Id865e2a2821fe04c1f927038d6dd967848cd9a55 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/446999 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-03-15 19:19:17 +00:00
Ben Walker	353fbcdaf0	nvmf/rdma: Create function to destroy rdma resources This unifies the clean up path between SRQ and normal operation. Change-Id: I396d7e3749579f27b5bb1e89b9d6761a77ba5beb Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/446979 Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-03-15 19:19:17 +00:00
Ben Walker	b25751d99d	nvmf/rdma: Add a structure to hold rqpair/rpoller resources Depending on whether SRQ is enabled, resources may be allocated to the rqpair or to the rpoller. Create a struct to hold these pointers that can be used in both locations to avoid duplicated code. Change-Id: I2c8fc59009201d9e41721e6462a81732b529a9e0 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/446978 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Eugene Kochetov <evgeniik@mellanox.com>	2019-03-15 19:19:17 +00:00
Ben Walker	527be2bf4e	nvmf: Remove qpair_is_idle This wasn't used anywhere. Change-Id: I405af3c808be284d19218f3f04c1e90e33e31de8 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/446977 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com>	2019-03-15 19:19:17 +00:00
Evgeniy Kochetov	ed0b611fc5	nvmf/rdma: Add shared receive queue support This is a new feature for NVMEoF RDMA target, that is intended to save resource allocation (by sharing them) and utilize the locality (completions and memory) to get the best performance with Shared Receive Queues (SRQs). We'll create a SRQ per core (poll group), per device and associate each created QP/CQ with an appropriate SRQ. Our testing environment has 2 hosts. Host 1: CPU: Intel(R) Xeon(R) CPU E5-2609 0 @ 2.40GHz dual socket (8 cores total) Network: ConnectX-5, ConnectX-5 VPI , 100GbE, single-port QSFP28, PCIe3.0 x16 Disk: Intel Optane SSD 900P Series OS: Fedora 27 x86_64 Host 2: CPU: Intel(R) Xeon(R) CPU E5-2630 v2 @ 2.60GHz dual-socket (24 cores total) Network: ConnectX-4 VPI , 100GbE, dual-port QSFP28 Disk: Intel Optane SSD 900P Series OS : CentOS 7.5.1804 x86_64 Hosts are connected via Spectrum switch. Host 1 is running SPDK NVMeoF target. Host 2 is used as initiator running fio with SPDK plugin. Configuration: - SPDK NVMeoF target: cpu mask 0x0F (4 cores), max queue depth 128, max SRQ depth 1024, max QPs per controller 1024 - Single NVMf subsystem with single namespace backed by physical SSD disk - fio with SPDK plugin: randread pattern, 1-256 jobs, block size 4k, IO depth 16, cpu_mask 0xFFF0, IO rate 10k, rate process “poisson” Here is a full fio command line: fio --name=Job --stats=1 --group_reporting=1 --idle-prof=percpu \ --loops=1 --numjobs=1 --thread=1 --time_based=1 --runtime=30s \ --ramp_time=5s --bs=4k --size=4G --iodepth=16 --readwrite=randread \ --rwmixread=75 --randrepeat=1 --ioengine=spdk --direct=1 \ --gtod_reduce=0 --cpumask=0xFFF0 --rate_iops=10k \ --rate_process=poisson \ --filename='trtype=RDMA adrfam=IPv4 traddr=1.1.79.1 trsvcid=4420 ns=1' SPDK allocates the following entities for every work request in receive queue (shared or not): reqs (1024 bytes), recvs (96 bytes), cmds (64 bytes), cpls (16 bytes), in_capsule_buffer. All except the last one are fixed size. In capsule data size is configured to 4096. Memory consumption calculation (target): - Multiple SRQ: core_num * ib_devs_num * SRQ_depth * (1200 + in_capsule_data_size) - Multiple RQ: queue_num * RQ_depth * (1200 + in_capsule_data_size) We ignore admin queues in calculations for simplicity. Cases: 1. Multiple SRQ with 1024 entries: - Mem = 4 * 1 * 1024 * (1200 + 4096) = 20.7 MiB (Constant number – does not depend on initiators number) 2. RQ with 128 entries for 64 initiators: - Mem = 64 * 128 * (1200 + 4096) = 41.4 MiB Results: FIO_JOBS kIOPS Bandwidth,MiB/s AvgLatency,us MaxResidentSize,kiB RQ SRQ RQ SRQ RQ SRQ RQ SRQ 1 8.623 8.623 33.7 33.7 13.89 14.03 144376 155624 2 17.3 17.3 67.4 67.4 14.03 14.1 145776 155700 4 34.5 34.5 135 135 14.15 14.23 146540 156184 8 69.1 69.1 270 270 14.64 14.49 148116 156960 16 138 138 540 540 14.84 15.38 151216 158668 32 276 276 1079 1079 16.5 16.61 157560 161936 64 513 502 2005 1960 1673 1612 170408 168440 128 535 526 2092 2054 3329 3344 195796 181524 256 571 571 2232 2233 6854 6873 246484 207856 We can see the benefit in memory consumption. Change-Id: I40c70f6ccbad7754918bcc6cb397e955b09d1033 Signed-off-by: Evgeniy Kochetov <evgeniik@mellanox.com> Signed-off-by: Sasha Kotchubievsky <sashakot@mellanox.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/428458 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-03-15 19:19:17 +00:00
Seth Howell	62266a72cf	rdma: allocate protection domains for devices up front. We were only using one pd per device anywas, and this is necessary for shared receive queue support. Change-Id: I86668d5b7256277fe50836863408af2215b5adf9 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/447385 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-03-12 21:37:51 +00:00
Seth Howell	bb3e441388	rdma: destroy qpairs based on num_outstanding_wr. Both Mellanox and Soft-RoCE NICs work with this approach. Change-Id: I7b05e54037761c4d5e58484e1c55934c47ac1ab9 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/446134 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-03-08 21:09:09 +00:00
Seth Howell	961cd6ab7e	rdma: register a poller to destroy defunct qpairs Not all RDMA drivers fail back the dummy recv and send operations that we send to them when destroying a qpair. We still need to free the resources from these qpairs to avoid eating up all of the system memory after multiple connect and disconnect events. Since we won't be getting any more completions, the best heuristic we can use is waiting a long time and then freeing the resources. qpair_fini is only called from the proper polling thread so we can safely call process_pending to flush the qpair before closing it out. Change-Id: I61e6931d7316d1e78bad26657bb671aa451e29f4 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/443057 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-03-04 19:12:48 +00:00
Seth Howell	59f0d22e40	rdma: Fix misordered assert and decrement. In the error path, we were first decrementing a variable and then asserting that it must be >0. These operations should occur in the opposite order. Change-Id: I6cec544faf17bb75cbfca3d3a3c173dc5db14f99 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/446440 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: yidong0635 <dongx.yi@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-02-28 21:20:38 +00:00
Seth Howell	756ce464f6	rdma: update default number of shared buffers. When the decision was made to uncouple the number of shared buffers from the queue depth and allow the user to decide for themselves, the default was also significantly lowered, which caused some issues when trying torun performance tests (See https://github.com/spdk/spdk/issues/699). While this is a user modifiable variable, it is still best to keep the higher default value. The original value was equivalent to max_queue_depth * SPDK_NVMF_MAX_SGL_ENTRIES * 2 with the defaults for max_queue depth and max_sgl_entries being 128 and 16 respectively. Hence 4096 fixes: `0b20f2e552` Change-Id: I809e97a10973093a2b485b85bca7160091166f70 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/446525 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-02-28 21:09:50 +00:00
Zahra Khatami	a55b2109bb	nvmf: remaning changes related to nvmf hooks Change-Id: I6780fa43cebd9f48d1ae0ea6fbeb92a95c4dfa15 Signed-off-by: zkhatami88 <z.khatami88@gmail.com> Reviewed-on: https://review.gerrithub.io/c/443653 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-02-22 21:16:36 +00:00
Seth Howell	b38e3a60c6	rdma: change the logic of rdma_qpair_process_pending I think this simplifies the process a little bit. Change-Id: Icc87a59c9f6fd965ef35531975b7036d85c4bc95 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/445916 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-02-22 18:31:02 +00:00
Seth Howell	80eecdd881	rdma: use an stailq for incoming_queue Change-Id: Ib1e59db4c5dffc9bc21f26461dabeff0d171ad22 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/445344 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-02-22 18:31:02 +00:00
Seth Howell	bfdc957c75	rdma: remove the state_cntr variable. We were only using one value from this array to tell us if the qpair was idle or not. Remove this array and all of the functions that are no longer needed after it is removed. This series is aimed at reverting `fdec444aa8` which has been tied to performance decreases on master. Change-Id: Ia3627c1abd15baee8b16d07e436923d222e17ffe Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/445336 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-02-22 18:31:02 +00:00
Seth Howell	04ebc6ea28	RDMA: Remove the state_queues Since we no longer rely on the state queues for draining qpairs, we can get rid of most of them. We cn keep just a few, and since we don't ever remove arbitrary elements, we can use stailqs to perform those operations. Operations on Stailqs carry about half the overhead as operations on tailqs Change-Id: I8f184e6269db853619a3581d387d97a795034798 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/445332 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-02-22 18:31:02 +00:00
yidong0635	9d838d24ad	rdma: add return to avoid address points to the zero page Error logs in nvmf_rdma_dump_request lead to report error about address points to the zero page, add judgement to return. this issue occurs in heavy load fio testing. Change-Id: I50302be88b3af53f718e3800aa16df7c506ca4e8 Signed-off-by: yidong0635 <dongx.yi@intel.com> Reviewed-on: https://review.gerrithub.io/c/441110 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-02-15 04:29:40 +00:00
Seth Howell	b7651b681c	NVMe-oF: add asserts for SGE counts We should never be going over these limits in the respective transports, but add asserts to check this during testing. Change-Id: Ifcaa82ccf58546a38020b31df54ee5d1d9822b8b Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/442777 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-02-12 23:34:20 +00:00
Seth Howell	145485769e	nvmf: remove qpair state activating. This intermediate state is unused and meaningless. the qpair transitions into this state right before calling a synchronous operation and then transitions to active as soon as that operation completes successfully. If the operation did not complete successfully, we were leaving qpairs in this weird intermediate state when for all intents and purposes they had reverted to an uninitialized state. Keeping qpairs in the uninitialized state until they have been added to a poll group creates a meaningful distinction between states that can be actionable from the transport level. Change-Id: I6de9bc424b393b6fff221aa2f4212aaa91488629 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/443471 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-02-12 20:39:44 +00:00
Seth Howell	b952668186	rdma: destroy uninitialized qpairs immediately. Connections in the uninitialized state haven't been added to a poll group yet, so submitting dummy requests to them will be pointless since they will never be polled. We need to reject the connection and destroy the qpair immediately. Change-Id: Id5dd711882e1ae7c13ae32c06da2285186b00a1b Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/443470 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-02-12 20:39:44 +00:00
Seth Howell	825cac2720	rdma.c: Create a single point of entry for qpair disconnect Since there are multiple events/conditions that can trigger a qpair disconnection, we need to funnel them to a single point of entry. If more than one of these events occurs, we can ignore all but the first since once a disconnect starts, it can't be stopped. Change-Id: I749c9087a25779fcd5e3fe6685583a610ad983d3 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/443305 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-02-12 20:39:44 +00:00
Seth Howell	b6b0a0ba59	rdma: adjust I/O unit based on device SGL support For devices that support fewer SGE elements than our default values, we need to adjust the I/O unit size so that we don't ever try to submit more SGLs than we are allowed to. Change-Id: I316d88459380f28009cc8a3d9357e9c67b08e871 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/442776 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-02-12 18:46:57 +00:00
Seth Howell	92f5548a91	rdma: properly account num_outstanding_data_wr This value was not being decremented when we got SEND completions for write operations because we were using the recv send to indicate when we had completed all writes associated with the request. I also erroneously made the assumption that spdk_nvmf_rdma_request_parse_sgl would properly reset this value to zero for all requests. However, for requests that return SPDK_NVME_DATA_NONE rom spdk_nvmf_rdma_request_get_xfer, this funxtion is skipped and the value is never reset. This can cause a coherency issue on admin queues when we request multiple log files. When the keep_alive request is resent, it can pick up an old rdma_req which reports the wrong number of outstanding_wrs and it will permanently increment the qpairs curr_send_depth. This change decrements num_outstanding_data_wrs on writes, and also resets that value when the request is freed to ensure that this problem doesn't occur again. Change-Id: I5866af97c946a0a58c30507499b43359fb6d0f64 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/443811 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Sasha Kotchubievsky <sashakot@mellanox.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-02-12 18:43:44 +00:00
Seth Howell	41cd5ff4fb	rdma: fix max_read_depth_definition. max_read_depth should be based on max_qp_init_read_atomic, or the maximum number of read values that the initiator will accept as outstanding. The device attributes object contains values for both the initiator (remote side) and the target (local side). All attributes with the name init in them are meant to correspond to the initiator. The qp_read_atomic value represents the number of reads and atomic operations that can have this device as the target. qp_init_read_atomic represents how many read operations the initiator has said that we can have outstanding that have the initiator's rdma device as the target. Since this number represents how many outstanding reads we will send to the initiator at once, we should use the qp_init_read_atomic value. Change-Id: Iacc044e8321080de8accd9128ac3777bbb948afc Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/442409 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-02-05 18:04:04 +00:00
Ben Walker	9521d11bdb	nvmf/rdma: Remove stray spdk_nvmf_rdma_wr Wasn't used. Change-Id: I5b440e18a0a6cbb9b6137b7074a0312e51f41b95 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/441592 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-02-04 19:14:54 +00:00
Ben Walker	608d80a033	nvmf/rdma: Eliminate management channel This is a holdover from before poll groups were introduced. We just need a per-thread context for a set of connections, so now that a poll group exists we can use that instead. Change-Id: I1a91abf52dac6e77ea8505741519332548595c57 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/442430 Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-02-04 18:20:13 +00:00
Ben Walker	4e614b3127	nvmf/rdma: Capitalize SEND in code comment for consistency The READ and ATOMIC in the comment above are capitalized, so make this all caps too. Change-Id: I49fae2ceb826b22953d9b26d42b95f17e2dac617 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/442427 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-02-04 18:12:31 +00:00
Ben Walker	e1dd85a5b7	nvmf: Don't increment current_recv_depth for dummy RECV When a connection goes to close and has no I/O outstanding, the current_recv_depth was being decremented beyond 0 and rolling over. If the poll group then finds a successful receive completion on the next poll (for a command that arrived prior to starting the disconnect but hadn't been processed yet), it would trip the max queue depth check added recently and start another disconnect process. If only one command arrives in this window, everything actually works out ok. However, if there are two receive completions sitting in the completion queue after the disconnect process is started, the first one does the double disconnect and the second one does another disconnect which ends up dereferencing a null pointer. Since there is always a special reserved slot for the dummy recv, don't do decrements or increments of the current_recv_depth for the dummy recv. This allows the code to still enforce the actual max_queue_depth on recvs without underflowing or overflowing the counter. Change-Id: I56c95b2424e956a3b007b25c50cbf47262245b8f Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/442642 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-01-30 19:03:46 +00:00
zkhatami88	8e2f0cdb01	nvmf: Add mechanism to override nvmf pd/mr behavior Change-Id: I8d3abfcd1934bbab5bf8dacae08e8a7f29992b93 Signed-off-by: zkhatami88 <z.khatami88@gmail.com> Reviewed-on: https://review.gerrithub.io/c/433977 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com> Reviewed-by: Sasha Kotchubievsky <sashakot@mellanox.com>	2019-01-30 19:03:35 +00:00
Seth Howell	1d0a8e1cec	rdma: split PENDING_DATA_TRANSFER into two states. Since we have different requirements for submitting RDMA read and write operations, we should track them separately so that we don't block writes when the device does not have enough resources for read operations. Change-Id: I5d6424c0e26f2f5362866d1bb21eb46700c245da Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/441794 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-01-28 16:58:50 +00:00
Seth Howell	158dc9470d	rdma: Make sure we don't submit too many WRs Before, the number of WRs and the number of RDMA requests were linked by a constant multiple. This is no longer the case so we need to make sure that we don't overshoot the limit of WRs for the qpair. Change-Id: I0eac75e96c25d78d0656e4b22747f15902acdab7 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/439573 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-01-28 16:58:50 +00:00
Seth Howell	dfdd76cf21	rdma: track outstanding data work requests directly. This gives us more realistic control over the number of requests we can submit. Change-Id: Ie717912685eaa56905c32d143c7887b636c1a9e9 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/441606 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-01-25 19:12:17 +00:00
Seth Howell	7289d370f7	rdma: fix rw_depth to read_depth: rw_depth was a misinterpretation of the spec. It is based on the value of max_qp_rd_atom which only governs the number of read and atomic operations. However, we were using rw_depth to block both read and write operations which is an unnecessary restriction. write operations should only be governed by the number of Work Requests posted to the send queue. We currently guarantee that we will never overshoot the queue depth for Work requests since they are embedded in the requests and limited to a size of max_queue_depth. Change-Id: Ib945ade4ef9a63420afce5af7e4852932345a460 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/441165 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-01-25 19:12:17 +00:00
Seth Howell	5301be93cd	rdma: set wr opcodes while parsing the SGL. Change-Id: I88fdf0b48653997f790cf5de6774d1c16621a9c1 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/441605 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-01-25 19:12:17 +00:00
Seth Howell	1f9ac1179e	rdma: add num_outstanding_data_wr tracker to req This will be necessary later on when we need to throttle send and recv requests in software. Change-Id: Ifb25eaabd15e101fbfc2959a08a321f80857b280 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/441604 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-01-25 19:12:17 +00:00
Xiaodong Liu	db5c3ce362	nvmf/rdma: dynamically enlarge CQ size Assigned CQ size when creating CQ may run over due to heavy workload with too many qpairs. Enlarge it dynamically can prevent IBV_EVENT_CQ_ERR caused by CQ's runover. This patch fixes issue #498: https://github.com/spdk/spdk/issues/498 Change-Id: I6c2d7194d4147d812d49d4fe787fcba5c6bbede9 Signed-off-by: Xiaodong Liu <xiaodong.liu@intel.com> Reviewed-on: https://review.gerrithub.io/c/440853 Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com>	2019-01-24 21:51:09 +00:00
Seth Howell	cf73fb2f1f	nvmf/rdma: add a pool of request_data structs This change is related to enabling multi-sgl element support in the NVMe-oF target. For single SGL use cases, there is a 1:1 relationship between rdma_requests and ibv_wrs used to transfer the data associated with the request. In the ingle SGL case that ibv_wr is embedded inside of the spdk_nvmf_rdma_request structure as part of an rdma_request_data structure. However, with Multi-SGL element support, we require multiple ibv_wrs per rdma_request. Insted of embedding these structures inside of the rdma_request and bloating up that object, I opted to leave the first one embedded in the object and create a pool that requests can pull from in the Multi-SGL path. By leaving the first request_data object embedded in the rdma_request structure, we avoid adding the latency of requesting a mempool object in the basic cases. Change-Id: I7282242f1e34a32eb59b55f326a6c331d455625e Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/428561 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com>	2019-01-18 20:43:46 +00:00
Seth Howell	caa06154bd	rdma: fix the poll_group_create error paths. It was possible to leak pollers if we had multiple devices in the transport. The new err_exit path fixes this. Change-Id: Iafd5643c67fae741113f10afe761af1988cb6a9b Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/439419 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-01-18 16:57:37 +00:00
Seth Howell	e6ddb7df3f	rdma: use the new common poll group data buffer cache. This change is aimed at addressing github issue #555 Change-Id: I5112ac38c59f2f0a17d0c560e7e2f640a11f58a9 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/440419 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Ziye Yang <optimistyzy@gmail.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-01-18 16:57:37 +00:00
Seth Howell	e28605f47a	nvmf/transport: move buffer_pool to generic struct. This is shared between all currently valid transports. Just move it up to the generic structure. This will make implementing more shared features on top of this a lot easier. Change-Id: Ia896edcb7555903ba97adf862bc8d44228df2d36 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/440416 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-01-17 19:42:01 +00:00
Seth Howell	e816c8fda8	nvmf: add a buffer_cache to transport opts This patch series is geared at solving github issue 555. Ultimately the goal of this series is to add a per-poll-group buffer cache to prevent starvation. Change-Id: I8ddaa47487665c2f9adce2109eb71b8fa71a7927 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/439415 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-01-16 19:06:20 +00:00
Seth Howell	b17e0ae7db	rdma: process pending reqs before destroying qp This is an attempt to clean up requests sititng in the waiting_for_buffer state before destroying it for good. Change-Id: I8ae047e4d7fd01f30419ae346e4da49355dc033d Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/440127 Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-01-15 18:11:41 +00:00
Seth Howell	e0280b1100	rdma: add drain argument to process_pending This allows us to drain all of the pending requests from the qpairs before we destroy them, preventing them from being picked up on subsequent process_pending polls. Change-Id: I149deff437b4c1764fabf542cdd25dd067a8713a Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/440428 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-01-15 18:11:41 +00:00
Ziye Yang	2b787d487e	nvmf/rdma: remove the duplicated code in spdk_nvmf_rdma_request_free The purpose of this patch is to remove the duplicated code used in spdk_nvmf_rdma_request_free Change-Id: I3f74466a7ec788000eff9c2a75c9ea2cacaf5cc2 Signed-off-by: Ziye Yang <optimistyzy@gmail.com> Reviewed-on: https://review.gerrithub.io/c/439942 Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-01-14 03:56:28 +00:00
Ziye Yang	cb1c3fae98	nvmf/rdma: fix the coredump issue when ctrlr + c target When the host connects the target and does the io related job, if we use ctrlr + c, it will be crash. The issue is that we found the rqpair->qpair.group is NULL. Change-Id: Id36cfac2be9abc707bf75a2e1ddb3f414610b6f1 Signed-off-by: Ziye Yang <optimistyzy@gmail.com> Reviewed-on: https://review.gerrithub.io/c/437232 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-01-09 21:05:32 +00:00
Seth Howell	212fd2196f	rdma: Complete rdma_req when RDMA_READ op fails This operation is not attached to a send request so we need to put the request into the completed state right away since there is no send associated with it during the draining process. Change-Id: I294f99950b00a584d8940bb4f93ac046c478d3b3 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/439437 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-01-09 20:55:55 +00:00
JinYu	76675f6f60	rdma: check ibv state after rdma update it We found ibv state value may be unreasonable, so before we use the state value we do some judgement. The unreasonable state probably means hardware issue, so the process flow become unpredicatable. Fix GitHub issue #508. Change-Id: I213f4d684b103cce7bc072aecd591e2c491e0596 Signed-off-by: JinYu <jin.yu@intel.com> Reviewed-on: https://review.gerrithub.io/c/436920 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Ziye Yang <optimistyzy@gmail.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-01-09 08:29:43 +00:00
Seth Howell	fa757dc96d	rdma: dump outstanding requests from rqpairs If this happens, we have something going seriously wrong and we need as much debug information as we can get. Change-Id: I305512790461443316b9f231fa2afeb69593af1b Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/438097 Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-01-09 08:28:50 +00:00
Ziye Yang	0b20f2e552	nvmf/rdma: Remove data buffer num dependency on SPDK_NVMF_MAX_SGL_ENTRIES The least needed data buffer number should only be larger for completing one RDMA (read/write RDMA). Change-Id: I44eb51db279fc055f687eb78b6a642dbb5cb23f3 Signed-off-by: Ziye Yang <optimistyzy@gmail.com> Reviewed-on: https://review.gerrithub.io/437808 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com>	2018-12-25 01:35:43 +00:00
Ziye Yang	58f1624497	nvmf: add the transport shared buffer num configuration option. Previously, we allocate the buffer size according to the MaxQueueDepth info, however this is not exactly a good way for customers to configure, we should provided a shared buffer number configuration for the transport. Change-Id: Ic6ff83076a65e77ec7376688ffb3737fd899057c Signed-off-by: Ziye Yang <optimistyzy@gmail.com> Reviewed-on: https://review.gerrithub.io/437450 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2018-12-20 19:55:57 +00:00
Evgeniy Kochetov	7da9f8faba	nvmf/rdma: Fix refcnt check on RDMA QP destroy Check for QP reference counter in RDMA QP destroy function was wrong and QP resources were never released. Change-Id: I6ab0ce39452e8263f89589d138c90f749516ebb1 Signed-off-by: Evgeniy Kochetov <evgeniik@mellanox.com> Signed-off-by: Sasha Kotchubievsky <sashakot@mellanox.com> Reviewed-on: https://review.gerrithub.io/436974 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com>	2018-12-14 16:15:12 +00:00
Seth Howell	a451c8385e	NVMe-oF: Add explicit reports for MR-split buffers: This is a failsafe for finding and reporting data buffers that span multiple Memory Regions. These errors should never be triggered, but finding and reporting them will help any debugging. Change-Id: I3c61e3cc510f5a36039fc1815ff0de45fce794d5 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/436054 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2018-12-10 23:47:38 +00:00
Ziye Yang	d40f805d54	nvmf: fix the error path for shared data buffer free. Since we use aligned buffer, I think that the error handling path here is not correct, the address is wrong. Change-Id: I5bcb7f050199496423f861fd6aea65e0fe48c804 Signed-off-by: Ziye Yang <ziye.yang@intel.com> Reviewed-on: https://review.gerrithub.io/435992 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com>	2018-12-05 05:57:09 +00:00
Seth Howell	5aca5cd71b	rdma: don't print a notice on QP state change. This notice was scaring a lot of people because every time we disconnect a qpair it tells the user that qpair is entering an error state. That is part of the normal state flow of qpairs during disconnect, but makes it seem like something is going wrong. Change-Id: I776e71db2b24fa963113fee88b5cf02c0820f171 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/435555 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ziye Yang <optimistyzy@gmail.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com>	2018-12-03 09:44:06 +00:00
Jim Harris	942e02aa68	nvmf: add some instrumentation in error path Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I1b5fad59c76fb4dbb6fcedf3f5a1e24af2064c4d Reviewed-on: https://review.gerrithub.io/434271 Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Sasha Kotchubievsky <sashakot@mellanox.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2018-11-30 19:58:14 +00:00
Liu Xiaodong	0e7ca66922	lib/trace: show specific usage of trace mask Previously, if want to know which mask bit is used for specific trace group, the only way is to check source code. Now list each trace group with its trace tpoint group mask bit in usage message Change-Id: I7a85fe9c0885f1919f6ffbdc97dab81f1986fb07 Signed-off-by: Liu Xiaodong <xiaodong.liu@intel.com> Reviewed-on: https://review.gerrithub.io/435448 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2018-11-30 14:52:28 +00:00
Liu Xiaodong	73a3e13280	lib/nvmf: realign tab for TRACE_GROUP_NVMF_XXX Change-Id: I7be0c7c417c84421e6abdbefb734cd0c05561194 Signed-off-by: Liu Xiaodong <xiaodong.liu@intel.com> Reviewed-on: https://review.gerrithub.io/435405 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ziye Yang <optimistyzy@gmail.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2018-11-30 14:52:28 +00:00
Seth Howell	0e6a32deab	nvme_rdma/nvmf: add cb_fns to check mr contiguity This is necessary to confirm that a buffer that spans a 2_MB boundary is still in a single MR. Change-Id: If0d14e514ab2197a0d2e3af4f565f56d50591210 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/435179 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com>	2018-11-29 15:16:13 +00:00
Seth Howell	1180bf8343	rdma: clean up SGE definitions and properly set values We have historically conflated SPDK_NVMF_MAX_SGL_ENTRIES and the maximum number of SGEs associated with a wr data object. For now these are the same thing, but there should be nothing tying the number of NVMe request SGL elements to the number of rdma request wr sgl elements. Also, clarify the rx_sge and tx_sge enums to reflect the actual maximum number of SGEs associated with either the send and receive queues. This change doesn't actually modify these values, but sets us up to do things like split the data in an NVMe SGE into multiple WR SGEs in case the buffer associated with the NVMe SGE is not contained in a single RDMA mr. We also need to store these values in the qpair for later usage. Change-Id: Iff3756fc72787a4b72a99b2bdf90bf486a8010fa Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/433196 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2018-11-16 15:18:41 +00:00
Seth Howell	1d9be84bfd	nvmf/rdma: change the default buffer size. Having the buffers be the same size as the maximum xfer size doesn't do us any favors. Make these buffers a ratio of the maximum transfer size and the number of supported nvmf SGLs. Also configure the number of nvmf request iovs to correspond with this new ratio. Change-Id: I3147dcd86b599c74521ebfdf3bcdbcdee8871a3a Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/428747 Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2018-11-15 08:17:39 +00:00
Evgeniy Kochetov	90b4bd6cf9	nvmf/rdma: Fix QP shutdown procedure implementation This patch implements the following QP shutdown flow: 1. Move the QP to ERR state 2. Post dummy work requests to send and receive queues 3. Poll CQ until it returns dummy work requests (with WR Flush Error status) 4. Call ibv_destroy_qp and release resources In order to differentiate dummy and normal WRs new spdk_nvmf_rdma_wr structure was introduced which contains type of WR. Since now it is expected that wr_id field in ibv_recv/send_wr and ibv_wc always points to this structure. Based on WR type wr_id can be safely casted to correct container structure. In case of unsuccessful work completions 'opcode' can not be used for this purpose because it may be invalid (see "IB Architecture Specification Volume 1", ch. 11.4.2.1 "Poll for completion"). Change-Id: Ifb791e36114c619c71ad4d831f2c7972fe7cf13d Signed-off-by: Evgeniy Kochetov <evgeniik@mellanox.com> Signed-off-by: Sasha Kotchubievsky <sashakot@mellanox.com> Reviewed-on: https://review.gerrithub.io/430754 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com>	2018-11-08 21:20:25 +00:00
Ben Walker	5941ab0351	nvmf/rdma: Simplify code that casts wr_id field We were previously doing lots of checks in debug mode to verify the validity of this field. Now we understand how it works, so these checks are never going to hit and are just making the code harder to read. Change-Id: Ic82d479ae34a8c7db06db62aee1cdf6e8bec126e Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/430866 Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Sasha Kotchubievsky <sashakot@mellanox.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com>	2018-11-02 16:39:37 +00:00
Ben Walker	50a438d3bc	nvmf/rdma: No longer rely on wr.opcode being valid on error The specification states that opcode is not valid when the status is not success. Instead, keep track of the operation type ourselves. Change-Id: I60af4b35e761c46f5f296a61cedfca198836197f Signed-off-by: Ben Walker <benjamin.walker@intel.com> Co-authored-by: Evgeniy Kochetov <evgeniik@mellanox.com> Reviewed-on: https://review.gerrithub.io/430865 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Sasha Kotchubievsky <sashakot@mellanox.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com>	2018-11-02 16:39:37 +00:00
Ben Walker	8e7295036b	nvmf/rdma: Remove error recovery of RDMA qps After some lengthy discussions with the RDMA experts, the only way forward on an RDMA qp error is to disconnect it. The initiator can create a new qp if it wants to later on. Remove all of the error recovery code and disconnect the qp any time an error is encountered. Change-Id: I11e1df5aaeb8592a47ca01529cfd6a069828bd7f Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/430389 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2018-11-02 16:39:37 +00:00
Ben Walker	d3fa0181e3	nvmf/rdma: Move cm event processing down near where it is referenced Code movement only. No other changes. Change-Id: I04cf179ecd57154172a9369926cbeaaa37e11a52 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/430505 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Sasha Kotchubievsky <sashakot@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2018-10-31 21:56:31 +00:00
Ben Walker	039c8341e3	nvmf/rdma: Remove handling for LAST_WQE_REACHED This event only occurs when using shared receive queues, which the target does not currently support. Change-Id: If155843610cf0e961b9783d4afd64b969b4316f4 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/430388 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Sasha Kotchubievsky <sashakot@mellanox.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2018-10-31 21:56:31 +00:00
Seth Howell	e6dac39cb0	nvmf/rdma: rename SPDK_NVMF_RDMA_DEFAULT_IO_UNIT_SIZE This value for the rdma transport at least is tied very closely to the size of the iover buffers. Changing the name makes it less confusing. Change-Id: I8a703f023c37f794323b7280228340aa587243fe Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/428746 Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2018-10-12 16:30:24 +00:00
Ben Walker	aaa691b0ce	nvmf/rdma: Delay disconnect processing until connect processing is done If a disconnect occurs before connect processing has completed, delay handling the disconnect. Change-Id: Ibf91d7dc1f389be452ac6be8948c51e5dd3b9614 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/425990 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Piotr Pelpliński <piotr.pelplinski@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2018-10-04 22:59:57 +00:00
Pawel Wodkowski	c4fee1e970	mk: don't use '-include spdk/config.h' Each file that need to check SPDK_CONFIG_* options need to include spdk/config.h explicitly. Change-Id: If9f2a91ac4c2b1a300dcf88ec3e2a12714ad344a Signed-off-by: Pawel Wodkowski <pawelx.wodkowski@intel.com> Reviewed-on: https://review.gerrithub.io/427221 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com>	2018-10-02 23:13:32 +00:00
Seth Howell	5d57386885	env_dpdk: spdk_mem_map_translate informs user of translation size. This function will now check for whether or not a memory region is contiguous accross 2MB map entries and return the total length of that contiguous buffer up to the size specified by the user. Also includes unittests This series of changes is aimed at enabling spdk_mem_map_translate to report back to the user the length of the valid mem_map up to the function that requested the translation. This will be useful when retrieving memory regions associated with I/O buffers in NVMe-oF. For large I/O it will be possible that the buffer is split over multiple MRs and the I/O will have to be split into multiple SGLs. Change-Id: I2ce582427d451be5a317808d0825c770e12e9a69 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/425329 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2018-09-26 20:57:57 +00:00
Seth Howell	4e06bb5e6d	env: pass an spdk_mem_map_ops structure to mem_map_alloc This series of changes is aimed at enabling spdk_mem_map_translate to report back to the user the length of the valid mem_map up to the function that requested the translation. This will be useful when retrieving memory regions associated with I/O buffers in NVMe-oF. For large I/O it will be possible that the buffer is split over multiple MRs and the I/O will have to be split into multiple SGLs. Change-Id: I90da6d4d31c669a3bf046f7721923dd743c5ef21 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/425328 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2018-09-26 20:57:57 +00:00
Ben Walker	4d5f288c7d	nvmf/rdma: Fix double complete when RNIC goes offline A request could be completed twice, once for an error on an IBV_SEND operation and again on an outstanding IBV_RDMA_WRITE operation, if the RNIC goes offline while a complete + data transfer are occurring. This fixes GitHub issue #414 Change-Id: I2338b4d4582c5ee2512cfbd1e89048a10d3ecf1c Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/425646 Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2018-09-18 15:23:57 +00:00
Ben Walker	e7988759d0	nvmf/rdma: Improve behavior when unable to send response capsule Previously there was only an assert if it failed to send a response capsule. Now, release the resources associated with the request (and leave the assert in). This is a slight improvement. A full fix will likely involve forcibly terminating the connection. Change-Id: I62377078d0cb310042966a0eaca4c80c5f91f9f7 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/425633 Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2018-09-18 15:22:25 +00:00
Ben Walker	efe4c272f9	nvmf/rdma: Add run-time check for SEND_WITH_INVALIDATE support We were previously checking only if the version of libibverbs was suitable for SEND_WITH_INVALIDATE. However, the NIC itself also has to support it and that should be checked. Change-Id: Ia43eb761343ce4dbe0496f3c929cfb889eb5815d Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/425631 Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2018-09-18 15:22:25 +00:00
John Barnard	183d81d0c6	nvmf: Move target opts to transport opts (part 2) - Add independent functions to create transport with specific opts and add to target while maintaining backward compatibility with current apps and rpc configuration that still use the add listener method to create a transport. - Add new rpc function to create transport and add to target. + Update json reporting to include new rpc function. + Update python scripts to support new rpc function. + New nvmf test script (cr_trprt.sh) to test new rpc function. Change-Id: I12d0a42e34c9edff757755f18a78b722d5e1523e Signed-off-by: John Barnard <john.barnard@broadcom.com> Reviewed-on: https://review.gerrithub.io/423590 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2018-09-17 20:42:16 +00:00
Seth Howell	d288c41242	env_dpdk: change behavior of spdk_mem_map_translate The function now takes a pointer as it's last argument, and copies the size of the memory region for which the translation is validinto that pointer. For now, that will always be 2MB. However that behavior can change in the future. This series of changes is aimed at enabling spdk_mem_map_translate to report back to the user the length of the valid mem_map up to the function that requested the translation. This will be useful when retrieving memory regions associated with I/O buffers in NVMe-oF. For large I/O it will be possible that the buffer is split over multiple MRs and the I/O will have to be split into multiple SGLs. Change-Id: I8686c166ec956507f5ae55cf602341281482cb89 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/424888 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2018-09-15 00:28:23 +00:00
Maciej Szwed	44ab0033ba	nvmf: get qp_context only on QP related event This fixes #418 Signed-off-by: Maciej Szwed <maciej.szwed@intel.com> Change-Id: I81516f0fc5720917fda24530613f8580582498ac Reviewed-on: https://review.gerrithub.io/425254 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com>	2018-09-11 21:33:39 +00:00
Ben Walker	9b47c7e7cf	nvmf/rdma: Don't release qpair resources when messages pending If multiple notifications from ib events or cm events occur, don't release the qpair resources until all of the events have executed. Change-Id: Id569acc051819b0c76602601a7aa9b50661d2fab Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/425019 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2018-09-11 16:39:09 +00:00
Ben Walker	f10a91ed0d	nvmf: Add function to get local addr for a qpair Change-Id: I19b9834c709bf97b1bbc1a9278b8c3b9350546e2 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/425185 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2018-09-11 15:23:33 +00:00
Ben Walker	311ce0e2ee	nvmf: Add a function to get the listen addr for a qpair The function returns the transport ID describing the listen address on which the connection originated. Change-Id: Ib11cddb8ff2ceb04a5f3ce236ba96c68b7226773 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/425023 Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2018-09-11 15:23:33 +00:00
Ben Walker	1c34d1a448	nvmf/rdma: Correctly hint AI_NUMERICSERV to getaddrinfo The call seems to work out correctly without this, but the man page is clear that this hint should be provided if the service is a string containing a port number. Change-Id: I9eb966cbe3ccf310836167a5a48ac1b6bd679430 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/425184 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2018-09-11 15:23:33 +00:00
Ben Walker	683c70c216	nvmf/rdma: Fix bug in get_peer_trid The port wasn't being converted from network to host byte order. Change-Id: I154349205ca09ceca932c44883ef3242acd87be3 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/425183 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com>	2018-09-11 15:23:33 +00:00
Ben Walker	e06896b94c	nvmf/rdma: On getting a wc error, force the qpair into the error state This initiates an error recovery instead of a disconnect. The error recovery may result in a disconnect if the qpair is not recoverable. This also resolves an issue where the disconnect may immediately release the resources associated with the rqpair, but upcoming wc entries may still reference it. Change-Id: I9d9e212a83129412e049c91c02725699ce2cac11 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/425010 Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2018-09-10 16:44:33 +00:00
Ben Walker	8f64db180e	nvmf: Add a function to get the source address for a qpair Change-Id: I6ae1f380aebbcf090a0ff31ff96fc4592fc29591 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/421173 Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2018-09-07 16:03:06 +00:00
Ben Walker	8f5cd34671	nvmf/rdma: Pass a message to the owning thread on qpair disconnect This was the only usage of spdk_nvmf_qpair_disconnect that was not being called from the owning thread. Send a message here so that spdk_nvmf_qpair_disconnect can be simplified later. Change-Id: Ic5fae4503a95f7183079a02544812a9fc5d4def5 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/424592 Reviewed-by: Seth Howell <seth.howell5141@gmail.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com>	2018-09-05 18:08:02 +00:00
Jim Harris	e8881867f8	nvmf: add tracepoints for ib async events While here, clean up the trace application output based on some debugging done with these tracepoints. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Iaf79f0ff8c80d0a6b9768ae0da213d57e98ec552 Reviewed-on: https://review.gerrithub.io/424286 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com>	2018-09-05 18:03:43 +00:00
Jim Harris	82c3c30f44	trace: remove alias concept This was added a long time back for tracking an rte_mbuf whose buffer was a different rte_mbuf - all related to a userspace TCP stack that is no longer in development. The concept isn't useful now, so remove it to reduce the complexity of the tracing code. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I310e492eba7f55df242bb29d82fb19f6daee1f51 Reviewed-on: https://review.gerrithub.io/424565 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com>	2018-09-05 18:03:43 +00:00
Ben Walker	c94020001a	thread: Add a name parameter to spdk_register_io_device This is a string name used for debugging only. Change-Id: I9827f0e6c83be7bc13951c7b5f0951ce6c2a1ece Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/424127 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com>	2018-09-05 16:00:54 +00:00
Changpeng Liu	74ebeda461	nvmf: print a warning log when got completion WR error Change-Id: Ia728b4334a4f6abacdd94eecc45e27697e29522a Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.gerrithub.io/424458 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2018-09-04 18:52:25 +00:00
Jim Harris	8bcbe397c1	nvmf: pass cmid as arg1 for spdk_trace_record This will allow us to filter tracepoints based on the connection that generated them. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I3570c6613e477f4e14a85266b7e01f0fcb77f5db Reviewed-on: https://review.gerrithub.io/424280 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2018-09-04 17:09:25 +00:00
Ben Walker	28a61c2130	nvmf/rdma: Simplify event acknowledgement in disconnect path This no longer requires special handling - the event can be acknowledged like all of the others. Change-Id: Ib30cf35ec7aff45734ca6fe729e15d8fe41e3838 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/423935 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2018-08-31 14:50:05 +00:00
Ben Walker	81d51948ad	nvmf/rdma: Move spdk_nvmf_process_cm_event by event handlers Keep the code together. This is only code movement. Change-Id: Ie52f1ab09e197192025f2b664df410ba6e1f06aa Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/423934 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2018-08-31 14:50:05 +00:00
Ben Walker	e6b2caee51	nvmf/rdma: Immediately release resources for requests when killing qpair Previously, this would release resources for requests if there was an RDMA error on the qpair. Expand this case to include scenarios where the qpair is in the process of intentionally shutting down. Change-Id: Ib018f190389ee2df20eba3dddcc7dcffdbb4909d Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/423745 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2018-08-31 14:50:05 +00:00
Ben Walker	764346697a	nvmf/rdma: Query qp state prior to acknowledging disconnect event This guarantees that the qpair memory still exists. Change-Id: I759197b90513f30488aa46bd26535c663e64dae6 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/423744 Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2018-08-31 14:50:05 +00:00
Ben Walker	9f6d509bf9	nvmf/rdma: Don't abort commands with pending RDMA ops until quiesced Don't abort commands in states indicating an RDMA operation is outstanding until an event indicates that all of the work items have completed. Change-Id: Ie2b83604bee142e383ffbcae088f4da0fd0fa658 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/423413 Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com>	2018-08-31 14:50:05 +00:00
Ben Walker	745a54e420	nvmf/rdma: Handle successful requests on an errored queue pair Due to polling order, a request may have completed its previous operation successfully, but the queue pair may be in an error state. In this case, move the request directly to the completed state to release resources. Change-Id: Ic0a5ba036af246b1b6155169cf9682e943b73120 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/423412 Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2018-08-28 16:13:38 +00:00
Ben Walker	0d7d3a04e3	nvmf/rdma: RDMA operation errors now result in a qpair disconnect If an RDMA operation fails, initiate a queue pair disconnect. Make sure all of the resources are released appropriately. Change-Id: I8857ffc17b170279c7d30eb939fbe47da7bcdf5a Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/423410 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2018-08-28 16:13:38 +00:00
Ben Walker	b86bb376ff	nvmf/rdma: Avoid queryng the qp state as much as possible This call results in a syscall that should be avoided. We can often use our cached value instead. Change-Id: I11b5c5457ac2f68bfd46877d3bbc077a50dc9acb Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/423409 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com> Reviewed-by: Philipp Skadorov <philipp.skadorov@wdc.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2018-08-28 16:13:38 +00:00
John Barnard	8e8084903e	nvmf: Move target opts to transport opts (part 1) - Move most of the target opts from nvmf_tgt to nvmf_transport. - Update transport create functions to pass in transport opts. - When transport opts are NULL in transport create function, use target opts. (for backward compatiblity) - Part 1 of 2 patches. Part 2 (to follow after part 1 accepted) will allow independent creation of transport with specific opts while maintaining backward compatibility with current apps and rpc configuration that still use the add listener method to create a transport. Change-Id: I0e27447c4a98e0b6a6c590541404b4e4be879b47 Signed-off-by: John Barnard <john.barnard@broadcom.com> Reviewed-on: https://review.gerrithub.io/423329 Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2018-08-27 20:43:53 +00:00
Chen Wang	6fa48bbf62	lib: fix typos in the lib directory Change-Id: Idcb60b79d2902bb316facc6f60e0a81e5cf847ed Signed-off-by: Chen Wang <chenx.wang@intel.com> Reviewed-on: https://review.gerrithub.io/423372 Reviewed-by: GangCao <gang.cao@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2018-08-24 17:15:12 +00:00
Maciej Szwed	242201d2c9	nvmf: update the IBV state only for QP related events qp_context is only available for QP related events. For other events we should not update ibv state as we try to access null object data field. Signed-off-by: Maciej Szwed <maciej.szwed@intel.com> Change-Id: Id8d2fee090d9a40c7e00c866914c2eb164e7587c Reviewed-on: https://review.gerrithub.io/422941 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2018-08-22 20:25:17 +00:00
Ben Walker	20f1342636	nvmf/rdma: Create pd and memory map at transport initialization Instead of waiting until the first listen address is added, create a protection domain and a memory map for every RDMA device in the system. This consumes more resources when there are RDMA devices that aren't used by the target, but it will simplify some order of operations issues when listen addresses and poll groups are added and removed at run time. Change-Id: Idfe6f8307decbf19e02765dbf67f03c2510a328f Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/422602 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2018-08-21 17:02:31 +00:00
Seth Howell	1570c87f81	rdma: disbale send with inval on Soft-RoCE NICs Currently, the RXE kernel driver does not support send with invalidate. There is a change to the kernel making its way downstream that will enable this feature. At that point, we can conditionally enable send-with-invalidate based on the kernel version. Change-Id: I05c7bcbf8ec944be89c10bdf6ccc3229e4586914 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/422579 Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2018-08-17 20:56:02 +00:00
Seth Howell	b4de8e1158	nvmf_tgt: add support for remote invalidate. Change-Id: I619421677ecc77c3b458c3b98fdc1cb27870a222 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/421258 Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2018-08-17 20:56:02 +00:00
Leonid Ravich	eaea3f24cc	RDMA: fixing create qp failure due to not suppored send sge number, some vendorse support less send sge then SPDK_NVMF_MAX_SGL_ENTRIES. Change-Id: I5b550b537b6ff4ae5d7876a3f277f88cf06049e4 Signed-off-by: Leonid Ravich <Leonid.Ravich@dell.com> Reviewed-on: https://review.gerrithub.io/421012 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com>	2018-08-16 03:35:12 +00:00
Seth Howell	e03aca3ce3	nvmf/rdma: don't delete queue pair until it is empty. Change-Id: I6ee2f9fd02292cc03db6ed16858a9d2cc9c4de05 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/421167 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2018-08-16 03:30:24 +00:00
Seth Howell	54c394c483	nvmf/rdma: cleanup qpairs and reqs on poll group deletion. Change-Id: I6dedf295b80148f37f75ebd5553f18dae76b2ab8 Signed-off-by: Seth Howell <seth.howell@intel.com> Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/421166 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2018-08-13 18:57:45 +00:00
Ben Walker	ed60507d5e	nvmf: Queue pairs can no longer be removed from poll groups In RDMA, qpairs can't be removed from poll groups because the poll group defines the completion queue. So don't allow this operation anymore, even if it were theoretically possible on other transports. Change-Id: I69a3d1b336decd2d25e43ddea94f8b2095ef662f Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/421174 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com>	2018-08-13 18:57:45 +00:00
Ben Walker	808b47c3aa	nvmf/rdma: Trigger error recovery on IBV_EVENT_SQ_DRAINED again After some other refactoring, we can now efficiently handle IBV_EVENT_SQ_DRAINED events during error cases again, so do that. Change-Id: Iba9ec59d9e6b72d8a6d8c7b74f3c3c532114a0a4 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/421045 Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2018-08-08 16:40:21 +00:00
Ben Walker	b46fb4749b	nvmf/rdma: Rename spdk_nvmf_rdma_qp_drained to spdk_nvmf_rdma_qpair_recover Also clean up some print statements Change-Id: I67cfc9ea560298a310b1216d4542a981c0f1e8f3 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/420938 Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com>	2018-08-08 16:40:21 +00:00
Ben Walker	531fd76d10	nvmf/rdma: Treat nvmf qpair state as read-only Decide which action to take based on a combination of the nvmf qpair state and the RDMA qpair state. Change-Id: I338ace9dd66dd8dcf81aa30e51758aa81768d7f4 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/421162 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2018-08-08 16:40:21 +00:00
Ben Walker	3bec66015e	nvmf/rdma: Simplify spdk_nvmf_rdma_qp_drained No longer send an event to process the pending queue - just do it inline. Change-Id: I32716c9ecac3791de297c2a48529c15d220dbe6c Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/421044 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com>	2018-08-06 16:23:36 +00:00
Ben Walker	65a512c6cd	nvmf/rdma: Combine spdk_nvmf_rdma_qp_drained and spdk_nvmf_rdma_recover recover was only called by drained, and they're relatively small Change-Id: I65002cfe13d0045a37609be5b85be087402b4a65 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/421043 Reviewed-by: Seth Howell <seth.howell5141@gmail.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com>	2018-08-06 16:23:36 +00:00
Ben Walker	12444f400d	nvmf/rdma: Only abort all requests when first entering error state There is no need to keep attempting to abort all requests later on, there won't be any in these other states. Change-Id: I7b12e10b87e0d0bb4a74fdf67fb278b443e70e8a Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/421042 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com>	2018-08-06 16:23:36 +00:00
Ben Walker	d0d3dc4e8b	nvmf/rdma: Delay updating rdma qpair state until fully initialized The state of the RDMA qpair is not entirely initialized (RTS) until after the CM event is accepted. Delay caching the state until then. Change-Id: I39befb867fc6a01e94d7fc176071aaabb906bd07 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/421041 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com>	2018-08-06 16:23:36 +00:00
Ben Walker	a9b9f0952d	nvmf/rdma: Don't trigger error recovery on IBV_EVENT_SQ_DRAINED IBV_EVENT_SQ_DRAINED can occur during both error recovery and normal operation. We don't want to spend time sending a message to the correct qpair thread and then attempting to abort all I/O in the case where this wasn't triggered by an error. The case where this occurs during an error is very rare and only in response to a user forcing the state to err from the sqd state. For now, don't handle that case at all. Handle that corner case in a later patch. Change-Id: I16462ca52739b68f6b52a963f7344e12f7f48a55 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/420936 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com>	2018-08-06 16:23:36 +00:00
Ben Walker	13a887f1e2	nvmf/rdma: Simplify spdk_nvmf_rdma_qp_drained This was the only call point of two very small static functions, so merge them into the main body. Change-Id: Ifdd3355ffd500ac5ad4fcf69feace65b35132906 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/420935 Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2018-08-06 16:23:36 +00:00
Ben Walker	c3756ae387	nvmf: Eliminate spdk_nvmf_rdma_update_ibv_qp The update call was never used independently of the get call, so combine them Change-Id: Ibae622e5fd23203e79ceeae1aeccc5c7d9d1ebc0 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/420934 Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com>	2018-08-06 16:23:36 +00:00
Ben Walker	1cfff49fe9	nvmf/rdma: Fix formatting of spdk_nvmf_rdma_request_set_state Change-Id: Id6fb8a9f02a00f3a8e03f621b74f7505c549a345 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/421040 Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2018-08-03 06:50:41 +00:00
Seth Howell	4bee4e03b6	nvmf: free AER resourcess before disconnecting qpair It is necessary to free the AER without sending a completion to ensure that the host does not attempt to send an additional AER upon receiving the first completion. Change-Id: I2b3f8f286d6396019d8ace97d2376547705b8d9d Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/420661 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2018-07-27 20:50:36 +00:00
Seth Howell	388e310150	nvmf: add free_req function pointer. At times, it may be necessary to free requests without completing them. For example, when freeing a qpair, one needs to free the AER sent from the host before deleting the qpair. It is important not to send a completion for the AER because: 1. According to the spec, this will trigger the host to send another AER 2. No Asynchronous Events have occured, so we should not complete the AER. Change-Id: I92e163f0fed0ee2bc942569a647cb3c1967edec9 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/419732 Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2018-07-27 20:50:36 +00:00
Philipp Skadorov	4bfb557d80	nvmf/rdma: recover qp from fatal errors RDMA QP is attempted to recover after IBV_EVENT_QP_FATAL event is received from IBV asynchronous event API. RDMA QP is put into ERROR state and is not processing any inbound requests. The outstanding requests are only allowed to COMPLETED and FREE states, no outbound transfers are performed. IBV_EVENT_QP_LAST_WQE_REACHED or IBV_EVENT_SQ_DRAINED event is expected to follow IBV_EVENT_QP_FATAL, giving a go to draining of all outstanding requests and freeing the associated resources. The requests executed by block layer are gracefully allowed to complete, but no outbound transfers are made. Note, outstanding requests can not be reliably completed through polling the CQ, as WC's with failure status might not have all the fields valid. The failed WC's are dropped and the outstanding requests are fetched from the appropriate state's linked list. QP recovery is triggered when there is no more outstanding requests. If QP recovery is completed succesfully, the RDMA QP is put back into ACTIVE state, the QP disconnect is triggered otherwise. Change-Id: I45ee7feea067f80ccc6402518990014d691fbda3 Signed-off-by: Philipp Skadorov <philipp.skadorov@wdc.com> Reviewed-on: https://review.gerrithub.io/416879 Chandler-Test-Pool: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2018-07-18 20:58:50 +00:00
Philipp Skadorov	fdec444aa8	nvmf/rdma: track requests in any state Requests that are being put into IBV context are lost when IBV QP breaks and its SQ drains. In order to track NVMf/RDMA requests, RDMA QP has been reworked to track requests at any state with queues of requests for each state. This allowed to get rid of a few intermediate queues and request counters. A couple of states has been added to track outbound requests with and without data. They will be used by QP recovery for freeing resources assigned to outstanding requests. Change-Id: Ie84207325c38e5bb2c247cd6dcddb82dfad0d503 Signed-off-by: Philipp Skadorov <philipp.skadorov@wdc.com> Reviewed-on: https://review.gerrithub.io/416878 Tested-by: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2018-07-12 01:02:25 +00:00
Daniel Verkamp	5518a327a8	nvmf/rdma: fix error paths in spdk_nvmf_rdma_create Most of the error paths in this function leaked resources. Make them all use spdk_nvmf_rdma_destroy() so all resources are consistently freed. The spdk_io_device_register() call is moved to the top of the function so that the io_device is always valid when calling the destroy function. Change-Id: Ic92f09f157ee8245fb962d8bc3330aadd87b294a Signed-off-by: Daniel Verkamp <daniel.verkamp@intel.com> Reviewed-on: https://review.gerrithub.io/418869 Tested-by: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Xiaodong Liu <xiaodong.liu@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ziye Yang <optimistyzy@gmail.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com>	2018-07-11 16:04:43 +00:00
Daniel Verkamp	043e5edb1f	nvmf/rdma: check for rdma_get_devices() failure rdma_get_devices() may return NULL on failure; we need to check for this before dereferencing the returned pointer. Fixes GitHub issue #360. Change-Id: I9628e5865365d256f4b1887bf07ce8737b55d356 Signed-off-by: Daniel Verkamp <daniel.verkamp@intel.com> Reviewed-on: https://review.gerrithub.io/418868 Tested-by: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com>	2018-07-10 23:53:57 +00:00
Senthil Kumar V	6138d3bc72	nvmf: Allow In-Capsule data size to be 0. Change-Id: I59f4f69ed695cc9a2b6d0b87052fdf50004ee1c7 Signed-off-by: Senthil Kumar V <senthil.kumar.veluswamy@wdc.com> Reviewed-on: https://review.gerrithub.io/418170 Tested-by: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Daniel Verkamp <daniel.verkamp@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2018-07-09 22:24:40 +00:00
shahar salzman	a0246f6553	lib: validate ib_verbs context is valid before using it Change-Id: I54793624e46a4e51b0c989ddfe933ccb5f035123 Signed-off-by: shahar salzman <shahar.salzman@kaminario.com> Reviewed-on: https://review.gerrithub.io/417858 Tested-by: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Daniel Verkamp <daniel.verkamp@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2018-07-09 19:06:36 +00:00
Seth Howell	09e3f4e3db	nvmf: give qpair_disconnect an asynchronous api. qpair_disconnect has previously presented an entirely synchronous API. However, it relies on other asynchronous operations to complete its task. By giving it an asynchronous API, we can avoid possible race conditions. Patch 1 of several. Change-Id: If9e26ee70ae5d6c0273750226b4408a8e4587e19 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/417345 Tested-by: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Daniel Verkamp <daniel.verkamp@intel.com>	2018-07-06 22:49:39 +00:00
Ben Walker	f80001e2c6	nvmf/rdma: Unset poll group pointer when qpair is removed Change-Id: I2eb84490144c2e1f772c4094645e5067149d2862 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/415316 Tested-by: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Daniel Verkamp <daniel.verkamp@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2018-06-20 22:07:24 +00:00
Ben Walker	878185cf0e	nvmf: Rename spdk_nvmf_ctrlr_disconnect to spdk_nvmf_qpair_disconnect Change-Id: I0c6c410d120bec830ec17105de43ca62bf202b7b Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/415313 Tested-by: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Daniel Verkamp <daniel.verkamp@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2018-06-15 19:11:29 +00:00
Ben Walker	6a5ae72b47	nvmf: Add trace points for the RDMA state machine Remove the old trace points since they didn't actually work. More trace points should be added in the future. Change-Id: I1b658af8e309137882c31460723d7bb94d555b79 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/414280 Tested-by: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Daniel Verkamp <daniel.verkamp@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2018-06-12 20:01:33 +00:00
Ben Walker	a83f91c29a	thread: Replace #include of io_channel.h with thread.h Change-Id: I6babd4cf990bf19b510db88bdfb0ca81e29d9252 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/414700 Tested-by: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Madhu Pai <mpai@netapp.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Daniel Verkamp <daniel.verkamp@intel.com>	2018-06-12 15:24:07 +00:00
zkhatami88	0cdb08b0e0	env: add size parameter to spdk_mem_map_translate Change-Id: I808101edaf4d75613baf19a950915f1d8e75b1af Signed-off-by: zkhatami88 <z.khatami88@gmail.com> Signed-off-by: Daniel Verkamp <daniel.verkamp@intel.com> Reviewed-on: https://review.gerrithub.io/413154 Tested-by: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Zahra Khatami <zahra.k.khatami@oracle.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2018-06-05 18:36:00 +00:00
Philipp Skadorov	b6f90c527a	nvmf/rdma: monitor asynchronous events NVMf cnx acceptor poller is changed to check the asynchronous events from the RDMA devices. RDMA async events are polled together with RDMA CM events; the file descriptors are combined into a poll fd array and processed in a single poll syscall. The errors handler is an empty placeholder for this patch, it just prints the kind of event read from the IB device context. The work for implementing event handling is left for later. Signed-off-by: Philipp Skadorov <philipp.skadorov@wdc.com> Change-Id: Ib167990651b585090aceef1404a88d431a910226 Reviewed-on: https://review.gerrithub.io/412540 Tested-by: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2018-06-04 17:28:04 +00:00
Srikanth kaligotla	8580daa1ac	nvmf: SGL support for NVMF RDMA Driver. Change-Id: I447754c69de432b5a65dc8c1d9ae690926e88c51 Signed-off-by: John Meneghini <johnm@netapp.com> Signed-off-by: Srikanth kaligotla <kalis@netapp.com> Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/410302 Tested-by: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Daniel Verkamp <daniel.verkamp@intel.com>	2018-06-04 17:15:49 +00:00
Ben Walker	4a8b3adb44	nvmf: Simplify qpair disconnect code path This path works for disconnect events on qpairs at run time. Disconnects in response to killing the target have not been worked out yet. This path does not currently wait for outstanding I/O to complete. Change-Id: I8e476c8444b460c18e51601fb950b9132d12f67d Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/412076 Tested-by: SPDK Automated Test System <sys_sgsw@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2018-05-30 17:38:26 +00:00

... 2 3 4 5 6 ...

590 Commits