ivampiresp/Spdk - Spdk - Leaflow Developers

Author	SHA1	Message	Date
Hailiang Wang	73a171a07c	rdma: assert ibv_send_wr is not NULL Vhost testing crashed from Nightly testing, because a member access within null pointer of type 'struct ibv_send_wr'. Change-Id: If8f34f23864883ea73516d2d1fe3b30137c04316 Signed-off-by: Hailiang Wang <hailiangx.e.wang@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/458913 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-06-25 13:37:15 +00:00
Alexey Marchuk	53777de855	rdma: Unset IBV_SEND_SIGNALED flag for RDMA_WRITE operations Unsetting this flag will decrease the number of WRs retrieved during CQ polling and will decrease the oeverall processing time. Since RDMA_WRITE operations are always paired with RDMA_SEND (response), it is possible to track the number of outstanding WRs relying on the completed response WR. Completed WRs of type RDMA_WR_TYPE_DATA are now always RDMA_READ operations. The patch shows %2 better peformance for read operations on x86 machine. The performance was measured using perf with the following parameters: -q 16 -o 4096 -w read -t 300 -c 2 with nvme null device, each measurement was done 4 times avg IOPS (with patch): 865861.71 avg IOPS (master): 847958.77 avg latency (with patch): 18.46 [us] avg latency (master): 18.85 [us] Change-Id: Ifd3329fbd0e45dd5f27213b36b9444308660fc8b Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Signed-off-by: Sasha Kotchubievsky <sashakot@mellanox.com> Signed-off-by: Evgenii Kochetov <evgeniik@mellanox.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/456469 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-06-11 18:07:28 +00:00
Jim Harris	bf647c168a	nvmf: increase default max num qps to 128 This matches the Linux kernel target. Users can still decrease this default when creating the transport (i.e. -p option for nvmf_create_transport in rpc.py). Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Icad59350a2cd35cfc4ad76d06399345191680c05 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/454820 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-05-22 14:50:05 +00:00
Seth Howell	61948a1ca7	rdma: add check for allocating too many SRQ. We could run into issues with this if we were using an arbitrarily large amount of cores to run SPDK. Change-Id: Ia7add027d7e6ef1ccb4a69ac328dbdf4f2751fd8 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/452250 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-05-15 20:29:32 +00:00
Seth Howell	14777890a6	rdma: add an stailq for qpairs pending recv This will help us not iterate through the whole list of connections when only some of them have pending recvs. Change-Id: I681bc98befbdda4e77ef333b7a086c08b2708eb3 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/449266 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-05-13 22:09:55 +00:00
Seth Howell	c3884f943c	rdma: batch rdma recvs per poll. This will help save MMIO overhead. Especially in the SRQ case. Change-Id: I6fb70cf6de4763450f97961f41ccdce3acec2e63 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/449265 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-05-13 22:09:55 +00:00
Seth Howell	b4dc10fbb7	rdma: create a list for qpairs pending send transfers By creating a list of qpairs, we can avoid looping over every connected qpair to process sends each time we poll. Change-Id: If24bbc363176f52fbfb756d56719edd885a21a11 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/449264 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-05-10 22:24:35 +00:00
Seth Howell	9d63933b7f	rdma: batch rdma sends. By batching ibv sends each time we poll, we can reduce the number of MMIO writes that we do. Change-Id: Ia5a07b0037365abfa8732629c34d34a9ed49ac70 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/449253 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-05-10 22:24:35 +00:00
Seth Howell	350e429a57	rdma: add a flag for disabling srq. There are cases where srq can be a detriment. Add a flag to allow users to disable srq even if they have a piece of hardware that supports it. Change-Id: Ia3be8e8c8e8463964e6ff1c02b07afbf4c3cc8f7 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/452271 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-05-06 18:11:13 +00:00
Jim Harris	b6206d657c	trace: shorten max name from 44 to 24 characters This restriction helps reduce the amount of padding when printing out the event trace, allowing it to fit in a small number of columns. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ifa31e5a6967c7b9bc7028069effb71533f80596f Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/452736 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-05-02 08:41:56 +00:00
Jim Harris	617184be3b	trace: remove short_name This was not used by any of the trace register descriptions. Let's remove it rather keeping it around if we don't need it. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Idda809e2911db5be555ff6aa13695484a14bf665 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/452734 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com>	2019-05-02 08:41:56 +00:00
Seth Howell	6cc18a64aa	rdma.c: Don't set recv->qpair to NULL We can use the rpoller->srq to check if a qpair is valid when processing recv completions. Change-Id: I6aa360adc48a3312ddcf79f10e2a65b502a7314f Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/452247 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-05-01 18:48:13 +00:00
Seth Howell	33f60621af	lib: resize key mempools Mempools are based off of a ring structure which allocates its elements as a power of two. It also only exposes n-1 elements to the user. So when we create a mempool with 2^n elements in it, we have to allocate a ring with 2^n+1 entries. By decreasing the number of elements in these key mempools by 1, we can save a decent amount of memory. Change-Id: I942c9dd4cf59096969bc2559fb46fd2084a07f09 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/448875 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-05-01 17:45:29 +00:00
Seth Howell	d05c553827	rdma: don't spam people with async event messages. It used to be that we would get async events very infrequently. However, with the introduction of SRQ, this number has gone up tremendously. Change the way we report our these events so that we don't spam/confuse people running the target. Change-Id: I33070281fa854cbc17784d61bbbb870196ca8780 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/452159 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-04-26 18:10:56 +00:00
Seth Howell	ec47f92b9b	rdma: fix potential heap-use-after-free in srq shutdown If there are outstanding recvs for a qpair when it is destroyed, we need to clear the qpair from it before reposting it. Otehrwise, we have a potential heap-use-after-free of double free (depending on whether the recv completion is in error state or not). See github issues #730 Change-Id: Ic2009c761cbcc5e89174f62fbd0872d0489c67ca Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/452122 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-04-26 11:16:22 +00:00
Seth Howell	89d2efe07e	rdma: set the srq param in the initiator. We were setting this value in the target from our initiator, but it turns out the rdma_conn_params struct is responsible for setting the opposite side so we need to add it in the target side when accepting connections. Also, add a test to demonstrate target functionality when we overwhelm the SRQ. It is useful to note that performance really tanks when you start overwhelming the srq so it may be useful to use this test case to check performance gains in edge cases over time. Change-Id: Iac541bd9fc1d82eca9f21e7abc3f625663a6c460 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/451678 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-04-24 09:22:16 +00:00
jiaqizho	b70e698465	rdma:fix core dump when rdma_create_qp return error. Signed-off-by: jiaqizho <jiaqi.zhou@intel.com> Change-Id: Ie900e01820f69fc5b2d5e30d519c6b619d7a7281 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/449507 Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-04-22 18:40:35 +00:00
Seth Howell	7d7b44f2a6	rdma: decrement descriptor before checking SEND_WITH_INVAL We were incrementing over the end of the descriptor list and assigning undefined values to the rsp opcode in SEND_WITH_INVAL case. We were only hitting this error when mixing sgl and inline requests in the same workload. We were just by chance hitting a four bit value that was set to all 1s from the in capsule data from the last request. Change-Id: Ied06356f3d22fa34a2cd869dfad6bdca8720791d Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/450873 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-04-19 17:29:45 +00:00
Seth Howell	2cc6b0dfcb	rdma: set the number of wr sge_entries per I/O This was not being properly set in the multi-sgl path. Also add a verification step to the fio configuration file to prevent against future regressions. Change-Id: I510b6acd92bc2fbc9b6fbec1d59945cc53584ad3 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/450305 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-04-19 17:29:45 +00:00
JinYu	dd90ff7a21	nvmf/rdma: fix bugs in spdk_nvmf_rdma_qpair_destroy Rqpair qp and resources maybe not be created, if rqpair fail to initialise. For example, in function new_qpair, the code run to spdk_nvmf_qpair_disconnect, but rqpair is initialised in poll_group_add. Fix #557 segmentaion fault(core dump) Change-Id: I1892e6d13e2d53dd5a7c4856d775f9b3b85da961 Signed-off-by: JinYu <jin.yu@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/450986 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com> Reviewed-by: Hailiang Wang <hailiangx.e.wang@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-04-18 21:47:57 +00:00
JinYu	c7395a1171	nvmf: fix the rqpair->current_send_depth If rsp->status.sc != SUCCESS and xfer == DATA_CONTROLLER_TO_HOST, We would not send the data WR, so clean the num_outstanding_data_wr. Fix #728 Change-Id: I32259788e495ed76f8f02a9d871bd56356d93dc4 Signed-off-by: JinYu <jin.yu@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/450726 Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-04-16 14:42:03 +00:00
Seth Howell	1fb629c4d2	rdma: make the pending_data_buf_queue an STAILQ Should speed up operations, and allows us to remove the 16 byte link object from the request structure. Change-Id: Ie62df1f44d22580a7a7ae41c498295841d1e3064 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/448080 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-04-04 21:34:55 +00:00
Seth Howell	9f7582c3a5	rdma: reorder qpair elements to plug hole Saves 8 bytes Change-Id: Icb429ba79d7a085978950dd3045aa9ef28351101 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/448073 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-04-04 04:34:59 +00:00
Seth Howell	91105e2031	rdma: Don't store ibv_qp_attr in the qpair. We were only using one enum from this whole struct, so there is no need to store it. Plus the queries we use to update it are so infrequent and only occur during connect and disconnect so I think we can save quite a bit of space by removing this without compromising performance. Change-Id: Icf29977a3c10cb289564fa2760a0059f07a0f8cb Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/448072 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-04-04 04:34:59 +00:00
Seth Howell	ab79560e65	rdma: simplify spdk_nvmf_rdma_poller_poll. There was a lot of duplicated code here between states. I'm trying to minimize the duplicated code without making it confusing. Change-Id: I13183431e554c8a9f501b3385bbd7b59e2c83161 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/448066 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-04-04 04:34:59 +00:00
Seth Howell	a8169c37e0	rdma: add error path for fill_iovs_multi_sgl Catch an edge case where a multi sgl request is longer than the allowed transfer size. Change-Id: I79779050fe951d16f1240e2c3d8cf5037e576ea2 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/440766 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-04-04 04:34:59 +00:00
Seth Howell	6812b63c5f	rdma: always allocate buffers for requests upfront This is important to avoid thrash when we don't have enough buffers to satisfy a request. Change-Id: Id35fd492078b8e628c2118317f674f07e95d4dba Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/449109 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-04-04 04:34:59 +00:00
Seth Howell	f4adbc79ce	rdma: optimize and move buffers variable. The buffers are really specific to the request and not the wr or data object. In the case of multiple wr requests, the maximum number of buffers per req is equal to the number of SGEs in the NVMe-oF request *2. Change-Id: Ic59498bfed461d180adb2fb9a481ac5b11fa9252 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/449108 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-04-02 23:26:08 +00:00
Seth Howell	62700dac2e	nvmf/rdma: Add support for multiple sgl descriptors to sgl parser Enable parsing an nvmf request that contains an inline nvme_sgl_last_segment_descriptor element. This is the next step towards NVMe-oF SGL support in the NVMe-oF target. Change-Id: Ia2f1f7054e0de8a9e2bfe4dabe6af4085e3f12c4 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/428745 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-04-02 23:26:08 +00:00
Seth Howell	934775db43	rdma: make semantic changes to fill_buffers func Changing i to iovcnt in all references to the req->iov structure will be important when we start processing multi-sgl requests. Change-Id: I90a9b6d872b94f846ae7d29a45dd2703eafa6175 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/449201 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-03-29 19:02:22 +00:00
Seth Howell	e70a759489	rdma: pull buffer assignment out of fill_iovs This will be used by the multi-sgl version of this function as well. Change-Id: Iafeba4836a77482fa2a158f86f1c17fe7fdeb510 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/449104 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-03-29 19:02:22 +00:00
Seth Howell	a9fc7e1db8	rdma: use LAST_WQE_REACHED event in the SRQ path This event is generated by NICs utilizing the SRQ feature when the last RECV for that qpair is processed. I have confirmed this feature. Change-Id: Ib6d6b6d02987f789b4d5dd3daf734e3351ee1974 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/448063 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-03-25 17:23:51 +00:00
yidong0635	fc43fbba04	rdma: fixed heap used after free issue. With ASAN to run this cases, it will report issue about heap used after free in spdk_nvmf_rdma_qpair_destroy. Resources have been released before, change the order to in this tailq to release resources. ERROR: AddressSanitizer: heap-use-after-free on address 0x6080000080e0 at pc 0x0000006e1e3f bp 0x7fd48b6c3df0 sp 0x7fd48b6c3de0 READ of size 8 at 0x6080000080e0 thread T3 (reactor_1) 0x6e1e3e in spdk_nvmf_rdma_qpair_destroy spdk/lib/nvmf/rdma.c:813 Change-Id: Ia1c12bca84955a2de60399e6b265c9b8901bb51e Signed-off-by: yidong0635 <dongx.yi@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/448534 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Seth Howell <seth.howell5141@gmail.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-03-21 18:00:04 +00:00
Seth Howell	e59ac513fb	rdma: remove reqs from read/write queues in error Not doing so can cause us to hit asserts during the shutdown path. This should fix an intermittent failure we are seeing on the test pool where we hit the assert rdma_req->state != RDMA_REQUEST_STATE_FREE in spdk_nvmf_rdma_request_process. Note that this problem doesn't cause any data corruption when debug is not enabled, it just causes us to probcess a subset of commands through the state machine one extra time suring qpair shutdown. Change-Id: Ibc36bfea87ec4089b8e2c7a915f48714fddb0b09 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/447843 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-03-19 18:18:45 +00:00
Seth Howell	33668b2254	rdma: change structure of drained_qpair to work w/ messages. This will become important later on. Change-Id: I94e5af03359e476afbc68664e43f44269ad5974c Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/448074 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-03-18 23:32:21 +00:00
Seth Howell	7dd3cf441a	rdma: limit the completion queue based on the SRQ. When we have a shared receive queue, the number of outstanding items associated with a completion queue is deterministic, and limited by how many RECVs we have total in the SRQ. So, we can set the total size of the Completion queue at the beginning of time and never resize it. Change-Id: I787e4c5bbd52ac8948a323d1301f926f887cd91c Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/447492 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-03-18 23:32:21 +00:00
Seth Howell	a5972c6245	rdma: consolidate common error paths in qpair_init Consolidating error paths is common practice in SPDK so do that here to make the function more uniform and save space. Change-Id: I98c5d5f7feeb688f1d8b24f4d2d3461a43d00c1d Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/448191 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-03-18 23:32:21 +00:00
Seth Howell	97a43680a9	rdma: move cq_resize to its own function. Change-Id: I07aef399320fd4a014f63760670ea765d2e18b4b Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/448190 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-03-18 23:32:21 +00:00
Seth Howell	fa79f64ad1	rdma: Keep a pointer to the SRQ in the qpair Change-Id: Id173038b6ad6b1564acf5d6886814f7d310964c7 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/447471 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-03-18 23:32:21 +00:00
Seth Howell	01201d3e87	rdma: remove compile time config for SRQ Change-Id: I44af3ee4dc6ec76045e1d0614910402487098a3d Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/447120 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-03-18 23:32:21 +00:00
Seth Howell	0d3fcd10e9	rdma: add function to create qpair resources. Change-Id: Id865e2a2821fe04c1f927038d6dd967848cd9a55 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/446999 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-03-15 19:19:17 +00:00
Ben Walker	353fbcdaf0	nvmf/rdma: Create function to destroy rdma resources This unifies the clean up path between SRQ and normal operation. Change-Id: I396d7e3749579f27b5bb1e89b9d6761a77ba5beb Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/446979 Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-03-15 19:19:17 +00:00
Ben Walker	b25751d99d	nvmf/rdma: Add a structure to hold rqpair/rpoller resources Depending on whether SRQ is enabled, resources may be allocated to the rqpair or to the rpoller. Create a struct to hold these pointers that can be used in both locations to avoid duplicated code. Change-Id: I2c8fc59009201d9e41721e6462a81732b529a9e0 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/446978 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Eugene Kochetov <evgeniik@mellanox.com>	2019-03-15 19:19:17 +00:00
Ben Walker	527be2bf4e	nvmf: Remove qpair_is_idle This wasn't used anywhere. Change-Id: I405af3c808be284d19218f3f04c1e90e33e31de8 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/446977 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com>	2019-03-15 19:19:17 +00:00
Evgeniy Kochetov	ed0b611fc5	nvmf/rdma: Add shared receive queue support This is a new feature for NVMEoF RDMA target, that is intended to save resource allocation (by sharing them) and utilize the locality (completions and memory) to get the best performance with Shared Receive Queues (SRQs). We'll create a SRQ per core (poll group), per device and associate each created QP/CQ with an appropriate SRQ. Our testing environment has 2 hosts. Host 1: CPU: Intel(R) Xeon(R) CPU E5-2609 0 @ 2.40GHz dual socket (8 cores total) Network: ConnectX-5, ConnectX-5 VPI , 100GbE, single-port QSFP28, PCIe3.0 x16 Disk: Intel Optane SSD 900P Series OS: Fedora 27 x86_64 Host 2: CPU: Intel(R) Xeon(R) CPU E5-2630 v2 @ 2.60GHz dual-socket (24 cores total) Network: ConnectX-4 VPI , 100GbE, dual-port QSFP28 Disk: Intel Optane SSD 900P Series OS : CentOS 7.5.1804 x86_64 Hosts are connected via Spectrum switch. Host 1 is running SPDK NVMeoF target. Host 2 is used as initiator running fio with SPDK plugin. Configuration: - SPDK NVMeoF target: cpu mask 0x0F (4 cores), max queue depth 128, max SRQ depth 1024, max QPs per controller 1024 - Single NVMf subsystem with single namespace backed by physical SSD disk - fio with SPDK plugin: randread pattern, 1-256 jobs, block size 4k, IO depth 16, cpu_mask 0xFFF0, IO rate 10k, rate process “poisson” Here is a full fio command line: fio --name=Job --stats=1 --group_reporting=1 --idle-prof=percpu \ --loops=1 --numjobs=1 --thread=1 --time_based=1 --runtime=30s \ --ramp_time=5s --bs=4k --size=4G --iodepth=16 --readwrite=randread \ --rwmixread=75 --randrepeat=1 --ioengine=spdk --direct=1 \ --gtod_reduce=0 --cpumask=0xFFF0 --rate_iops=10k \ --rate_process=poisson \ --filename='trtype=RDMA adrfam=IPv4 traddr=1.1.79.1 trsvcid=4420 ns=1' SPDK allocates the following entities for every work request in receive queue (shared or not): reqs (1024 bytes), recvs (96 bytes), cmds (64 bytes), cpls (16 bytes), in_capsule_buffer. All except the last one are fixed size. In capsule data size is configured to 4096. Memory consumption calculation (target): - Multiple SRQ: core_num * ib_devs_num * SRQ_depth * (1200 + in_capsule_data_size) - Multiple RQ: queue_num * RQ_depth * (1200 + in_capsule_data_size) We ignore admin queues in calculations for simplicity. Cases: 1. Multiple SRQ with 1024 entries: - Mem = 4 * 1 * 1024 * (1200 + 4096) = 20.7 MiB (Constant number – does not depend on initiators number) 2. RQ with 128 entries for 64 initiators: - Mem = 64 * 128 * (1200 + 4096) = 41.4 MiB Results: FIO_JOBS kIOPS Bandwidth,MiB/s AvgLatency,us MaxResidentSize,kiB RQ SRQ RQ SRQ RQ SRQ RQ SRQ 1 8.623 8.623 33.7 33.7 13.89 14.03 144376 155624 2 17.3 17.3 67.4 67.4 14.03 14.1 145776 155700 4 34.5 34.5 135 135 14.15 14.23 146540 156184 8 69.1 69.1 270 270 14.64 14.49 148116 156960 16 138 138 540 540 14.84 15.38 151216 158668 32 276 276 1079 1079 16.5 16.61 157560 161936 64 513 502 2005 1960 1673 1612 170408 168440 128 535 526 2092 2054 3329 3344 195796 181524 256 571 571 2232 2233 6854 6873 246484 207856 We can see the benefit in memory consumption. Change-Id: I40c70f6ccbad7754918bcc6cb397e955b09d1033 Signed-off-by: Evgeniy Kochetov <evgeniik@mellanox.com> Signed-off-by: Sasha Kotchubievsky <sashakot@mellanox.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/428458 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-03-15 19:19:17 +00:00
Seth Howell	62266a72cf	rdma: allocate protection domains for devices up front. We were only using one pd per device anywas, and this is necessary for shared receive queue support. Change-Id: I86668d5b7256277fe50836863408af2215b5adf9 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/447385 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-03-12 21:37:51 +00:00
Seth Howell	bb3e441388	rdma: destroy qpairs based on num_outstanding_wr. Both Mellanox and Soft-RoCE NICs work with this approach. Change-Id: I7b05e54037761c4d5e58484e1c55934c47ac1ab9 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/446134 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-03-08 21:09:09 +00:00
Seth Howell	961cd6ab7e	rdma: register a poller to destroy defunct qpairs Not all RDMA drivers fail back the dummy recv and send operations that we send to them when destroying a qpair. We still need to free the resources from these qpairs to avoid eating up all of the system memory after multiple connect and disconnect events. Since we won't be getting any more completions, the best heuristic we can use is waiting a long time and then freeing the resources. qpair_fini is only called from the proper polling thread so we can safely call process_pending to flush the qpair before closing it out. Change-Id: I61e6931d7316d1e78bad26657bb671aa451e29f4 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/443057 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-03-04 19:12:48 +00:00
Seth Howell	59f0d22e40	rdma: Fix misordered assert and decrement. In the error path, we were first decrementing a variable and then asserting that it must be >0. These operations should occur in the opposite order. Change-Id: I6cec544faf17bb75cbfca3d3a3c173dc5db14f99 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/446440 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: yidong0635 <dongx.yi@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-02-28 21:20:38 +00:00
Seth Howell	756ce464f6	rdma: update default number of shared buffers. When the decision was made to uncouple the number of shared buffers from the queue depth and allow the user to decide for themselves, the default was also significantly lowered, which caused some issues when trying torun performance tests (See https://github.com/spdk/spdk/issues/699). While this is a user modifiable variable, it is still best to keep the higher default value. The original value was equivalent to max_queue_depth * SPDK_NVMF_MAX_SGL_ENTRIES * 2 with the defaults for max_queue depth and max_sgl_entries being 128 and 16 respectively. Hence 4096 fixes: `0b20f2e552` Change-Id: I809e97a10973093a2b485b85bca7160091166f70 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/446525 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-02-28 21:09:50 +00:00

1 2 3 4 5 ...

415 Commits