ivampiresp/Spdk - Spdk - Leaflow Developers

Author	SHA1	Message	Date
Shuhei Matsumoto	b8a87e6af5	nvmf/tcp: Abort request whose CID matches if it is outstanding Call nvmf_ctrlr_abort_request() if the request whose CID matches is found and its state is executing. nvmf_tcp_qpair_abort_request() returns immediately if rc is SPDK_NVMF_REQUEST_EXEC_STATUS_ASYNCHRONOUS or calls spdk_nvmf_request_complete() otherwise. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I1abceecc211ee79d8ac18a82dc63b13d313a6f27 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3008 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Michael Haeuptle <michaelhaeuptle@gmail.com>	2020-07-14 07:54:42 +00:00
Shuhei Matsumoto	604b4503c4	lib/nvmf: Add nvmf_transport_qpair_abort_request() State machine is different among NVMe-oF transports and is encapsulated to the transport neutral NVMe-oF controller and NVMe-oF qpair. To implement abort operation for each NVMe-oF transport, add a function pointer qpair_abort_request to struct spdk_nvmf_transport_ops and a stub nvmf_transport_qpair_abort_request() to encapsulate which transport is used. The following patches will implement qpair_abort_request for each transport. Each qpair_abort_request() is responsible to call spdk_nvmf_request_complete() for the abort request. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I2beac959ed428c5108cf33691226b7fae5cd24d6 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3007 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Michael Haeuptle <michaelhaeuptle@gmail.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-07-14 07:54:42 +00:00
Maciej Szwed	eb05cbd677	pollers: Fix pollers to return correct busy status Poller should return status > 0 when it did some work (CPU was used for some time) marking its call as busy CPU time. Active pollers should return BUSY status only if they did any meangful work besides checking some conditions (e.g. processing requests, do some complicated operations). Signed-off-by: Maciej Szwed <maciej.szwed@intel.com> Change-Id: Id4636a0997489b129cecfe785592cc97b50992ba Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2164 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-07-07 07:29:31 +00:00
Ziye Yang	a213592c89	nvmf/tcp: Fix the error return code It should return "NVME_TCP_PDU_FATAL". I think that this issue is introduced after we move the data copy from tcp transport layer to the socket layer. And it should return "NVME_TCP_PDU_FATAL now", and it will be consistent with the logic in the same function. With this patch, it will fix the big I/O size write from the initiator. Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: Ide018adb603eb13d002fc98886258dd1e2424f7c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3122 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Community-CI: Mellanox Build Bot Community-CI: Broadcom CI	2020-07-01 07:51:17 +00:00
Ben Walker	5584232cce	nvmf: Remove new_qpair callback from transport accept function pointer Transports may now call spdk_nvmf_tgt_new_qpair() instead. Change-Id: Ib3295c488e22517e82f2051055ae47521d76fe56 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2814 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI Community-CI: Mellanox Build Bot Reviewed-by: Anil Veerabhadrappa <anil.veerabhadrappa@broadcom.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-06-18 07:29:41 +00:00
Maciej Szwed	e7e10859d3	nvmf: Make spdk_nvmf_tgt_accept return the number of events accepted This will be usefull for pollers/threads stats. Signed-off-by: Maciej Szwed <maciej.szwed@intel.com> Change-Id: I4d1651f3ff6410c258c8bc75c2a68640b67d2ed9 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2849 Community-CI: Broadcom CI Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Anil Veerabhadrappa <anil.veerabhadrappa@broadcom.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-06-15 15:28:00 +00:00
Jacek Kalwas	000e6f5b87	nvmf: move cdata subset from transport to ctrlr Having that transport can decide about particular ctrlr attributes not globally but per ctrlr. Signed-off-by: Jacek Kalwas <jacek.kalwas@intel.com> Change-Id: Ia3fb0d4e576cb9f8ce6df75f775e2fd5727d7f48 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2757 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-06-05 09:04:01 +00:00
Alexey Marchuk	1551197db5	rpc: Deprecate max_qpairs_per_ctrlr parameter This parameter describes the number of admin and IO qpairs while admin qpair always exists and should not be configured explicitly. Introduce a new parameter `max_io_qpairs_per_ctrlr` which configures the number of IO qpairs. Internal structure of NVMF transport is not changed, both RPC parameters configure the same nvmf transport parameter. Deprecate max_qpairs_per_ctrlr in spdkcli as well Side change: update dif_insert_or_strip description - it can be used by TCP and RDMA transports Config files parsing is not changed since it is deprecated Fixes #1378 Change-Id: I8403ee6fcf090bb5e86a32e4868fea5924daed23 Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2279 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Anil Veerabhadrappa <anil.veerabhadrappa@broadcom.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Community-CI: Broadcom CI	2020-06-04 07:20:45 +00:00
Seth Howell	61d85773f6	lib/nvmf: remove spdk_ and _spdk prefix from functions. I missed a few files in this library the first time. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: I2ad55355e6348eaa10384a148dd45deb9f68fc2b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2442 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Anil Veerabhadrappa <anil.veerabhadrappa@broadcom.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-06-01 09:21:14 +00:00
Seth Howell	4de405ab6e	lib/nvmf: remove spdk prefix from static functions in tcp.c Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: If5d29c4236022f949a6f2e44bcd51a6e2a9ea88b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2289 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-05-12 21:49:03 +00:00
Wenhua Liu	a4340e4501	Set low watermark in NVMe/TCP target to a more appropriate value. In SPDK NVMe/TCP target, when initializing the socket, the low watermark is set to sizeof(struct spdk_nvme_tcp_common_pdu_hdr), which is 24 bytes. In our testing, some times there might be very small data packet (as small as 16 bytes) be sent to wire. After this, if there is no more data sent to the same socket, this small data packet won’t be received by NVMe/TCP controller qpair thread because the size hasn’t reached the low watermark. Because of this, the qpair thread is waiting for more data come in and the initiator is waiting for the IO request to be completed. Hence the delay happens. As the minimum data that allows target to determine the PDU type is sizeof(struct spdk_nvme_tcp_common_pdu_hdr), which is 8 bytes, we changed low watermark setting as below. With the change, the problem was gone immediately. Change-Id: I14ccc4c84b77e33a617726e7455304aca29d5d57 Signed-off-by: Wenhua Liu <liuw@vmware.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2138 Community-CI: Mellanox Build Bot Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-05-06 12:44:06 +00:00
Jacek Kalwas	538f1354e0	nvmf: allow to override virtual controller capabilities Virtual controller capabilities can be overridden on transport specific layer. The current behavior shall be preserved. This can be useful to limit or extend the default based on transport type. Signed-off-by: Jacek Kalwas <jacek.kalwas@intel.com> Change-Id: I754f0d957a46f219adc1e55f792e79c7546ddb43 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1274 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-04-28 13:48:17 +00:00
Ziye Yang	94345a0a1a	nvme: Add the priority field in struct spdk_nvme_transport_id Purpose: To set the priority of the NVMe-oF connection especially for TCP connection. For example, the previous example can be: trtype:TCP adrfam:IPv4 traddr:10.67.110.181 trsvcid:4420 With the change, it could be: trtype:TCP adrfam:IPv4 traddr:10.67.110.181 trsvcid:4420 priority:2 The priority is optional. We try to change spdk_nvme_transport_id but not in spdk_nvme_ctrlr_opts since the opts in spdk_nvme_ctrlr_opts will reflect in every nvme ctrlr, this is short of flexibility. Signed-off-by: Ziye Yang <ziye.yang@intel.com> Signed-off-by: Sudheer Mogilappagari <sudheer.mogilappagari@intel.com> Change-Id: I1ba364c714a95f2dbeab2b3fcc832b0222b48a15 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1875 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-04-24 15:53:34 +00:00
Ziye Yang	8ad1f4bfa8	lib/sock: remove spdk_sock_set_priority Since the related feature is already contained in spdk_sock_listen and spdk_sock_connect functions, we no longer need this function. Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: I1eafff0d139fa266a355fbee2bf0fc3947db69fc Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1876 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-04-22 09:19:01 +00:00
Shuhei Matsumoto	ab0bc5c254	lib/thread: Use function name as poller name by using macro SPDK_POLLER_REGISTER We will be create fine name for each poller but it will need large effort. Replacing spdk_poller_register by the macro SPDK_POLLER_REGISTER will provide better name than function address with minimum effort. Following patches may improve function name for clarification. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: If862a274c5879065c3f7cb04dcb5ca7844523e68 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1781 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Maciej Szwed <maciej.szwed@intel.com> Community-CI: Broadcom CI	2020-04-15 07:23:09 +00:00
Ben Walker	1621809e7e	nvmf/tcp: Correctly size the socket receive buffer The code used to do this but it was removed when the buffering was shifted down to the posix layer. Add a way for users of sockets to still properly size the buffers. This also means that by default, the receive buffering is not enabled on sockets. That matches the behavior of the previous release. Signed-off-by: Ben Walker <benjamin.walker@intel.com> Change-Id: I20ce875be2efd841fe3a900047b4655a317d7799 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1560 Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-04-08 06:42:55 +00:00
Ben Walker	ae6519e488	nvmf/tcp: Don't break out of poll loop based on number of PDUs It's actually faster to process them until you run out of data. Signed-off-by: Ben Walker <benjamin.walker@intel.com> Change-Id: I9e81babdb9bdc405a8dbf03b2f701fe50bcc70f6 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1559 Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-04-08 06:42:55 +00:00
Ben Walker	ea65bf612d	Revert "nvme/tcp: Change hdr in nvme_tcp_pdu to pointer" This reverts commit `ea5ad0b286`. This code is moving from the nvmf target to the posix sock layer in this series. Change-Id: I333bdf325848e726ab82a9e6916e1bbdcd34009c Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/446 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-03-17 08:23:07 +00:00
Ben Walker	8b7e6ca407	Revert "nvmf/tcp: Use a big buffer for PDU receving." This reverts commit `d50736776c`. This code is moving from the nvmf target to the posix sock layer in this series. Change-Id: I7cf477333f2a3fa4a0089394d5fa28142b262a7f Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/445 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-03-17 08:23:07 +00:00
Ben Walker	cb448c1bd7	Revert "nvmf/tcp: Remove the potential pdu hdr memory copy." This reverts commit `5e7b8d18f3`. This code is moving from the nvmf target down into the posix sock layer in this series. Change-Id: Iea9a7cef5bedd6a34edf7b4c87825897279829c3 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/444 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-03-17 08:23:07 +00:00
Ben Walker	c40f35b764	nvmf: Make spdk_nvmf_tgt_listen synchronous again This was recently made asynchronous to support virtualized transports. However, we're moving to add a new call to associated a listener with a subsystem to transports and the operation that needed to be asynchronous will actually be performed there. For simplicity, make this synchronous again. Change-Id: Ie98136a19c58f0f9bba0d140476de3bbb38e12d7 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/881 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-03-06 10:29:45 +00:00
Jacek Kalwas	6d8f1fc648	nvmf: remove redundant trid obj copy It is memory optimisation as transport id is 'heavy'. As a side effect simpler handling of listen and stop_listen on transport specific layer. Signed-off-by: Jacek Kalwas <jacek.kalwas@intel.com> Change-Id: I4e9d0e0c5eee2d570ec4ac9079270c32d5afb8db Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/626 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-02-19 13:43:15 +00:00
Ben Walker	cc353f0e27	nvmf: Add a public nvmf_transport.h This defines the official interface that NVMe-oF target transports may use. For now, all code is just copied from elsewhere. Eventually we'll want to add doxygen comments. Change-Id: I0cd9368607544be18c7c49188d071e38ceb59b8f Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/412 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jacek Kalwas <jacek.kalwas@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-02-12 12:07:04 +00:00
Alexey Marchuk	9727aa281f	tcp: refactor of header/data digest support check Some functions performed incorrect header/data digest support check, align it with NVMEoF spec. Use a table to check if PDU supports digest depending on its type. Change-Id: I6170dd19ace017f37fda0a923f604732799460b9 Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/483375 Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Community-CI: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-02-04 18:18:49 +00:00
Ben Walker	f84c916c41	nvmf/tcp: Correctly kick the recv state machine when a request is freed When a command arrives and no requests are available, the socket recv state machine sits in the RECV_STATE_AWAIT_REQ state until another network event occurs. If this I/O was the last one sent, this leaves the target hung. To fix this, when a request is completed, kick the state machine to make forward progress. In practice, this can only occur once the pdu send acknowledgements are asynchronous relative to arriving commands. That only begins happening with the use of MSG_ZEROCOPY. When MSG_ZEROCOPY is turned on, it's possible receive the next PDU in a chain for a command prior to seeing the acknowledgement that the response that triggered that PDU actually sent. Change-Id: I556f31ad56970d36aa3538cfde375d35f3d4e551 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/480002 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-01-27 17:42:24 +00:00
Ben Walker	48a547fd82	nvmf/tcp: Wait for R2T send ack before processing H2C Previously, the R2T was sent and if an H2C arrived prior to seeing the R2T ack, it was processed anyway. Serialize this process. In practice, if the H2C arrives with a correctly functioning initiator, that means the R2T already made it to the initiator. But because the PDU hasn't been released yet, immediately processing the PDU requires an extra PDU associated with the request. Basically, making this change halves the worst-case number of PDUs required per connection. In the current sock layer implementations, it's not actually possible for the R2T send ack to occur after that H2C arrives. But with the upcoming addition of MSG_ZEROCOPY and other sock implementations, it's best to fix this now. Change-Id: Ifefaf48fcf2ff1dcc75e1686bbb9229b7ae3c219 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/479906 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-01-27 17:42:24 +00:00
Ben Walker	033ef363a9	nvmf/tcp: Inline spdk_nvmf_tcp_pdu_set_buf_from_req This function was only called from one spot. Change-Id: I856f564d3ef6c6157be7a32a2cd812c702516a8d Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/482003 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-01-27 17:42:24 +00:00
Ben Walker	fdfb7908b5	nvmf/tcp: Rename next_expected_r2t_offset to h2c_offset This seems like a more descriptive name Change-Id: Ia616865b3fb36d8f9ccc5fb2ca6185bdd8543cf8 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/482002 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Alexey Marchuk <alexeymar@mellanox.com>	2020-01-27 17:42:24 +00:00
Ben Walker	a2adca79d9	nvmf/tcp: Set up math to always use 1 R2T per nvme command With our target design, there's no advantage to sending multiple R2T PDUs per nvme command. This patch starts by setting up the math so that at most 1 R2T PDU is required per request. This can be guaranteed because the maximum data transfer size (MDTS) is pre-negotiated in NVMe-oF to a reasonable size at start up. It then proceeds to simplify all of the logic around mapping requests to PDUs. It turns out that the mapping is now always 1:1. There are two additional cases where there is no request object at all but a PDU is still needed - the connection response and termination request. Put an extra PDU on the queue object for that purpose. This is a major simplification. Change-Id: I8d41f9bf95e70c354ece8fb786793624bec757ea Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/479905 Community-CI: SPDK CI Jenkins <sys_sgci@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Alexey Marchuk <alexeymar@mellanox.com>	2020-01-27 17:42:24 +00:00
Ben Walker	399529aaa1	nvmf/tcp: Set max h2c size equal to max I/O size We can always accept up to the maximum I/O size in an H2C, so eliminate the #define. Change-Id: I349dab5f9b6ec482a7c580b1396e03c8d30a250b Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/482278 Community-CI: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Alexey Marchuk <alexeymar@mellanox.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-01-27 17:42:24 +00:00
Ben Walker	4dba507224	nvmf/tcp: Simplify qpair resource initialization The resources allocated to a queue pair do not need to be directly correlated to the queue size requested by the initiator in NVMe-oF, as long as enough resources are present. The RDMA transport, for instance, does complex pooling of the resources behind the scenes when using a shared receive queue. Simplify the resource allocation for a TCP qpair to just always allocate the max allowed queue size right away. This is a configurable parameter, so system administrators can adjust for their needs. The initiator may then request a queue size less than or equal to that, which will only be enforced by queue depth counting and not impact the actual number of resources allocated on the target. This change relies on the MaxC2HSize being equal to the Maximum Data Transfer Size (MDTS) reported. That is the default configuration, but MDTS is configurable. Changing the MDTS with this patch to a value larger than 128k will cause the target to break. This is addressed in the next patch in this series. Change-Id: Ibd4723785c6a4d8d444f9b7bbfa89f98de2320f5 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/479733 Community-CI: SPDK CI Jenkins <sys_sgci@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Alexey Marchuk <alexeymar@mellanox.com>	2020-01-27 17:42:24 +00:00
Ben Walker	444cf90c72	nvmf/tcp: Change qpair's state_cntr array to uint32_t These values do not need to be negative. Change-Id: Id9f798cf1c9da354448f9c6fbb90e599f877bb32 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/482277 Community-CI: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Alexey Marchuk <alexeymar@mellanox.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-01-27 17:42:24 +00:00
Ben Walker	5a7b33ec67	nvmf/tcp: In _pdu_write_done, free pdu before calling user callback By releasing the just-completed PDU prior to calling the callback, for flows that immediately submit another PDU inside the callback, the just-released PDU can be immediately reused. This reduces the number of PDUs required in the pool to continue forward progress to half of the previous value, while also making it more CPU cache friendly. Change-Id: I8031b8f9f57ac05f261d96433d9899fe5e31d318 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/479904 Community-CI: SPDK CI Jenkins <sys_sgci@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Or Gerlitz <gerlitz.or@gmail.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-01-27 17:42:24 +00:00
Ben Walker	63a60a0c4c	nvmf/tcp: Fix r2t completion callback This was calling a callback for another function which attempted to release the request. The code only worked because in the r2t case the cb_arg was set to NULL, and that makes the request free function do nothing. Change-Id: Id9ec30ceb0eaa41deb67aa995da5d6f786d9b9f0 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/479903 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Community-CI: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Or Gerlitz <gerlitz.or@gmail.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-01-17 09:00:08 +00:00
Ben Walker	2112c8bf3a	nvmf/tcp: Remove pdu ref count This wasn't actually used. Every PDU only had a single reference. Change-Id: I8adaa7edeca5fe175aa853c156df741170d76c10 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/479902 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Community-CI: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-01-17 09:00:08 +00:00
Jacek Kalwas	708ed4fb6e	nvmf: pass listen done cb to transport specific code This would allow to respond for add listener rpc request even when there are async calls in transport specific function. Signed-off-by: Jacek Kalwas <jacek.kalwas@intel.com> Change-Id: I94a9f45b7ba9e8d46a60ae3785953cea12554732 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/479511 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Community-CI: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Anil Veerabhadrappa <anil.veerabhadrappa@broadcom.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-01-16 09:18:38 +00:00
Jacek Kalwas	7cd56fb3ed	nvmf: align tcp and rdma listen calls Make common code as part of successful return. In rdma check if already listening first. Signed-off-by: Jacek Kalwas <jacek.kalwas@intel.com> Change-Id: Ib0c87ac11db7daff00dc4042c9e0ab20eb7ffd0f Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/478721 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Community-CI: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-01-16 09:18:38 +00:00
Ziye Yang	0bfaaace8f	sock: Add impl_name parameter in spdk_sock_listen/connect. Purpose: With this patch, (1)We can support using different sock implementations in one application together. (2)For one IP address managed by kernel, we can use different method to listen/connect, e.g., posix, or uring. With this patch, we can designate the specified sock implementation if impl_name is not NULL and valid. Otherwise, spdk_sock_listen/connect will try to use the sock implementations in the list by order if impl_name is NULL. Without this patch, the app will always use the same type of sock implementation if the order is fixed. For example, if we have posix and uring together, the first one will always be uring. Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: Ic49563f5025085471d356798e522ff7ab748f586 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/478140 Community-CI: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Community-CI: SPDK CI Jenkins <sys_sgci@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-01-16 09:11:32 +00:00
Seth Howell	f038354efa	lib/nvmf: enable pluggable NVMe-oF transports. Change-Id: If1fd7d6c2385f42ca32dea0f8ecb528a60778d40 Signed-off-by: Seth Howell <seth.howell@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/477504 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-01-16 09:10:38 +00:00
Seth Howell	5b3e6cd137	lib/nvmf: opts_init and transport_create use string now. This will help enable pluggable NVMe-oF transports. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: I1947cc2e6e4ff078609f8bdbbdfefc5b110674c2 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/478753 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Anil Veerabhadrappa <anil.veerabhadrappa@broadcom.com>	2020-01-16 09:10:38 +00:00
Seth Howell	7ed0904b9b	lib/nvme: update trid struct with trstring. The trtype should be stored as both an enum and string. This is intended to help pave the way for pluggable NVMe-oF transports. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: I6af658d7a17c405e191ff401b80ab704c65497e7 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/478744 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Alexey Marchuk <alexeymar@mellanox.com>	2020-01-16 09:10:38 +00:00
Ben Walker	d31eb732af	nvmf/tcp: Allocate pdu pool out of hugepages It is faster for the kernel to pin memory in hugepages, so allocate the pdu pool from hugepages. This will help more with upcoming changes to leverage MSG_ZEROCOPY. Change-Id: I9ce581acca9c6edb71bd8119258966e3b405db77 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/475801 Community-CI: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Or Gerlitz <gerlitz.or@gmail.com>	2020-01-08 15:47:08 +00:00
Ben Walker	053fa66b10	nvmf/tcp: Minimize the places where the tqpair state changes All transitions to the EXITING state go through the disconnect function now Change-Id: Ia55816351b2998bfef26130b6ffdc4a1010567a1 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/470533 Community-CI: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-01-08 15:47:08 +00:00
Ben Walker	04a4aab2e0	nvmf/tcp: Simplify handling of spdk_nvmf_tcp_pdu_get failures This function can't actually return NULL. It aborts if we get our math wrong. Change-Id: Iaf77112addc3c14c70755a56043c5dba3427890d Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/478911 Community-CI: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-01-08 15:47:08 +00:00
dongx.yi	f7e8827aa6	nvmf/tcp: Using spdk_min instead of multi-lines codes. We can use spdk_min to get the copy_len in spdk_nvmf_tcp_send_c2h_term_req. It confirms copy_len it's not larger than SPDK_NVME_TCP_TERM_REQ_ERROR_DATA_MAX_SIZE Signed-off-by: dongx.yi <dongx.yi@intel.com> Change-Id: Id343928e1911e4ab77fca7463f3f0cc55889db30 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/479118 Community-CI: SPDK CI Jenkins <sys_sgci@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2020-01-08 09:12:20 +00:00
Jacek Kalwas	5b87daa92f	nvmf/tcp: remove redundant memset Minor optimisation done by code analysis, both cmd and dif are overridden in TCP_REQUEST_STATE_NEW. Signed-off-by: Jacek Kalwas <jacek.kalwas@intel.com> Change-Id: I6bae4ddae175035d029c0693f7e4351b95a296ab Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/478604 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-01-03 08:31:52 +00:00
dongx.yi	cb7da325bb	lib/nvmf: Remove unnecessary return. It's not wrong, just to keep consistency with other functions. So remove these. Signed-off-by: dongx.yi <dongx.yi@intel.com> Change-Id: I833211ea8ee6c6b02c874ea340a3f936a0c4c00f Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/478684 Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Community-CI: SPDK CI Jenkins <sys_sgci@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-12-24 08:12:40 +00:00
Ziye Yang	8d51277046	nvmf/tcp: remove the unnecessary error info. It will be the expected behavior when the error message will printed if we use asynchrounous I/O. And the real error message for not getting the tcp_req is located in spdk_nvmf_tcp_capsule_cmd_hdr_handle. Change-Id: I1a608fbd3a04050eacb6cb68eafd50e5128925ab Signed-off-by: Ziye Yang <ziye.yang@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/477872 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-12-23 08:42:11 +00:00
dongx.yi	6b5f764856	nvmf/tcp: fix wrong judgement of ipv6. Here should check spdk_sock_is_ipv6. Signed-off-by: dongx.yi <dongx.yi@intel.com> Change-Id: I828c322b79f6d1ac3f9e004d6062358c1d567d4e Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/478142 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-12-18 09:37:12 +00:00
Jacek Kalwas	94507133eb	nvmf/tcp: rm set_state in spdk_nvmf_tcp_capsule_cmd_hdr_handle TCP_REQUEST_STATE_NEW is already set in spdk_nvmf_tcp_req_get. Signed-off-by: Jacek Kalwas <jacek.kalwas@intel.com> Change-Id: Ia835f3763cd74ef9b504901c719d9954317f49af Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/476164 Community-CI: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Community-CI: SPDK CI Jenkins <sys_sgci@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-12-16 12:34:28 +00:00
Ben Walker	5d497f6cf5	nvmf/tcp: Use writev_async for sending data on sockets This eliminates the flushing logic, simplifying the tcp transport. This also happens to greatly improve performance, especially on random read tests. The batching done in spdk_sock_writev_async seems to be more effectively than the previous batching logic in the tcp transport. Change-Id: Id980ac6073e380dc75f95df3f69cb224f50fb01b Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/470532 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Community-CI: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-12-16 12:34:02 +00:00
Jacek Kalwas	f206551388	nvmf: fix status override in case parse_sgl fails It is valuable to have more detail status instead SPDK_NVME_SC_INTERNAL_DEVICE_ERROR. Signed-off-by: Jacek Kalwas <jacek.kalwas@intel.com> Change-Id: Ifd003b490a7ae9af017645c97636ceaf2f93d4b0 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/476634 Community-CI: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-12-09 14:02:37 +00:00
Changpeng Liu	bc13d02237	nvmf: move transport spdk_nvmf_*_req_get_xfer() function into the common nvmf library Change-Id: I1619cc9b3feea1feb16282dc6c9cc8d5a380282c Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/475952 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-by: <jacek.kalwas@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-12-06 14:43:41 +00:00
Jacek Kalwas	155c3babce	nvmf/tcp: rm qpair destroy from poll_group_add Destroy in poll_group_add results in heap-use-after-free because upper layer calls qpair_fini in case poll_group_add returns error. Signed-off-by: Jacek Kalwas <jacek.kalwas@intel.com> Change-Id: I3e921a21b7ab5f7c15c80bc5919cb97cbda0b5d2 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/475858 Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-11-28 12:36:36 +00:00
Ziye Yang	4579a16f30	lib/nvmf: Add a new state to wait for the req slot Also need to update the spdk_nvmf_tcp_poll_group_poll. Since if the tqpair recv state in wait_for_req, we may already received the data, and there could be not epoll event. Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: I9c5a202e47e57aaba63da143f954a20c135a98ae Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/473626 Community-CI: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-11-15 20:25:15 +00:00
Ziye Yang	08273e77de	tcp: Fix no tcp_req issue while using async writev later. Purpose: But if we use asynchronous writev for pdu sending, the call_back of writev may occur after the new data coming. So it means that the free tcp request may not be available. So we use the strategy to check the request status in TCP_REQUEST_STATE_TRANSFERRING_CONTROLLER_TO_HOST. So the strategy is checking the state_cntr of all the reqs in TCP_REQUEST_STATE_TRANSFERRING_CONTROLLER_TO_HOST state. 1 If the state_cntr > 0, we should queue the new request. 2 If the statec_cntr == 0, it means that there is no available slot for the new tcp request , i.e., the new nvme command comming from the initiator. If we receive this, it means that the initiator sends more requests，and we should reject it. Signed-off-by: Ben Walker <benjamin.walker@intel.com> Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: Ifbeb510e669082cb7b80faf2e7987075af31d176 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/472912 Community-CI: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-11-08 22:17:42 +00:00
Ziye Yang	e19fd311fc	nvmf/tcp: Add ttransport variable in spdk_nvmf_tcp_sock_process To avoid the allocation of ttransport in the sub functions, and it makes the code much efficient. Change-Id: Ie4c5a1755ddbecf10dc364ff811f74a7af5f9c3b Signed-off-by: Ziye Yang <ziye.yang@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/473003 Community-CI: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-11-08 22:17:42 +00:00
Ziye Yang	e9be9df45f	nvmf/tcp: Fix the potential issue of connection construction. When we use async writev (e.g., lib io_uring), we find that the callback of writev is executed after recving the new data from the initiator, and this is possible. For example, if the NVMe-oF TCP target receives the ic_req from the initiator, and sendout the ic_resp, the state of tqpair will change from invalid to running until the callback is executed. And the data of ic_resp is already sent to the initiator, and we receive the new command later. However, we may still not get the call back function executed (i.e, spdk_nvmf_tcp_send_icresp_complete). And it is possible for using lib io_uring, I faced this issue when using lib uring. And this patch can fix this issue. Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: I7f4332522866d475e106ac6d36a8ec715133f0dc Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/472770 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Community-CI: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com>	2019-11-07 23:08:17 +00:00
Jim Harris	262ecf0ec5	nvmf/tcp: stop trying to accept when no more socks The loop is intended to accept multiple socks when available, but once accept returns NULL, there's no reason to keep trying. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I896908d276da35bc3fff172c1c17e22abd2a5343 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/473234 Community-CI: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-11-06 14:47:05 +00:00
Ben Walker	34385d80a3	nvmf/tcp: Add pointer to qpair from PDU It's important to be able to recover full context from just the PDU in the future. Change-Id: I3d1f3c326299b1237b42dbe33d340a282c3bc5bb Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/470531 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com>	2019-11-01 17:56:16 +00:00
Ben Walker	83ffb2075e	nvme/tcp: Rename pdu->ctx to pdu->req This is always the request pointer, so rename it for clarity. Change-Id: Ifbda7db7787c65f0deb190a1e94f0676b2c0d99a Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/470530 Community-CI: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Alexey Marchuk <alexeymar@mellanox.com>	2019-11-01 17:56:16 +00:00
Ben Walker	78a11548da	nvmf/tcp: Move duplicated disconnect code to a function Change-Id: Ib3daec83ec518a0934911e04d771c19cb34b6167 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/470529 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-11-01 17:56:16 +00:00
Ben Walker	811a66e97e	nvmf/tcp: Use the new sock_is_connected function during shutdown Change-Id: I3cf8765bbbcddaeda731188c7911b1966b953bc4 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/470514 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Alexey Marchuk <alexeymar@mellanox.com>	2019-11-01 17:56:16 +00:00
Ben Walker	5f856f4d65	nvmf/tcp: No longer set sndbuf size Use whatever size the socket layer thinks is best. In practice, this is the same size as before. Change-Id: I4820e16d8da6e566d1f8f078a75d345399f64ab5 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/470511 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Sasha Kotchubievsky <sashakot@mellanox.com> Reviewed-by: Alexey Marchuk <alexeymar@mellanox.com>	2019-11-01 17:56:16 +00:00
Ziye Yang	2ec99adad9	nvmf/tcp: fix the state machine issue if data is already read. Since we use big buffer to read the data, so the incoming data may already be read when the req is waiting for the buffer. So if we use the orginalstatement machine, there will be no read event will be generated again. The quick solution is to restore the original code, since for req which has incapsule data, we not need to wait for the buffer from the shared buffer pool. Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: Ib195d57cc2969235203c34664115c3322d1c9eae Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/472047 Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-10-24 18:00:00 +00:00
Jan Kryl	0a04c076ea	nvmf: Add context parameter to new_qpair() callback It can be useful for passing additional information about nvmf target to a handler for new nvmf connections. Context can be stored in globals as it is currently done in nvmf code. However in case of multiple targets or languages where accessing global state is challenging (i.e. Rust), this becomes inconvenient. Change-Id: Ia6a2fdba4601531822b3e5fda7ac5ab89d46f6c5 Signed-off-by: Jan Kryl <jan.kryl@mayadata.io> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/469263 Community-CI: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Seth Howell <seth.howell@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Sasha Kotchubievsky <sashakot@mellanox.com>	2019-10-17 16:29:36 +00:00
Alexey Marchuk	fcd652f5e3	tcp: Use nvmf_request dif structure Change-Id: I215da84d9f27fbc2614ce70ae36ed024ce107a4d Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Signed-off-by: Sasha Kotchubievsky <sashakot@mellanox.com> Signed-off-by: Evgenii Kochetov <evgeniik@mellanox.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/470467 Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-10-11 15:36:19 +00:00
Ziye Yang	fd98a83ce7	nvmf/tcp: re-organize spdk_nvmf_tcp_req Run the following command: pahole ./app/nvmf_tgt/nvmf_tgt -R -C spdk_nvmf_tcp_req It tells me change the bool definition location of dif_insert_or_strip. Change-Id: Ia43ab62bcc223a07e6415b2c769fe4af2b097f18 Signed-off-by: Ziye Yang <ziye.yang@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/470401 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-10-08 01:45:47 +00:00
Shuhei Matsumoto	c8734543bc	nvmf/tcp: Simplify spdk_nvmf_tcp_req_parse_sgl() By passing the pointer to struct spdk_nvmf_transport_poll_group to spdk_nvmf_tcp_req_parse_sgl(), we can remove spdk_nvmf_tcp_req_fill_iovs() and inline spdk_nvmf_request_get_buffers() into spdk_nvmf_tcp_req_parse_sgl(). Pointers to struct spdk_nvmf_request are used in many lines of spdk_nvmf_tcp_req_parse_sgl(). Caching and using them simplifies and improves readability a little for spdk_nvmf_tcp_req_parse_sgl(). We can pass pointer to not struct spdk_nvmf_tcp_transport but struct spdk_nvmf_transport to spdk_nvmf_tcp_req_parse_sgl(). Ordering the pointer to struct spdk_nvmf_tcp_req first in parameters of spdk_nvmf_tcp_req_parse_sgl() matches the function name. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I9f0d33b48383800c3b0a738eb24b11ffed7e6e60 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/469640 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-10-01 14:04:19 +00:00
Shuhei Matsumoto	c0ee8ef7d5	nvmf: Merge each transport's fill_buffers() into spdk_nvmf_request_get_buffers() This patch is close to the end of the effort to unify buffer allocation among NVMe-oF transports. Merge each transport's fill_buffers() into common spdk_nvmf_request_get_buffers() of the generic NVMe-oF transport. One noticeable change is to set req->data_from_pool to true not in each specific transport but in the generic transport. The next patch will add spdk_nvmf_request_get_multi_buffers() for multi SGL case of RDMA transport. This relatively long patch series is a preparation to support zcopy APIs in NVMe-oF target. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Icb04e3a1fa4f5a360b1b26d2ab7c67606ca7c9a0 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/469205 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-09-30 21:11:52 +00:00
Shuhei Matsumoto	7c7a0c0a68	nvmf: Pass not num_buffers but length to spdk_nvmf_request_get_buffers() The subsequent patches unifies getting buffers, filling iovecs, and filling WRs in a single API. This is a preparation. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I077c4ea8957dcb3c7e4f4181f18b04b343e9927d Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/468953 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Seth Howell <seth.howell@intel.com>	2019-09-26 16:12:28 +00:00
Shuhei Matsumoto	79945ef0ed	nvmf: Hold number of allocated buffers in struct spdk_nvmf_request This patch makes multi SGL case possible to call spdk_nvmf_request_get_buffers() per WR. This patch has an unrelated fix to clear req->iovcnt in reset_nvmf_rdma_request() in UT. We can do the fix in a separate patch but include it in this patch because it is very small. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: If6e5af0505fb199c95ef5d0522b579242a7cef29 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/468942 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-by: Seth Howell <seth.howell@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-09-26 16:12:28 +00:00
Shuhei Matsumoto	34a0d851f6	nvmf/tcp: Return DIF error to initiator instead of severe disconnection On a DIF verification error, fail the read command with a status code of APPLICATION_TAG_CHECK_ERROR, GUARD_CHECK_ERROR, or REFERENCE_TAG_CHECK_ERROR and a status code type of SCT_MEDIA_ERROR. The state of the request is TCP_REQUEST_STATE_TRANSFERRING_CONTROLLER_TO_HOST when a DIF verification error is detected. So dequeue the request from C2H data queue, return the response PDU, and then send the command response. This was an item on the TODO list. RDMA transport do this right behavior from the start and so TCP transport follows it by this patch. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I102bbd253cc8c1379d0937c9536bf2bfe04cbf6a Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/468911 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-09-24 17:04:28 +00:00
Shuhei Matsumoto	ddd97a8b3b	nvmf/tcp: Move setting orig_length to the location the value is fixed at tcp_req->orig_length had been set just before I/O submission but the value is already fixed in spdk_nvmf_tcp_req_parse_sgl(). Hence move setting tcp_req->orig_length accordingly. This follows the good practice of RDMA transport. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I99f6e266d8f7027bce810864314f3ee24a1af10c Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/468910 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-09-24 17:04:28 +00:00
Michal Ben Haim	62615117f7	SPDK: changing TREQ value from 'not specified' to 'not required'. Signed-off-by: Michal Ben Haim <michal.benhaim@kaminario.com> Change-Id: Ia7bda5b18db24df97172d4500a499c4635d592d5 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/467499 Community-CI: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Reviewed-by: John Kariuki <John.K.Kariuki@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-09-10 17:51:26 +00:00
Ben Walker	59e34aa865	nvmf/tcp: Don't set socket recvbuf size anymore The default behavior is to set it to 2MB, so this isn't required anymore. Change-Id: I62d7605cd4d5bc41347128f32f9a1aa373a15744 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/466993 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com>	2019-09-10 17:48:49 +00:00
Ziye Yang	24eb7a84b0	nvme/tcp: fix the iov vector count. Since we use pdu->data_iovcnt to build the iov in nvme_tcp_build_iovs, so send out pdu has the maximal iov number equals to: 2 + pdu->data_iovcnt, so we change the comparison. This makes sure that we can handle all the data owned by one pdu. Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: I2b9258cc5716d706c0fa38af609726c439708768 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/467207 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com>	2019-09-09 02:08:31 +00:00
Shuhei Matsumoto	9796768132	nvmf: Move pending_data_buf_queue to common struct spdk_nvmf_transport_poll_group This unifies buffer management among transports further and is a preparation to make buffer allocation asynchronous. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I8c588eeac4081f50fe32605feb7352f72c628d95 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/466847 Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Anil Veerabhadrappa <anil.veerabhadrappa@broadcom.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-09-09 00:42:22 +00:00
Shuhei Matsumoto	b4778363b4	nvmf/tcp: Pass nvmf_request to nvmf_tcp_req_fill_buffers Most variables related with I/O buffer are in struct spdk_nvmf_request now. So we can pass nvmf_request instead of nvmf_tcp_req to nvmf_tcp_req_fill_buffers and do it in this patch. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I00eff578a98891e99fcb9a3aafa3d99126d6f1c1 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/466089 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com>	2019-08-30 16:56:46 +00:00
Shuhei Matsumoto	2bc819dd52	nvmf/tcp: Use STAILQ for queued_c2h_data_tcp_req and pending_data_buf_queue This is a small performance optimization and an effort to unify I/O buffer management further among transports. It is ensured that the request is the first of STAILQ when spdk_nvmf_tcp_send_c2h_data() is called or the case TCP_REQUEST_STATE_NEED_BUFFER is executed in spdk_nvmf_tcp_req_process(). Hence change TAILQ_REMOVE to STAILQ_REMOVE_HEAD for these two cases. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I0b195874ac22a8d5ecfb283a9865d2615b7d5912 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/466637 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2019-08-30 16:56:46 +00:00
Ziye Yang	5e7b8d18f3	nvmf/tcp: Remove the potential pdu hdr memory copy. In this patch, we directly point the hdr_p to the memory owned by the pdu_recv_buf to avoid memory copy. Change-Id: Iee0dd98058928f429bf7ad22103cd4826226400f Signed-off-by: Ziye Yang <ziye.yang@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/465158 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com>	2019-08-30 02:25:22 +00:00
Shuhei Matsumoto	8a80461ac6	nvmf/tcp: execute buffer allocation only if request is the first of pendings RDMA transport executes spdk_nvmf_rdma_request_parse_sgl() only if the request is the first of the pending requests in the case RDMA_REQUEST_STATE_NEED_BUFFER in the state machine spdk_nvmf_rdma_requests_process(). This made RDMA transport possible to use STAILQ for pending requests because STAILQ_REMOVE parses from head and is slow when the target is in the middle of STAILQ. On the other hand, TCP transport executes spdk_nvmf_tcp_req_parse_sgl() even if the request is in the middle of the pending request in the case TCP_REQUEST_STATE_NEED_BUFFER in the state machine spdk_nvmf_tcp_req_process() if the request has in-capsule data. Hence TCP transport have used TAILQ for pending requests. This patch removes the condition if the request has in-capsule data from the case TCP_REQUEST_STATE_NEED_BUFFER. The purpose of this patch is to unify I/O buffer management further. Performance degradation was not observed even after this patch. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Idc97fe20f7013ca66fd58587773edb81ef7cbbfc Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/466636 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-08-29 18:17:38 +00:00
Shuhei Matsumoto	9968035884	nvmf/tcp: Replace TCP specific get/free_buffers by common APIs Use spdk_nvmf_request_get_buffers() and spdk_nvmf_request_free_buffers(), and then remove spdk_nvmf_tcp_request_free_buffers() and spdk_nvmf_tcp_request_get_buffers(). Set tcp_req->data_from_pool to false after spdk_nvmf_request_free_buffers(). Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I286b48149530c93784a4865b7215b5a33a4dd3c3 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/465876 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-08-29 18:17:38 +00:00
Shuhei Matsumoto	005b053a02	nvmf: Move data_from_pool flag to common struct spdk_nvmf_request This is a prepration to unify buffer management among transports. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I6b1c208207ae3679619239db4e6e9a77b33291d0 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/466002 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-08-29 18:17:38 +00:00
Shuhei Matsumoto	04ae83ec93	nvmf: Move allocated buffer pointers to common struct spdk_nvmf_request This is a preparation to unify buffer management among transports. struct spdk_nvmf_request already has SPDK_NVMF_MAX_SGL_ENTRIES (16) * 2 iovecs. Hence incresing the number of buffers twice will be no problem. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Idb525abbf35dc9f4b8547b785b5dfa77d106d8c9 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/465873 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com>	2019-08-29 18:17:38 +00:00
Ziye Yang	d50736776c	nvmf/tcp: Use a big buffer for PDU receving. Purpose: Reduce the recv/readv system call. Method: Use a big recv buffer to conduct the read. Though it will introduce addtional buffer copy, we hope that the overhead introduced by buffer copy will be smaller compared with frequent recv/readv system call overhead. And the design is to make a trade off between them. Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: I9286fd9cec0b512cea8e3f2c335c5bf862b98573 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/464842 Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-08-28 15:38:02 +00:00
Ziye Yang	ea5ad0b286	nvme/tcp: Change hdr in nvme_tcp_pdu to pointer Purpose: Prepare the further optimnization in the target side whening receving pdu headers, we expect to use zero copy. Change-Id: Iae7f9106844736d7160d39d0af1f5941084422ec Signed-off-by: Ziye Yang <ziye.yang@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/465380 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com>	2019-08-28 15:38:02 +00:00
Shuhei Matsumoto	eab7360bcb	nvmf/tcp: Factor out getting and filling buffers from nvmf_tcp_req_fill_iovs This follows the practice of RDMA transport and is a preparation to unify buffer allocation among transports. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Ib85625f2a0eca01ef4028685dd838d6c41faad7b Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/465872 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-08-26 19:04:24 +00:00
Shuhei Matsumoto	72c10f7094	nvmf/tcp: Use spdk_mempool_get_bulk in nvmf_tcp_req_fill_iovs This follows the practice of RDMA transport and a preparation to unify buffer management among transports. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I4e9b81b2bec813935064a6d49109b6a0365cb950 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/465871 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-08-26 19:04:24 +00:00
Shuhei Matsumoto	8aac212005	nvmf/tcp: Pass number of alloc buffers s as param to nvmf_tcp_request_free_buffers This is a preparation to the next patch to use spdk_mempool_get_bulk. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I28a5ad941004f139c9032d85c2ef92680081f1ce Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/465870 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-08-26 19:04:24 +00:00
Ziye Yang	1917d3b413	nvmf: move the assigment of pdu outside the switch Purpose: To reduce the duplicated code. And one minor fix: add an empty line between two functions Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: I12c9ddba6526c094cd2bd945e14f9d8bf5209adf Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/464504 Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-08-09 07:37:12 +00:00
Ziye Yang	73d9cef8c5	nvmf/tcp: add nvme_tcp_pdu_cal_psh function. Purpose: 1 Do not caculated the psh_len every time. 2 Small fix, for ch_valid_bypes, and psh_valid_bytes, we do not need to use uin32_t. Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: I9b643da4b0ebabdfe50f30e9e0a738fe95beb159 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/464253 Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Broadcom SPDK FC-NVMe CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-08-07 01:46:54 +00:00
Shuhei Matsumoto	cf95d4a24f	sock: Fix return value of spdk_sock_group_poll to return number of events spdk_sock_group_poll() and spdk_sock_group_poll_count() had returned 0 on success. The implementation didn't match the specification described in the header file, and couldn't be used to collect stats correctly because 0 means idle. This patch fixes the return value of spdk_sock_group_poll() and spdk_sock_group_poll_count() to return number of events and the callers not to overwrite the return value by 0. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I7e2a17187fc74ea44d3acf2f35d63f5e5a254eda Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/463710 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2019-08-02 00:19:43 +00:00
Ziye Yang	6d4f580e79	nvmf/tcp: Remove spdk_nvmf_tcp_qpair_process_pending Phenomenon: Test case: Using the following command to test ./test/nvmf/target/shutdown.sh --iso --transport=tcp without this patch, it will cause coredump. The error is that the NVMe/TCP request in data buffer waiting list has "FREE" state. We do not need call this function in spdk_nvmf_tcp_qpair_flush_pdus_internal, it causes the bug during shutdown test since it will call the function recursively, and it does not work for the shutdown path. There are two possible recursive calls: (1)spdk_nvmf_tcp_qpair_flush_pdus_internal -> spdk_nvmf_tcp_qpair_process_pending -> spdk_nvmf_tcp_qpair_flush_pdus_internal -> >.. (2) spdk_nvmf_tcp_qpair_flush_pdus_internal-> pdu completion (pdu->cb) ->.. -> spdk_nvmf_tcp_qpair_flush_pdus_internal. And we need to move the processing for NVMe/TCP requests which are waiting buffer in another function to handle in order to avoid the complicated possbile recursive function calls. (Previously, we found the simliar issue in spdk_nvmf_tcp_qpair_flush_pdus_internal for pdu sending handling) But we cannot remove this feature, otherwise, the initiator will hang for waiting the I/O. So we add the same functionality in spdk_nvmf_tcp_poll_group_poll function. Purpose: To fix the NVMe/TCP shutdown issue. And this patch also reables the test for shutdown and bdevio. Change-Id: Ifa193faa3f685429dcba7557df5b311bd566e297 Signed-off-by: Ziye Yang <ziye.yang@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/462658 Reviewed-by: Seth Howell <seth.howell@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2019-07-26 21:16:23 +00:00
Ziye Yang	9375616ae2	nvmf/tcp: code cleanup move the staement location of TCP request setting and remove the duplicated code. Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: Ia659756185547ff4f8aa26c5bc01f63defe6c113 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/462589 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-07-22 02:40:35 +00:00
Ziye Yang	6ad6a1131b	nvmf/tcp: Add a feature to allow set the sock priority of the connection. This priority is used to differentiate the sock priority on the TCP connections between NVMe-oF TCP target and other TCP based applications. Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: I6ee294e647420b56d1d91a07c2e37bf34ce24e03 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/461801 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2019-07-19 06:30:19 +00:00
Darek Stojaczyk	36ccca2c08	nvmf/tcp: switch to spdk_malloc() spdk_dma_malloc() is about to be deprecated. Change-Id: Ic42db528bbae4b3ca2e91cb9ac46def99ecb5f28 Signed-off-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/459431 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-07-17 01:28:57 +00:00
Shuhei Matsumoto	7ee58b90e1	nvmf/tcp: Set DIF context to PDU when processing in-capsule, C2H, or H2C data Set DIF context of the corresponding request to PDU when - processing in-capsule data of the command, - processing data of C2H PDU, or - processing data of H2C PDU. Change-Id: I3a668a55be21dbe2ee6ecf26476290670bd7b4a8 Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/458929 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com>	2019-07-11 05:30:28 +00:00
Shuhei Matsumoto	e3e023cfd3	nvmf/tcp: Increase in-capsule buffer size to fill DIF fields When NVMe/TCP initiator transfers in-capsule data, NVMe/TCP has to process it as in-capsule data. If DIF insert/strip is enabled, in-capsule data size will be increased by NVMe/TCP target to insert metadata. However size of in-capsule data buffer had not been increased, and buffer overflow occurred when NVMe/TCP initiator transfers in-capsule data to NVMe/TCP target with DIF insert/strip being enabled. This patch increases size of in-capsule data buffer size to store metadata. 16 byte metadata per 512 byte data block is the current maximum ratio of metadata per block. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I88b127efd7a945bde167a95df19a0b9175cb8cd0 Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/461333 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com>	2019-07-11 05:30:28 +00:00
Shuhei Matsumoto	9d4ee5f344	nvmf/tcp: Fix wrong data offset in nvmf_tcp_pdu_payload_insert_dif We updated readv_offset before generating DIF to avoid adding the temporary variable _rc in the previous patch, but that caused write error when inserting DIF. Fix the bug in this patch. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Id0788280a83cbea2554c851db77751432fc00cba Reviewed-on: https://review.gerrithub.io/c/spdk/spdk/+/461116 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2019-07-11 05:30:28 +00:00

1 2 3 4 5

215 Commits