ivampiresp/Spdk - Spdk - Leaflow Developers

Author	SHA1	Message	Date
Changpeng Liu	6a3cc9140f	nvme/quirks: remove NVME_QUIRK_READ_ZERO_AFTER_DEALLOCATE for Intel P55XX The P55XX serial SSDs can support dlfeat.read_value in the identify namespace data structure, we don't need to add this quirk for it, just remove it. Change-Id: I165d89085e246a570e80dbaf05f41dc331b93f0c Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/7526 Community-CI: Broadcom CI Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: <dongx.yi@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2021-04-21 23:19:59 +00:00
balaji	a2a82087b9	NVMe: Defer the notices processing in the AER callback untill all the CQE entries have been processed. When the format command is issued, the kioxia drives responds with "NS Attr change" notices. In the callback function of the notice, the CQ Head Doorbell is updated twice with the same value while issuing the Active NS list & identify NS commands. Fixes: #1701 Signed-off-by: G.Balaji <gbalajieie@gmail.com> Change-Id: I8cc80fba0a226c22753e605ef3129602a9313ce7 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/7149 Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2021-04-21 23:19:05 +00:00
Changpeng Liu	b03129e710	nvme: return errno instead of assertion when mapping PRPs Also update the UT. Change-Id: I6086bf4cafca8a917a467490955d7df0ba8930d5 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/7495 Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2021-04-20 20:24:23 +00:00
Changpeng Liu	018f6a8f21	nvme: add the support to remap NVMe command with SGL Previously we can only remap NVMe command using PRP, now we add the SGL support. Change-Id: Iec352d858a07bdd3d5f261336d6fa1167ba7aa79 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/7279 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2021-04-20 20:24:23 +00:00
Changpeng Liu	3a58a5f19c	nvme: add spdk_nvme_map_cmd API The API `spdk_nvme_map_prps` is used in nvmf/vfio-user to remap VM's NVMe command data buffer to local virtual address, and for command using PRP, there maybe multiple pages, when parsing the PRP list to local IOVs, we need a parameter to check that the maximum number of vectors can't exceed the IOVs, this API can't meet the requirement, while here, we add a new API `spdk_nvme_map_cmd` and with a new parameter `max_iovcnt` to fix this case, and it can also cover the command using SGL in the coming patches. Change-Id: I71063524bed16ee3434103867a556d3741e55326 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/7278 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2021-04-20 20:24:23 +00:00
Changpeng Liu	1d3b01e261	nvme: refactor spdk_nvme_map_prps into two functions Change spdk_nvme_map_prps to a internal fucntion with a new parameter `max_iovcnt` to protect the IOVs. Also for the purpose to keep API compatibility, we still leave the API here. Change-Id: I9a638beb87aab20bba5f8a4fa0a9396110d56aff Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/7335 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2021-04-20 20:24:23 +00:00
Ben Walker	42b47742de	nvme/tcp: Only flush socket if not part of poll group If the qpair is part of a poll group, the socket will get flushed as part of polling that group already. We only need to explicitly flush to handle the case where the qpair is not in a poll group. Signed-off-by: Ben Walker <benjamin.walker@intel.com> Change-Id: Ib2a510b6d26d1622950437d81e0a40f6b15d6b54 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/7049 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Community-CI: Mellanox Build Bot	2021-04-19 12:54:24 +00:00
Ben Walker	6b86039fd9	nvme/tcp: Ensure qpair is polled when it gets a writev_async completion There was a fix for this that went into the posix layer, but the underlying problem is the logic in the nvme/tcp transport. Attempt to fix that instead. Change-Id: I04dd850bb201641d441c8c1f88c7bb8ba1d09e58 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/6751 Community-CI: Broadcom CI Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-04-19 12:54:24 +00:00
G.Balaji	391d89b873	nvme: move AER handling to a separate function This prepares for an upcoming patch to fix issue #1701 which requires handling async events outside of the check completions loop. Fixes: #1701 Signed-off-by: G.Balaji <gbalajieie@gmail.com> Change-Id: I4985d814903143511383172b1a443580db33a78f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/7416 Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-04-16 15:01:33 +00:00
Changpeng Liu	9c7f460f55	vfio-user: optimize notice log to debug log Change-Id: I2e47b148209ce4c232dbdc5f20c90548be995e1a Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/7334 Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2021-04-15 11:23:47 +00:00
Jim Harris	c2e0da84c6	nvme: improve comment for nvme_pcie_ctrlr_get_max_xfer_size NVME_MAX_PRP_LIST_ENTRIES has changed over time, so let's just remove the reference to the exact value here. Also explain a bit more why the max size isn't (NUM_ENTRIES + 1) * page_size. While here, do a small whitespace cleanup as well. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ib75813788abdd3dbb43192f9fdc27f99b33aeadf Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/7328 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-04-14 15:35:08 +00:00
Alexey Marchuk	558be98fd3	nvme/pcie: Add statistics per poll group Check if qpair has a poll group during the connect process, use poll group's statistics or allocate own structure per qpair. That is done due to not all applications use poll groups and we want to avoid "if (qpair->group)" conditions in data path. Admin qpair always allocates its own statistics structure but the statistics are not reported since this qpair is not attached to a poll group. Statistics are reported by spdk_nvme_perf tool if --transport-stats and in bdev_nvme_transport_statistics RPC method. Change-Id: I58765be161491fe394968ea65ea22db1478b219a Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/6304 Community-CI: Broadcom CI Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2021-04-13 21:30:52 +00:00
Alexey Marchuk	e966937625	nvme: Add functions to get/free poll group statistics These are interface functions that can be used by an application e.g. spdk_nvme_perf or bdev_nvme library. The next patches will add usage of these functions. Change-Id: I33b88e0e713c2ea5967f9241885e3257c5070577 Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/6300 Community-CI: Broadcom CI Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2021-04-13 21:30:52 +00:00
Alexey Marchuk	3fcda8e779	nvme: Add transport intrafce to get/free stats The new 2 API function allow to get and free stats per poll group. New function to get transport name have been added to report not only transport type but also the name. For now only RDMA transport reports statistics, other transports will be added later. Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: I2824cb474fde5fa859cf8196dabac2c48c05709c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/6299 Community-CI: Broadcom CI Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2021-04-13 21:30:52 +00:00
Alexey Marchuk	50569293ef	nvme/rdma: Add poller statistics New statistics include number of poller calls, number of idle polls and total number of completions. These statistics allow to estimate % of idle polls and the number of completions per poll. Since nvme_rdma_cq_process_completions function returns number of completed NVMF requests and each NVMF request consumes 2 RDMA completions (send+recv), this function was extended to return the number of RDMA completions. Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: Ifdc1e2e467f645adb5d66d39ff2a379e161fbd77 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/6298 Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2021-04-13 08:41:39 +00:00
Alexey Marchuk	527f406b6b	nvme/rdma: Use RDMA statistics These statistics allow to estimate WRs batching efficiency. The number of send WRs equals the total number of submitted NVME commands. Change-Id: I96c9836cd6b9070cf5f62e43b4d2738506866e94 Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/6297 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2021-04-13 08:41:39 +00:00
Jin Yu	59237d22b8	lib: add a hotplug lib for device The hotplug lib can be used for pcie devices such as nvme, virtio_blk and virtio scsi. For the sigbus handler, there is only one in a process and it should handle all the devices. And align nvme to the hotplug lib Add the ADD uevent support for allowing the device hotplug. Change-Id: I82cd3b4af38ca24cee8b041a215a85c4a69e60f7 Signed-off-by: Jin Yu <jin.yu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/5653 Community-CI: Broadcom CI Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: <dongx.yi@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2021-04-12 09:48:21 +00:00
GangCao	01f45ecd8c	NVMe: add the prefix of ctrlr identification for diagnosability Introduce new utilities NVME_CTRLR_ERRLOG, NVME_CTRLR_WARNLOG and so on to output the ctrlr's identification at different log levels. For RDMA and TCP, the subnqn will be output and for PCIe and custom, the traddr will be ouptput. Change-Id: I81a112463bf752999aa1fe4e0c867d88e09a2f64 Signed-off-by: GangCao <gang.cao@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/7057 Reviewed-by: Xiaodong Liu <xiaodong.liu@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: <dongx.yi@intel.com> Reviewed-by: Vasuki Manikarnike <vasuki.manikarnike@hpe.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2021-04-09 17:29:21 +00:00
GangCao	415d664029	NVMe: fix the potential issue of dereference NULL pointer Change-Id: Icb81239e64faa5e9c0daf361798b68e153effcee Signed-off-by: GangCao <gang.cao@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/7154 Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: <dongx.yi@intel.com> Community-CI: Broadcom CI Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2021-04-09 17:29:21 +00:00
Alexey Marchuk	c7bb68aa3e	nvme: Handle errors returned by submit function When a request is submitted, it may have incorrect iov alignment that doesn't fit PRP requirements. In the current version an internal function fails such a request and returns a NULL pointer. This is mapped to -ENOMEM error which is returned to generic bdev layer where such a request is queued in a "nomem_io" queue and later can be resubmitted. That is incorrect and such a request must be completed immediately. To fail the request, we need to differentiate between -ENOMEM and other cases, so we pass a pointer to a result to local nvme functions Change-Id: I7120d49114d801497a71fca5a23b172732d088de Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/7036 Community-CI: Broadcom CI Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-04-05 22:58:57 +00:00
Alexey Marchuk	a1f848b057	nvme: Add a function to handle error case in IO functions This function will be used the next patch, current behaviour remains unchanged Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: Ie74c7395f67a08b0cac018eb5114f358a6b583cb Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/7092 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2021-04-05 22:58:57 +00:00
Changpeng Liu	04cd292237	nvme/quirk: add MDTS excludes interleaved metadata quirk The specification for Maximum Data Transfer Size (MDTS) says this field should include the length of metadata, if metadata is interleaved with the logical block data. However, some drives can support MDTS without counting the interleaved metadata, so for this case SPDK will only use data length without interleaved metadata length. Change-Id: I29920a25885699e2689be043b87122367be0e416 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/6813 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-03-29 07:38:37 +00:00
Ziye Yang	a620cd198f	nvme/tcp: Fix the zero copy enablement issue. Remove the polling group check. Because at this moment, the qpair is not added into a polling group. If we do not remove it, we will never enable zcopy feature for I/O qpair. And in sock implementmentation, we already fixed the zero copy handling if a socket is not in a polling group. See posix_sock_flush function. So we can fix this issue if we directly remove this check. Reported by: Aleksey Marchuk <alexeymar@mellanox.com> Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: I969936c4b6c7f13cbfa4d6eb479010c53f3e384a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/7056 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Community-CI: Mellanox Build Bot	2021-03-26 08:22:53 +00:00
Tomasz Zawadzki	38d13d06a8	lib/opal: remove deprecated spdk_opal_supported() This API has been deprecated since SPDK 20.07, see commit (`b2947f52`). Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: Idb45906c81ea5682c6a67def0265910266d861b5 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/6629 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Community-CI: Mellanox Build Bot	2021-03-18 14:43:08 +00:00
Ziye Yang	f1f4f7d3bc	nvme/tcp: Use the async manner to send pdu when crc32c enabled. This patch refactor the pdu sending logic with the async manner, then if the group contains the accel engine, we can use it. Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: I2d669c0a3255d7a8898441e406906add2f3a3556 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/6759 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-03-18 14:42:35 +00:00
Ziye Yang	6153b4aa8f	nvme: Add a new parameter in spdk_nvme_poll_group_create Purpose: To setup an accelerated function callback for created spdk_nvme_poll_group. In this patch, we just create the interface. The real usage of this call back will be provided in the other patch. Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: I0d936aa4eba4dbfcc0137942156b9f2919eb5b78 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/6758 Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-03-18 14:42:35 +00:00
Jim Harris	6156777bd4	nvme: assert if user tries to delete NULL tcp qpair It is invalid to try to delete a NULL qpair, so do not check for it in nvme_tcp_ctrlr_delete_io_qpair and return an error when NULL. Just change it to an assert instead. This makes it consistent with pcie and rdma. While here, add an assert in rdma as well. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ic2f76deecb21b78749dac85e33fb1fa0d14a1239 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/6917 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: <dongx.yi@intel.com>	2021-03-18 14:41:44 +00:00
Alexey Marchuk	47afb9280f	nvme/rdma: Use RDMA provider API to post recv WRs Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: I47cc1a21af1104f681519e542edaf66e363bb214 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/6296 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-03-09 08:54:12 +00:00
Jim Harris	38736d993d	nvme: zero shadow doorbell registers when freeing qpair The shadow registers need to be zero when the qpair is created. This happens automatically when a given qid is used for the first time, since the page is allocated with zmalloc. But if a qid is reused, we need to make sure its shadow registers are cleared before we create the qpair again with the same qid. So clear the registers in nvme_pcie_ctrlr_delete_io_qpair, just after the cq is deleted. Fixes issue #1795. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I08c30d1ea248559a01b802cd132dd57199b491b5 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/6752 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2021-03-09 08:53:36 +00:00
Changpeng Liu	2f579469b6	nvme/tcp: pass correct parameter to nvme_tcp_qpair_send_h2c_term_req_complete Previously the callback parameter for this function is NULL, this will cause segment fault, so pass the correct parameter here. Fix #1817 Change-Id: Ie768b7bf4a72862d16a44742ab3032803d0939a2 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/6690 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: <dongx.yi@intel.com> Community-CI: Mellanox Build Bot	2021-03-05 08:33:18 +00:00
Evgeniy Kochetov	6005775622	nvme: Move nsdata_zns to namespace structure Signed-off-by: Evgeniy Kochetov <evgeniik@nvidia.com> Change-Id: I3f7508d14075c23d9b850da20da6d86c36e08450 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/6501 Community-CI: Broadcom CI Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2021-03-02 08:06:15 +00:00
Evgeniy Kochetov	3a8852d384	nvme: Move nsdata to namespace structure Signed-off-by: Evgeniy Kochetov <evgeniik@nvidia.com> Change-Id: I6083331a24dbf90170096cb98e6371ef2d4e6f9d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/6500 Community-CI: Broadcom CI Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2021-03-02 08:06:15 +00:00
Niklas Cassel	2910ba6c53	nvme: simplify controller statemachine For the following nvme controller statemachine states: NVME_CTRLR_STATE_IDENTIFY_NS NVME_CTRLR_STATE_IDENTIFY_ID_DESCS NVME_CTRLR_STATE_IDENTIFY_NS_IOCS_SPECIFIC The statemachine can either: - Jump to succeeding state - If active ns list is empty, jump directly to NVME_CTRLR_STATE_CONFIGURE_AER - In the unlikely case if NVMe completion error, jump to NVME_CTRLR_STATE_ERROR Simply this such that we either: - Jump to succeeding state - In the unlikely case if NVMe completion error, jump to NVME_CTRLR_STATE_ERROR This will help to reduce the complexity of the nvme controller statemachine, especially considering that there are new additional states (NVME_CTRLR_STATE_IDENTIFY_NS_DIRECTIVE and NVME_CTRLR_STATE_CONFIGURE_NS_STREAMS) currently on review that would continue with the bad habit of having three possible jump states instead of just two. Signed-off-by: Niklas Cassel <niklas.cassel@wdc.com> Change-Id: I3242052b1108afcd8adbe6d0378b1358fef58ec8 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/6521 Community-CI: Broadcom CI Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: sunshihao <sunshihao@huawei.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-02-25 10:27:21 +00:00
Ziye Yang	579a678a51	nvme/tcp: Move sock creation into nvme_tcp_ctrlr_create_qpair function. Purpose: To get the optimal group, we need the socket information. Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: I17b048a402fbf002307dd225f64b20a9f876d642 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3324 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Community-CI: Mellanox Build Bot	2021-02-25 10:26:08 +00:00
Niklas Cassel	1f49ee235f	nvme: only reassign start_valid if we fetched a new SGE For a SGL using PRPs, there is always an alignment check of the start address in the beginning of the loop. This is stored in start_valid. If the start is indeed valid, we might fetch a new SGE, and then perform a second alignment check on this new SGE. However, this second alignment check is done unconditionally, meaning that for the last SGE in a request, we check if the same start address is aligned twice. Only perform the second alignment check if we actually fetched a new SGE. Signed-off-by: Niklas Cassel <niklas.cassel@wdc.com> Change-Id: I9df8038c650b0879f838d1d9d895e8dd7172840d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/6493 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-02-25 10:25:39 +00:00
Niklas Cassel	9d79d27e49	nvme: add function to get ZNS max active zones Add a function to get the number of max active zones for a zoned namespace. The value inside the identify namespace struct is a 0's based value, where 0xffffffff means unlimited. If unlimited, the addition will overflow and return 0, which is the intended value to represent unlimited for this API. Signed-off-by: Niklas Cassel <niklas.cassel@wdc.com> Change-Id: Ia09e3db157ca0afadbd3ca4032eedd7bcd88248c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/6443 Community-CI: Broadcom CI Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: sunshihao <sunshihao@huawei.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2021-02-24 13:06:32 +00:00
Niklas Cassel	bb5330c06d	nvme: add function to get ZNS max open zones Add a function to get the number of max open zones for a zoned namespace. The value inside the identify namespace struct is a 0's based value, where 0xffffffff means unlimited. If unlimited, the addition will overflow and return 0, which is the intended value to represent unlimited for this API. Signed-off-by: Niklas Cassel <niklas.cassel@wdc.com> Change-Id: I4223146bc1ddf90486892a0af5fe5ce006dc5fd3 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/6442 Community-CI: Broadcom CI Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: sunshihao <sunshihao@huawei.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2021-02-24 13:06:32 +00:00
Alexey Marchuk	04474fec32	nvme/fabric: Make connect related logs more verbose Add logging of transport address and status codes Fixes issue #1790 Change-Id: I67c23f74dd4a4b39233f50c4c29efe25872e0777 Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/6338 Community-CI: Broadcom CI Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2021-02-17 10:18:05 +00:00
Jim Harris	5e37d3b2d8	nvme: add spdk_nvme_qpair_get_id Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I550d42806a84eb94effd1e1c4e26d9493cda7703 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/6395 Community-CI: Broadcom CI Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2021-02-17 10:17:43 +00:00
Niklas Cassel	014baeb8ef	nvme: add support for ZNS zone append vector variant We already have support for spdk_nvme_zns_zone_append(), add support for spdk_nvme_zns_zone_appendv() (zone append with NVME_PAYLOAD_TYPE_SGL). _nvme_ns_cmd_rw() currently performs verification of the SGL, if the parameter check_sgl is set. This parameter is set for all calls with payload of type NVME_PAYLOAD_TYPE_SGL. In order to be able to perform the same check_sgl verfication on zone append vectors, we need to refactor _nvme_ns_cmd_rw() a bit. Setting check_sgl ensures that _nvme_ns_cmd_split_request_sgl() or _nvme_ns_cmd_split_request_prp() gets called. These functions will split an oversized I/O into several different requests. However, they also iterate the SGE entries, verifies that the total payload size, total SGE entries is not too many, and that buffers are properly aligned. A proper request will not get split. For zone append, splitting a request into several is not allowed, however, we still want the verification part to be done, such that (e.g.) a non first/last SGE which is not page aligned, will cause the whole request to be rejected. (In the case of spdk_nvme_ns_cmd_write(), a non first/last SGE which is not page aligned will instead cause the request to be split.) An alternative would be to try to rip out the verification part from _nvme_ns_cmd_split_request_sgl() and _nvme_ns_cmd_split_request_prp(). However, that is non-trivial, and would most likely end up with a lot of duplicated code, which would easily get out of sync. Signed-off-by: Niklas Cassel <niklas.cassel@wdc.com> Change-Id: I2728acdcadeb70b1f0ed628704df19e75d14dcca Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/6248 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-02-17 10:17:22 +00:00
Niklas Cassel	b1b4b8676f	nvme: use spdk_nvme_zns_ns_get_zone_size_sectors() where appropriate Use the new function spdk_nvme_zns_ns_get_zone_size_sectors() where it is appropriate (in comparison to the existing spdk_nvme_zns_ns_get_zone_size() variant). Signed-off-by: Niklas Cassel <niklas.cassel@wdc.com> Change-Id: Ic929ffbc5a1f4a16ba6719a985c05ae625caed46 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/6417 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2021-02-17 10:17:22 +00:00
Niklas Cassel	9a800c38f5	nvme: add spdk_nvme_zns_ns_get_zone_size_sectors() Sometimes it is more optimal to get the zone size in number of sectors, instead of in number of bytes. Therefore, add a new spdk_nvme_zns_ns_get_zone_size_sectors() function to get zone size in number of sectors. Signed-off-by: Niklas Cassel <niklas.cassel@wdc.com> Change-Id: I0fe67e00a3d74dd27acfc895ae97448d995b89a3 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/6416 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2021-02-17 10:17:22 +00:00
Krishna Kanth Reddy	48aed8a578	lib/nvme: Adding support for PMR Identification Identify application prints the PMR details if it is supported Signed-off-by: Krishna Kanth Reddy <krish.reddy@samsung.com> Change-Id: Iaba4c15e18e1402035b11a34b2defe8078855751 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/6209 Community-CI: Broadcom CI Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2021-02-11 08:40:48 +00:00
Niklas Cassel	4249dc1010	nvme: account for PRACT when calculating max sectors per transfer There is a special case when using 8-byte metadata + PI + PRACT where no metadata is transferred to/from controller. Since _nvme_ns_cmd_rw() already calculates the proper sector size using _nvme_get_host_buffer_sector_size(), which takes PRACT into account, change the sectors_per_max_io calculation to also take PRACT into account. This will avoid certain requests that don't need splitting getting split. Signed-off-by: Niklas Cassel <niklas.cassel@wdc.com> Change-Id: I8d450d37c2458453701189f0e0eca4b8fe71173b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/6247 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2021-02-10 08:44:54 +00:00
Niklas Cassel	b05b3d0a12	nvme: add assert in nvme_ns_cmd_zone_append_with_md() Zone append commands cannot be split. _nvme_ns_cmd_rw() should never cause a NVME_PAYLOAD_TYPE_CONTIG zone append request to be split. This is currently true, but add an assert to make sure that any refactoring to _nvme_ns_cmd_rw() does not break this promise. Also add error handling, since release builds are built with asserts disabled. Follow-up patches will refactor _nvme_ns_cmd_rw(). Signed-off-by: Niklas Cassel <niklas.cassel@wdc.com> Change-Id: I5fd2440c4c9d6bd8d56f30354b208a9047b64729 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/6246 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Community-CI: Mellanox Build Bot	2021-02-09 11:30:51 +00:00
Jim Harris	e329ec45a6	nvme: add set_nvme_pcie_set_hotplug_filter This function allows applications to specify whether they wish to allow probing a newly attached NVMe PCIe SSD. The env layer will only even probe devices that have been allowed. By default, this is all devices, but if the user has specified some list of allowed PCI addresses (via spdk_env_opts pci_allowed) then newly attached PCIe devices are implicitly not allowed. This API allows applications to add device addresses to the allowed list after the application has started. This API will be useful for use cases where multiple SPDK processes are running on one server, and assignment of PCIe SSDs to those processes are based on some function of the SSD's PCIe address. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I919bc267f2ad9130ab5c875ff760a301028b047e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/6184 Community-CI: Broadcom CI Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: <dongx.yi@intel.com>	2021-02-09 11:27:19 +00:00
Vasuki Manikarnike	d92c2f118a	lib/nvme: Remove qpair from all lists before freeing it. Fixes #1777. When a qpair cannot be allocated because the transport connection fails, the qpair was freed without unlinking it from the other structures. This was leading to a segfault when attempting to create and free other qpairs. Also added a unit test to cover this case. Change-Id: I74b78d1847f90117248b07203b43a11ff5cfa5d6 Signed-off-by: Vasuki Manikarnike <vasuki.manikarnike@hpe.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/6272 Community-CI: Broadcom CI Community-CI: Mellanox Build Bot Reviewed-by: Michael Haeuptle <michaelhaeuptle@gmail.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: <dongx.yi@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2021-02-08 16:13:59 +00:00
Mao Jiang	6b3ec9683e	nvme/rdma: Fix rdma ctrlr creating qpair memory leak Change-Id: Ie94cacac0b8dcf90b0243e8d568bb728dc7d3045 Signed-off-by: Mao Jiang <maox.jiang@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/6126 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: <dongx.yi@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Community-CI: Mellanox Build Bot	2021-02-04 08:41:33 +00:00
Ziye Yang	74b2916c4a	nvme/rdma: Only wait for the RDMA event if spdk_rdma_qp_disconnect return 0 If rdma_qp_disconnect is not correctly sent out, we will not wait for the event. Change-Id: I99701e421dc93909d481ccf35e9bfd8004e60da8 Signed-off-by: Ziye Yang <ziye.yang@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/6163 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: <dongx.yi@intel.com>	2021-02-04 08:37:38 +00:00
Ziye Yang	5206698e77	nvme/tcp: Add the implementation to get the optimal polling group Add the real support in nvme tcp transport. Change-Id: I2aa9b0284d6fe009925e67f602a055e787f77987 Signed-off-by: Ziye Yang <ziye.yang@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/5734 Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2021-02-04 08:30:54 +00:00
Ziye Yang	e749b5d3ec	nvme: Add the interface to get the optimal polling group This patch is used to add spdk_nvme_poll_group_get_optimal public API. Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: Iee34c89e0e1ff1f81167b18e198c144ca28f71de Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3311 Community-CI: Broadcom CI Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2021-02-04 08:30:54 +00:00
Changpeng Liu	8b203d1994	nvme/pcie: move the common IO path APIs to nvme_pcie_common.c The data path for PCIe and vfio-user transports are almost same too, so move the code from nvme_pcie.c to nvme_pcie_common.c, so that these APIs can be reused by vfio_user. No logic change for this patch. Change-Id: I82f480bba3bae0ce35e2a98f29839081095f7d50 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/6040 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: <dongx.yi@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-01-26 16:30:10 +00:00
Changpeng Liu	da5c9de277	nvme: add NO_SGL_FOR_DSM quirk for Intel P55XX SSDs Intel P55XX SSDs can support SGL feature but can't use Dataset Management command with SGL format, so add a quirk here for now, if the limitation was fixed in future, we can remvoe this. Also SPDK doesn't privoide scatter buffer API for DSM, so using PRP with DSM is totally fine. Change-Id: Ibe92f4deb5b8bc2077115f5b7244bc17be4f3b23 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/5858 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-01-22 08:16:53 +00:00
yidong0635	73487b15d8	nvme/nvme_tcp: Remove unnecessary returns. No need these returns at the end of void functions. So remove them. Signed-off-by: yidong0635 <dongx.yi@intel.com> Change-Id: I8889745f3ef82af513d03259a77a33c1f4f536cb Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/6015 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2021-01-22 08:16:17 +00:00
Niklas Cassel	aa6767fb14	nvme: add support for ZNS zone append command The Zone Append command is an optional command in the Zoned Namespace Command Set. Zone Append differs from a regular write, in that the command is not given an exact LBA of where to write the data. Instead the user has to set the zslba field to the start of a zone, and the data will be appended to that zone. The actual LBA where the data was stored is returned in the spdk_nvme_cpl, where Dword0 contains 31:00 of the ALBA field, and Dword1 contains bits 63:32 of the ALBA field. Signed-off-by: Niklas Cassel <niklas.cassel@wdc.com> Change-Id: Iabae1b3456bfbb62c07b63d79afe9a14e460fe83 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/6013 Community-CI: Broadcom CI Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2021-01-22 08:15:55 +00:00
Niklas Cassel	c078941ca1	nvme: create _nvme_get_host_buffer_sector_size helper function Create a _nvme_get_host_buffer_sector_size helper function, to avoid the same code being duplicated in several functions. Signed-off-by: Niklas Cassel <niklas.cassel@wdc.com> Change-Id: I8c14683c683a44e03c97eefa186833831f754bcc Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/6035 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2021-01-22 08:15:55 +00:00
Niklas Cassel	33b5627a37	nvme: only set check_sgl if sending in SGL type It does not make sense to send in check_sgl == true, when we are calling _nvme_ns_cmd_rw() with a payload of type NVME_PAYLOAD_CONTIG. _nvme_ns_cmd_rw() simply cannot "check SGL" if the payload is not a SGL. Doing so regardless just makes the code harder to read. We still send in check_sgl == true, when we are calling _nvme_ns_cmd_rw() with a payload of type NVME_PAYLOAD_SGL. Signed-off-by: Niklas Cassel <niklas.cassel@wdc.com> Change-Id: I56d49a2abf7819d20cf5974c9e0df8f04f1ccd10 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/6009 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-01-22 08:15:55 +00:00
Alexey Marchuk	74542bae77	tcp: Rename readv_offset to rw_offset in nvme_tcp_pdu In the next patch this member will be used to track both read and write offsets Change-Id: I852125ff35257f9821ddf4a641d96afb29ebf0a0 Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/5924 Community-CI: Broadcom CI Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2021-01-21 09:55:53 +00:00
Changpeng	3ff9c13614	NVMe/vfio-user: add initial version vfio-user transport to NVMe driver When NVMf target linked with vfio-user library, we can use vfio-user client library to connect to the target. Here is the three examples that can work with target: identify -r 'trtype:VFIOUSER traddr:/var/run/muser/domain/muser0/8' -g perf -r 'trtype:VFIOUSER traddr:/var/run/muser/domain/muser0/8' -g -q 1 -o 4096 -w read -t 10 reconnect -r 'trtype:VFIOUSER traddr:/var/run/muser/domain/muser0/8' -g -q 32 -o 4096 -w randrw \ -M 50 -t 10 -c 0xE You can run the following test script test/nvmf/target/nvmf_vfio_user.sh to have a quick test, currently enabled with NVMe Identify,Perf,Reconnect tools. Change-Id: Ieb9842b2f372184fffbf7f23e4aad26feb47c350 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3839 Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2021-01-21 05:00:18 +00:00
Changpeng Liu	1b0ec0b7c2	nvme/pcie: add controller specific vtophys Libvfio-user assumes the memory translation is IOVA=VA mode, since SPDK CI is running inside a VM, the memory mode is IOVA=PA mode, so when testing NVMe vfio-user transport inside a VM spdk_vtophys doesn't work with libvfio-user, so here we add a function to return memory address based on TRTYPE. Change-Id: I11d1c87197f7bbfc243b6bf368795c9a74bd1303 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/5958 Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: <dongx.yi@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-01-21 05:00:18 +00:00
Changpeng Liu	349895a25f	nvme/pcie: set doorbell base when allocating the bar Change-Id: Id40f03f1bc4e90113dad6326dbeb9a7b5af5c1b1 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/5960 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: <dongx.yi@intel.com>	2021-01-21 05:00:18 +00:00
Changpeng Liu	6b4b2d2913	nvme: move common pcie transport code new source files There are some common data structures and APIs in pcie transport which can be used both for pcie and vfio-user transport, so move the common code into a new header and source file. No actual logic change just the code movement except remove the static function declarations. Change-Id: Ie9021e703a5780fdd6840f0e3cfea76a0017a811 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/5923 Community-CI: Broadcom CI Reviewed-by: sunshihao <sunshihao@huawei.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2021-01-21 05:00:18 +00:00
Changpeng Liu	6308a24f56	nvme: add a new custom transport SPDK_NVME_TRANSPORT_VFIOUSER The new custom transport can enable NVMe driver running with NVMe over vfio-user target. Change-Id: I5f90e8516eaca08fc3eab658b29b760a03326ff7 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/5996 Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2021-01-21 05:00:18 +00:00
sunshihao520	5fba455d3e	lib/nvme: add two async API for Directive Send and Directive Receive. Add two async API for Directive Send and Directive Receive. spdk_nvme_ctrlr_cmd_directive_send; spdk_nvme_ctrlr_cmd_directive_receive; Signed-off-by: sunshihao <sunshihao@huawei.com> Change-Id: Icb6974f74902df1512a5ffa9835188132634291b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/5803 Community-CI: Broadcom CI Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2021-01-20 16:27:34 +00:00
sunshihao520	e69af13dea	nvme: add directive definitions to nvme_spec.h Add data structure needed by directives to nvme_spec.h Signed-off-by: sunshihao <sunshihao@huawei.com> Signed-off-by: Weifeng Su <suweifeng1@huawei.com> Signed-off-by: xiehuiming <xiehuiming@huawei.com> Change-Id: I0cbc53fb4f7ca448ab0566d826a7ae8800bbf504 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/5683 Community-CI: Broadcom CI Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2021-01-20 16:27:34 +00:00
sunshihao520	1572882a7f	lib/nvme: add function spdk_nvme_bytes_to_numd to calculate number of dwords According to kernel, use an inline function spdk_nvme_bytes_to_numd to transfer paload_size form bytes to numer of dwords. Signed-off-by: sunshihao <sunshihao@huawei.com> Change-Id: I8b9ded122bbf4a3c8e46988993ea52404783c0b0 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/5926 Community-CI: Broadcom CI Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2021-01-20 08:45:43 +00:00
Changpeng Liu	3962f9047a	nvme: add quirks for new NVMe controller Change-Id: Ice9da4b539b6c5955e449ec4d1ea18d580595caf Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/5857 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: <dongx.yi@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-01-14 16:20:38 +00:00
Michael Haeuptle	02d3d439be	nvme: New function to perform a NVMe subsystem reset This commit introduces spdk_nvme_ctrlr_subsystem_reset to perform a NVMe subsystem reset according to the NVMe spec. Signed-off-by: Michael Haeuptle <michael.haeuptle@hpe.com> Change-Id: If4ffae1bd92d4d16a62ec2b6a01f7373223b5705 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/5488 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Vasuki Manikarnike <vasuki.manikarnike@hpe.com> Community-CI: Broadcom CI Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2021-01-14 16:20:29 +00:00
Alexey Marchuk	b6efb964cd	nvme/rdma: Use RDMA provider memory translation Change-Id: Ie0995a55d252c0167b82ef54aaf7c7b8e5fd75d0 Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/5122 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2021-01-14 16:19:48 +00:00
suhua1	6d306ae59f	nvme: Fix the problem of callback failure during hot plugging Signed-off-by: suhua1 <suhua1@kingsoft.com> Change-Id: Iba8afdfcf4d90245ab1aa700b12cb9c375c4fe76 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/5855 Community-CI: Broadcom CI Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2021-01-12 08:07:04 +00:00
Mao Jiang	6fd1459493	nvme/rdma: Fix rdma allocation return unique pointer Allocate memory with zero number or size, maybe return a unique pointer rather than NULL. Add a check before common allocation APIs. Change-Id: I83e07cab5145035e705bc32364652be90f238633 Signed-off-by: Mao Jiang <maox.jiang@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/5809 Reviewed-by: GangCao <gang.cao@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2021-01-08 09:35:43 +00:00
Alexey Marchuk	6ea36f46c4	nvme: Update CQ error logging Current version provides unclear output Change-Id: Ib044b00b5f91b1e363911f1b79c51c73c8a6920c Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/5743 Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-12-31 08:28:32 +00:00
yidong0635	10717b577c	nvme/nvme_transport: Unify returns in disconnect and connect. Here "return rc == -EINPROGRESS ? 0 : rc;" They are the same meaning in these two functions. Keep the comments here. This makes more clear to readers. Signed-off-by: yidong0635 <dongx.yi@intel.com> Change-Id: I8590de3f0fe27337163ee8b02ea63e166f1bbe7c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/5689 Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2020-12-28 13:29:35 +00:00
Shuhei Matsumoto	974c70e63f	lib/nvme: nvme_ctrlr_fail() return immediately if ctrlr is already failed As log shows 00:06:32.300 [2020-12-18 21:13:35.511014] nvme_ctrlr.c:1414:spdk_nvme_ctrlr_reset: ERROR: Controller reinitialization failed. 00:06:32.300 [2020-12-18 21:13:35.511104] nvme_ctrlr.c: 925:nvme_ctrlr_fail: ERROR: ctrlr 192.168.100.8 in failed state. 00:06:32.300 [2020-12-18 21:13:35.511132] bdev_nvme.c: 392:_bdev_nvme_reset_complete: ERROR: Resetting controller failed. 00:06:32.300 [2020-12-18 21:13:35.511240] nvme_ctrlr.c: 925:nvme_ctrlr_fail: ERROR: ctrlr 192.168.100.8 in failed state. 00:06:32.300 [2020-12-18 21:13:35.511511] bdev_nvme.c: 556:bdev_nvme_failover: NOTICE: Unable to perform reset, already in progress. if spdk_nvme_ctrlr_reset() failed, nvme_ctrlr_fail() is called, and then if spdk_nvme_ctrlr_process_admin_completions() failed, nvme_ctrlr_fail() is called. We don't know which one comes first but nvme_ctrlr_fail() should do nothing if the ctrlr is already failed. Hence we should avoid setting ctrlr->is_failed and calling nvme_transport_ctrlr_disconnect_qpair() twice. However we should set ctrlr->is_removed if the parameter hot_remove is true. We do these changes in this patch. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Iac37c892e054fb59d78e69346ca7f0575d596235 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/5694 Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2020-12-28 13:28:11 +00:00
Jim Harris	3c2190c214	nvme: capture ticks for timeout before checking state In some extreme use cases, an SPDK process could get swapped out for a long period of time just after we checked the state but before we called spdk_get_ticks(). So now we will only timeout if the timer expired before we checked the state and the state did not advance. It's possible we could just move the timeout check to before the ctrlr->state switch, but I was hesitant to change the flow for this case. Fixes issue #1720. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I95b1db3365b5d2d8a65e528f53c302a724d44460 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/5596 Community-CI: Broadcom CI Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2020-12-28 13:02:54 +00:00
Jim Harris	17afd268df	nvme: call correct remove_cb when device is removed When a device is removed, we should use the remove_cb that was specified when the device was originally probed and attached, if one was set. Also add a new spdk_nvme_ctrlr_set_remove_cb API. This can be used for cases where a different remove_ctx is desired than was specified for the probe call. This also enables setting a remove_cb when using connect APIs which do not have a way currently to provide a remove_cb. This also requires fixing the bdev nvme module, which was depending on the previously errant behavior. Fixes issue #1715. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Id848b39040099ff7a21fe57ea6b194a8c25ae015 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/5510 Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: <dongx.yi@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-12-14 14:34:42 +00:00
Ziye Yang	771fae3ee9	lib/nvme: Simplify the code related with nvme_ctrlr_opts init 1 Remove the duplicated code. 2 Add a static check when developers add a new field. Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: Ie9f88d2505c52633cb2077ba565e429a3ec18abd Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/5532 Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: <dongx.yi@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-12-14 14:30:18 +00:00
Evgeniy Kochetov	eb753b2515	examples/nvme: Add hostnqn parameter to perf and identify Signed-off-by: Evgeniy Kochetov <evgeniik@nvidia.com> Change-Id: I5f8435d9c270fa87295d1fc9126bb0c91aaf055c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/5467 Community-CI: Broadcom CI Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-12-11 11:10:28 +00:00
Changpeng Liu	e1aba43745	nvme: hide the 2MiB memory check in env API spdk_vtophys() already checks that, so we don't need to check it in the NVMe driver again. Change-Id: I74288ae8cab80e1be34583475fa02a3ae13e090c Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/5166 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2020-11-25 08:52:51 +00:00
Nick Connolly	b8c99a3aad	lib: Use PRId64 for portability POSIX defines PRId64/PRIu64/PRIx64 for printing 64-bit values in a portable way. Replace a few references to %ld to remove the assumption about the size of a long. Similarly, use %z with size_t arguments. Where the value being printed is an unsigned 64-bit value, use PRIu64 instead of %ld. Explicitly test for not __linux__ where that is the intent, rather than testing for __FreeBSD__. Cast pointer to uintptr_t before aligning it, rather than using a specific integer size which may not be large enough to store a pointer. Signed-off-by: Nick Connolly <nick.connolly@mayadata.io> Change-Id: Icfe219e1bbb2d06b3ef05710fac5b7091d340251 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/5142 Community-CI: Broadcom CI Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2020-11-20 11:01:37 +00:00
Niklas Cassel	23a1916935	nvme: add function to get the max zone append data transfer size The zone append command, which is part of the Zoned Namespace Command Set, has a maximum data transfer size that can be less than or equal to mdts. Since zone append commands will not be allowed to be split, the user has to be able to get the maximum zone append data transfer size. Add a function that returns this limit. Signed-off-by: Niklas Cassel <niklas.cassel@wdc.com> Change-Id: I9da2672ea8a307ff62251c069a42f7540765e08b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/5140 Community-CI: Broadcom CI Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-11-20 11:00:18 +00:00
Niklas Cassel	38d59d8b5a	nvme: add function to check if controller supports zone append Zone append is an optional command in the Zoned Namespace Command Set. Add a convenience function to check if the controller supports the zone append command. The ratified NVMe TP 4056 added a CSI field (in cdw14) to the Get Log Page command. However, since there already exist two public functions to get a log page (spdk_nvme_ctrlr_cmd_get_log_page() and spdk_nvme_ctrlr_cmd_get_log_page_ext()), avoid creating a third one for now, since nvme_ctrlr_get_zns_cmd_and_effects_log() itself can leverage one of the existing public functions. Signed-off-by: Niklas Cassel <niklas.cassel@wdc.com> Change-Id: I99516dbac8db6714488b4d6cabe64c27f46d6153 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/5078 Community-CI: Broadcom CI Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com>	2020-11-20 11:00:18 +00:00
Niklas Cassel	efedaf17e8	nvme: remove superfluous parentheses Remove superfluous parentheses around ctrlr->cdata.mdts. They provide no value while making the code harder to read. Signed-off-by: Niklas Cassel <niklas.cassel@wdc.com> Change-Id: I4342d87f0e33fd92fe76357eb0379fb1e9c8f98f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/5138 Community-CI: Broadcom CI Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-11-20 11:00:18 +00:00
Niklas Cassel	d7345e08ae	nvme: group NVME_CTRLR_STATE_WAIT_FOR_* states together While I assume that the initial thought of having the NVME_CTRLR_STATE_WAIT_FOR_* state directly after the state which it is waiting for, was to make it clear for the reader in which order the states will be executed. However, it feels silly to have the same code copy pasted everywhere. Someone who needs to add a new state will still need to edit nvme_ctrlr_state_string() and enum nvme_ctrlr_state, which still defines the NVME_CTRLR_STATE_WAIT_FOR_* state directly after the state which it is waiting for. In one way, moving the NVME_CTRLR_STATE_WAIT_FOR_* states to the end of nvme_ctrlr_process_init(), when reading nvme_ctrlr_process_init(), it is actually easier to see the ordering of the states which actually do something of significance. Signed-off-by: Niklas Cassel <niklas.cassel@wdc.com> Change-Id: Ia95ea5ac3c44a53179edbdc65cba45bec94e469f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/5115 Community-CI: Broadcom CI Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-11-18 11:28:11 +00:00
Weifeng Su	00a126bea8	lib/env_dpdk: Change the location of lock files from /tmp to /var/tmp Move the lock files from '/tmp' to '/var/tmp' cause user maybe delete files in /tmp or remount /tmp by mistake, And the JSON-RPC domain socket located in '/var/tmp' also. Signed-off-by: Weifeng Su <suweifeng1@huawei.com> Signed-off-by: Shihao Sun <sunshihao@huawei.com> Change-Id: I18d52f42462e8477fb35aeea9e38efc51610d17c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/5096 Community-CI: Broadcom CI Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-11-17 08:25:31 +00:00
Weifeng Su	eacc9199fc	lib/nvme: Add error handler for parse_event Add handler for 'strstr, strrchr' function in 'NULL' return that maybe cause memory access issue. Signed-off-by: Weifeng Su <suweifeng1@huawei.com> Signed-off-by: Shihao Sun <sunshihao@huawei.com> Change-Id: I2525fbcd9f8ce0a78383305c735b2d27575f4bfe Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/5071 Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-11-17 08:25:31 +00:00
Jim Harris	0e47be4b41	nvme: add nqn prefix to hostnqn Section 7.9 of the NVMe spec says that all nqns must start with "nqn.". Fixes issue #1669. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I7fd0e6a0a397e831c4fa2377126b6b1e1b127d88 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/5017 Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: <dongx.yi@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-11-06 08:11:54 +00:00
Alexey Marchuk	d296fcd8d9	nvme_tcp: Fix icreq/icresp handing with zcopy enabled. There is a problem with TCP zcopy enabled: 1. TCP initiator sends icreq and start polling a qpair. Polling of qpair actively calls nvme_tcp_read_pdu function 2. nvme_tcp_read_pdu: qpair is in NVME_TCP_PDU_RECV_STATE_AWAIT_PDU_CH state, it reads 8 bytes of common PDU header. It determines the type of the PDU and finds the size of PDU_PSH header. 3. nvme_tcp_read_pdu: qpair is in NVME_TCP_PDU_RECV_STATE_AWAIT_PDU_PSH state. It should read 120 bytes of icresp PDU. The number of bytes which needs to be read is pdu->psh_len - pdu->psh_valid_bytes. qpair receives 120 bytes (the full PDU) and calls nvme_tcp_pdu_psh_handle -> nvme_tcp_icresp_handle. Here we check that we haven't yet received buffer reclaim notification and simply return from this function. At the same time we continue to poll the qpair. 4. nvme_tcp_read_pdu: qpair is in NVME_TCP_PDU_RECV_STATE_AWAIT_PDU_PSH state and tries to read data from a socket again. The number of bytes is pdu->psh_len - pdu->psh_valid_bytes. But now pdu->psh_len == pdu->psh_valid_bytes, so we call nvme_tcp_read_data with zero length. readv with zero length is commonly used to check errors on the socket, but in our case there is no errors and readv returns 0. 5. nvme_tcp_read_data treats zero as error and return NVME_TCP_CONNECTION_FATAL. Fix is to handle icresp, but leave qpair in INITIALIZING state until we receive acknowledgement for icreqsend_ack. We also move qpair to NVME_TCP_PDU_RECV_STATE_AWAIT_PDU_READY recv_state so recv_pdu will be zerofied and qpair will try to read a common PDU header. But since it is not initialized yet, it won't receive anything from the target. Fixes issue #1633 Change-Id: I22cedefe530a8ac3b51495988ed6265d8fad15bb Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4969 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-10-30 09:05:35 +00:00
Michael Haeuptle	7fc48a5ffc	nvme: break completion loop when ctrlr is invalid This fixes #1423 where the completion loop never breaks when the NVMe ctrlr is no longer present. This condition can happen during a hot remove. Signed-off-by: Michael Haeuptle <michael.haeuptle@hpe.com> Change-Id: Ia238c8aeae720832068de28ce4d34a9d233344fb Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4831 Community-CI: Broadcom CI Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Vasuki Manikarnike <vasuki.manikarnike@hpe.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-10-29 08:11:35 +00:00
Jim Harris	ddf86600bb	nvme: continue probing ctrlrs even if one fails It is possible that a single probe_ctx could be used to probe multiple newly attached nvme controllers. If one of those controllers is removed during this process, the rest of the controllers do not get probed and can even get stuck in a zombie state. It is better to just continue with probing the rest of the controllers. Fixes issue #1611. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I4156ee8b50e8d52cfeee7224f210a58bb773e939 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4945 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Michael Haeuptle <michaelhaeuptle@gmail.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Vasuki Manikarnike <vasuki.manikarnike@hpe.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-10-29 08:11:17 +00:00
Alexey Marchuk	85fa43241b	nvmf/tcp: Support ICD for fabric/admin commands According to the SPEC we should support up to 8192 bytes of ICD for admin and fabric commands. Transport configuration parameter in_capsule_data_size is applied to all qpair types - admin and IO. Also we allocate resources when we get a connection request, so we don't know qpair type at this moment. Create a list of buffer in TCP poll group to support ICD up to 8192 bytes when configuration ICD is less than this value. The number of elements in this pool is hardcoded, it is planned to add a new configuration parameter later. Fixes issue #1569 Change-Id: I8589e3e2ea95d515f5503c6de7c1ee40aaf7b6da Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4754 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-10-27 08:40:12 +00:00
Alexey Marchuk	e385cafa72	nvme: Don't log an error when we can't resubmit all requests In TCP NVME initiator with zero copy enabled requests might be completed asynchronously - out of qpair_process_completions context. At the same time we calculate requests completed asynchronously so that generic NVME layer can resubmit queued requests after calling qpair_process_requests (or poll_group_process_requests). But there is a time gap between async request complete and qpair_process_completions and the user can submit new IO thereby decrease the number of free TCP requests. That means that there might be less free requests than we excpected when we try to resubmit queued requests. The solution is change ERRLOG to DEBUG log since it is not a fatal case. Change-Id: If045ecd331cc6693e8ef450d8e15432dfa5d8812 Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4859 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot	2020-10-26 11:48:46 +00:00
Alexey Marchuk	c72a16431a	nvme/tcp: Fix check of completion number during icresp handling The current approach checks "rc == 0". It worked before adding polling of poll group since a single qpair should return 1 completion for its own icreq while poll group can return several completions for all qpairs attached to this poll group (but .e.g not for those qpair who is waiting for the completion). Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: I60d05d8d6640e4e2bbaf3cd533d2f5a3637adea1 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4768 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2020-10-21 20:45:13 +00:00
Niklas Cassel	6dec7623be	nvme: add support for ZNS zone management receive command Add support for the ZNS zone management receive command. An internal nvme_zns_zone_mgmt_recv() function is created that matches the parameters of the zone management receive function in the ZNS specification. Convenience functions are provided for the following Zone Receive Action: Report Zones. Zone Receive Actions not implemented: Extended Report Zones. Signed-off-by: Niklas Cassel <niklas.cassel@wdc.com> Change-Id: I23589a602336da5dffccec7230d07026a868e81b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4793 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-10-21 12:18:44 +00:00
Niklas Cassel	be3ff9c0b5	nvme: add function to get ZNS num zones Add a function to get the number of zones for a zoned namespace. Since the ZNS specification does not allow zone size == 0, divide by zero should not be possible on a spec compliant drive. Signed-off-by: Niklas Cassel <niklas.cassel@wdc.com> Change-Id: I15e0ac8a72e244e248b7fb44f7156b182ecd98b4 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4792 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI	2020-10-21 12:18:44 +00:00
Niklas Cassel	6abee9a205	nvme: add function to get ZNS zone size Add a function to get the zone size of a zoned namespace. Signed-off-by: Niklas Cassel <niklas.cassel@wdc.com> Change-Id: I89429fa2fd36307a8b34bdf91e3251c68596a48c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4791 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Community-CI: Broadcom CI	2020-10-21 12:18:44 +00:00
Shuhei Matsumoto	d8f4bbeb43	lib/nvme: Add spdk_nvme_detach_async() and spdk_nvme_detach_poll_async() Add two new public APIs, spdk_nvme_detach_async() and spdk_nvme_detach_poll_async() to detach multiple controllers in parallel as a simple manner to users. Hold the target controller to nvme_ctrlr_detach_ctx because users will free any object which held it after returning spdk_nvme_detach_async(). spdk_nvme_detach_ctx holds all nvme_ctrlr_detach_ctx in a sequence by linked list. spdk_nvme_detach_ctx has a boolean variable polling_started to prevent the user from calling spdk_nvme_detach_async() while spdk_nvme_detach_poll_async() is called repeatedly. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>i Change-Id: Ib049c19f7ef24410b963fd5c777a21184f3012d1 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4758 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jacek Kalwas <jacek.kalwas@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-10-20 16:55:55 +00:00
Shuhei Matsumoto	ea1bfd84cd	lib/nvme: Make internal of spdk_nvme_detach() asynchronous Add two new helper functions, nvme_ctrlr_detach_async() and nvme_ctrlr_detach_poll_async() to make the internal of spdk_nvme_detach() asynchronous. Use callback function to remove controller from the attached list after completing shutdown and before freeing to avoid conflict between attach and detach. Update MOCKs in the corresponding unit test cases. The next patch will add two public APIs spdk_nvme_detach_async() and spdk_nvme_detach_poll_async() based on this patch. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Ifbdfec2a1facde9354007c6248f280e245a36eed Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4416 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jacek Kalwas <jacek.kalwas@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-10-20 16:55:55 +00:00
Shuhei Matsumoto	3806b2e1f9	lib/nvme: Make nvme_ctrlr_destruct() asynchronous Following the last patch, separate nvme_ctrlr_destruct() into nvme_ctrlr_destruct_async() and nvme_ctrlr_destruct_poll_async(), but keep nvme_ctrlr_destruct() by replacing the internal by nvme_ctrlr_destruct_async() and nvme_ctrlr_destruct_poll_async(). Add shutdown_complete to nvme_ctrlr_detach_ctx. If shutdown_complete is true, we can destruct the controller. The case that nvme_ctrlr_shutdown_async() failed sets shutdown_complete to true. The case that nvme_ctrlr_disable() is called sets shutdown_complete to true unconditionally. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I3994e259f9d3ccf8fede3ac03aadef911eefb9dd Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4415 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jacek Kalwas <jacek.kalwas@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2020-10-20 16:55:55 +00:00
Shuhei Matsumoto	257fcb7352	lib/nvme: Make nvme_ctrlr_shutdown() asynchronous This patch is the first of the patch series to make spdk_nvme_detach() asynchronous. We have lengthy shutdown notification, i.e., we have to wait a long time until shutdown processing is completed, in some SSDs. If the running system has many such SSDs, we see large intolerable delay. SPDK provides a controller option, no_shn_notification as a workaround. We can use the workaround if the use case of the detach is to switch to the next application without system reboot. However, we cannot use the workaround if we want to do system reboot after detach. To mitigate such lengthy shutdown notification, we need to parallelize detachment among SSDs. Hence the patch series will introduce an asynchronous detach API and will use the API to parallelize detachment. This patch adds the following changes. Introduce a context structure and separate nvme_ctrlr_shutdown() itno nvme_ctrlr_shutdown_async() and nvme_ctrlr_shutdown_poll_async() using the context structure. Name the context structure as nvme_ctrlr_detach_ctx because it will be used only in internal APIs. The upcoming public APIs will support multiple detachment and will have the contest structure named as spdk_nvme_detach_ctx. Use TSC instead of counter because polling interval will be controlled by the caller. Use the convenient macro, SPDK_CEIL_DIV(), to round off the time value in milliseconds. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I9e2355fd24b6d6a4d6c1813577d53822304d4f33 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4414 Reviewed-by: Jacek Kalwas <jacek.kalwas@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-10-20 16:55:55 +00:00
Niklas Cassel	67475a58f4	nvme: add support for ZNS zone management send command Add support for the ZNS zone management send command. An internal nvme_zns_zone_mgmt_send() function is created that matches the parameters of the zone management send function in the ZNS specification. Convenience functions are provided for the following Zone Send Actions: Close Zone, Finish Zone, Open Zone, Reset Zone, and Offline Zone. Zone Send Actions not implemented: Set Zone Descriptor Extension. Signed-off-by: Niklas Cassel <niklas.cassel@wdc.com> Change-Id: I9cc04776c249d2b9c515438b81a0f06aea4198fe Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4682 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-10-20 12:18:56 +00:00
Niklas Cassel	3be2935701	nvme: move ZNS specific functions to their own header file Move ZNS specific functions to their own header file: include/spdk/nvme_zns.h. This is similar to how it's done for e.g. NVMe-oF with include/spdk/nvmf.h. This avoids cluttering the nvme.h header with ZNS specific functions. A SPDK program that wants to use the spdk_nvme_zns_* functions will need to include include/spdk/nvme_zns.h, however, this is similar to how e.g. spdk_nvmf namespaced functions work. This change is safe to do now, since there is no SPDK release with ZNS so far. Signed-off-by: Niklas Cassel <niklas.cassel@wdc.com> Change-Id: If39fab4e7a6924cc3bd67aa0c03e9c6de5f8a324 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4681 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-10-20 12:18:56 +00:00
Niklas Cassel	198a3ad2fd	nvme: if no command set is requested, use the most reasonable default Since namespace types were introduced in NVMe, the CC.CSS register has a new value (SPDK_NVME_CC_CSS_IOCS) which indicates that all supported command sets should be selected/enabled. This possibly includes command sets other than NVM and ADMIN only. Therefore, if a SPDK application wants enable all the command sets that the controller supports, it has to explicitly set opts->command_set to SPDK_NVME_CC_CSS_IOCS. To avoid possibly a lot of SPDK applications having to set this parameter, check if the user requested a command set explicitly, if not, make SPDK automatically use the most reasonable default, based on the supported bits set by the controller. The most common case is that you want to enable (all) the command sets that the controller supports. A user will still be able to restrict the controller to only use the NVM command set (or ADMIN only), by setting opts->command_set to a specific value. Since the current default command set value specified by spdk_nvme_ctrlr_get_default_ctrlr_opts() is SPDK_NVME_CC_CSS_NVM, which is defined as 0, we cannot know if the user specified a command set explicitly or not. To solve this, change the default command set value specified by spdk_nvme_ctrlr_get_default_ctrlr_opts() to CHAR_BIT (0x8), which is larger than the largest value that can be set in CS.CSS (which is only 3 bits wide, thus 0x7). Signed-off-by: Niklas Cassel <niklas.cassel@wdc.com> Change-Id: I45ec148d3667ab87c41fbfb6d6612a1e0e5c9d9c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4701 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-10-20 12:18:56 +00:00
Jim Harris	95e5339490	nvme: fix signed/unsigned error with cuse ioctls cuse defines the ioctl handler to take an int for the command, but the kernel uses unsigned int. So cast the int parameter to unsigned int when doing the switch. Fixes issue #1613. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I7ef13afc755ee79d8a79b3c7a09dccee7bb7aa1c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4716 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-10-19 10:00:29 +00:00
Changpeng Liu	9ae19a413b	nvme: restore qid in free_io_qids bit array There is an error when do following sequences: 1. Allocate an I/O queue pair 2. Do controller reset via spdk_nvme_ctrlr_reset 3. Allocate an I/O queue pair becaues the free_io_qids was reset and didn't restore. Fix issue #1621. Change-Id: Icd533f171079c12fe03be07e659e8eed9b082384 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4698 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-10-19 09:54:54 +00:00
yidong0635	846bb4b9e1	nvme/nvme_rdma: Fix possible used uninitialized value. In file included from nvme_rdma_ut.c:36: /home/clear/spdk/lib/nvme/nvme_rdma.c:651:22: note: ‘bad_send_wr’ was declared here 651 \| struct ibv_send_wr *bad_send_wr; \| ^~~~~~~~~~~ In file included from /home/clear/spdk/lib/nvme/nvme_rdma.c:41, from nvme_rdma_ut.c:36: /home/clear/spdk/lib/nvme/nvme_rdma.c: In function ‘nvme_rdma_poll_group_process_completions’: /home/clear/spdk/include/spdk/log.h:132:2: error: ‘bad_send_wr’ may be used uninitialized in this function [-Werror=maybe-uninitialized] 132 \| spdk_log(SPDK_LOG_ERROR, __FILE__, __LINE__, __func__, __VA_ARGS__) \| ^~~~~~~~ cc1: all warnings being treated as errors. Signed-off-by: yidong0635 <dongx.yi@intel.com> Change-Id: I38ae36756b4bacef7e89f0f1737684c8b8981b12 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4696 Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-10-16 08:15:57 +00:00
Vasuki Manikarnike	ed65607cf9	lib/nvme: Hold ctrlr->ctrlr_lock before calling nvme_ctrlr_fail Fixes #1615 Signed-off-by: Vasuki Manikarnike <vasuki.manikarnike@hpe.com> Change-Id: Ib38f38a1086ec804222ccb94dfb3d41a85b62a8d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4608 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-10-15 08:24:47 +00:00
Tomasz Zawadzki	4e8e97c886	log: remove internal log.h header There is nothing left here, so remove it. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: Ib947d42bc577dbebb4650b1be885e05a80f8f8cf Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4541 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Anil Veerabhadrappa <anil.veerabhadrappa@broadcom.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Community-CI: Broadcom CI	2020-10-15 08:23:39 +00:00
Tomasz Zawadzki	2172c432cf	log: simplify SPDK_LOG_REGISTER_COMPONENT This patch removes the string from register component. Removed are all instances in libs or hardcoded in apps. Starting with this patch literal passed to register, serves as name for the flag. All instances of SPDK_LOG_* were replaced with just * in lowercase. No actual name change for flags occur in this patch. Affected are SPDK_LOG_REGISTER_COMPONENT() and SPDK_*LOG() macros. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I002b232fde57ecf9c6777726b181fc0341f1bb17 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4495 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Mellanox Build Bot Reviewed-by: Anil Veerabhadrappa <anil.veerabhadrappa@broadcom.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Community-CI: Broadcom CI	2020-10-14 08:00:35 +00:00
Alexey Marchuk	3a2148213f	nvme/tcp: Enable zcopy send when qpair is attached to poll group We can receive buffer reclaim notifications only when a qpair is attached to a poll group (so qpair's socket is connected to a socket poll group). The previous assumption that we enable zcopy only for IO qpairs was wrong since IO qpair might not use poll groups too (e.g. abort application). Fixes issue #1607 Change-Id: I67329d755d81da6606e65eddfeceb20839346d87 Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4476 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-10-06 09:35:31 +00:00
Shuhei Matsumoto	b8fbb070f8	lib/nvme: nvme_ctrlr_probe() fails if the controller is being destructed When we support spdk_nvme_detach_async(), any controller may be destructed asynchronously. We will be able to know the case by ctrlr->is_destructed is true and ctrlr is queued in the attached list. nvme_ctrlr_probe() should fail if the found ctrlr satisfies these conditions. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I299c2e5ea3c16cc1239899c163bb9e0eb921ade5 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4413 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-10-05 08:27:56 +00:00
Niklas Cassel	084a062e81	nvme: add getters for ZNS specific data structures Add getters to the ZNS specific data structures, so that an SPDK application, e.g. examples/nvme/identify/identify.c, has the ability to get and utilize the information in them. Signed-off-by: Niklas Cassel <niklas.cassel@wdc.com> Change-Id: I26056161093cc811acb6840ff7e2068e5f6058f6 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4412 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-09-30 07:26:04 +00:00
Niklas Cassel	c4d1b7d57a	nvme: add new state to get Identify IOCS Specific Namespace data structs Add a new state in the SPDK NVMe state machine in order to fetch I/O Command Set Specific Namespace data structures. Right now there is only support for the Zoned Namespace Command Set Specific Identify Namespace data structure. The NVM Command Set Specific Identify Namespace data structure is all zeroes right now, reserved for future use. The Key Value Command Set Identify Namespace data structure is not all zeroes, however, adding support for Key Value is outside the scope of this patch. The new NVME_CTRLR_STATE_IDENTIFY_NS_IOCS_SPECIFIC state is added after the NVME_CTRLR_STATE_IDENTIFY_ID_DESCS state. This is because we need to have fetched the identifiers in the desc list in order to know which command set a namespace belongs to. A slightly nicer design might have been to refactor the NVMe state machine to first fetch the id desc list, then the identify namespace struct, and finally the identify IOCS specific namespace struct. However, since this would have required a lot of changes, it didn't really seem justified. Signed-off-by: Niklas Cassel <niklas.cassel@wdc.com> Change-Id: I62cbc533c2c3eec1ccf0ba9b1c414d5a70919cff Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4368 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-09-30 07:26:04 +00:00
Niklas Cassel	64563ada5d	nvme: add new state to get Identify IOCS Specific Controller data structs Add a new state in the SPDK NVMe state machine in order to fetch I/O Command Set Specific Controller data structures. Right now there is only support for the Zoned Namespace Command Set Specific Identify Controller data structure. The NVM Command Set Specific Identify Controller data structure is all zeroes right now, reserved for future use. The Key Value Command Set Identify Controller data structure is also all zeroes right now, reserved for future use. The new NVME_CTRLR_STATE_IDENTIFY_IOCS_SPECIFIC state is added after the NVME_CTRLR_STATE_IDENTIFY state. That way, if support for the Zoned Namespace Command Set is enabled during probing, we will fetch the Zoned Namespace Command Set Specific Identify Controller data structure, regardless if any Zoned Namespaces are attached or not, and no additional steps will be needed once a Zoned Namespace is attached. Since we only have one command set to fetch, avoid creating NVME_CTRLR_STATE_IDENTIFY_IOCS_SPECIFIC substates, although that will probably be needed when support for another command set is added. Signed-off-by: Niklas Cassel <niklas.cassel@wdc.com> Change-Id: I95535b09b03b7ef2ee9a11eebdbd28aad66d65ba Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4367 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-09-30 07:26:04 +00:00
Niklas Cassel	b2570934cb	nvme: change the type of spdk_nvme_ctrlr::state from enum to int When adding an additional state to enum nvme_ctrlr_state, abidiff (1.6.0) will report that almost every public interface in the nvme library has been impacted, causing test/make/check_so_deps.sh to fail. While it is possible that by adding another state, the compiler decides to use a larger data type for representing enum nvme_ctrlr_state, abidiff shouldn't complain in the first place, since spdk_nvme_ctrlr is only ever exposed as an opaque handle. It can never be accessed directly. Jim Harris suggested to workaround this abidiff bug by changing the type of spdk_nvme_ctrlr::state from enum to int. Signed-off-by: Niklas Cassel <niklas.cassel@wdc.com> Change-Id: I8b85446580043e95cf791249d643907587e2f982 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4427 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-09-30 07:26:04 +00:00
Jim Harris	1deb6b9e6b	nvme: disable zero copy for client TCP sockets This seems to be causing some CI test failures. So disable zero copy in all cases for now for client sockets. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Iceea09fe65fb90c7df15f500878a473f1ad4152c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4473 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-09-30 00:21:26 +00:00
Jim Harris	ed071dd9f9	build: cleanup LOCAL_SYS_LIBS for uuid LOCAL_SYS_LIBS is meant to define direct system library dependencies for a given library. libuuid is directly used by the SPDK util library and then other SPDK libraries use uuid indirectly through util. So only the util library should include uuid in LOCAL_SYS_LIBS. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ia0d2d63f48e6f89891164cf2f9dc4c7a6476d4e3 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4366 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-09-29 09:36:25 +00:00
Alexey Marchuk	86865969ff	sock/posix: Enable send zero copy for client sockets In NVME TCP initiator zero copy is enabled for IO qpairs and disabled for admin qpairs Change-Id: Ibdf521dccde9b95ec5dd15a5eb2baed8fcf8b88e Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Signed-off-by: Or Gerlitz <ogerlitz@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4211 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-09-29 09:35:47 +00:00
Alexey Marchuk	a910bc647d	nvme/tcp: Calculate requests completed asyncronously A preparation step for enabling zero copy in NVMEoF TCP initiator. With zero copy enabled, some requests might be completed out of "process_completions" call and we should take them into account to return the correct number of completions. Change-Id: Iba7973f6da815645bbfad0334619d46b66379226 Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Signed-off-by: Or Gerlitz <ogerlitz@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4209 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-09-29 09:35:47 +00:00
Alexey Marchuk	2ceff364e5	nvme/tcp: Add synchronization for icreq send ack and icresp A preparation step for enabling zero copy in NVMEoF TCP initiator. We should wait for both events to occur before continue qpair initialization. Add a new bit to nvme_tcp_qpair::flags to track receiving of icreq ack since icreq is sent without tcp_req and there is no way to apply existing synchronization mechanisms. Move tcp qpair to initializing state if we receive icresp before icreq ack, this state will be checked during handling of icreq ack to continue qpair initialization Change-Id: I7f1ec710d49fb1322eb0a7f133190220b9f585ab Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Signed-off-by: Or Gerlitz <ogerlitz@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4207 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-09-29 09:35:47 +00:00
Alexey Marchuk	bc36528cda	nvme/tcp: Process poll_group when waiting for icresp A preparation step for enabling zero copy in NVMEoF TCP initiator. Since nvme_tcp_qpair_process_completions doesn't process poll group, we can't get asycn notification from kernel. 1. Add a qpair to poll group before we send icreq in order to be able to process buffer reclaim notification. 2. Check if qpair is connected to a poll group and call nvme_tcp_poll_group_process_completions instead of nvme_tcp_qpair_process_completions when waiting for icresp 3. Add processing of poll group to nvme_wait_for_completion_timeout and nvme_wait_for_completion_robust_lock since they are used to process FABRIC_CONNECT command Change-Id: I38d2d9496bca8d0cd72e44883df2df802e31a87d Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Signed-off-by: Or Gerlitz <ogerlitz@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4208 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2020-09-29 09:35:47 +00:00
Alexey Marchuk	a85579d8ef	nvme/tcp: Refactor header/data digest using bitfields Currently host/data digest are bool members of nvme_tcp_qpair structure. Change the type of this members to bitfield, reserved bits will be used in the next patches to support zero copy. Change-Id: If0659bf2445901e45fe0816af5f4fca5f494b154 Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Signed-off-by: Or Gerlitz <ogerlitz@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4206 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-09-29 09:35:47 +00:00
Alexey Marchuk	7388e54de4	nvme/tcp: Complete request when all acks are received A preparation step for enabling zero copy in NVMEoF TCP initiator. Make sure that we complete a request (call user's callback) when all acknowledgements are received. For write operation - when we received send cmd ack, h2c ack and response from target. For read operation - when we received send cmd ack and c2h completed Since we can receive send ack after resp command, store nvme completion received in resp command in a new field added to tcp_req structure Change-Id: Id10d506a346738c7a641a979e1c8f86bc07465a4 Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Signed-off-by: Or Gerlitz <ogerlitz@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4204 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2020-09-29 09:35:47 +00:00
Alexey Marchuk	2d4af0c174	nvme/tcp: Add synchronization for subsequent R2T requests. A preparation step for enabling zero copy in NVMEoF TCP initiator. Some NVMEoF TCP targets can send several R2T requests. We should check that we finished the previous H2C (received buffer reclaim notification from kernel) before sending the next H2C. This patch adds a new ordering bit indicating the described case and 2 fields to nvme_tcp_req to store the values from the last R2T request which will be applied when send ack is received. Change-Id: Iaa5ad846712ca18a8382680baa02413c18c4eb37 Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Signed-off-by: Or Gerlitz <ogerlitz@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4203 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2020-09-29 09:35:47 +00:00
yidong0635	ee17068b88	nvme: fix a typo. Meanwhile, to verify an issue about git push unittest failure. Signed-off-by: yidong0635 <dongx.yi@intel.com> Change-Id: Idac60e5832390eb8bdce68aee639be2e9ac6cff6 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4373 Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-09-25 11:42:56 +00:00
Shuhei Matsumoto	583fa214a3	lib/nvme: Add two public APIs to get ANA group ID and ANA state of the namespace Add spdk_nvme_ns_get_ana_group_id() and spdk_nvme_ns_get_ana_state() to getthe ANA group ID and the ANA state of the given namespace, respectively. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Id5f1f7ee488a1eb2a7a77f9986a3bb89146628e0 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4354 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Monica Kenguva <monica.kenguva@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2020-09-25 11:42:44 +00:00
Shuhei Matsumoto	cc2b6962f9	lib/nvme: Keep ANA state per namespace up-to-date Add ana_state and ana_group_id to struct spdk_nvme_ns and keep them up-to-date by updating when spdk_nvme_ctrlr is created or ANA change notice is received asynchronously. For both cases, struct spdk_nvme_ctrlr holds the latest ANA state. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I311fe1c8015c8b8ac9659c38661244706c04b3e3 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4287 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-09-25 11:42:44 +00:00
Shuhei Matsumoto	3befb518b0	lib/nvme: Add an internal API nvme_ctrlr_parse_ana_log_page() Add an internal API nvme_ctrlr_parse_ana_log_page() to parse an ANA log page and execute the specified callback function for each ANA group descriptor in the ANA log page. We will be able to copy the ANA group descriptor to the caller instead. To do that, we will need to inform the size of the descriptor first, but the size will not be constant. Passing parser to the API will be more convenient. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Ifd8fda30a83965948017fb8ad992c0d889197cde Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4279 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-09-25 11:42:44 +00:00
Shuhei Matsumoto	a0befabdd4	lib/nvme: Controller holds the latest ANA log page When creating a controller, allocate a buffer to the controller and read ANA log page into the buffer. When receiving ANA change notice, read ANA log page into the buffer to keep the contents up to date. The next patch will provide a public API to get the contents of ANA log page the controller holds. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: If5c653f4e80d157e5120bb754e6660250b2b8fa1 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4233 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2020-09-25 11:42:44 +00:00
Shuhei Matsumoto	06d25b7021	lib/nvme: Add an internal API nvme_wait_for_completion_robust_lock_timeout Add an internal API nvme_wait_for_completion_robust_mutex_lock_timeout() and related internal APIs just call it with adjusting parameters. nvme_wait_for_completion_robust_lock_timeout() will be usable for the current use cases of nvme_wait_for_completion_robust_lock() and future use cases. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I2b499643930256a39ebe279f56a399f20a7a2fde Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4217 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2020-09-25 11:42:44 +00:00
Ben Walker	85ddcf6f8d	nvme/tcp: Clean up error message Fix some spelling and make the message clearer Change-Id: Ib291542a9735d6409db84f16c530e78567123f67 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4249 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2020-09-22 11:40:58 +00:00
Niklas Cassel	d1db020954	nvme: add a csi parameter to nvme_ctrlr_cmd_identify() With the introduction of namespace types, the identify command has gained an additional parameter: Command Set Identifier (CSI). This parameter is similar to the existing parameters NSID and CNTID, and is not used by all CNS values. Most notably, the CSI parameter is not used for the existing CNS values 00h (ID NS) and 01h (ID CTRL). There are new CNS values, e.g. 05h (ID IOCS specific NS), and 06h (ID IOCS specific CTRL), which do take the new CSI parameter. The new CNS values instead return Command Set Specific data structures, which is basically an additional data structure. Therefore, the CNS values 00h and 01h are very much still in use. (Even the NVM Command Set has a Command Set Specific data structure, even though all fields in that data structure are currently reserved.) Since the CSI parameter is unused by all the existing calls to nvme_ctrlr_cmd_identify() (since none of the calls send in a CNS value that uses CSI), simply send in 0 for all existing calls. No functional change intended. Signed-off-by: Niklas Cassel <niklas.cassel@wdc.com> Change-Id: Ia2b2324393a0707152b2f8511f0a22ad4a12bd46 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4309 Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2020-09-22 11:39:53 +00:00
Niklas Cassel	726ac67c55	nvme: use the most appropriate is_active() function There are two different is_active() functions. spdk_nvme_ctrlr_is_active_ns() which iterates through the active_ns_list, and spdk_nvme_ns_is_active(), which simply checks the nsdata. There is an event callback that refreshes active_ns_list when a relevant events has occured. In nvme_ns_construct(), nvme_ctrlr_identify_ns() has just been called, so we know that nsdata is as fresh as possible. Hence, there is no reason to iterate through a less fresh active_ns_list. Since we know that the nvme_ctrlr_identify_ns() call was done through the same controller, we also know that the active/inactive is from the perspective of the correct controller, so that is not a reason to use the less efficient is_active() function. Signed-off-by: Niklas Cassel <niklas.cassel@wdc.com> Change-Id: I185f59b53e16e70163e33a3909f4b55ebf631cc4 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4293 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-09-18 07:25:06 +00:00
Niklas Cassel	743271d92d	nvme: cache the command set identifier Since the command set identifier might be accessed at several different states in the nvme state machine, cache it so that we don't need to loop through the ns id desc list every time. This is similar to how other identify fields are cached using nvme_ns_set_identify_data(). None of the identifiers in the desc list (including the new CSI) can change over the life time of a namespace, so caching them should be safe. Signed-off-by: Niklas Cassel <niklas.cassel@wdc.com> Change-Id: Ie06180a4b3750dfa1a42f47afe0f7f9e3ec04ba9 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4266 Community-CI: Broadcom CI Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-09-18 07:25:06 +00:00
Niklas Cassel	a7de7dee23	nvme: remove superfluous else statement If the nvme completion was an error, the function will return, so there is no reason for an else statement. In fact, the else statement in nvme_ctrlr_identify_ns_async_done() differs from the coding style used in other nvme_ctrlr_identify_* functions, and arguably makes the code harder to read. Signed-off-by: Niklas Cassel <niklas.cassel@wdc.com> Change-Id: If76b823b7ca04ab98abb2912927c344ee9f12314 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4265 Community-CI: Broadcom CI Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-09-18 07:25:06 +00:00
Niklas Cassel	07525a91b5	nvme: clear the ns id desc list in nvme_ns_destruct() Clear the ns id desc list in nvme_ns_destruct(). Without this, someone can get stale data by calling e.g. spdk_nvme_ns_get_uuid() on a destructed namespace. Signed-off-by: Niklas Cassel <niklas.cassel@wdc.com> Change-Id: I965dd4cd6101d3a77eddbd582b9618b3436d39c8 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4263 Community-CI: Broadcom CI Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-09-18 07:25:06 +00:00
Jim Harris	792807a444	nvme: fix infinite loop when aborting queued reqs When we disconnect a qpair, part of the code path is calling _nvme_qpair_abort_queued_reqs. This takes care of aborting any requests that were queued waiting for slots to open on the submission queue. It walks the STAILQ one by one and manually completes them with ABORT status back to the caller. But if the callback path submits another request, this request may also get queued to the end of the queued_req TAILQ. This can result in an infinite loop. The solution is to use an STAILQ_SWAP to a local, empty STAILQ. Then we ensure we only abort the requests that were queued when _nvme_qpair_abort_queued_reqs() started executing. Fixes issue #1588. I used the multipath.sh test to reproduce this on my local system. If it ever dropped into the STAILQ loop in this function, we would hit the infinite loop. With this patch, I confirmed locally that now we safely avoid the infinite loop and the test passes. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I657db23efe5983bd8613c870ad62695a7fc7f689 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4284 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: <dongx.yi@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2020-09-17 21:34:58 +00:00
Alexey Marchuk	e7c92b2426	nvme/tcp: Rename r2t_recv, set this flag when send_ack is 0 Rename ordering bit r2t_recv to h2c_send_waiting_ack, that is more descriptive name. Change-Id: I6d6143ff4c1cccc74e11226b7974706808092f9a Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Signed-off-by: Or Gerlitz <ogerlitz@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4202 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-09-16 07:58:59 +00:00
Alexey Marchuk	dc88d13129	nvme/tcp: Move tcp_req ordering bits to union This makes it easier to zerofy ordering bits. Change-Id: If5696bfedfff1bf75e41c1449eac7fccb469e98b Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Signed-off-by: Or Gerlitz <ogerlitz@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4201 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com>	2020-09-16 07:58:59 +00:00
Alexey Marchuk	eb78b90ca8	nvme/rdma: Check that SGL descriptors fit into ICD The issue happens when SPDK RDMA initiator is connected to a remote target and this target reports rather small (or zero) ICD and we try to send several SGL descriptors. Since SGL descriptors are located in ICD, we should check that their total length fits into ICD. In other case sending such a command will cause RDMA errors (local length error) Change-Id: I8c0e8375dae799bc442ed2fab249cad2c4ccce51 Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reported-by: Or Gerlitz <ogerlitz@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4131 Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-09-16 07:58:13 +00:00
Jacek Kalwas	4e06de69b8	nvme: expose functions to manage queue identifiers In cases where the SPDK nvme driver is being used as a validation/test vehicle, users may need to allocate a currently unused qid that can be used for creating queues using the raw interfaces. One example would be testing N:1 SQ:CQ mappings which are supported by PCIe controllers but not through the standard SPDK nvme driver APIs. These new functions fulfill this purpose, and ensure that the allocated qid will not be used by the SPDK driver for any future queues allocated through the spdk_nvme_ctrlr_alloc_io_qpair API. Signed-off-by: Jacek Kalwas <jacek.kalwas@intel.com> Change-Id: I21c33596ec415c2816728a600972b242da9d971b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3896 Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-09-15 07:13:46 +00:00
Changpeng Liu	fd5e781afc	nvme: don't loop MMIO reads if the controller is already in failed state This may happen when resetting a controller, if the ADMIN queue failed to reconnect, the controller is set to failed state, so for this case we don't need to loop until timeout, just exit. Change-Id: I2b37af5453086cd64f3609c41eb8f6475da55fd4 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4143 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI Reviewed-by: <dongx.yi@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: JinYu <jin.yu@intel.com>	2020-09-14 09:19:07 +00:00
Shuhei Matsumoto	ce7eee9861	lib/nvme: Enable Async ANA Change Notice if supported Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Ida297aaf5b8212dde5abfe8e2d3df8d333f6c109 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4157 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-09-14 09:18:35 +00:00
Shuhei Matsumoto	6c6efee89d	lib/nvme: Set Add ANA log page to be supported by controller If ctrlr->cdata.cmic.ana_reporting is 1, set the corresponding field to true. Then use its API in the identify application. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I4e74bc4c114883e4aecdbee7a6f1a02027db23a5 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4156 Community-CI: Broadcom CI Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-09-14 09:18:35 +00:00
Niklas Cassel	174a5fe140	nvme: add initial namespace types support Add support for getting the Command Set Identifier for a given namespace. The SPDK_NVME_CAP_CSS_IOCS feature can be implemented on top of an old NVMe specification. If the feature is set, retrieve the NS ID Descriptor List regardless of the NVMe specification version. The quirk is still respected. Signed-off-by: Niklas Cassel <niklas.cassel@wdc.com> Change-Id: I7b257115ecb0d813ba75201c0f48960c7070dcc9 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4085 Community-CI: Broadcom CI Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-09-11 07:29:51 +00:00
Niklas Cassel	5c861295dd	nvme: warn if found UUID descriptor length is incorrect Warn if found UUID descriptor length differs from NIDL for NIDT_UUID. This will help identify non-compliant NVMe controllers. Signed-off-by: Niklas Cassel <niklas.cassel@wdc.com> Change-Id: Idf0daff9996147f38413318d1cd7fc3f929c5ce4 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4138 Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-09-11 07:29:51 +00:00
Jim Harris	2431578990	nvme: increase size of uevent buffer to 1MB In production environments, there could be large numbers of uevents other than nvme hotplug events. We want to ensure we never lose an nvme uevent due to ENOBUFS (i.e. overflow). So allocate a bigger receive buffer for the netlink socket to ensure we never lose any events. We only allocate one netlink socket per SPDK application, so the extra memory consumption is not really a concern. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I663fbb093516a01a8980a1517245f92d8c76f7aa Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4070 Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-09-08 07:49:32 +00:00
Jin Yu	ba773a54c7	nvme: fix keep alive issues in host side There are two bugs: 1, When the target response 0, it means target does't support keep alive. 2, Change the interval time to us so when the keep alive timeout is 1ms then the interval is 500us. Fix github issue: #1565 Change-Id: I75707ab0e4e639209a9c50ef326492fae213044d Signed-off-by: Jin Yu <jin.yu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4077 Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-09-08 07:46:40 +00:00
Ben Walker	4aa0bb676b	nvme/pci: Check for command completions after deleting submission queue After a submission queue is deleted, the device is supposed to post completions for every command to the completion queue. Previously, we never looked and completed all commands with an ABORTED status. Instead, complete any commands in the completion queue with the status the drive gave them. Change-Id: If851a365d4f305cf4390454b6b26dd0f7c5b82ac Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3875 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2020-09-04 08:14:44 +00:00
Tomasz Zawadzki	ccdbd2ed51	nvme/cuse: fix nblocks and block_size in rw path nblocks is zero based, so read path was missing the increment. NVMe device that cuse represents can be of any block_size, so rather than hardcoding 512 - actually verify it. Both paths didn't request enough of a buffer from cuse. Reported-by: Niklas Cassel Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I228dc2572bc94ecbcb913e950d912a7ab5be9434 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4037 Community-CI: Broadcom CI Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-09-04 08:13:56 +00:00
Tomasz Zawadzki	ef3f0d97ab	nvme/cuse: refactor read/write to pass cuse_device and block_size This patch does not alter functionality, just moves around where cuse_device and block_size is determined. Next patch will fix both paths. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I5a827b5b4ab080b2aa0f76f5cdcbcb177b38b474 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/4036 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-09-04 08:13:56 +00:00
Seth Howell	58216dd07e	lib/nvme: fix mem leak in req submit. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: If64c06177605a8f57d87ba22b86fe58ddebd6f7a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3921 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Michael Haeuptle <michaelhaeuptle@gmail.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com>	2020-09-02 07:38:38 +00:00
Jacek Kalwas	477d9500a3	env: fix pci driver registration Registration macro now generates function based on driver's name. It allows to have multiple registration within single source file. Similar pattern is used e.g. by SPDK_NVMF_TRANSPORT_REGISTER. Signed-off-by: Jacek Kalwas <jacek.kalwas@intel.com> Change-Id: Ied0887e8dae7fe9ca1517313be5eff8f218b7e98 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3895 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-08-27 08:36:43 +00:00
Seth Howell	309200661f	lib/nvme: pass up rc from failed connect in reset. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: I8041728de7bb7bdb35fb1570f3c2fb20feef8664 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3877 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-08-26 09:47:24 +00:00
Seth Howell	316f92d118	lib/nvme: pass up fabric connect rc to app. This will allow applications to understand why they were unable to connect. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: Ic04c7e72098c6ec1823de7d6a07d90150ef5ac20 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3836 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2020-08-26 09:47:24 +00:00
Jim Harris	178291a516	nvme: read all uevents Currently when the uevent processing code finds a non-uio/vfio uevent, it just stops its loops and returns. This means that if there are a lot of non-uio/vfio uevents, the netlink socket buffer can build up until its full because only one non-uio/vfio event gets drained per spdk_nvme_probe() call (which may be very infrequently). So modify parse_event so that it does not indicate error when a non-uio/vfio event is found. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ic8a40f71ee89d597ce46129eac889fe5b7ef5171 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3876 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-08-24 07:38:33 +00:00
Ziye Yang	d4d2e317b5	nvme/tcp: Make the return value consistent. We should make nvme_tcp_ctrlr_connect_qpair always return negative value if this function fails. Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: I457e704e39d7a3acd298fd48e89e8ea51e2ed4ad Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3809 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-08-24 07:37:13 +00:00
Changpeng Liu	5adf099eed	nvme: don't send Identify NS ID Descriptor List to inactive NS The specification says it will return INVALID FIELD if the NS is in inactive state. Fix issue #1551. Change-Id: I1b32f023ed665d410f4705e439068699e2b2f8de Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3860 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-08-21 08:45:31 +00:00
Alexey Marchuk	8bec9feb76	nvme/rdma: Remove unused spdk_nvme_send_wr_list nvme_rdma_qpair::sends_to_post is not used, remove it and spdk_nvme_send_wr_list structure Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: If9c42736d4e796a947bbfe80f59efd2fd7f77859 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3822 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-08-21 08:24:43 +00:00
Seth Howell	ce83fc2aff	lib/nvme: remove qpair from ctrl list in connect fail path This is an oversight that can cause issues with looping through the list if we end up allocating the same qpair twice. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: I513ea35398f4b724366c21be144531fbfbdb4347 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3835 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-08-21 08:24:18 +00:00
yidong0635	361cddfd63	nvme/nvme_pcie: Remove unused codes. pctrlr->cmb.mem_register_addr and pctrlr->cmb.mem_register_size are assigned after spdk_mem_register. if spdk_mem_register is failed , ctrlr_map_cmb hasn't been executed. they are not be used. So remove them. Signed-off-by: yidong0635 <dongx.yi@intel.com> Change-Id: I3d1996eee8b5260b79c4c3e0a2e1d376da2343b7 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3856 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-08-21 08:24:00 +00:00
Changpeng Liu	5d5a9077a3	nvme: don't assert on custom transport Change-Id: I2d425c127dc070f7bb508f5a61e6304f6042fdf7 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3857 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-08-21 08:23:19 +00:00
Ziye Yang	0d3cc15a62	nvme/tcp: Correct the incapsule data usage According to page35 in recent NVMe-oF spec ( NVMe-over-Fabrics-1.1-2019.10.22-Ratified), ioccsz is used to restrict the incapsule size of I/O command, so do not restrict the NVMe-oF OPC command and also the admin command. We accidently trigger an bug in kernel since we do not send the fabrics command with the incapsule and make the kernel coredump, though the kernel has bugs. Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: I869a2c8ab7b9c2ac1e5cc5b603920662591c2c64 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3837 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: <dongx.yi@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-08-20 09:26:06 +00:00
Ziye Yang	2031f8f70d	nvme: set the error code if we cannot send keep alive command. If the transport is broken, we should set errno code in spdk_nvme_ctrlr_process_admin_completions instead of keeping silence. Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: Ie73763e1329e12a8c82a0223d360991f86c39be3 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3773 Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-08-19 07:29:26 +00:00
Seth Howell	518a1e013a	lib/nvme: make fabrics connect timeout configurable. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: If829d399882ef948d95673c17e5689c91386c21d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3795 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-08-19 07:29:19 +00:00
Seth Howell	b3bb3a1bbf	lib/nvme: change timeout in wait_for_completions to usec This allows for much more granular control over the timeout. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: Ib23de21e60eec4207c55320579699edf284f4e16 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3794 Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-08-19 07:29:19 +00:00
Tomasz Kulasek	549b9f31c6	lib/nvme: implement SPDK_NVME_DATA_NONE data transfer in CUSE Change-Id: Ifb2a53bdbaabd74b7f412923a97d79b44afde861 Signed-off-by: Tomasz Kulasek <tomaszx.kulasek@intel.com> Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1744 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Karol Latecki <karol.latecki@intel.com>	2020-08-12 10:39:29 +00:00
Seth Howell	a22f55658c	lib/nvme: add timeout to fabric connect commands. We can't afford to wait here blocking forever. Add a reasonable timeout (1 second) to this command. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: I7e373c5515d42a46403f03d7d09e72225f8910ff Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3563 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-08-10 11:27:16 +00:00
Ziye Yang	7bac9b06f1	nvme TCP: Make the control related pdu also allocated from the SPDK DMA memory Purpose: To make the pdu management consistent with other PDUs, then we can easily adapt our code into some hardware offloading solution. Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: Ic4a2847fd1b6cacda4cbaa52ff12c338f0394805 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3588 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-08-04 18:28:08 +00:00
Seth Howell	0d8f86f842	lib/nvme: don't submit request if qpair is disconnected. This becomes a problem when the qpair is reconnected. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: I6677b396cf766684a4891ffbee93aa3e4e83374d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3391 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-07-31 08:23:44 +00:00
Seth Howell	0b1799cd98	nvme/transport: addd assert for transport. Silences a KW error. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: Ifd8d6088a22de7c230d48751be2b3991d0649778 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3553 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-07-29 07:37:26 +00:00
Jim Harris	751e2812bc	nvme: do not abort reqs in multi-process cleanup path When a process cleans up IO qpairs from another crashed process in a multi-process environment, we must not try to abort reqs for that IO qpair. Any reqs will contain callbacks for the crashed process which we must not try to execute in a different process. Fixes issue #1509. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I5e58cce7bdb86e3feb4084733815c086901f867e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3536 Community-CI: Mellanox Build Bot Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-07-27 22:42:07 +00:00
Darek Stojaczyk	cdffd22581	env_dpdk: move NVMe PCI driver definition to the nvme lib Now that drivers can be registered from upper layers there's no need to keep them centralized inside env. (check_format.sh complains that spdk_pci_nvme_get_driver() shouldn't start with the spdk_ prefix - to workaround that we move the function declaration from one place in env.h to another - that's enough to convince check_format it really is a public function) Change-Id: If86aebd6c997349569c71430ec815b413eb44ef8 Signed-off-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3187 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Jacek Kalwas <jacek.kalwas@intel.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-07-24 21:42:26 +00:00
Seth Howell	c9bc86c061	lib/nvme: add checks for new states in reconnect_io_qpair. This function hasn't kept up properly with the states that we use for tracking the qpair lifecycle. Add checks for NVME_QPAIR_DISCONNECTING and NVME_QPAIR_DESTROYING. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: I51607d4f00e94937b08fca28e766163580d46461 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3359 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-07-24 15:43:07 +00:00
Darek Stojaczyk	45528bfef6	env: add spdk_pci_id->class_id This follows struct rte_pci_id which had class_id as well. We'll need it to make some additional DPDK APIs public through the env abstraction. Change-Id: I794a6cd6b17e48daf53b48fa5abe3d3dcfeaa403 Signed-off-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3182 Reviewed-by: Jacek Kalwas <jacek.kalwas@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-07-23 20:48:47 +00:00
Darek Stojaczyk	64c4fafd32	nvme/pcie: dont initiate hotremoval on SIGBUS for VFIO devices This is an attempt to workaround a kernel data race spotted in #1275. We have to delay our VFIO interraction until after the kernel handles the PCIe hotremoval. Otherwise we risk having a deadlock inside the kernel. We could put a dummy delay in SPDK hotremove routine, but we don't know exactly how long we have to wait, so wait for the VFIO hotremove notification to arrive instead. From what I understand it is always sent after the kerner already releases the critical locks. This patch breaks hotremove for VFIO devices when running with DPDK version < 18.05, but those reached EOL already and their support is removed from SPDK later within this series. It would be nice to wait for this VFIO notification inside lib/env_dpdk e.g. inside the detach function, but we don't know there why the detach is happening. It could be triggered by a user RPC, not physical device hotremoval. Change-Id: I2727de923bb2e3e05a83202465330a32a77d7cfc Signed-off-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1732 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2020-07-23 20:48:47 +00:00
Ziye Yang	1da44e0604	nvme_tcp: Move the default buffer factor size in nvme_tcp.h 1 Change the default factor from 4 to 8, which can be used to improve the performance. 2 Change the base buffer size in nvme_tcp.c, we should not use sizeof(struct spdk_nvme_tcp_cmd), it is 72 bytes. Normally, the initiator will receive C2h pdus and R2T Pdus by most, so set the size of using sizeof(struct spdk_nvme_tcp_c2h_data_hdr) is enough. Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: I384f4cb026cb8d83e75b639f7256ee8cb8ed1df1 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3283 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: John Kariuki <John.K.Kariuki@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2020-07-22 12:21:07 +00:00
Shuhei Matsumoto	71da42e953	lib/nvme: Add spdk_nvme_ctrlr_cmd_abort_ext() to abort commands whose cb_arg matches A new API spdk_nvme_ctrlr_cmd_abort_ext() gets cmd_cb_arg as a parameter, and use it to abort requests whose cb_arg matches cmd_cb_arg. The caller can set the parameter qpair to NULL if it wants to abort requests on admin qpair. Hold ctrlr->ctrlr_lock throughout because abort request is submitted to admin qpair. The API is not critical for performance, and so initialize parent data first. The API is for a specific qpair, and so hold SQID in the parent, and it is copied to the children. On the other hand, CID is set to child when request to abort is found. Use an new helper function nvme_transport_qpair_iterate_requests() to add abort request for each outstanding request which has cmd_cb_arg as its callback context. The case is possible such that the request to abort is not outstanding yet but queued. Hence abort queued requests which has cmd_cb_arg as its callback context too, but it is done only if there is no error so far. If only queued requests are aborted and there is no outstanding request to abort, complete with success synchronously. If there is no outstanding or queued request to abort, return -ENOENT. When any abort request is submitted, the difference between success and failure is only bit 0 of CDW0 according to the NVMe specification. We cannot the existing helper functions nvme_request_add_child() and nvme_cb_complete_child() but can use nvme_request_remove_child(). nvme_qpair_submit_request() may use only nvme_request_remove_child() from these three helper functions. Hence we use req->parent as other types of request do. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I3a271c6542f8e2e6b425b3bf6151f41e924bc200 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2039 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI Community-CI: Mellanox Build Bot Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-07-14 07:54:42 +00:00
Shuhei Matsumoto	576a373d58	lib/nvme: Abort queued requests whose cb_arg matches Use another list dedicated to hold queued requests being aborted to avoid potential infinite recursive calls. Add a helper function nvme_qpair_abort_queued_req() to move requests whose cb_arg matches from qpair->queued_req to qpair->aborted_queued_req. Then nvme_qpair_resubmit_requests() aborts all requests in qpair->aborted_queued_req. The first idea was that nvme_qpair_abort_queued_req() aborts queued requests directly. However, this caused infinite recursive calls. Hence separate requesting abort to queued requests and actually aborting queued requests. The detail of the infinite recursive calls is as follows: Some SPDK tool submits the next request from the callback to the completion of a request in the completion polling loop. For such tool, if the callback submits a request and then aborts the request immediately, and the request could not be submitted but queued, it will create infinite recursive calls by request submit and abort, and it will not be able to get out of completion polling loop. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I8196182b981bc52dee2074d7642498a5d6ef97d4 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2891 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-07-13 08:40:42 +00:00
Alexey Marchuk	e137881e4e	nvme/tcp: Insert free req at the head of the list lifo model is more cache friendly Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: Id937ab0c1b8b4ce121136144c7d6013bbe5eb963 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3282 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2020-07-13 08:40:31 +00:00
Jacek Kalwas	4d9ab1e9a1	nvme: pretty print dptr Signed-off-by: Jacek Kalwas <jacek.kalwas@intel.com> Change-Id: I576878fbbafc3d17617ceeec99e40565be7d5d3d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3213 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-07-10 07:30:59 +00:00
Jacek Kalwas	64f05eb5c5	nvme: pretty print fabric cmd Signed-off-by: Jacek Kalwas <jacek.kalwas@intel.com> Change-Id: Ib4bc28026cab208d45c8b876714fa525e5bb38f3 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3200 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-07-10 07:30:59 +00:00
Jacek Kalwas	9cd4723913	nvme: pretty print set/get features Signed-off-by: Jacek Kalwas <jacek.kalwas@intel.com> Change-Id: Ib6f1811da9a6294983bce04cff01ba1fb5e45607 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3179 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-07-10 07:30:59 +00:00
Jacek Kalwas	41b360d54e	nvme: add missing cmds to admin opc string Signed-off-by: Jacek Kalwas <jacek.kalwas@intel.com> Change-Id: I6cdcf675ebc8ad31d88b5469f87e1eae066b2e3c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3178 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-07-10 07:30:59 +00:00
Jacek Kalwas	61668cc43e	nvme: introduce new set of cmd/cpl printers Having functions without qpair on the interface allows for wider usage e.g. by nvmf layer. Signed-off-by: Jacek Kalwas <jacek.kalwas@intel.com> Change-Id: I3a51ad53f00eb29e2ba2681ef4ff0cc2a197b65d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3176 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Community-CI: Broadcom CI Community-CI: Mellanox Build Bot	2020-07-10 07:30:59 +00:00
Ziye Yang	4c9aad0299	nvme/tcp: Report the free entries if sending_ack is set Previous we fix the same issue in this commit: `cb98b2ab3e` But we forget to fix it here. And we also need to update here, otherwise we will still face the same issue described in commit: `cb98b2ab3e` Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: I3660dbb6e97c92ea4cb347cfce4bf23c6dfe97ab Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3242 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-07-09 07:23:19 +00:00
Jacek Kalwas	42f2f01a0e	nvme: fix timeout log during ctrlr state change Signed-off-by: Jacek Kalwas <jacek.kalwas@intel.com> Change-Id: Icf6741f059ed92fdf0ee9fd0952acdd07711c69c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3226 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-07-08 07:54:29 +00:00
Shuhei Matsumoto	d80c9f6257	lib/nvme: Add underscore prefix to nvme_qpair_abort_queued_reqs() This is a preparation to the next patch. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I15356c69e676dc41d3af69caa6d12c1fcb282152 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3071 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Michael Haeuptle <michaelhaeuptle@gmail.com>	2020-07-08 07:54:01 +00:00
Shuhei Matsumoto	f2bd635ecf	lib/nvme: Add qpair_iterate_requests() to iterate the common operation among transports To abort requests whose cb_arg matches, add child abort request greedily. Iterating all outstanding requests is unique for each transport but adding child abort is common among transports, and adding child abort is replaceable by other operations. Hence add qpair_iterate_requests() function to the function pointer table of transport, and pass the operation done in the iteration by a parameter of it. In each transport, the implementation of qpair_iterate_requests() uses TAILQ_FOREACH_SAFE() for potential future use cases. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Ic70d1bf2613fce2566eade26335ceed731f66a89 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2038 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Michael Haeuptle <michaelhaeuptle@gmail.com>	2020-07-08 07:54:01 +00:00
Shuhei Matsumoto	aa2ea2bed5	nvme/rdma: Follow the fix in TCP transport and restore nvme_rdma_req_put() Recently two patches were merged but we should have get more reviews. The fix done in TCP transport will be better because we can keep the existing functions and make the code change minimum. Restore nvme_rdma_req_put() and move removing rdma_req from rqpair->outstanding_reqs to nvme_rdma_req_complete(). One exception is the case that only nvme_rdma_req_put() is called. For the case remove rdma_req from rqpair->outstanding_reqs before calling nvme_rdma_req_put(). Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I3f68dbc88c60af6b8f4ecc3209fde9b763ac3189 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3073 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Michael Haeuptle <michaelhaeuptle@gmail.com>	2020-07-08 07:54:01 +00:00
Shuhei Matsumoto	ad69e739e1	nvme/tcp: Dequeue request from outstanding list before calling completion Each request has a callback context as cb_arg, and the callback to nvme_complete_request() for the completed request may reuse the context to the new request. On the other hand, TCP transport dequeues tcp_req from tqpair->outstanding_reqs after calling nvme_complete_request() for the request pointe by tcp_req. Hence while nvme_complete_request() is executed, tqpair->outstanding_reqs may have two requests which has the same callback context, the completed request and the new submitted request. The upcoming patch will search all requests whose cb_arg matches to abort them. In the above case, the search may find two requests by mistake. To avoid such error, move dequeueing tcp_req from tqpair->outstanding_reqs before calling nvme_request_complete(). One exception is the case that only nvme_tcp_req_put() is called. For the case remove tcp_req from tqpair->outstanding_reqs before calling nvme_tcp_req_put(). Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I5f2ac292c60431ac1e27b8657db92b220860a0a8 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2865 Community-CI: Broadcom CI Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Michael Haeuptle <michaelhaeuptle@gmail.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-07-08 07:54:01 +00:00
Shuhei Matsumoto	e060285ea6	nvme/tcp: Change nvme_tcp_req_complete() to take tcp_req instead of req Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Ida0ee76015821d7db54b273d14383a245a18047b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3058 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2020-07-08 07:54:01 +00:00
Ziye Yang	449dee3563	nvme/tcp: Fix the sending conflict between cmd and h2c pdu. As is well known, we may also handle the r2t data pdu receving earlier before calling the cb function of send_cmd due to the outof order execution of the lower layer uring socket interface.So we need to fix this issue, otherwise the data of the sending_pdu will be placed with the wrong data. And it will cause the issue shown in https://github.com/spdk/spdk/issues/1473 Fixes #1473 Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: Idac1ad65761695f3a655b85003861c1d1f4f3875 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3215 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-07-07 07:31:53 +00:00
Jin Yu	d76951c7ba	nvme_rdma: fix the recvs_to_post.first to NULL nvme_rdma_qpair_submit_recvs is not judged in nvme_rdma_poll_group_process_completions path. If we do not clean the recvs_to_post.first we may get the wrong current_num_recvs when the rc is non-zero and call it again. Change-Id: If0046e711525dcfcb419132a01fed7a09db13ba0 Signed-off-by: Jin Yu <jin.yu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3163 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: <dongx.yi@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-07-06 07:20:33 +00:00
Jin Yu	05805e54a0	nvme:disconnected state then destroying state Put the destroying state after the disconnected state. Because nvme_transport_ctrlr_disconnect_qpair will modify the state of qpair to disconnected, and in the path of rdma, it will postpone the deletion of qpair until the release of pg by judging the destroying state. So qpair is not deleted. Change-Id: Ica606905cddf67d0ffda14bd48cc5f4e424f01ee Signed-off-by: Jin Yu <jin.yu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3136 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: <dongx.yi@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-07-06 07:20:26 +00:00
Jin Yu	19228a0602	nvme_rdma:fix current_num_sends to current_num_recvs Change-Id: I1a3067165c06db3fe7d7fd1c1ec149e845100b27 Signed-off-by: Jin Yu <jin.yu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3162 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: <dongx.yi@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-07-06 07:20:26 +00:00
Simon A. F. Lund	ae6c40a4a5	lib/nvme: fallback to NVM when then given command_set is not supported When the user requests a non-default command set configuration, via the probe_cb() supplied to spdk_nvme_probe(), which is not supported, then the controller initialization will not proceed. This patch changes that behavior into falling back to the NVM command set and continue with the controller initialization. It is done by assigning the NVM command set to opts.command_set such that the user knows in attach_cb() with which command_set the controller is configured/enabled. The fallback is needed since the user does not have access to the controller capabilities register. The strategy left for the user is thus to try. However, this is an issue, as the user only has one attempt, as subsequent calls to spdk_nvme_probe() will not trigger probe_cb() for the controllers whose initialization did not proceed. Signed-off-by: Simon A. F. Lund <simon.lund@samsung.com> Change-Id: Ia414628fcd7d56956649647775462e62d98c0a90 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2931 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-07-03 07:31:31 +00:00
Jim Harris	7ff9609f3d	nvme: add security send/receive quirk for Intel 900P SSDs These SSDs set the oacs.security bit but do not actually support OPAL. So do not set the controller flag indicating SECURITY_SEND_RECV support in this case. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I7fcfeafcc8d9439a1c53c60a1aea1801923a2ce5 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3156 Community-CI: Broadcom CI Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-07-03 07:31:20 +00:00
Jim Harris	16364fdc14	nvme: copy 8 bytes at a time into virtualized CMB QEMU won't emulate wide (>8 byte) instructions on an MMIO region. We tried to fix this previously but it wasn't sufficient. So this version adds an explicit copy function that will only copy 8 bytes at a time from the source cmd pointer into the SQ cmd point in the virtualized CMB. Fixes issue #1458. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I804005a2e8cf3f7365c1608674f82a3f843e59ba Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3135 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-07-02 07:21:38 +00:00
Alexey Marchuk	e762508854	nvme_rdma: Add check for keyed SGL length The length of a keyed SGL data block is limited by 3 bytes. Add a check to fail requests which length exceeds 3 bytes. In other case we can send an incorrectly formed SGL request with an invalid or zero length. Fixes issue #1450 Change-Id: I77cdaff5fbf4be5754a3ac6008b8ccd532ac5905 Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3056 Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-07-02 07:21:31 +00:00
Ziye Yang	cb98b2ab3e	nvme/tcp: Report that we have free entries if send_ack is set. Without this patch, we will face the following warning code when compiled with (--with-uring --enable-debug) while testing big I/O size: 256KB, e.g., "nvme_qpair.c: 474:nvme_qpair_resubmit_requests: ERROR: Unable to resubmit as many requests as we completed" The reason is because the nvme_tcp_request structure is not freed yet if send_ack is not set, so there will be no entries when there are other requests submit again. And this patch can mitigate such issue. Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: I4c7616fbd3c82a883b4e9facd257a1a4f66e876d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3123 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-07-01 07:51:17 +00:00
Seth Howell	203ed4f673	lib/nvme: report rdma_connect errors up the stack. This will allow applications to discern specific connect behavior and make choices relative to it. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: I46182c285367ceb8a72511defe4508b3592b4572 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3095 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-06-29 09:19:09 +00:00
Alexey Marchuk	8421f83973	rdma: Fix qpair desctruction in error flow rdma_qp may not be initialized when qpair is not fully created. When such a qpair is being destroyed we may pass a NULL pointer to spdk_rdma_qp_disconnect or spdk_rdma_qp_destroy and hit an assert. This patch fixes this problem for NVMEoF target and initiator. Change-Id: I84787dc1b1211293c2a19f59d47727eaecd9d5a1 Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3050 Community-CI: Broadcom CI Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-06-29 09:18:52 +00:00
Michael Haeuptle	89013903fe	NVME: Fixes stuck IOs during hot remove (#1451 ) When a NVMe device is hot removed, subsequent calls to nvme_qpair_submit_request can fail with ENXIO. The failure path handling for ENXIO did not free the request which exhausts the qpair's free_req list eventually and all IOs are stuck going forward. This fix adds the same cleanup handling to nvme_qpair_submit_request for this error case as it is done in _nvme_qpair_submit_request. Signed-off-by: Michael Haeuptle <michael.haeuptle@hpe.com> Change-Id: I5677d53965bdbd6d339c013483cdf42ce782099a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3018 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2020-06-29 09:18:29 +00:00
Ziye Yang	ceb07eb8f4	nvme/tcp: Fix send_cb and recv pdu function contention when there is R2T. When using uring socket, we see following assert nvme_tcp.c:1018: nvme_tcp_capsule_resp_hdr_handle: Assertion `tcp_req->state == NVME_TCP_REQ_ACTIVE' failed. Detailed info is in https://ci.spdk.io/results/autotest-per-patch/builds/19205/archive/nvmf-tcp-vg-autotest/build.log We face this issue, because there is also code execution ordering between "sending callback function" and "pdu receving function". We did not find it in physical machine testing, but finding it in vagrant machine in CI. Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: I5eb241d564c0fc42ce0601b7c85999a2550f0de3 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3046 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-06-29 09:18:13 +00:00
Ziye Yang	2ac8d3ff5e	nvme/tcp: Allocate send_pdu with DMA allocated memory. Purpose: It will be used to leverage the uring acceleration later when we use io_uring_prep_write_fixed. Because for using the Registered buffers feature in I/O uring, we currently can register all the huge memories. And if we allocate send_pdus in DMA memory, we can leverage such feature. Change-Id: Id0ba5f7fe43202027c0378e9cbe74d861aad21e5 Signed-off-by: Ziye Yang <ziye.yang@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/3002 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2020-06-24 08:22:17 +00:00
Shuhei Matsumoto	465b2f8a6b	nvme/rdma: Inline nvme_rdma_req_put() nvme_rdma_req_complete() and nvme_rdma_req_put() are called in a row except a single case. Move clearing completion_flags and req of rdma_req from nvme_rdma_req_put() to nvme_rdma_req_complete(), and then inline nvme_rdma_req_put() because nvme_rdma_req_put() does only insert now. To do this, change the type of the second parameter of nvme_rdma_req_complete() from struct nvme_request to struct spdk_nvme_rdma_req. For the exceptional case that only nvme_rdma_req_put() is called, change nvme_rdma_req_init() to clear rdma_req->req if returned with error. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Ibf7e6d245f3a48fb895cd9e6d92596ef833f26d1 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2876 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Michael Haeuptle <michaelhaeuptle@gmail.com>	2020-06-24 08:19:43 +00:00
Shuhei Matsumoto	a57aeac1fe	nvme/rdma: Dequeue request from outstanding list before calling completion Each request has a callback context as cb_arg, and the callback to nvme_complete_request() for the completed request may reuse the context to the new request. On the other hand, RDMA transport dequeues rdma_req from rqpair->outstanding_reqs after calling nvme_complete_request() for the request pointed by rdma_req. Hence while nvme_complete_request() is executed, rqpair->outstanding_reqs may have two requests which has the same callback context, the completed request and the new submitted request. The upcoming patch will search all requests whose cb_arg matches to abort them. In the above case, the search may find two requests by mistake. To avoid such error, move dequeueing rdma_req from rqpair->outstanding_reqs before calling nvme_request_complete(). Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Ia183733f4a4cd4f85de17514ef3a884693910a05 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2863 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Michael Haeuptle <michaelhaeuptle@gmail.com>	2020-06-24 08:19:43 +00:00
Shuhei Matsumoto	299fce881a	nvme/pcie: Dequeue request from outstanding list before calling completion Each request has a callback context as cb_arg, and the callback to nvme_complete_request() for the completed request may reuse the context to the new request. On the other hand, nvme_pcie_qpair_complete_tracker() dequeues tr from pqpair->outstanding_tr after calling nvme_complete_request() for the request pointed by tr. Hence while nvme_complete_request() is executed, pqpair->outstanding_tr may have two requests which has the same callback context, the completed request and the new submitted request. The upcoming patch will search all requests whose cb_arg matches to abort them. In the above case, the search may find two requests by mistake. To avoid such error, change nvme_pcie_qpair_complete_tracker() to dequeue tr from pqpair->outstanding_tr before calling nvme_complete_request(). Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Ie9efc200d06d02d8ee2be3cb8e9fd64591bc210d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2861 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Michael Haeuptle <michaelhaeuptle@gmail.com>	2020-06-24 08:19:43 +00:00
Shuhei Matsumoto	aacac0b433	lib/nvme: Get ctrlr simply from req->qpair->ctrlr at spdk_nvme_ctrlr_cmd_abort() req->qpair and qpair->ctrlr are set at their initialization, and hence we don't have to store ctrlr to req->user_buffer but we can get ctrlr simply from req->qpair->ctrlr. Besides, save a local variable by overwriting qpair when it is NULL. This is also a small clean up. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I1cca763fbaae55079b38f7f16fcf779360232797 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2890 Community-CI: Broadcom CI Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Michael Haeuptle <michaelhaeuptle@gmail.com>	2020-06-24 08:19:43 +00:00
Shuhei Matsumoto	43f3a8d99f	lib/nvme: Factor out submit abort request into a helper function To reuse the code for the upcoming new function spdk_nvme_ctrlr_cmd_abort_ext(), factor out the operation to submit an abort request into an new helper function _nvme_ctrlr_submit_abort_request(). Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Id00bed8fb8dc15b77e3c7f19105c1623588faa53 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2494 Community-CI: Broadcom CI Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Michael Haeuptle <michaelhaeuptle@gmail.com>	2020-06-24 08:19:43 +00:00
Shuhei Matsumoto	5322f30787	lib/nvme: Fix aborting queued abort requests at controller reset or destruct ctrlr->outstanding_aborts is counted only for submitted abort requests. However ctrlr->outstanding_aborts had been decremented for queued abort requests by mistake. Subsequent patches will use parent-children for abort requests but nvme_free_request() is not aware of such relationship. Queued abort requests had not been canceled or aborted when controller was destructed. Retry submitting queued abort requests had been repeated recursively and had caused stack overflow. This patch fixes all. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I8ce0ae51ddd5ed3e1e8ac86329c8bdb7a9236b2f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2555 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Michael Haeuptle <michaelhaeuptle@gmail.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-06-24 08:19:43 +00:00
Shuhei Matsumoto	5cbc1d5cae	lib/nvme: Factor out retry queued abort into a helper function To reuse the code for the upcoming new function spdk_nvme_ctrlr_cmd_abort_ext(), factor out the operation to retry queued abort into an new helper function nvme_ctrlr_retry_queued_abort(). Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I56bdfefe2c54990c9fd0463303b3d65ccf6e8942 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2493 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Michael Haeuptle <michaelhaeuptle@gmail.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-06-24 08:19:43 +00:00
Shuhei Matsumoto	9d4751a550	lib/nvme: Fix Abort Command Limit to process as 0's based value SPDK NVMe driver had processed ACL as 1's based value by mistake, and SPDK NVMe-oF target sets ACL to 0. Hence If NVMe driver connects to SPDK NVMe-oF transport, spdk_nvme_ctrlr_cmd_abort() always queued abort request. Fix this bug to process ACL as 0's based value in spdk_nvme_ctrlr_cmd_abort(). Besides, initialize ACL explicitly to 0 in spdk_nvmf_ctrlr_identify_ctrlr() for clarification. Signed-off-by: Michael Haeuptle <michael.haeuptle@hpe.com> Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Id4f3a469776cdab88bcc6f41e7893885a7b78d8c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2513 Community-CI: Broadcom CI Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Michael Haeuptle <michaelhaeuptle@gmail.com>	2020-06-24 08:19:43 +00:00
Shuhei Matsumoto	10c4193363	lib/nvme: Set the parent to failure when submission of any children failed When one of the children is failed to submit, if any children is already submitted, the function can return success to wait for those children to complete, but the parent should be set to failure. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I2ea53856ee58da991bceca0058d1e1f55d42af37 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2492 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Michael Haeuptle <michaelhaeuptle@gmail.com>	2020-06-24 08:19:43 +00:00
Shuhei Matsumoto	b3d0c369a8	lib/nvme: Fix wrong completion status when retrying queued abort command next->cpl should have been set if nvme_ctrlr_submit_admin_request() failed in spdk_nvme_ctrlr_cmd_abort_cpl(). Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I4114294e398b18148d8a9ad798a656a7686c5a85 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2491 Community-CI: Broadcom CI Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Michael Haeuptle <michaelhaeuptle@gmail.com>	2020-06-24 08:19:43 +00:00
Ziye Yang	3a1f5364d2	nvme/tcp: Fix nvme_tcp_req free conflict between cmd sending and incoming pdu receiving This patch tries to solve the out of order call back handling for cmd sending and the incoming pdu handling. Normally, the cmd call back will be called before receving the next PDU from the target if the application uses the sync manner. With the uring implementation, after sending the cmd to the target, we may have the following scenerio: (1) Firstly receive the incoming pdu(e.g., CapsuleResp pdu, C2hdata pdu) due to the group polling read event. (2) Secondly execute the callback function related with NVMe command sending. This means that the data from the initiator is really sent out to the target, and the target receives, then sending back the data to the initiator. But the uring io_uring_cqe event is not handled, thus if we execute (1) first, it will clean the data structures related with nvme_tcp_req, and the nvme_tcp_req will be used for other purpose. Then causes wrong behaviour like the following: "Rand Write test failed at QD=128 because fio hangs with the following error: nvme_tcp.c: 971:nvme_tcp_capsule_resp_hdr_handle: ERROR: no tcp_req is found with cid=66 for tqpair=0x7f23d8001710". And this patch can address this issue. Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: I5043aaa8adf5033d93dedac15f633f0850e0b9f2 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2818 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-06-22 07:47:31 +00:00
Alexey Marchuk	268aacb24a	rdma: Add new API spdk_rdma_qp_accept This API is a wrapper for rdma_accept which allows to remove spdk_rdma_qp_init_attr::initiator_side. Change-Id: Iba2be5e74e537c498fb11c939c922b2bbda95309 Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2908 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2020-06-18 07:28:04 +00:00
Alexey Marchuk	5f6ef934df	mk/nvme/nvmf: Add deps for RDMA libs These dependencies were removed in patches that added RDMA provider. It was incorrect change since it causes SEGFAULT when SPDK is built with shared libraries Change-Id: I15f4ff86a75b3d080e1c7c89d75af4959c4ed989 Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2900 Community-CI: Broadcom CI Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Seth Howell <seth.howell@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2020-06-18 07:28:04 +00:00
Ziye Yang	18450e8b82	nvme: solve the spdk_nvme_connect compatibilty issue. This is used to make spdk_nvme_connect can support the old library for compatibility. Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: I49d92fb473c3cbabd8e1240785b920480202eee9 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1998 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-06-05 09:04:11 +00:00
Ziye Yang	05cd548cf1	nvme: Fix the field initialization of struct spdk_nvme_ctrlr_opts Purpose: Make the initilaization in order. Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: I66962073a02b6a4c2fc79ac343cdf5310075dd63 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2766 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-06-05 09:04:11 +00:00
Tomasz Kulasek	550d4867a0	lib/nvme: update io msg producers on ns manage Change-Id: I20adb92ae4e13e775b5e70617c705afd32e16c9e Signed-off-by: Tomasz Kulasek <tomaszx.kulasek@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/681 Community-CI: Broadcom CI Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2020-06-05 09:01:44 +00:00
Tomasz Kulasek	11aca20442	nvme/cuse: add spdk_nvme_cuse_update api spdk_nvme_cuse_update_namespaces() API allows application to repopulate namespaces on NVMe CUSE devices. Change-Id: I5375b13bcc91b17cd26b4dc6f02d83dc33826542 Signed-off-by: Tomasz Kulasek <tomaszx.kulasek@intel.com> Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/658 Community-CI: Broadcom CI Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2020-06-05 09:01:44 +00:00
Seth Howell	1039254319	nvme/rdma: add cq resizing. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: I6350d76b8c1e778c18e693b2dfbb10dd36b3e3d8 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1927 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-06-04 07:20:16 +00:00
Seth Howell	67b0dcfe29	nvme_rdma: add tracking for rdma objects in qpair. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: I0b45aed21dc649888bb9d93c5937fb553f35eb27 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2568 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-06-04 07:20:16 +00:00
Seth Howell	8bef6f0bdf	lib/nvme: rdma poll group with shared cq. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: Ifde29f633f09cccbebfdcde5ab2f96d9590449f1 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1167 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI Community-CI: Mellanox Build Bot Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-06-04 07:20:16 +00:00
Maciej Szwed	a2606d4b5d	all: Fix warnings found by gcc 9.3.0 and 9.3.1 Signed-off-by: Maciej Szwed <maciej.szwed@intel.com> Change-Id: Ie7c5b7b1c2d59a1f927eb5602f4ea5a003eff287 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2433 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-06-02 09:51:57 +00:00
Seth Howell	6f97efb784	test/make: add leaf type changes check to check_so_deps. This is an important aspect of ABI versioning. There was a change to the spdk_bdev_opts struct which is accessible from the public API that results in an ABI change. There was also a change in the spdk_accel_module_if struct that will affect new modules so its major version needs to be revved. There was also a change in the publically accessible rmda_hooks structure which forced an ABI change in the NVMe library. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: I5cc6886fe01b4adc2836b6e15995471f0361dc29 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2663 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2020-06-01 13:07:30 +00:00
Seth Howell	1a9c19a954	lib/nvme: remove spdk prefix from internal headers. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: Iccde5860b83217163428ff504cba87a1cf209720 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2444 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com>	2020-06-01 13:07:30 +00:00
Seth Howell	6d18ea425b	lib/nvme: force qpair disconnect before aborting rdma requests. This is needed for shared completion queues which can still give us successful completions on aborted requests if the qpair hasn't been disconnected. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: I85cf1a81ef563d8c02d684b09d2f7ad5008e38cb Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1961 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-06-01 09:22:05 +00:00
Seth Howell	b4e060b560	lib/nvme: check that req is not null in RDMA. When a request has been aborted, it's possible to get a completion for an rdma request but the rdma_req->req object has already been cleared to NULL. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: I5f7b1b96ff4be8c436aae9a7e2a7c9927d04e627 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1960 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-06-01 09:22:05 +00:00
Tomasz Kulasek	c62887ad0b	nvme/cuse: add dynamic nvme cuse devices update This patch adds a posibility to update cuse devices after cuse device registration. During NVMe controller life cycle there's a posibility to create and delete namespaces with Namespace Manage feature from NVMe spec 1.4. To guarantee consistency, after changing the number of namespaces NVMe CUSE devices must be repopulated after Namespace Manage operation. This patch introduces cuse_nvme_ctrl_update() entry point to allow repopulate NVMe CUSE devices. Change-Id: I478faae040027965321af585abfc34eaf35c908c Signed-off-by: Tomasz Kulasek <tomaszx.kulasek@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/650 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2020-06-01 09:21:48 +00:00
Tomasz Kulasek	a7567709a6	lib/nvme: implement SPDK_NVME_DATA_HOST_TO_CONTROLLER in cuse Change-Id: I07bc1771eec284f90d887a0a806d315fbc989428 Signed-off-by: Tomasz Kulasek <tomaszx.kulasek@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1218 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-06-01 09:21:48 +00:00
Tomasz Kulasek	8da21f69d3	lib/nvme: rework cuse admin command For now only controller to host transmition is implemented for CUSE. This patch separates cuse_nvme_admin_cmd_send() entry point to be used to implement another transmition directions. Change-Id: Ic9013a30c16cf71957c8b411ee00a43c7aa8bbb6 Signed-off-by: Tomasz Kulasek <tomaszx.kulasek@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1674 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Darek Stojaczyk <dariusz.stojaczyk@intel.com>	2020-06-01 09:21:48 +00:00
Shuhei Matsumoto	f21f51bd81	lib/nvme: Remove inclusion of SPDK event library Remove inclusion of spdk/event.h and spdk_internal/event.h from SPDK NVMe library. Their dependency had been removed before. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Ide3a0902b1cebb9c9033ade45d7488622e38696c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2688 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-06-01 09:20:41 +00:00
Changpeng Liu	3b6622046e	nvme/pcie: use 16 bits width access API for PCI_COMMAND register PCI_COMMAND register with offset 0x4 is 16 bits width. Change-Id: I86483e58a3ac680efaae6ff7edaadde8efc289f6 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2589 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-05-29 14:59:24 +00:00
Tomasz Kulasek	c84b3cccdd	lib/nvme: fix do not use cuse session on fail Change-Id: I9946829183c43286da4be6bda887a4d13d93e398 Signed-off-by: Tomasz Kulasek <tomaszx.kulasek@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2612 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2020-05-29 14:58:03 +00:00
Jim Harris	89e47f6014	nvme: create netlink socket during nvme_driver_init This helps ensure thread safety on creation of the netlink socket, when probe is called from multiple threads at once. It is also a lot more clean - we just create it once, rather than checking every time probe is called to see if it has to be created. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I528cedc3ff44de6ea8ecaf6d2389226502ba408e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2681 Community-CI: Broadcom CI Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-05-29 09:26:05 +00:00
Jim Harris	18f79f2449	nvme: add mutex to nvme_driver_init This will allow spdk_nvme_probe and variants to be called from multiple threads in parallel. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I534db605c9e192b943afe973981b7b503d8b7e34 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2680 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2020-05-29 09:26:05 +00:00
Seth Howell	684b3a49f0	lib/nvme: split request resubmission into function. This will need to be done separately for poll groups. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: I0e432493bdb02e13fe5c73a8a09911cef573307b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1664 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2020-05-28 07:13:44 +00:00
Seth Howell	63732d8880	lib/nvme: split cq completion processing to its own function. This helps create a separation between processing a qpair and processing a completion queue which can be shared across multiple qpairs. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: I111dd16ec4327854f232988a96891a65813f00e1 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1166 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2020-05-28 07:13:44 +00:00
zkhatami88	fe3fab26bf	nvme/rdma: Using hooks in reg mr Signed-off-by: zkhatami88 <z.khatami88@gmail.com> Change-Id: I9493fe82b5b758c0092d20ef18b79d652fefed85 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1905 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-05-28 07:11:39 +00:00
GangCao	34430edda7	nvme: add the ctrlr traddr in the error message In the case, there are several ctrlr used, it's better to have the traddr to indicate which ctrlr has the issue to shutdown. Change-Id: Ie564bb70566ba5822938efc99125d063f7b4ae4a Signed-off-by: GangCao <gang.cao@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2588 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Xiaodong Liu <xiaodong.liu@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2020-05-27 07:36:58 +00:00
Seth Howell	fadfef63d1	lib/nvme: provide mechanism for tracking request completions Add wrappers around the request and response values and track those using the wr_id value. This will come in handy when we start doing poll group based completion processing. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: Iaff75b03e41d49f53e55e0ce65d384567988fc9d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1165 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2020-05-21 09:21:27 +00:00
Seth Howell	a3f72b2e5a	lib: net, notify, nvme, rocksdb remove spdk_ prefix. remove only the spdk_ prefix from static functions in the above libraries. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: I59ce032c3312fa73f30c133fd62e603c1eee2859 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2365 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2020-05-21 09:19:00 +00:00
Seth Howell	d18e63206a	mk/lib: add a check that major and minor version is set for libs. Also, while we are here, consolidate setting SO_SUFFIX to one spot. Previously, it was possible for a library to slip through without an SO version. Signed-off-by: Seth Howell <seth.howell@intel.com> Change-Id: I4db5fa5839502d266c6259892e5719b05134518c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2361 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Community-CI: Broadcom CI Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2020-05-21 09:19:00 +00:00
Changpeng Liu	44f69a9e51	nvme: add a quirk for QEMU emulated NVMe SSD When starting the QEMU NVMe SSD with "cmb_size_mb=XX" parameter, the controller memory buffer feature is enabled in the Guest, the SPDK NVMe driver running in the Guest will allocate the submission queue in the controller memory buffer by default, it will use memset and SSE instruction when copying NVMe command to the submission queue entry or zero the whole submission queue, inside the memset implementation the AVX2 instuction will be used if the CPU can support such feature. However, due to the limitation in the QEMU, the maximum access width to the PCI BAR space is set to 8 Bytes, SPDK will report illegal instruction in Guest. Here we add a quirk for the QEMU emulated NVMe. Fix issue #1362. Change-Id: Ib5e2e4198d39ce7f8455175f7db283db0b01eebf Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2196 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-05-20 14:16:52 +00:00
Changpeng Liu	b2947f528f	nvme/opal: deprecate spdk_opal_supported() The spdk_opal_supported() is redundant with spdk_opal_dev_construct(), because we only return the spdk_opal_dev structure when the drive can support OPAL feature. Change-Id: Ieadf271a0c8530f2440cded05ad139483a8c5937 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/2383 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2020-05-20 14:16:44 +00:00
Alexey Marchuk	9b86f31a38	nvme/rdma: Handle failed send/recv as a fatal error Do not make attempt to resubmit failed send/recv WR, instead report and error to the upper layer (in case of new request) or fail a qpair (in case of active polling). In the case of failed ibv_post_send and disabled `delay_cmd_submit` nvme_rdma_qpair_submit_request returns an error to the caller. The caller completes failed request but RDMA layer still keeps it in a send queue. Later RDMA layer can send the corresponding WR and notify the upper layer about the completion of the request for the second time. Change-Id: I1260f215b8523d39157a5cc3fda39cd4bd87c8ec Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1662 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Seth Howell <seth.howell@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-05-20 12:03:50 +00:00
Alexey Marchuk	8c6a345534	nvme/rdma: Use RDMA provider API to send WRs Change-Id: I3dc87751d250da84d988b1c7a9c57112b5bd10b0 Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/1661 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Seth Howell <seth.howell@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2020-05-20 12:03:50 +00:00

... 3 4 5 6 7 ...

1550 Commits