ivampiresp/Spdk - Spdk - Leaflow Developers

Author	SHA1	Message	Date
Shuhei Matsumoto	5485f55dc1	ut/bdev_nvme: Separate disconnected and connected qpair in poll_group More precise stubs for spdk_nvme_poll_group are critically important to verify upcoming changes. Add a flag is_failed to struct spdk_nvme_qpair separately from is_connected. This is used to inject error to a connection. Replace a single list qpairs by two lists, connected_qpairs and disconnected_qpairs for struct spdk_nvme_poll_group. Then utilize these to manage qpair in poll group. spdk_nvme_ctrlr_reconnect_io_qpair() is not used in the NVMe bdev module now. Remove the corresponding stub. Adjust polling count accordingly. Change-Id: I4d867c56ae518276813f6f96d23a5f6933364fd4 Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10816 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-01-19 08:44:09 +00:00
Shuhei Matsumoto	728e3721a4	nvme_rdma: Remove a guard for recursive calls from poll_group_disconnect_qpair() nvme_poll_group_disconnect_qpair() is called only by a single place now. We do not need the flag poll_group_disconnect_in_progress any more. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I8f9c0f14baa8fcb9b0637635a5bb3d34a8b11af5 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10673 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-01-19 08:44:09 +00:00
Shuhei Matsumoto	7ae79a38a5	nvme: Limit spdk_nvme_poll_group_remove() to use only for disconnected qpairs Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I3c06c41664ee757423641474141439f9c32fc0b6 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10671 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Monica Kenguva <monica.kenguva@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-01-19 08:44:09 +00:00
Shuhei Matsumoto	e021cc0147	nvme: Swap ctrlr_disconnect_qpair() and poll_group_remove() in nvme_ctrlr_free_io_qpair() nvme_ctrlr_disconnect_qpair() calls nvme_poll_group_disconnect_qpair() if the qpair uses a poll group, and nvme_poll_group_disconnect_qpair() calls nvme_ctrlr_disconnect_qpair() if the state of the qpair is not DISCONNECTING. This relationship made the code very complex. A few patches starting from this patch simplifies disconnect and free qpair operations. This patch swaps the ordering of nvme_ctrlr_disconnect_qpair() and spdk_nvme_poll_group_remove() in spdk_nvme_ctrlr_free_io_qpair(). This ensures the qpair is disconnected when spdk_nvme_ctrlr_free_io_qpair() calls spdk_nvme_poll_group_remove(). This enables us to limit spdk_nvme_poll_group_remove() to be available only for disconnected qpairs. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I0601a74f953a2efc4f177a51a4450baea33533d4 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10670 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-01-19 08:44:09 +00:00
Shuhei Matsumoto	80e81273e2	bdev/nvme: Do not use ctrlr for I/O submission if reconnect failed repeatedly If ctrlr_loss_timeout_sec is set to -1, reconnect is tried repeatedly indefinitely, and I/Os continue to be queued. This patch adds another option fast_io_fail_timeout_sec, a flag fast_io_fail_timedout to nvme_ctrlr. If the time fast_io_fail_timeout_sec passed after starting reset, set fast_io_fail_timedout to true not to use the path for I/O submission. fast_io_fail_timeout_sec is initialized to zero as same as ctrlr_loss_timeout_sec and reconnect_delay_sec. The name of the parameter follows the famous DM-multipath, its fast_io_fail_tmo. Change-Id: Ib870cf8e2fd29300c47f1df69617776f4e67bd8c Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10301 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-01-17 14:25:15 +00:00
Shuhei Matsumoto	ae4e54fdc3	bdev/nvme: Retry reconnecting ctrlr after seconds if reset failed Previously reconnect retry was not controlled and was repeated indefinitely. This patch adds two options, ctrlr_loss_timeout_sec and reconnect_delay_sec, to nvme_ctrlr and add reset_start_tsc, reconnect_is_delayed, and reconnect_delay_timer to nvme_ctrlr to control reconnect retry. Both of ctrlr_loss_timeout_sec and reconnect_delay_sec are initialized to zero. This means reconnect is not throttled as we did before this patch. A few more changes are added. Change nvme_io_path_is_failed() to return false if reset is throttled even if nvme_ctrlr is reseting or is to be reconnected. spdk_nvme_ctrlr_reconnect_poll_async() may continue returning -EAGAIN infinitely. To check out such exceptional case, use ctrlr_loss_timeout_sec. Not only ctrlr reset but also non-multipath ctrlr failover is controlled. So we need to include path failover into ctrlr reconnect. When the active path is removed and switched to one of the alternative paths, if ctrlr reconnect is scheduled, connecting to the alternative path is left to the scheduled reconnect. If reset or reconnect ctrlr is failed and the retry is scheduled, switch the active path to one of alternative paths. Restore unit test cases removed in the previous patches. Change-Id: Idec636c4eced39eb47ff4ef6fde72d6fd9fe4f85 Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10128 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Monica Kenguva <monica.kenguva@intel.com>	2022-01-17 14:25:15 +00:00
Shuhei Matsumoto	962c4c3800	bdev/nvme: Fix a degradation that I/O gets queued infinitely We noticed the difference between the SPDK 21.10 and the latest master in a test. The simplified scenario is as follows: 1. Start SPDK NVMe-oF target 2. Run bdevperf for the target with -f parameter to suppress exit on failure. 3. Kill the target after I/O started. With the SPDK 21.10, bdevperf retries failed I/Os and exits after the test time is over. With the latest SPDK master, bdevperf hungs and does not exit even after the test time is over. The cause was as follows: reset ctrlr is repeated very quickly (once per 10ms by default) and hence I/Os were queued infinitely because nvme_io_path_is_failed() returned false if nvme_ctrlr is resetting. We should queue I/O when nvme_ctrlr is resetting only if reset is throttoled and fail-fast for the repeated failures is supported. Hence in this patch, fix the degradation and remove the related unit test cases. Reported-by: Evgeniy Kochetov <evgeniik@nvidia.com> Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I4047d42dc44488a05264c6a841d101a7c371358b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11062 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-01-17 14:25:15 +00:00
Ahriben Gonzalez	0345729e00	nvme: Add metadata support to io commands Adding metadata support for io commands. Currently metadata is ignored even if present in the cmd struct. Making metadata adress readable/writable depending on data transfer bits. Adding extra unit test to make sure metadata fields are populated. Signed-off-by: Ahriben Gonzalez <ahribeng@gmail.com> Change-Id: I1d01974a6b2831c82b43e94073065d235eea429a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10854 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-01-14 11:10:13 +00:00
Ben Walker	517b557226	nvme: Do not track a separate active namespace list We only populate active namespaces into the main namespace tree, so we don't need a separate list of active namespaces too. Change-Id: Iaf194f806cc1d9672f5567cff3dffafff3165069 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10034 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2022-01-14 08:35:10 +00:00
Ben Walker	e7602c158f	nvme: Hold namespaces in an RB_TREE Since this is now sparsely populated, a tree is a better choice. Change-Id: Ie66d913fa1d298de56a7d22ef55f0adf7f8803b8 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10031 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-01-14 08:35:10 +00:00
Ben Walker	b4dace738e	nvme: Do not allocate inactive namespace objects Some subsystems report a very large maximum value for the number of namespaces, but in essentially every case the subsystem is sparsely populated with active namespaces. To save memory, don't allocate objects for the inactive ones. Change-Id: I4cbeb5a7a898d3c685f4a3a9ec4c2ce45efffb92 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9898 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2022-01-14 08:35:10 +00:00
Ben Walker	1cfae16563	accel: Use vectored crc32 operations instead of chaining Chaining may be faster, but this is really an implementation detail of the idxd driver. Push the decision on how to implement a vectored crc down into the individual drivers and eliminate it from the generic framework. Signed-off-by: Ben Walker <benjamin.walker@intel.com> Change-Id: Iedbdc5a6dbd3f7d1674d0a83f6827588f4b6b2fb Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10291 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com>	2022-01-12 08:20:39 +00:00
Konrad Sztyber	6631c2a8aa	nvmf/tcp: initialize zcopy phase in nvmf_tcp_req_get Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Ia74148fb36733deaf7b2f833ac0247859311a805 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10794 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-01-12 08:20:11 +00:00
Konrad Sztyber	a50a70ecdf	nvmf: abort outstanding zcopy reqs in qpair disconnect Zero-copy requests are kept on the outstanding queue for the whole duration of the request - from the initial zcopy_start submission to the completion of zcopy_end. This means, that there's a period in which a request doesn't wait for a completion from the bdev layer, but is still on the oustanding queue (after zcopy_start callback, before zcopy_end submit). If a qpair gets disconnected while a request is in this state, we need to manually force its completion, as otherwise it might hang indefinitely (e.g. waiting for host data). Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I53731b8e363b725efa564ca3c7d89b46f5fb2a24 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10793 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com>	2022-01-12 08:20:11 +00:00
Konrad Sztyber	974a32b72e	nvmf: resume queued zcopy requests The zero-copy requests can also be queued when a subsystem is paused, so we need to properly resume and submit them by using zcopy_start. Since only requests that haven't received the zero-copy buffer (i.e. before zcopy_start was called) can be queued, we don't need to bother with checking zcopy_phase. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Ie629688f6961eb2ae05741df496720b91be4d80d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10792 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-01-12 08:20:11 +00:00
Shuhei Matsumoto	521a9bb22c	bdev/nvme: Fix race between failover and add secondary trid We sort secondary trids to avoid using disconnected trids for failover. However the sort had a bug. This bug was found by running test/nvmf/host/multipath.sh in a loop. Verify the fix by adding unit test. Fixes #2300 Signed-off-by: Shuhei Matsumoto <shuheimatsumoto@gmail.com> Change-Id: I22b0ede4d2ef98b786c3e0d1f5337a2d568ba56d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10921 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2022-01-10 22:18:46 +00:00
Jim Harris	b68f2eeb0b	bdev_nvme: add bdev_nvme_start_discovery RPC This patch adds the framework for a discovery service in the bdev/nvme module. Users can specify an IP/port of a discovery service. The bdev/nvme module will connect to a discovery controller, get the discovery log page, and then register for AERs. It will connect to each subsystem specified in the initial log page. AER completions will trigger fetching the log page again, at which point new subsystems will be connected to, or removed subsystems will be detached. This patch does the following: * Adds the new start_discovery RPC * Connects to the discovery controller * Gets the discovery log page * Registers for AERs * Detach from discovery controllers at shutdown Subsequent patches in this series will: * Connect to subsystems listed in discovery log page * Detach from subsystems that were listed in earlier discovery log pages but subsequently removed * Add a stop_discovery RPC Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I54bfa896a48c5619676f156b5ea9f2d1f886c72f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10694 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-01-10 15:23:39 +00:00
Konrad Sztyber	7a374fbc0b	nvmf: make zcopy_end void Since spdk_bdev_zcopy_end() cannot really fail (it only fails if we pass a bad bdev_io), we can simplify the nvmf zcopy_end functions by making them void and always expect asynchronous completion. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I6e88ac28aba13acadea88489ac0dd20d1f52f999 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10790 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Anil Veerabhadrappa <anil.veerabhadrappa@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-01-06 18:53:42 +00:00
Konrad Sztyber	92d7df1f47	nvmf: use spdk_nvmf_request_exec to submit zcopy_start Since this path now supports sending zero-copy, use it for zcopy_start. Additionally, it makes it possible make zcopy_start void, as it reports all errors asynchronously via request_complete(), and remove some of the duplicated error checks. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I41f43ce1651432d9a7d74e3680d4a3f780128a1d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10789 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-01-06 18:53:42 +00:00
Konrad Sztyber	686b9984b3	nvmf: return async/complete status in bdev zcopy operations Additionally, the NVMe completion status is now updated and the IOs are queued if the bdev layer doesn't have enough IO descriptors. It makes the zcopy operations behave similarly to the other IO operations. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I455ae781e32aa6e60d144d2c91f109bd8be46664 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10787 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-01-06 18:53:42 +00:00
Konrad Sztyber	0e09df57dd	nvmf: rename zcopy operations to zcopy_(start\|end) It makes their names consistent with the bdev API. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I314051f0980b46959d6560aa25885f13b4c28f2a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10786 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Anil Veerabhadrappa <anil.veerabhadrappa@broadcom.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-01-06 18:53:42 +00:00
Konrad Sztyber	f65099d378	nvmf: remove zcopy check in spdk_nvmf_request_exec It will make it possible to submit zero-copy requests through spdk_nvmf_request_exec(). Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Ibc14fe77cd477b11ed55d1350a7486caaad81add Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10783 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-01-06 18:53:42 +00:00
Konrad Sztyber	7d23ac8657	nvmf: remove zcopy phase checks from IO functions The code should never reach these functions for requests using zero-copy. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: If9f30e05a43b340a982604d5b985242d63ce252b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10782 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-01-06 18:53:42 +00:00
Konrad Sztyber	aa1d039836	nvmf: zero-copy enable flag in transport opts It makes it possible for the user to specify whether a transport should try to use zero-copy to execute requests when possible. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I40a92b0d7a6707f4c9292795f380846acb227200 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10780 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-01-06 18:53:42 +00:00
Changpeng Liu	2a6c2c289c	nvmf: support static CNTLID SPDK NVMf subsystem supports dynamic controller model, for transports other fabrics, users should use static controller model. Change-Id: I364ea61a71b04d51932fd9e0e16f401a383ff67c Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10149 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-01-06 01:20:32 +00:00
Alexey Marchuk	3c4a68cafc	nvme: Do not create IO qpair during ctrlr initialization If nvme ctrlr is resetting or initializing, free_io_qids bitmap is already freed or not created yet. In that case an attempt to create IO qpair leads to segmentation fault. Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: I6a97bf81d5a568db20d23b3f88cf01e994ba42e3 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10827 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuheimatsumoto@gmail.com>	2021-12-27 08:43:03 +00:00
Alexey Marchuk	eb09178a59	nvme/rdma: Correct qpair disconnect process In current implementation RDMA qpair is destroyed right after disconnect. That is not graceful qpair shutdown process since there can be requests submitted to HW and we may receive completions for already destroyed/freed qpair. To avoid this, only disconnect qpair in ctrlr_disconnect_qpair transport callback, all other resources will be released in ctrlr_delete_io_qpair cb. This patch is useful when nvme poll groups are used since in that case we use shared CQ, if the disconnected qpair has WRs submitted to HW then qpair's destruction will be deferred to poll group. When nvme poll groups are not used, this patch doesn't change anything, in that case destruction flow is still ungraceful. However since CQ is destroyed immediately after qpair, we shouldn't receive any requests which point to released resources. A correct solution for non-poll group case requires async diconnect API which may lead to significant rework. There is a bug when Soft Roce is used - we may receive a completion with "normal" status when qpair is already disconnected and all nvme requests are aborted. Added a workaround for it. Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: I0680d9ef9aaa8737d7a6d1454cd70a384bb8efac Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10327 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Shuhei Matsumoto <shuheimatsumoto@gmail.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-12-23 08:44:40 +00:00
GangCao	10f32b9f19	lib/blob: do not assume realloc(NULL, 0) returns a not-NULL value There is situation that num_extent_pages is zero and original pointer is also NULL, the realloc() could return a Not NULL pointer. Related UT has been added and updated. 1) In the default allocation (num_clusters == 0), the extent_pages is not allocated as expected. 2) In the thin provisioning allocation (num_clusters != 0), the extent_pages will be allocated if extent_table is used. More related information as below: The crux of the problem is that according to POSIX: realloc: "If ptr is NULL, then the call is equivalent to malloc(size)" malloc: "If size is 0, then malloc returns either NULL or a unique pointer value that can later be successfully passed to free" blobstore was relying on realloc(NULL, 0) always return a unique pointer value, and not NULL. This is not portable behavior. Change-Id: Ibc28d9696f15a3c0e2aa6bb2371dc23576c28954 Signed-off-by: GangCao <gang.cao@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10470 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-12-20 18:14:06 +00:00
Ben Walker	fca4262987	nvme: Remove nvme_ns_update In the one place this was called, we can call nvme_ns_construct instead. There's no harm in re-fetching the identify pages. Change-Id: I91292ff9650bdc7edd5588a05837b671dcac1922 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10102 Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2021-12-20 08:49:41 +00:00
Peng Lian	4c1757ffb9	nvmf: update discovery log when removing hostnqn In NVMF Revision spec 1.1a, discovery log should be updated when removing hostnqn of subsystem. Update unit test to check the discovery log when removing hostnqn and destroying subsystem. Signed-off-by: Peng Lian <peng.lian@smartx.com> Change-Id: I51c597a2493295a677a7aa68e4f13a887f7e1140 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10668 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-12-16 08:52:20 +00:00
Anil Veerabhadrappa	68f0c6160a	ut/fc : fix fc_ls_ut compilation failure This regression was introduced when 'accept' was removed from spdk_nvmf_transport_ops structure. Signed-off-by: Anil Veerabhadrappa <anil.veerabhadrappa@broadcom.com> Change-Id: I5d880791db258a97a1861dbd841e97a7c068ce12 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10676 Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2021-12-16 08:43:39 +00:00
Changpeng Liu	723adbaf32	UT/vfio-user: fix clang-12 compilation error Add missed STUBs. Change-Id: I20989bf4ea66720d62f8ecc9668bb8f74e459666 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10638 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2021-12-15 04:32:05 +00:00
Jacek Kalwas	43022da379	nvmf: remove accept poller from generic layer Not every transport requires accept poller - transport specific layer can have its own policy and way of handling new connection. APIs to notify generic layer are already in place - spdk_nvmf_poll_group_add - spdk_nvmf_tgt_new_qpair Having accept poller removed should simplify interrupt mode impl in transport specific layer. Fixes issue #1876 Change-Id: Ia6cac0c2da67a298e88956734c50fb6e6b7521f1 Signed-off-by: Jacek Kalwas <jacek.kalwas@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/7268 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-12-14 13:18:33 +00:00
Jim Harris	59f3cdacb1	nvmf: don't always update discovery log when adding hosts If a subsystem has no listeners, then there is no need to update the discovery log when adding a host, or setting a subsystem to allow all hosts. This eliminates some unnecessary discovery log update notifications, especially when setting 'allow any hosts' on a subsystem immediately after it is created (and before it has any listeners). Update unit test to check the adding a host to a subsystem without listeners does not rev the genctr. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I63dab5df564269e574bb925890088f52063aa378 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10546 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Jacek Kalwas <jacek.kalwas@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-12-10 17:32:18 +00:00
Jim Harris	3867f83dea	test/nvmf: add local var for hostnqn string Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ia967512bfcc5d7b1df15b6f6b5c132f21d601dce Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10563 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-12-10 17:32:18 +00:00
Jim Harris	9ac2cf7ff0	nvmf: don't update discovery log on subsystem create/delete The discovery log isn't updated when a subsystem is created or deleted, it's only updated when a listener for a subsystem is added or removed. So remove the nvmf_update_discovery_log() in the subsystem create and delete paths. They just generate extra AER completions that potentially cause the host to do unneeded work. Note that if a subsystem is deleted with active listeners, the subsystem delete path will remove each of the listeners before deleting the subsystem itself. So the discovery log will still get updated when those listeners are removed. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Id01bbfa3b24d3e1279a614a2fd60be41387a03b1 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10545 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Jacek Kalwas <jacek.kalwas@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-12-10 17:32:18 +00:00
paul luse	fbb24d0ebe	lib/accel: remove batching from the framework and plug-in modules Batching will be made available for DSA specifically through the new idxd_perf tool. Signed-off-by: paul luse <paul.e.luse@intel.com> Change-Id: Ic51d9ad3692074805b1ffa705cea8be35737c778 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9846 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Monica Kenguva <monica.kenguva@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-12-08 16:35:40 +00:00
Shuhei Matsumoto	215518069a	bdev/nvme: nvme_ctrlr_create() gets prchk_flags from nvme_async_probe_ctx Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Id3deca8e0aba23299347a6aee6f0f44ee683556e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10555 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2021-12-08 08:31:24 +00:00
Shuhei Matsumoto	696ad465d7	bdev/nvme: Remove the failover_in_progress flag from struct nvme_ctrlr The failover_in_progress flag is used to decide the return value of bdev_nvme_failover(). bdev_nvme_delete() calls bdev_nvme_failover() with remove=true to remove nvme_ctrlr->active_path_id. However bdev_nvme_failover() returns zero if nvme_ctrlr->failover_in_progress is true. bdev_nvme_failover() may return zero even if it does not remove nvme_ctrlr->active_path_id. The following will be better. bdev_nvme_failover() returns -EBUSY if nvme_ctrlr->resetting is true, and the caller repeats calling bdev_nvme_failover() until the target trid becomes alternative path or bdev_nvme_failover() returns zero. To do that, the failover_in_progress flag is not necessary any more. Removing the failover_in_progress will also simplify the following patches to unify ctrlr reset and failover. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I57ab944beb1d06ea4def144c81c69705860de35f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10441 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-12-08 08:31:24 +00:00
Shuhei Matsumoto	7cc66c0ab1	bdev/nvme: Check if ns can be shared when configuring multipath We had not checked the bit 0 of the Namespace Multipath I/O and Namespace Sharing Capabilities (NMIC) field in the Identify Namespace data structure. If the bit 0 of the NMIC is zero, it is likely that namespaces are not identical. We should check if the value of the NMIC first, and do it in this patch. Additionally, it is not usual if the bit 0 of the CMIC and the bit 0 of the NMIC do not match. So in unit tests rename the parameter multi_ctrlr by multipath for ut_attach_ctrlr() and use it for the value of the NMIC. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I6aa7cbcc99be2507dbf18930f7b585a9ea7d0f90 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10380 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-12-08 08:31:24 +00:00
Shuhei Matsumoto	8afa746b4d	bdev/nvme: Use new APIs in a reset ctrlr sequence Replace the spdk_nvme_ctrlr_reset_async() and spdk_nvme_reset_poll_async() calls by the spdk_nvme_ctrlr_disconnect(), spdk_nvme_ctrlr_reconnect_async(), and spdk_nvme_ctrlr_reconnect_poll_async() calls in a reset ctrlr sequence. spdk_nvme_ctrlr_disconnect() can fail if ctrlr is already resetting or removed. But both cases are not possible. reset is controlled and the callback to the hot remove is called when the ctrlr is hot removed. So we assume spdk_nvme_ctrlr_disconnect() always succeed. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I1299e198597b2a2110f80b9a868e2dae015682ee Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10092 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-12-08 08:31:24 +00:00
Changpeng Liu	632c8d5613	nvme: make get INTEL log pages can be executed asynchronously Also we don't treat exceptions when getting INTEL log pages as a fatal error, the initialization will still contine. Change-Id: Ic2fd2be510fde2679c1546482934d0a180266936 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10341 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-12-06 23:17:07 +00:00
Evgeniy Kochetov	1fd2af0150	nvmf/ctrlr_bdev: Set DNR bit in status for failed NVMe passthru When NVMe passthru command (IO or admin) fails on submission (e.g. it is not supported), set DNR bit in completion status field. There is no sense in retrying the command in this case. Signed-off-by: Evgeniy Kochetov <evgeniik@nvidia.com> Change-Id: I55960c128bd9fc31f6defef0b9832259a71684b1 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8578 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2021-12-03 08:13:52 +00:00
Evgeniy Kochetov	d03b31c61f	nvmf/ctrlr_bdev: Fix status code for failed admin passthru command If NVMe admin passthru command is not supported by underlying bdev, set status code in NVMe completion to INVALID_OPCODE. Signed-off-by: Evgeniy Kochetov <evgeniik@nvidia.com> Change-Id: I29c4e1f8263b76b27c199cfd2d9b2474432ec70b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10517 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2021-12-03 08:13:52 +00:00
Evgeniy Kochetov	a9593c7981	bdev: Fail nvme passthru command if not supported by bdev The originally detected problem is that SPDK NVMf target fails command with invalid opcode with status code INTERNAL_DEVICE_ERROR instead of INVALID_OPCODE. All unknown commands on IO queue are passed to underlying block device layer as NVME_IO type. It is not checked if this type of commands is supported and, when command fails, INTERNAL_DEVICE_ERROR is set as status code. If command fails on submission, status code is set to INVALID_OPCODE which is more relevant. This patch adds check if command type is supported to bdev_nvme_*_passthru functions. If not supported, it is failed with ENOTSUP. Signed-off-by: Evgeniy Kochetov <evgeniik@nvidia.com> Change-Id: I4d7f7639da17dd3b1dc3eee7eb1b4a4f876117a2 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8567 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com>	2021-12-03 08:13:52 +00:00
Josh Soref	c9c7c281f8	spelling: test Part of #2256 * achieve * additionally * against * aliases * already * another * arguments * between * capabilities * comparison * compatibility * configuration * continuing * controlq * cpumask * default * depends * dereferenced * discussed * dissect * driver * environment * everything * excluded * existing * expectation * failed * fails * following * functions * hugepages * identifiers * implicitly * in_capsule * increment * initialization * initiator * integrity * iteration * latencies * libraries * management * namespace * negotiated * negotiation * nonexistent * number * occur * occurred * occurring * offsetting * operations * outstanding * overwhelmed * parameter * parameters * partition * preempts * provisioned * responded * segment * skipped * struct * subsystem * success * successfully * sufficiently * this * threshold * transfer * transferred * unchanged * unexpected * unregistered * useless * utility * value * variable * workload Change-Id: I21ca7dab4ef575b5767e50aaeabc34314ab13396 Signed-off-by: Josh Soref <jsoref@gmail.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10409 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2021-12-03 08:13:22 +00:00
Jim Harris	7e68d0baca	nvme: configure AER for discovery controllers Move the CONFIGURE_AER state before SET_KEEP_ALIVE to make sure that we run the CONFIGURE_AER state for discovery controllers. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ia4e24f6507c43e3fece06b9161ff8e0b8fa0e97d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10332 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-12-02 04:02:29 +00:00
Shuhei Matsumoto	f9fba507fe	bdev/nvme: Redirect the reset ctrlr operation into nvme_ctrlr->thread In the following patches, we want to retry reconnect if reconnect failed in a reset ctrlr sequence but we want to delay the retry. While we wait the delayed retry, we want to quiesce ctrlr completely. As part of quiesce ctrlr operations, we want to pause adminq poller but we need to do it on the nvme_ctrlr->thread. If a reset ctrlr sequence runs on the nvme_ctrlr->thread, we can avoid redirecting the pending destruct request at completion too. So we redirect the reset ctrlr sequence into the nvme_ctrlr->thread. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I538b962e2a7b5cf00ebbac2a1e888482ddeeee61 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10075 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-12-01 09:20:09 +00:00
Jim Harris	1c083e6200	nvme: set keep alive for discovery controllers Discovery services using the SPDK nvme driver may use long-lasting connections that detect AER completions to determine when there are changes in the discovery log. This means that we still need to send keep alives on discovery controller admin queues. So move the SET_KEEP_ALIVE_TIMEOUT state immediately after IDENTIFY, and run the SET_KEEP_ALIVE_TIMEOUT state even for discovery controllers. Note, we need the IDENTIFY's KAS value to properly set the keep alive timeout, so we have to keep the IDENTIFY state before SET_KEEP_ALIVE_TIMEOUT. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I5c6403c28fb72d42629c5f9009a89c4bfd44d162 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10329 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-11-24 08:34:58 +00:00
Shuhei Matsumoto	50b10bc20e	bdev/nvme: bdev_nvme_reset_io() redirect to the orig_thread at completion In the following patches, bdev_nvme_reset() will execute the reset ctrlr operation on the nvme_ctrlr->thread until completion as bdev_nvme_admin_passthru() does. Hence change the callback bdev_nvme_reset_io_continue() to redirect to the orig_thread by using bio. Furthermore, use bio->cpl.cdw0 to store the completion status of the reset processing. bdev_nvme_reset() does not use bio->cpl. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I361cc44494190ba83ad6e360788d78851416c46c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10074 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-11-23 08:46:36 +00:00
Shuhei Matsumoto	b4447abf70	bdev/nvme: Retry failed admin passthru up to retry_count times This patch supports admin passthrough retry when we get any error with DNR=0 but ABORTED_BY_REQUEST up to retry_count times. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I1bf29570791fdbe8651fa70c4c8685bb740fb86b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9944 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2021-11-23 08:46:36 +00:00
Shuhei Matsumoto	a9a86a14c1	bdev/nvme: Retry admin passthru immediately if it got ctrlr path error This patch supports admin passthrough retry when we get ctrlr path error at completion. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Ice0045b84054ec66a9db9ef23e21786d2c082b1d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9943 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-11-23 08:46:36 +00:00
Shuhei Matsumoto	35a2f4e22e	bdev/nvme: Retry admin passthru a second later if any ctrlr may become available When resetting ctrlr, adminq is disconnected first. If adminq is disconnected, admin passthrough request is rejected with -ENXIO. But resetting ctrlr may succeed. If resetting ctrlr succeeds, adminq is connected again, and admin passthrough request will be submitted successfully. On the other hand, if ctrlr is failed, admin passthrough request is rejected with -ENXIO. But when resetting ctrlr, ctrlr is set to unfailed. Hence bdev_nvme_admin_passthru() skips any ctrlr which is resetting or failed, and calls bdev_nvme_admin_passthru_complete() with -ENXIO if no available ctrlr is found. bdev_nvme_admin_passthru_complete() queues admin passthrough request and retry it one second later if ctrlr is resetting. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Ic748dc4faf29ebf717ae5c29dcf7c55fe2ea9243 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9942 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-11-23 08:46:36 +00:00
Changpeng Liu	0af4a7cd84	nvme: abort outstanding requests case by case For DSM command, the NVMe drive may take a long time to finish it, if we set a small timeout value for DSM command, the bdev/nvme module will try to reset the IO queue pair when timeout happens, in `spdk_nvme_ctrlr_free_io_qpair`, we will abort the outstanding IO requests first, then in the `nvme_pcie_ctrlr_delete_io_qpair`, we will poll the CQ for any requests that have been completed by the NVMe controller, if there are NVMe completions in the CQ, we will finish them again, thus double completions happened. Here we rename `nvme_qpair_abort_reqs` to `nvme_qpair_abort_all_queued_reqs`, so the common layer will just abort queued request, and let each transport to abort outstanding requests case by case. Fix #2233. Change-Id: Icae6214239160c615418cb514fc51cfe77b59211 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10233 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-11-22 08:35:35 +00:00
Jim Harris	d810a7458d	idxd: change NOTICELOGs to DEBUGLOGs The NOTICELOGs really clutter the output during application start - it's better to make these DEBUGLOGs instead. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I3ae37d5d057d7b972017befbc0834de414b9710b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10191 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2021-11-17 10:58:17 +00:00
Shuhei Matsumoto	7b8e7212a6	bdev/nvme: Abort the queued I/O for retry The NVMe bdev module queues retried I/Os itself now. bdev_nvme_abort() needs to check and abort the target I/O if it is queued for retry. This change will cover admin passthrough requests too because they will be queued on the same thread as their callers and the public API spdk_bdev_reset() requires to be submitted on the same thread as the target I/O or admin passthrough requests. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: If37e8188bd3875805cef436437439220698124b9 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9913 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-11-17 10:58:12 +00:00
Shuhei Matsumoto	72e4a4d46a	bdev/nvme: Each nvme_bdev_channel caches its current io_path Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I3ec3a588ff741cf04383e89f5a701e33bf1987a6 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9894 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-11-17 10:58:12 +00:00
Shuhei Matsumoto	ae7019417e	iscsi: Merge immediate data into the following R2T data The recent changes merged multiple Data-OUT PDUs within the same sequence into a single subtask up to 64KB. However, they were not enough. For a large write operation, the hardware iSCSI HBA host sent an immediate data whose size was not block size multiples and then more solicit data through R2T exchanges. One example for a 64KB write operation was as follows: host sent SCSI Write with 5792 bytes and F = 1 target replied a R2T host sent Data-OUT with 15880 bytes host sent Data-OUT with 11536 bytes host sent Data-OUT with 2848 bytes host sent Data-OUT with 11536 bytes host sent Data-OUT with 5744 bytes host sent Data-OUT with 12200 bytes and F = 1 The hardware iSCSI HBA host can decide the size of the unsolicited data but the SPDK iSCSI target can require the host to send the solicited data whose size is block size multiples. Hence we merge immediate data to the following R2T data if the immediate data is not more than 64KB and more R2T data come. Add another test case to check if the fix works for the above example. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I4906b4e1a8b61e08862f4ccc27a6caf165126530 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9708 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-11-16 09:08:27 +00:00
Alexey Marchuk	f72cab94dd	lib/vhost: Fix compilation with dpdk 21.11 Structure vhost_device_ops was renamed to rte_vhost_device_ops Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: Ie9601099d47465536500aa37fc113aeae03a8254 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10223 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: John Kariuki <John.K.Kariuki@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2021-11-16 09:06:54 +00:00
Ben Walker	84688fdb1c	nvme: Rename max_active_ns_idx to active_ns_count This was sometimes used as the maximum array index and sometimes as the maximum count. Make it consistent everywhere and give it a better name. Signed-off-by: Ben Walker <benjamin.walker@intel.com> Change-Id: I518efd99a7d36584624490b0b3497bb6e81ce9ac Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10101 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-11-15 11:59:59 +00:00
Kai Li	8f633fa1c3	bdev/nvme: display all ctrlrs for this bdev when dump bdev nvme controller After multipath feature is supported, one bdev will have more than one nvme ctrlr. Fore ease of view, display each ctrlr's trid info. Moreover, rename nvme_bdev_ctrlr_get as nvme_bdev_ctrlr_get_by_name here to keep consistent with nvme_ctrlr_get_by_name. Signed-off-by: Kai Li <lik271@chinatelecom.cn> Change-Id: I417506699bbea6ed13dac0fee942749757d2ae47 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10129 Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2021-11-11 23:24:26 +00:00
Niklas Cassel	b7ad5b0b90	bdev/zone: add support for get zone id In the bdev-zone API, there are a few functions that takes a zone_id: spdk_bdev_get_zone_info(), spdk_bdev_zone_management(), and the spdk_bdev_zone_append() functions. The way a zoned application is usually written is that it starts off by getting the zone report for all zones (zone_id will be sent in as 0), and then the application will keep the whole zone report in memory. Therefore, an application usually have access to the zone_id/zslba for all zones. However, there are cases, e.g. when getting an error on write, where the completion callback will only have the lba of the write that failed. Add a helper function that can be used to get the zone_id/slba for a given lba. Having this helper in bdev-zone will avoid SPDK applications needing to provide their own implementation for this. Signed-off-by: Niklas Cassel <niklas.cassel@wdc.com> Change-Id: I978335f87f7d49bc33aed81afcaa6d9f0af8a1e4 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10180 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2021-11-11 23:23:35 +00:00
Shuhei Matsumoto	eb739d0364	iscsi: Fix the case that incoming data is split between data segment and data digest When data segment size is 64KB and data digest is enabled, if data segment and data digest are split into different two packets, - pdu->mobj[0] became full first when reading data semgment, - pdu->mobj[1] was allocated but unused and data digest was read. In this case, two SCSI write tasks were submitted by mistake and the second SCSI write task had no data. Fix the bug in this patch. When iscsi_pdu_payload_read() is called and pdu->mobj[0] is full, allocate pdu->mobj[1] only if any of data segment remains to read. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I9a0c36c05f90092c3c2122a7eb91e10976830b40 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9965 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2021-11-11 23:22:57 +00:00
Ben Walker	2dbdb9945c	test/nvme: Only test non-contiguous namespaces for NVMe 1.2 or higher This wasn't supported before NVMe 1.2 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Change-Id: Ibf19cd77e522eb11c2091a9f4956f5616876986b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10097 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-11-10 19:36:27 +00:00
Ben Walker	52e432dff2	test/nvme: Fix buffer zeroing math This meant to zero the entire active namespace list. Signed-off-by: Ben Walker <benjamin.walker@intel.com> Change-Id: I2da2293b53acd57d3480cf93b052eb1520de35d4 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10028 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2021-11-10 19:36:27 +00:00
Jim Harris	ec2ad00c92	test/unit/raid: fix set-but-not-used error verify_io() keeps track of a buf pointer, but the buf pointer never actually gets used. So remove this buf pointer. Found by clang-13. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I79dfeac7f004b56f7d4404f41b2ff18b96968a20 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10056 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-11-03 18:30:55 +00:00
Shuhei Matsumoto	84ac18e545	bdev/nvme: Update ANA state if I/O failed by ANA error If I/O got ANA error, ANA state may be out of date. So in this case read ANA log page and update ANA states. Mark nvme_ns to be updating to avoid using while updating ANA state. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Ia43d38b3a589c84d6d0479dedcced033e76fb194 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9458 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-10-27 11:53:31 +00:00
Shuhei Matsumoto	f3fec96c20	bdev/nvme: Protect ANA log page from concurrent reads by using an new flag If an I/O failed by ANA error, the corresponding ANA state might be out of date. In the following patches, for this case, read the latest ANA log page and update the ANA state. Such reading ANA log page may be done on multiple threads concurrently including AER ANA change. Hence protect ANA log page by adding an new flag ana_log_page_updating to struct nvme_ctrlr and using it. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I8bb84091d50a5fdc0d9893b585be972dfd31c0f1 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9526 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-10-27 11:53:31 +00:00
Shuhei Matsumoto	43adb646b8	bdev/nvme: Retry failed I/O up to retry_count times Add bdev_retry_count to spdk_bdev_nvme_opts and retry_count to nvme_bdev_io, respectively. Set type of both to int because we want use -1 for infinite retry. Set the default value of bdev_retry_count to zero for the backward compatibility. bdev_retry_count is configurable by the RPC bdev_nvme_set_options. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I9bc746fcea54aa8722c76f79c70c2ae2b375aa53 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9864 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-10-27 11:53:31 +00:00
Alexey Marchuk	3d8904c66b	nvmf: Add discovery filtering rules SPDK nvmf target reports all listeners on all subsystems in discovery pages, kernel target reports only subsystems listening on a port where discovery command is received. NVMEoF specification allows to specify any addresses/ transport types. Ch 5: The set of Discovery Log entries should include all applicable addresses on the same fabric as the Discovery Service and may include addresses on other fabrics. To align SPDK and kernel targets behaviour, add filtering rules to allow flexible configuration of what should be listed in discovery log page entries. Fixes #2082 Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: Ie981edebb29206793d3310940034dcbb22c52441 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9185 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot	2021-10-25 22:57:48 +00:00
Jim Harris	e40bd53175	nvme/pcie: only set qpair state from qpair's thread The qpair's state member is only 3 bits of a uint8_t, and the in_completion_context bit is another bit in that same uint8_t. We know that the qpair's state is only ever updated by one thread, but it is possible that the state could be modified by one thread, while another thread is modifying in_completion_context. in_completion_context is only modified by the thread that is polling the qpair (or the qpair's poll group). But with async mode, another thread that has a qpair on the same PCIe controller could poll its adminq and reap the SQ completion for the qpair that's owned by the other thread. So do not set the generic qpair state to CONNECTED from the SQ completion callback. Instead just set the pcie_state to READY, and let the thread that owns the qpair detect the qpair is READY and set the state to CONNECTED itself. Fixes issue #2157. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I9efc0c954504f1841e1c3890ae78211ad0d1990e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9975 Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot	2021-10-25 19:53:14 +00:00
GangCao	9072c4ad0d	accel: create SW Engine Channel if HW Engine not supports Currently either HW Engine Channel or SW Engine Channel will be used. In the case that HW Engine Channel is used while does not support related operations like IOAT for CRC, it will shift back to the SW Engine's handle. So that this is an issue that it still refers to the HW Engine Channel while needs SW Eninge Channel to handle. This patch introduces the SW Eninge Channel and always initializes there in case that HW Engine does not support some operations. Related UT also added to simulate the case the IOAT does not support CRC and then SW Eninge needs to properly handle it. Change-Id: I4ecdcd09ab669a616b37c567b45b1e6499800ec9 Signed-off-by: GangCao <gang.cao@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9874 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot	2021-10-20 23:04:38 +00:00
Alexey Marchuk	2696886c75	dma: Update translation result to hold iovec pointer In some cases a single virtually contriguos memory buffer can be translated to several chunks of memory. To make such translation possible, update structure spdk_memory_domain_translation_result to use a pointer to iovec. Add a single iov structure or cases where translation is always 1:1, it will make easier translation callback implementation. For RDMA transport translation of address is always 1:1, so treat iovcnt other than 1 as an error. Change-Id: I65605575d43a490490eba72c1eb19f3a09d55ec6 Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Signed-off-by: Max Gurtovoy <mgurtovoy@nvidia.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9779 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-10-20 22:55:52 +00:00
Alexey Marchuk	549bcdc0a4	dma: Update memory domain context structure Instead of a union with domain type specific parameters, store an opaque pointer to user context. Depending on the memory domain type, this context can be cast to a specific struct, e.g. to spdk_memory_domain_rdma_ctx for RDMA memory domains. This change provides more flexibility to applications to create and manage custom memory domains Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Signed-off-by: Max Gurtovoy <mgurtovoy@nvidia.com> Change-Id: Ib0a8297de80773d86edc9849beb4cbc693ef5414 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9778 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2021-10-20 22:55:52 +00:00
Alexey Marchuk	0ecbe09bc1	dma: Add infrstructure for push operation Push operation complements existing pull operation and allows to implement read data flow using memory domains. Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Signed-off-by: Max Gurtovoy <mgurtovoy@nvidia.com> Change-Id: I0a3ddcb88c433dff7a9c761a99838658c72c43fd Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9701 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-10-20 22:55:52 +00:00
Shuhei Matsumoto	a59b3f9236	bdev/nvme: Retry I/O immediately if it got I/O path error The previous patch supported I/O retry when no available io_path was found at submission. This patch supports I/O retry when we get I/O path error at completion. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I93a1664944b15ab0a826a321e2ea7a2574263afe Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9850 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-10-20 07:23:40 +00:00
Shuhei Matsumoto	ef409194a1	bdev/nvme: Retry I/O a second later if any I/O path may become available If ANA state is inaccessible or qpair is disconnected, I/O cannot be submitted. But if qpair is connected, ANA state may become accessible, or if qpair is disconnected, it may become connected via resetting. Hence even if find_io_path() returned NULL, queue I/O and retry it one second later if qpair is connected or ctrlr is resetting. Sort retried I/Os by expiration values in ticks, and activate a timed poller per nvme_bdev_channel only if there is any retried I/O. So the poller function bdev_nvme_retry_ios() always returns BUSY because if the poller runs earlier than the closest retried I/O or runs when there is no retried I/O, it is more like a bug of the framework. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Id28110a0d63ebc1c5772814e2ff8a47934df1644 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9830 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-10-20 07:23:40 +00:00
Alexey Marchuk	d47893607b	test/scsi: Fix uninitialized variable dev_ut.c:667:30: error: ‘prev_lun’ may be used uninitialized in this function [-Werror=maybe-uninitialized] 667 \| struct spdk_scsi_lun lun, prev_lun; \| ^~~~~~~~ gcc (Ubuntu 9.3.0-17ubuntu1~20.04) 9.3.0 aarch64 Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: Id6608620ef6f18002ff7b7cc6de3e1361be762d0 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9860 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-10-18 21:49:20 +00:00
Alexey Marchuk	9efad7468f	dma: Rename fetch operation to pull The new name suits better to the following "data push" operation Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: Ic3249f65de203f375477f8e87b0749b9502d165c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9878 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2021-10-18 07:56:57 +00:00
Alexey Marchuk	219be8dff1	dma: Change signature of fetch callback iovs are not needed in the callback Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: I29718f1f2e65881628b72dea938e40c60348b85d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9877 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2021-10-18 07:56:57 +00:00
Ben Walker	c93b5564c8	bdev/nvme: Use an RB_TREE to hold namespaces in the controller If NN is very large this saves a lot of memory. This lookup is not generally used in the I/O path anyway. Change-Id: I98e190006843ad5d0bac8483bf9feb800d4a665a Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9884 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: GangCao <gang.cao@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-10-18 07:49:52 +00:00
Shuhei Matsumoto	2b70bf9291	ut/bdev_nvme: Fix bug in spdk_nvme_ctrlr_reset_async/poll_async() stubs In the SPDK NVMe driver, spdk_nvme_ctrlr_reset_async() sets ctrlr->is_failed to false and spdk_nvme_ctrlr_reset_poll_async() sets ctrlr->is_failed to true if it fails. On the other hand, in the unit test for the NVMe bdev module, the stub for spdk_nvme_ctrlr_reset_async() does nothing and the stub for spdk_nvme_ctrlr_reset_poll_async() sets ctrlr->is_failed to false if it succeeds. This bug made us very difficult to write unit test for I/O retry. Hence fix this bug. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Ic0dcf1109ce543a53fca74708fc86c8c74a17692 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9829 Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2021-10-18 07:49:14 +00:00
Shuhei Matsumoto	ccee9a9151	bdev/nvme: find_io_path() excludes io_path whose ANA state is not accessible bdev_nvme_find_io_path() selects an io_path whose qpair is connected and ANA state is optimized or non-optimized. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I79c978795562b606ee27aa43020684d8bcbf50c5 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9405 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-10-18 07:49:14 +00:00
Shuhei Matsumoto	56e2d632ce	bdev/nvme: Reset all ctrlrs of a bdev ctrlr sequentially Reset all controllers of a bdev controller sequentially. When resetting a controller is completed, check if there is next controller, and start resetting the controller. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I169a84b931c6b03b36bb971d73d5a05caabf8e65 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/7274 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2021-10-18 07:49:14 +00:00
Shuhei Matsumoto	5d62af41a3	bdev/nvme: Complete outstanding reset after canceling pending resets Previously the NVMe bdev module had completed the outstanding reset and then canceled pending resets. This was complex. On the other hand, the generic bdev layer cancels pending resets and then completes the outstanding reset. Following the generic bdev layer simplifies the code and makes us easier to control retry reset, delay retry reset by a few seconds, or stop retry after repeated failures and then delete ctrlr. Update unit tests accordingly. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I9a68422918ebcb052b3a281316ffba9b3450ecd4 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9816 Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2021-10-18 07:49:14 +00:00
Jim Harris	15535803d4	test/unit/scsi: initialize prev_lun Otherwise compiler complains that this variable is used uninitialized (scsi_dev_find_free_lun does reference it). Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I8fce604f22e06f7007669510f8ba0ae27c44261a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9868 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2021-10-18 07:47:38 +00:00
Jim Harris	f01146ae48	blob: use uint64_t for unmap and write_zeroes lba count Previous patches (`5363eb3c`) tried to work around the 32-bit unmap and write_zeroes LBA counts by breaking up larger operations into smaller chunks of max size UINT32_MAX lba chunks. But some SSDs may just ignore unmap operations that are not aligned to full physical block boundaries - and a UINT32_MAX lba unmap on a 512B logical / 4KiB physical SSD would not be aligned. If the SSD decided to ignore the unmap/deallocate (which it is allowed to do according to NVMe spec), we could end up with not unmapping any blocks. Probably SSDs should always be trying hard to unmap as many blocks as possible, but let's not try to depend on that in blobstore. So one option would be to break them into chunks close to UINT32_MAX which are still aligned to 4KiB boundaries. But the better fix is to just change the unmap and write_zeroes APIs to take 64-bit arguments, and then we can avoid the chunking altogether. Fixes issue #2190. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I23998e493a764d466927c3520c7a8c7f943000a6 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9737 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Xiaodong Liu <xiaodong.liu@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2021-10-14 08:17:16 +00:00
Shuhei Matsumoto	d456cd93d6	bdev/nvme: admin_passthru() submits to the first found unfailed ctrlr bdev_nvme_admin_passthru() chooses the first ctrlr which is not failed. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: If41a1d1e1bde4bddfa92e5a385509daa3f0ce4de Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9525 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-10-14 08:16:32 +00:00
Shuhei Matsumoto	e49f77ece3	bdev/nvme: find_io_path() returns io_path instead of ns and qpair We have io_path structure now and returning io_path rather than ns and qpair match the function name. The following patches will cache the returned io_path into nvme_bdev_io. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I5d773da18591fc324667f6b5c489a38f497bf3d8 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9295 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-10-14 08:16:32 +00:00
Shuhei Matsumoto	c19ec84378	bdev/nvme: Add multiple namespaces to a single nvme_bdev This patch removes the critical limitation that ctrlrs which are aggregated need to have no namespace. After this patch, we can add multiple namespaces into a single nvme_bdev. The conditions that such namespaces satisfy are, - they are in the same NVM subsystem, - they are in different ctrlrs, - they are identical. Additionally, if we add one or more namespaces to an existing nvme_bdev and there are active nvme_bdev_channels, the corresponding I/O paths are added to these nvme_bdev_channels. Even after this patch, ANA state is not utilized in I/O paths yet. It will be done in the following patches. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I15db35451e640d4beb99b138a4762243bee0d0f4 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8131 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-10-14 08:16:32 +00:00
Monica Kenguva	9221a0d81c	test/accel: add UT for test_spdk_accel_submit_copy_crc32c() Signed-off-by: Monica Kenguva <monica.kenguva@intel.com> Change-Id: I1d70d0fccc7e251777a5568ce869bb47fbaefaca Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9504 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-10-13 07:37:36 +00:00
Monica Kenguva	55ee29ec09	test/accel: add UT for test_spdk_accel_submit_crc32cv() Signed-off-by: Monica Kenguva <monica.kenguva@intel.com> Change-Id: Iefe925a61accea896e820cae5c401f1ceb8856e6 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9503 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-10-13 07:37:36 +00:00
Monica Kenguva	24128a9703	test/accel: add UT for test_spdk_accel_submit_crc32c() Signed-off-by: Monica Kenguva <monica.kenguva@intel.com> Change-Id: I38d0e301399e0ba1e2196a1a109f36c3c4cd1cd9 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9487 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-10-13 07:37:36 +00:00
Monica Kenguva	fb639b53b0	test/accel: add UT for test_spdk_accel_submit_fill() Signed-off-by: Monica Kenguva <monica.kenguva@intel.com> Change-Id: I6ecccb015ddc72b3d676d9607bf2ca79aa7435ef Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9486 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-10-13 07:37:36 +00:00
Monica Kenguva	b22cd665c9	test/accel: add UT for test_spdk_accel_submit_compare() Signed-off-by: Monica Kenguva <monica.kenguva@intel.com> Change-Id: I3287353d76de226f950620f1a4702887d33b1686 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9297 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-10-13 07:37:36 +00:00
Monica Kenguva	2e736160e1	test/accel: add UT for test_spdk_accel_submit_dualcast() Signed-off-by: Monica Kenguva <monica.kenguva@intel.com> Change-Id: If20750c3671e794b5079972c612743c5ab189862 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9285 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com>	2021-10-13 07:37:36 +00:00
Shuhei Matsumoto	5e87727596	scsi: SCSI device supports 256 LUNs at the maximum by default Most SCSI hosts, Linux, Windows, VMware, supports 256 LUNs per device now, and it is not easy to test even if any other non-free OS or driver supports more than 256 LUNs. Hence increase the macro constant SPDK_SCSI_DEV_MAX_LUN from 64 to 256. Then we do not need to expose it publicly now. So move it to lib/scsi/scsi_internal.h. Update the CHANGELOG together. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Iacde46c3854f326eebfb8befb47d41fce383b027 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9631 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-10-12 09:10:18 +00:00
Shuhei Matsumoto	4265fc50d9	scsi: Manage LUNs per device by not fixed size array but linked list Change a fixed size array to a linked list to manage LUNs per SCSI device. Keep the linked list sorted by LUN ID because this is necessary to efficiently find the lowest free LUN ID or check the specified LUN is free. To avoid traversing the linked list twice, change scsi_dev_find_free_lun() to return the LUN which comes just before where we want to insert an new LUN. Additionally, previously spdk_scsi_dev_add_lun_ext() had not checked if the specified LUN ID was duplicated. Fix the bug in this patch. Add unit test cases for the function scsi_dev_find_free_lun(). These changes will enable the following patches to increase SPDK_SCSI_DEV_MAX_LUN from 64 to 256 without consuming additional memory. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I7f6f070ddc680127cf86ae255055da2d1d29e4ec Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9630 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2021-10-12 09:10:18 +00:00
Ben Walker	be6a2feff2	bdev/nvme: bdev_nvme_delete now takes a path_id Specifying only a transport id is not enough. We need to be able to describe the host parameters too. Signed-off-by: Ben Walker <benjamin.walker@intel.com> Change-Id: Iadbea553aee4b38e7cacab0b486e7e5746d0d1ab Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9825 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-10-12 08:52:01 +00:00
Ben Walker	7d28aae7fb	bdev/nvme: Rename connected_trid to active_path_id This is the currently active path identifier in a failover scenario. The path is defined by more than just the transport identifier, so fix the name. Signed-off-by: Ben Walker <benjamin.walker@intel.com> Change-Id: I682c6f4c54f75307e2615bf80e70358180d99fe2 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9576 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2021-10-12 08:52:01 +00:00
Ben Walker	0262859f3c	bdev/nvme: Rename nvme_ctrlr_trid to nvme_path_id This defines a unique path between a host and a target. Change-Id: Ia3d24c1b34199a8b596aaf17900ca9694a9da77d Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9505 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2021-10-12 08:52:01 +00:00
Shuhei Matsumoto	845db70ccc	scsi: Report LUNs use spdk_scsi_dev_get_first/next_lun() to iterate LUNs Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Ib812c1f4c180f4173c3b3d668a98e3b23ed32899 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9629 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-10-11 10:08:22 +00:00
Shuhei Matsumoto	153373a5e7	iscsi: Pass iscsi_lun directly to the callback argument of spdk_scsi_lun_open() By this change, we will not need to traverse LUN list or tree in the callback to hot remove. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Ibe72fba824553d0189b9120884aa2113599a568d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9627 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: GangCao <gang.cao@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-10-11 10:08:22 +00:00
Shuhei Matsumoto	594f46d7a9	iscsi: iSCSI target uses spdk_scsi_dev_get_first/next_lun() to iterate LUNs Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I99cc770c3637d79689d61f996ae40bb4be25cb00 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9626 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: GangCao <gang.cao@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-10-11 10:08:22 +00:00
Shuhei Matsumoto	f61d81e47c	scsi: Add spdk_scsi_dev_get_first/next_lun() to traverse all LUNs Add two public APIs spdk_scsi_dev_get_first_lun() and spdk_scsi_dev_get_next_lun() to remove the dependency on the macro constant SPDK_SCSI_DEV_MAX_LUN from lib/iscsi and lib/vhost. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I6546697f823fe9f4fa34e1161f5c7fa912dd2d59 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9608 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: GangCao <gang.cao@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2021-10-11 10:08:22 +00:00
paul luse	78ef7cfc23	test/accel: add UT for test_spdk_accel_submit_copy() Signed-off-by: paul luse <paul.e.luse@intel.com> Change-Id: I2fb94a20bda23e9da07db5405977603dfaa319f9 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/6473 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-10-07 09:25:20 +00:00
paul luse	bcd3ed39e7	modules/crypto: remove dependency on rte_cryptodev_pmd.h Call rte_cryptodev_close() to free qpair memory instead of using an internal function. Signed-off-by: paul luse <paul.e.luse@intel.com> Change-Id: I1bd7f0dd86de83f278f6be3263cdf3fbd8e1c77f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9720 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-10-07 09:23:55 +00:00
Konrad Sztyber	c5ebb7ff99	bdev/nvme: use asynchronous ctrlr detach functions This patch replaces the synchronous `spdk_nvme_detach()` calls with its asynchronous counterparts in the controller unregister path. An additional poller is introduced to periodically poll the NVMe driver for detach completion. Once the detach is completed, the poller is unregistered and the nvme_ctrlr is destroyed. The poller uses the same period (1ms) as the async probe poller. Since reset and detach cannot happen at the same time, reset_poller was renamed to reset_detach_poller and it can now store the pointer either to the reset or detach poller, depending on the circumstances. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I5eb2dd6383d98d25d1f9748af08c1a13d18acb0e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8729 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-10-04 15:00:35 +00:00
Changpeng Liu	742ae4ec72	nvmf/vfio-user: check SQ doorbell is valid or not before use According to the specification, we should also post an AER error event for this error case. Fix #2171. Change-Id: Ifb2343453ea5e36ce244938a939537ee6ed1c4e1 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9584 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-09-30 08:08:05 +00:00
Ben Walker	14739d6e13	bdev/nvme: bdev_nvme_detach_controller is now much more flexible It can match by any provided parameter to remove paths. Change-Id: I5e7a87342bbb90943dc97fb52f142814fcf0acfa Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9453 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-09-28 07:29:50 +00:00
Ben Walker	a91079fd2d	bdev/nvme: connected_trid is now an nvme_ctrlr_trid Instead of storing an spdk_nvme_transport_id, store the object that contains it. This will make a few later patches easier. Change-Id: I36b74889fe39af3b7ab2b900fb3ea4b3f39e1f83 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9484 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2021-09-28 07:29:50 +00:00
Jim Harris	bcff088852	scheduler/dynamic: don't adjust tsc too much for very busy cores If a core has a very high busy percentage, we should not assume that moving a thread will gain that thread's busy tsc as newly idle cycles for the current core. So if the current core's percentage is above SCHEDULER_CORE_BUSY (95%), do not adjust the current core's busy/idle tsc when moving a thread off of it. If moving the thread does actually result in some newly idle tsc, it will get adjusted next scheduling period. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I26a0282cd8f8e821809289b80c979cf94335353d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9581 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2021-09-28 07:29:03 +00:00
Jim Harris	f1acee8f83	scheduler_dynamic: fix busy tsc accounting For the src thread, add the busy_tsc of the thread we are moving to the idle_tsc of the current core. This is consistent with how are accounting for the cycles in the target core too. We will disable the load_balancing.sh script for now. We will reenable it later in this patch set once a few other changes are made, along with some updates to the load_balancing.sh script based on the changes made in this patch set. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I8af82610804e97dabf62ccd90f75a0e6e37d276f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9550 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2021-09-28 07:29:03 +00:00
Jim Harris	62b273d7cf	test/reactor_ut: use more variables in dynamic scheduler ut The values 100 and 200 are used a lot in this part of the unit tests, many times for different reasons. So add some more variables and use some of the existing ones more often to make some of this more clear to the reader. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I2196bb6a1ac4b86ab0ddd9a3b88863664116cca5 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9625 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2021-09-28 07:29:03 +00:00
Jim Harris	ae51da29da	test/reactor_ut: don't assert number of events Refactor this part of the unit tests to make it a bit easier to maintain as the dynamic scheduler itself is modified. For example, depending on the simulated thread loads, we may need to pass extra events to cores for purposes of setting interrupt mode. The important thing to test here isn't how many events it takes to do that, but what is the end result. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Iad2e861cfa0bfd16c853332650e3ab3a9727f490 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9624 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2021-09-28 07:29:03 +00:00
Nick Connolly	7a5bc4905b	ut/rpc: wrap syscalls using spdk.mock.unittest.mk spdk.mock.unittest.mk contains platform specific definitions to wrap syscalls. Allow SPDK_MOCK_SYSCALLS to be predefined before it is included to extend the list of syscalls to be wrapped. Update rpc Makefile to use this mechanism so that the platform specific definitions are used. Signed-off-by: Nick Connolly <nick.connolly@mayadata.io> Change-Id: If51c0e7a31cf0eda45a844cb8cfa579efe173c42 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9621 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-09-27 20:59:37 +00:00
Nick Connolly	d960df9989	ut/nvme_qpair: add missing mutex init Add missing mutex init for ctrlr ctrlr_lock. Signed-off-by: Nick Connolly <nick.connolly@mayadata.io> Change-Id: I9f018898a828a2ca4caf246117b3b895c5069150 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9615 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-09-27 10:54:46 +00:00
Mao Jiang	25e1099b93	test/nvmf/ctrlr_bdev: cases for ctrlr reading and writing cmd Change-Id: I3626b3abe07274c4b3cb3e446899999372e14c47 Signed-off-by: Mao Jiang <maox.jiang@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9226 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-09-27 10:54:08 +00:00
Mao Jiang	a43f891e9b	test/nvmf/vfio_user: cases for creating vfio user Change-Id: Id477e1f1f278d34b6d025dafa34ddd9ed1cae1d1 Signed-off-by: Mao Jiang <maox.jiang@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8770 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2021-09-27 10:53:32 +00:00
Ziye Yang	34c901e308	nvme/tcp: Fix tcp_req->datao calculation issue. When data digest is enabled for a nvme tcp qpair, we can use accel_fw to calculate the data crc32c. Then if there are multiple c2h pdus are coming, we can use both CPU resource directly and accel_fw framework to caculate the checksum. Then the datao value compare will not match since we will not update "datao" in the pdu coming order. For example, if we receive 4 pdus, named as A, B, C, D. offset data_len (in bytes) A: 0 8192 B: 8192 4096 C: 12288 8192 D: 20480 4096 For receving the pdu, we hope that we can continue exeution even if we use the offloading engine in accel_fw. Then in this situation, if Pdu(C) is offloaded by accel_fw. Then our logic will continue receving PDU(D). And according to the logic in our code, this time we leverage CPU to calculate crc32c (Because we only have one active pdu to receive data). Then we find the expected data offset is still 12288. Because "datao" in tcp_req will only be updated after calling nvme_tcp_c2h_data_payload_handle function. So while we enter nvme_tcp_c2h_data_hdr_handle function, we will find the expected datao value is not as expected compared with the data offset value contained in Pdu(D). So the solution is that we create a new variable "expected_datao" in tcp_req to do the comparation because we want to comply with the tp8000 spec and do the offset check. We still need use "datao" to count whether we receive the whole data or not. So we cannot reuse "datao" variable in an early way. Otherwise, we will release tcp_req structure early and cause another bug. PS: This bug was not found early because previously the sw path in accel_fw directly calculated the crc32c and called the user callback. Now we use a list and the poller to handle, then it triggers this issue. Definitely, it will be much easier to trigger this issue if we use real hardware engine. Fixes #2098 Signed-off-by: Ziye Yang <ziye.yang@intel.com> Change-Id: I10f5938a6342028d08d90820b2c14e4260134d77 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9612 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: GangCao <gang.cao@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2021-09-27 10:53:04 +00:00
Mao Jiang	159fa94ad8	test/nvmf/subsystem: cases for restoring ns reservation Add rkey checking to enhance nvmf_ns_reservation_restore(). Change-Id: I6d557adcba9bf81f954c118aa09452642318bc98 Signed-off-by: Mao Jiang <maox.jiang@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9427 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-09-24 07:42:51 +00:00
Konrad Sztyber	a4b7f87b61	nvme: abort queued admin requests during init Abort any queued admin requests once admin queue gets enabled. A request can get queued if a controller is being reset and it gets submitted while admin qpair is being reconnected. If these requests aren't aborted, the init process will stall, as requests don't get resubmitted while controller is resetting and subsequent admin commands required for the initialization would be queued too. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: If456a297d2d434b3cc741816cbfb13b01d37e963 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9324 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-09-24 07:38:57 +00:00
Alexey Marchuk	9381d8d399	nvme: Update spdk_nvme_ctrlr_get_memory_domain Allow to return more than one memory domain. This change aligns bdev and nvme API and provides more flexibility for custom transports. Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: Ica9b12ad8463c361be6cb62ee2c0513eec0b486d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9546 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2021-09-24 07:37:45 +00:00
Alexey Marchuk	ea86c035bb	nvme/tcp: NVME TCP poll group statistics Enable dump of transport stats in functional test. Update unit tests to support the new statistics Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: I815aeea7d07bd33a915f19537d60611ba7101361 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8885 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2021-09-24 07:30:06 +00:00
Shuhei Matsumoto	75f1d6484a	bdev/nvme: Aggregate multiple ctrlrs into a single bdev ctrlr This patch enables us to aggrete multiple ctrlrs in the same NVM subsystem into a single bdev ctrlr to create multipath. This patch has a critical limitation that ctrlrs which are aggregated need to have no namespace. Hence any nvme bdev is not created. However it will be removed in the next patch. The design is as follows. A nvme_bdev_ctrlr is created to aggregate multiple nvme_ctrlrs in the same NVM subsystem. The name of the nvme_ctrlr is changed to be the name of the nvme_bdev_ctrlr. NVMe bdev module has both the failover feature and the multipath feature now. To choose which of failover or multipath to use, add an new parameter multipath to the RPC bdev_nvme_attach_controller. When we attach a new trid to the existing nvme_bdev_ctrlr, we use the failover feature if multipath is false, we use the multipath feature if multipath is false. nvme_bdev_ctrlr has a list for nvme_ctrlr and it is guarded by the global mutex. Callers can query nvme_ctrlrs from a nvme_bdev_ctrlr via trid as a key. nvme_bdev_ctrlr is not registered as io_device. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I20571bf89a65d53a00fb77236ad1b193e88b8153 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8119 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-09-24 07:29:05 +00:00
Shuhei Matsumoto	14535253f1	bdev/nvme: Reset the nvme_ctrlr if an I/O qpair is disconnected Previously, if an I/O qpair is disconnected, we tried reconnecting the qpair. However, this reconnect operation was very likely to fail and will not match the upcoming asynchronous connect/reconnect operation. We need an extra callback to make this reconnect operation asynchronous, but we do not want to have it. Hence if an I/O qpair is disconnected, we free the I/O qpair and then reset the corresponding nvme_ctrlr immediately. If the admin qpair is also disconnected, the nvme_ctrlr is reset immediately. However this event may never happen. So we do not wait for the error of the admin qpair. The NVMf host may disconnect connections by itself intentionally. In this case, resetting the nvme_ctrlr will surely fail. But resetting the nvme_ctrlr frees all I/O qpairs of the nvme_ctrlr and these I/O qpairs are not created again until resetting the nvme_ctrlr succeeds. Resetting the nvme_ctrlr once at most is more efficient than repeating reconnecting the I/O qpair. So this change is valuable even for such intentional disconnection. However, it is helpful to know the event that I/O qpair is disconnected. Hence change DEBUGLOG to NOTICELOG in the disconnected callback. The disconnected callback is not repeated, and we do not need to worry about NOTICELOG flooding. Refine the unit test case to verify this change. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I376b749c2f55d010692bf916370e8bb4249b795f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9515 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2021-09-24 07:29:05 +00:00
Jim Harris	2bd631dc38	scheduler_dynamic: change directory name to just "dynamic" This is similar to how we name other module library directories. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Iadaf59231323180b48b5d0cf2e6acb3d8bfc9807 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9549 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2021-09-22 13:52:21 +00:00
Mao Jiang	c17c7b9564	test/nvmf/transport: cases for creating polling group Also make stub for spdk_mempool_get_bulk consistent with DPDK APIs. Change-Id: I021378ea92651d75a73cc9f447df57c2f71680fa Signed-off-by: Mao Jiang <maox.jiang@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9356 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2021-09-22 06:58:51 +00:00
paul luse	a19781ba61	lib/accel: rework UT for code reuse Moved frequenty used stack vars to globals and added setup and teardown functions. Should be useful in upcoming patches as well wrt code savings. Signed-off-by: paul luse <paul.e.luse@intel.com> Change-Id: I468bec8856c354fcc954628e4e733594a6580104 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/7013 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Monica Kenguva <monica.kenguva@intel.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com>	2021-09-22 06:56:46 +00:00
Konrad Sztyber	1fbeeb23b3	nvme: rename nvme_qpair_abort_reqs to *_with_cbarg Renamed nvme_qpair_abort_reqs() to nvme_qpair_abort_reqs_with_cbarg() to highlight the fact that it only aborts requests with specified cb_arg and to distinguish it from _nvme_qpair_abort_reqs() which aborts all requests immediately. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I32fec5ab0501b1beb8605689d73ec42a6424fba5 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9323 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-09-22 06:55:28 +00:00
Konrad Sztyber	0825befa59	nvme: use saved CC register value when creating IO qpairs Signed-off-by: Jim Harris <james.r.harris@intel.com> Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I6c518df9d8ecd74247ed8f8ffe133305cbd627f3 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8622 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-09-22 06:55:28 +00:00
Konrad Sztyber	73050d511a	nvme: enable the controller asynchronously Signed-off-by: Jim Harris <james.r.harris@intel.com> Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I2a8116bbb95f6835cd37118f81ec1144501c5b3a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8620 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2021-09-22 06:55:28 +00:00
Shuhei Matsumoto	cf0dbbb924	ut/bdev_nvme: Copy probe_ctx->opts to ctrlr->opts In unit tests, spdk_nvme_ctrlr had opts but did not use it. Hostnqn will be checked to determine if multipath can be created. Hence we implement the stub spdk_nvme_ctrlr_get_default_ctrlr_opts() and copy probe_ctx->opts to ctrlr->opts as we do in lib/nvme. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I13980424d5f463877eae7f7cd1e5ffcae888aebe Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9333 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@gmail.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2021-09-20 10:51:04 +00:00
Konrad Sztyber	42b6254197	ut/rpc: mock out system calls Mock out open, close, unlink, and flock system calls. Flock isn't supported under nfs, so if the repo is mounted through nfs, the test will fail. And a unit test shouldn't be doing these calls aynway. Additionally, changed listen_addr from an IP address to a file path, as the RPC listens on a UNIX socket, so an IP address doesn't make much sense. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Icc759a74e6db4d1b9e766313a1e4672820e1c272 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9446 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2021-09-20 10:48:34 +00:00
Jim Harris	fc8d861892	nvme: add new SET_EN_0 state for ctrlr initialization This removes some code that was duplicated in the CHECK_EN and DISABLE_WAIT_FOR_READY_1 states. Signed-off-by: Jim Harris <james.r.harris@intel.com> Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Ie5d175540f71c692f7784c7ff22a48f34b9b7082 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8614 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-09-16 07:16:52 +00:00
Konrad Sztyber	94cec6a78a	ut/nvme_ctrlr: asynchronous register get/set mocks Added mocks in preparation for making the NVMe controller initialization use asynchronous versions of the register operations. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Ifbcc3c73933fb965db710389fec8cd2d52886d4d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8610 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Monica Kenguva <monica.kenguva@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-09-16 07:16:52 +00:00
Konrad Sztyber	efb2ed8751	nvme/fabric: extract prop set/get to separate sync/async functions It will make it easier to support asynchronous register set/get functions. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I9915609ff940596ae4d67388238cc685dfa426fa Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8608 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-09-16 07:16:52 +00:00
Shuhei Matsumoto	2ee6ab36f9	bdev/nvme: bdev_nvme_reset() follow spdk_nvme_ctrlr_reset() about return value Previously bdev_nvme_reset() returned -EBUSY if ctrlr is being destructed and returned -EAGAIN if ctrlr is being reset. These did not match what spdk_nvme_ctrlr_reset() returned. Reset operation will be more important than current when multipath is supported and reset operation is made asynchronous. Hence change bdev_nvme_reset() to follow spdk_nvme_ctrlr_reset(). bdev_nvme_reset() returns -ENXIO if ctrlr is being destructed and returns -EBUSY if ctrlr is being reset. Additionally change the return value of bdev_nvme_failover() accordingly. After the change bdev_nvme_failover() returns -ENXIO if being destructed and returns -EBUSY if ctrlr is being reset. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Ie2c6f8601050b1043d83de9cf01490751784e4e5 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8859 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Konrad Sztyber <konrad.sztyber@gmail.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Krzysztof Karas <krzysztof.karas@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-09-14 07:30:10 +00:00
Shuhei Matsumoto	6d2caa652b	bdev/nvme: Include hostid into ctrlr_opts when calling bdev_nvme_create() Following the last patch, include hostid into ctrlr_opts rather than passing it as a parameter for bdev_nvme_create(). Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I0d04db1c5767ec76a9a7cd255c3a8d56b0b8f583 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9344 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@gmail.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2021-09-14 07:30:10 +00:00
Alexey Marchuk	fb4398ef4b	nvmf: Correct the error path of transport creation. An error might occur after succesful transport creation when the new transport is added to nvmf poll groups, e.g. in nvmf_transport_poll_group_create. In that case transport is not detroyed and poll groups are not fully functional. To correct this behaviour, destroy transport if spdk_nvmf_tgt_add_transport fails. Also update nvmf_tgt initialization step to check that all poll groups were created. Change-Id: I116e6944729d846c1755c2844c77825f65db8c12 Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9255 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Jacek Kalwas <jacek.kalwas@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2021-09-08 08:08:41 +00:00
Ben Walker	d409971b79	bdev/nvme: Remove common.h/common.c This only existed to share code between OCSSD and regular NVM namespaces. Now OCSSD is gone, so just merge the files into bdev_nvme. Change-Id: Idb73cc05d67144de5dd20af8db24c8f6974d10a7 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9337 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ziye Yang <ziye.yang@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-09-08 08:06:39 +00:00
Ben Walker	a8b0293094	bdev/nvme: Don't rely on knowing ctrlr->num_ns in nvme_ctrlr_populate_namespaces Avoid relying on this number. Different targets have interpreted its meaning in different ways and it cannot be used anymore in practice. It may also be very, very large. Change-Id: I94e8eae49d6ccdbd8be302b30a120d89242b6d39 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9316 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-09-08 08:06:39 +00:00
Ben Walker	050346e05e	bdev/nvme: Add accessors for getting namespaces Try to use these accessors instead of directly using the namespaces array. This will make changing the data structure easier later on. Change-Id: I3367d0e0065894f3aa199ed1698d27976b4cbbb5 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9315 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2021-09-08 08:06:39 +00:00
Ben Walker	282b8b70a7	bdev/nvme: Don't allocate inactive namespaces If the number of namespaces is very large, this can cause excessive memory allocation. This is especially true because when the number of namespaces is large, it is almost always very sparsely populated. Signed-off-by: Ben Walker <benjamin.walker@intel.com> Change-Id: I27d94956c222ae3c49c6a7422164ae3a8ec8d963 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9302 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-09-08 08:06:39 +00:00
Alexey Marchuk	00277bc5d5	nvme/rdma: Fix searching for qpair by qp_num Poll group holds lists of qpairs in different states and when we got rdma completion with error, we iterate these lists to find a qpair which qp_num matches. qp_num is stored inside of ibv_qp which belongs to spdk_rdma_qp structure. When nvme_rdma_qpair is disconnected, pointer to spdk_rdma_qp is cleaned but qpair may still exist in poll group list and when we start searhing for qpair by qp_num we may dereference NULL pointer. This patch adds a check that pointer to spdk_rdma_qp is valid before dereferencing it. To minimize boilerplate code, wrap all check in macro. Add unit test to verify this fix. Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: I1925f93efb633fd5c176323d3bbd3641a1a632a9 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9050 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-09-08 08:04:04 +00:00
Alexey Marchuk	97385af196	nvmf: Fix double controller destruction when subsys is deleted When a subsystem is being deleted, we disconnect all qpairs and when the last qpair for some controller is disconnected, we start controller desctruction process. This process requires to send a message to subsystem's thread to remove the controller from the list in the subsystem and after that send a message to controller's thread to release resources. The problem is that the subsystem also destroys all attached controllers. This order is unpredictable and we may get heap-use-after-free or double free. To fix this problem we can rely on the fact that the subsystem can only be destroyed in incative state, that means that all qpairs linked to the subsystem are already disconnected and all controllers are already destroyed or in the process of destruction. spdk_nvmf_subsystem_destroy API is now can be asyncrhonous, it accepts a callback with cb argument. Change-Id: Ic72d69200bc8302dae2f8cd8ca44bc640c6a8116 Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/6660 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-09-08 08:04:04 +00:00
Jim Harris	971f07b9fb	nvme: add tracing to pcie request path John Kariuki tested this patch on a system with several Intel P5800X Optane SSDs, to determine the performance impact of adding these two spdk_trace_records() in the main NVMe I/O path. The pathological case (512B random reads on a single Xeon core) decreased from 13.10M to 12.88M, or 1.7%. Normal workloads (4KB+) would incur a smaller penalty since the I/O rate would be much lower - maybe even unnoticeable.. This is a really valuable tracepoint to have enabled by default, so I think this small amount of degradation is acceptable. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ie2543cadf3541eb74398d31ac0f495522ab49ec0 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9303 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2021-09-08 08:03:20 +00:00
Mao Jiang	5f0ed1cc97	test/nvmf/subsystem: cases for valid nqn Change-Id: I4dbbc4ec555c08395d257568aa2fcc49e7bd3cbf Signed-off-by: Mao Jiang <maox.jiang@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9321 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2021-09-07 07:34:31 +00:00
Jim Harris	6e5d6032a0	bdev/nvme: use spdk_nvme_ctrlr_prepare_for_reset() When preparing for a reset, use this new call to tell the driver to avoid sending DELETE_CQ/SQ commands to a PCIe controller when they aren't needed. Fixes issue #2073. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I9ebb7d5c3f7cbb1c3192f162f32edbbea41acde1 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9250 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Matt Dumm <matt.dumm@hpe.com> Reviewed-by: Michael Haeuptle <michaelhaeuptle@gmail.com>	2021-09-07 07:33:41 +00:00
Tomasz Zawadzki	a86e40f320	scheduler: create public API and subsystem for scheduler/governor This patch moves schedueler and governor related API from the internal event.h to public scheduler.h. With this it is possible to create subsystem responsible for handling the schedulers. Three schedulers and a governor were moved to scheduler modules from event framework. This will allow next patch to add JSON RPC configuration to the whole subsystem. Along with easier addition of other schedulers. Removed debug logs from gscheduler, as they serve little purpose. Signed-off-by: Krzysztof Karas <krzysztof.karas@intel.com> Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I98ca1ea4fb281beb71941656444267842a8875b7 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/6995 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2021-09-07 07:33:03 +00:00

1 2 3 4 5 ...

2562 Commits