ivampiresp/Spdk - Spdk - Leaflow Developers

Author	SHA1	Message	Date
Shuhei Matsumoto	80e81273e2	bdev/nvme: Do not use ctrlr for I/O submission if reconnect failed repeatedly If ctrlr_loss_timeout_sec is set to -1, reconnect is tried repeatedly indefinitely, and I/Os continue to be queued. This patch adds another option fast_io_fail_timeout_sec, a flag fast_io_fail_timedout to nvme_ctrlr. If the time fast_io_fail_timeout_sec passed after starting reset, set fast_io_fail_timedout to true not to use the path for I/O submission. fast_io_fail_timeout_sec is initialized to zero as same as ctrlr_loss_timeout_sec and reconnect_delay_sec. The name of the parameter follows the famous DM-multipath, its fast_io_fail_tmo. Change-Id: Ib870cf8e2fd29300c47f1df69617776f4e67bd8c Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10301 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-01-17 14:25:15 +00:00
Shuhei Matsumoto	ae4e54fdc3	bdev/nvme: Retry reconnecting ctrlr after seconds if reset failed Previously reconnect retry was not controlled and was repeated indefinitely. This patch adds two options, ctrlr_loss_timeout_sec and reconnect_delay_sec, to nvme_ctrlr and add reset_start_tsc, reconnect_is_delayed, and reconnect_delay_timer to nvme_ctrlr to control reconnect retry. Both of ctrlr_loss_timeout_sec and reconnect_delay_sec are initialized to zero. This means reconnect is not throttled as we did before this patch. A few more changes are added. Change nvme_io_path_is_failed() to return false if reset is throttled even if nvme_ctrlr is reseting or is to be reconnected. spdk_nvme_ctrlr_reconnect_poll_async() may continue returning -EAGAIN infinitely. To check out such exceptional case, use ctrlr_loss_timeout_sec. Not only ctrlr reset but also non-multipath ctrlr failover is controlled. So we need to include path failover into ctrlr reconnect. When the active path is removed and switched to one of the alternative paths, if ctrlr reconnect is scheduled, connecting to the alternative path is left to the scheduled reconnect. If reset or reconnect ctrlr is failed and the retry is scheduled, switch the active path to one of alternative paths. Restore unit test cases removed in the previous patches. Change-Id: Idec636c4eced39eb47ff4ef6fde72d6fd9fe4f85 Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10128 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Monica Kenguva <monica.kenguva@intel.com>	2022-01-17 14:25:15 +00:00
Shuhei Matsumoto	962c4c3800	bdev/nvme: Fix a degradation that I/O gets queued infinitely We noticed the difference between the SPDK 21.10 and the latest master in a test. The simplified scenario is as follows: 1. Start SPDK NVMe-oF target 2. Run bdevperf for the target with -f parameter to suppress exit on failure. 3. Kill the target after I/O started. With the SPDK 21.10, bdevperf retries failed I/Os and exits after the test time is over. With the latest SPDK master, bdevperf hungs and does not exit even after the test time is over. The cause was as follows: reset ctrlr is repeated very quickly (once per 10ms by default) and hence I/Os were queued infinitely because nvme_io_path_is_failed() returned false if nvme_ctrlr is resetting. We should queue I/O when nvme_ctrlr is resetting only if reset is throttoled and fail-fast for the repeated failures is supported. Hence in this patch, fix the degradation and remove the related unit test cases. Reported-by: Evgeniy Kochetov <evgeniik@nvidia.com> Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I4047d42dc44488a05264c6a841d101a7c371358b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11062 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-01-17 14:25:15 +00:00
Liu Xiaodong	75f30ff07d	test/rbd: explicit keyring & conf test Change-Id: Idd8622d7e6601ac83ea4f4a9969395f4d2e1c910 Signed-off-by: Liu Xiaodong <xiaodong.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11089 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: <tanl12@chinatelecom.cn> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: GangCao <gang.cao@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com>	2022-01-17 09:44:56 +00:00
Ahriben Gonzalez	0345729e00	nvme: Add metadata support to io commands Adding metadata support for io commands. Currently metadata is ignored even if present in the cmd struct. Making metadata adress readable/writable depending on data transfer bits. Adding extra unit test to make sure metadata fields are populated. Signed-off-by: Ahriben Gonzalez <ahribeng@gmail.com> Change-Id: I1d01974a6b2831c82b43e94073065d235eea429a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10854 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-01-14 11:10:13 +00:00
Ahriben Gonzalez	9e14341bd9	nvme: Always set result field for passthru cmd Modify admin passthru so that result field of passthru struct is always populated. This should be safe since dw0 is either reserved or contains command specific info. This is specifically meant for the namespace management command when attempting to create a namespace. As per spec: "Dword 0 of the completion queue entry contains the Namespace Identifier created.". So for nvme cli and perhaps other application to see what is the id of the namespace created there needs to be a way to pass the information back. Signed-off-by: Ahriben Gonzalez <ahribeng@gmail.com> Change-Id: Ide4effc126ad9eedac95b0700dd65041ed4b35b1 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10633 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-01-14 11:10:13 +00:00
Ahriben Gonzalez	0c645fdc8e	nvme: change cuse ioctl reply -Change cuse ioctl reply from status code to whole status field. -Add negative test for nvme cli cuse: Power Managment on Namespace Signed-off-by: Ahriben Gonzalez <ahribeng@gmail.com> Change-Id: I55a88a4f5ace5040f79c05edfc0b8559905bdd2e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10602 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-01-14 11:10:13 +00:00
Ben Walker	517b557226	nvme: Do not track a separate active namespace list We only populate active namespaces into the main namespace tree, so we don't need a separate list of active namespaces too. Change-Id: Iaf194f806cc1d9672f5567cff3dffafff3165069 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10034 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2022-01-14 08:35:10 +00:00
Ben Walker	e7602c158f	nvme: Hold namespaces in an RB_TREE Since this is now sparsely populated, a tree is a better choice. Change-Id: Ie66d913fa1d298de56a7d22ef55f0adf7f8803b8 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10031 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-01-14 08:35:10 +00:00
Ben Walker	b4dace738e	nvme: Do not allocate inactive namespace objects Some subsystems report a very large maximum value for the number of namespaces, but in essentially every case the subsystem is sparsely populated with active namespaces. To save memory, don't allocate objects for the inactive ones. Change-Id: I4cbeb5a7a898d3c685f4a3a9ec4c2ce45efffb92 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9898 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2022-01-14 08:35:10 +00:00
Changpeng Liu	6bc3a3e630	nvme/compliance: also delete CQ at last in a test case For the test case, we only delete the SQ for a queue pair, this will make SPDK NVMe driver not to send delete CQ when shutting down the controller, so here also delete CQ at last. Change-Id: Ibdfb727710ca51524858158edac415e93bb1c1e3 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10528 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-01-13 08:48:10 +00:00
Yifan Bian	daacb4264a	test/llvm_nvme_fuzz: add NAMESPACE ATTACHMENT and NAMESPACE MANAGEMENT command test cases Change-Id: I48a9a37d54269b2d7038a96dace9aefba334ede8 Signed-off-by: Yifan Bian <yifan.bian@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11011 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-01-12 08:21:14 +00:00
Konrad Sztyber	d9f1d4e1dd	test/nvmf: run bdevperf in the foreground There's no point in running bdevperf in the background only to send a framework_wait_init. Moreover, since bdevperf is executed with a 1s timeout, there's a risk that it'll finish before that RPC is sent causing a false positive failure. Fixes #2046 Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Icb5a8535d39954b7dfd3b4386cd008214d180bf3 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11014 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com>	2022-01-12 08:20:53 +00:00
Ben Walker	1cfae16563	accel: Use vectored crc32 operations instead of chaining Chaining may be faster, but this is really an implementation detail of the idxd driver. Push the decision on how to implement a vectored crc down into the individual drivers and eliminate it from the generic framework. Signed-off-by: Ben Walker <benjamin.walker@intel.com> Change-Id: Iedbdc5a6dbd3f7d1674d0a83f6827588f4b6b2fb Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10291 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com>	2022-01-12 08:20:39 +00:00
Jim Harris	932ee64b8f	bdev/nvme: add bdev_nvme_stop_discovery RPC This RPC will stop the specified discovery service, including detaching from any controllers that were attached as part of that discovery service. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I9222876457fc45e1acde680a7bd1925917c22308 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10832 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-01-12 08:20:23 +00:00
Jim Harris	61a640b2a6	test: add nvmf/host/discovery.sh Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I7c279594f5298f9a0d7476a639866e75aea4043a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10811 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-01-12 08:20:23 +00:00
Konrad Sztyber	b542979620	test/nvmf: zero-copy abort test Extended the zcopy test with a step checking aborts. It's similar to `test/nvmf/target/abort.sh`, with the difference being that here the aborted requests are executed using zero-copy. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Ia25936b7097a94d278c467ce07c009091ea3a55a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10799 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-01-12 08:20:11 +00:00
Konrad Sztyber	da2fb51627	test/nvmf: zero-copy test The test has two phases: first it runs a bdevperf with rw=verify to check data consistency and then runs bdevperf in the background while sending RPC requests causing the subsystem to be constanty paused/resumed. In-capsule data is set to 0 to make sure all IO requests are using zero-copy. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I1c1d7eb04714c8506307cb95b6cbc5988c8946a3 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10797 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com>	2022-01-12 08:20:11 +00:00
Konrad Sztyber	6631c2a8aa	nvmf/tcp: initialize zcopy phase in nvmf_tcp_req_get Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Ia74148fb36733deaf7b2f833ac0247859311a805 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10794 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-01-12 08:20:11 +00:00
Konrad Sztyber	a50a70ecdf	nvmf: abort outstanding zcopy reqs in qpair disconnect Zero-copy requests are kept on the outstanding queue for the whole duration of the request - from the initial zcopy_start submission to the completion of zcopy_end. This means, that there's a period in which a request doesn't wait for a completion from the bdev layer, but is still on the oustanding queue (after zcopy_start callback, before zcopy_end submit). If a qpair gets disconnected while a request is in this state, we need to manually force its completion, as otherwise it might hang indefinitely (e.g. waiting for host data). Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I53731b8e363b725efa564ca3c7d89b46f5fb2a24 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10793 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com>	2022-01-12 08:20:11 +00:00
Konrad Sztyber	974a32b72e	nvmf: resume queued zcopy requests The zero-copy requests can also be queued when a subsystem is paused, so we need to properly resume and submit them by using zcopy_start. Since only requests that haven't received the zero-copy buffer (i.e. before zcopy_start was called) can be queued, we don't need to bother with checking zcopy_phase. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Ie629688f6961eb2ae05741df496720b91be4d80d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10792 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-01-12 08:20:11 +00:00
Shuhei Matsumoto	521a9bb22c	bdev/nvme: Fix race between failover and add secondary trid We sort secondary trids to avoid using disconnected trids for failover. However the sort had a bug. This bug was found by running test/nvmf/host/multipath.sh in a loop. Verify the fix by adding unit test. Fixes #2300 Signed-off-by: Shuhei Matsumoto <shuheimatsumoto@gmail.com> Change-Id: I22b0ede4d2ef98b786c3e0d1f5337a2d568ba56d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10921 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2022-01-10 22:18:46 +00:00
Changpeng Liu	e57252f98a	test/app: add missed spdk_env_fini() or spdk_app_fini() when exiting Change-Id: I9d511713277ea85d34e0a0e59b570fcbfa014fb1 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10911 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2022-01-10 18:36:14 +00:00
Jim Harris	b68f2eeb0b	bdev_nvme: add bdev_nvme_start_discovery RPC This patch adds the framework for a discovery service in the bdev/nvme module. Users can specify an IP/port of a discovery service. The bdev/nvme module will connect to a discovery controller, get the discovery log page, and then register for AERs. It will connect to each subsystem specified in the initial log page. AER completions will trigger fetching the log page again, at which point new subsystems will be connected to, or removed subsystems will be detached. This patch does the following: * Adds the new start_discovery RPC * Connects to the discovery controller * Gets the discovery log page * Registers for AERs * Detach from discovery controllers at shutdown Subsequent patches in this series will: * Connect to subsystems listed in discovery log page * Detach from subsystems that were listed in earlier discovery log pages but subsequently removed * Add a stop_discovery RPC Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I54bfa896a48c5619676f156b5ea9f2d1f886c72f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10694 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-01-10 15:23:39 +00:00
Jim Harris	9035986e35	test: add iptables entry to forward packets across veth bridge It seems like some distros or kernels do not automatically forward packets across the bridge we set up in nvmf_veth_init. So add an iptables entry to explicitly add a forwarding rule. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I50bc203afcfee888c27c0e1d77609e554dedc61e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10810 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2022-01-10 15:23:39 +00:00
Konrad Sztyber	7a374fbc0b	nvmf: make zcopy_end void Since spdk_bdev_zcopy_end() cannot really fail (it only fails if we pass a bad bdev_io), we can simplify the nvmf zcopy_end functions by making them void and always expect asynchronous completion. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I6e88ac28aba13acadea88489ac0dd20d1f52f999 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10790 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Anil Veerabhadrappa <anil.veerabhadrappa@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-01-06 18:53:42 +00:00
Konrad Sztyber	92d7df1f47	nvmf: use spdk_nvmf_request_exec to submit zcopy_start Since this path now supports sending zero-copy, use it for zcopy_start. Additionally, it makes it possible make zcopy_start void, as it reports all errors asynchronously via request_complete(), and remove some of the duplicated error checks. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I41f43ce1651432d9a7d74e3680d4a3f780128a1d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10789 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-01-06 18:53:42 +00:00
Konrad Sztyber	686b9984b3	nvmf: return async/complete status in bdev zcopy operations Additionally, the NVMe completion status is now updated and the IOs are queued if the bdev layer doesn't have enough IO descriptors. It makes the zcopy operations behave similarly to the other IO operations. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I455ae781e32aa6e60d144d2c91f109bd8be46664 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10787 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-01-06 18:53:42 +00:00
Konrad Sztyber	0e09df57dd	nvmf: rename zcopy operations to zcopy_(start\|end) It makes their names consistent with the bdev API. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I314051f0980b46959d6560aa25885f13b4c28f2a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10786 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Anil Veerabhadrappa <anil.veerabhadrappa@broadcom.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-01-06 18:53:42 +00:00
Konrad Sztyber	f65099d378	nvmf: remove zcopy check in spdk_nvmf_request_exec It will make it possible to submit zero-copy requests through spdk_nvmf_request_exec(). Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Ibc14fe77cd477b11ed55d1350a7486caaad81add Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10783 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-01-06 18:53:42 +00:00
Konrad Sztyber	7d23ac8657	nvmf: remove zcopy phase checks from IO functions The code should never reach these functions for requests using zero-copy. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: If9f30e05a43b340a982604d5b985242d63ce252b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10782 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-01-06 18:53:42 +00:00
Konrad Sztyber	aa1d039836	nvmf: zero-copy enable flag in transport opts It makes it possible for the user to specify whether a transport should try to use zero-copy to execute requests when possible. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I40a92b0d7a6707f4c9292795f380846acb227200 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10780 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-01-06 18:53:42 +00:00
Krzysztof Karas	39b7e10fd4	hw_hotplug.sh: fix the incorrect path to log.txt file This test fails due to incorrect path to log.txt file needed to detect start of hotplug application. The change that introduced the bug: Change-Id: Ia14507a282796ad28f067c086b9112ae6463b654 This change corrects the file path. Change-Id: I5d43c47044f46100fa4ca7b845973c427cae237c Signed-off-by: Krzysztof Karas <krzysztof.karas@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10976 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-01-06 18:52:31 +00:00
Yifan Bian	89f253dd20	test/llvm_nvme_fuzz: add DELETE I/O COMPLETION QUEUE and DELETE I/O SUBMISSION QUEUE command test cases Change-Id: Idf20d03077ea24d347170ada955dda6d92efce49 Signed-off-by: Yifan Bian <yifan.bian@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10984 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-01-06 18:52:05 +00:00
Changpeng Liu	2a6c2c289c	nvmf: support static CNTLID SPDK NVMf subsystem supports dynamic controller model, for transports other fabrics, users should use static controller model. Change-Id: I364ea61a71b04d51932fd9e0e16f401a383ff67c Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10149 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-01-06 01:20:32 +00:00
Michal Berger	e0482c503e	autopackage: Move extra RPM tests to nightly This is done to save some time in per-patch testing Signed-off-by: Michal Berger <michalx.berger@intel.com> Change-Id: I777e30ed90a9cd02ec12c0fd0c19a5616c491f6a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8637 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Karol Latecki <karol.latecki@intel.com>	2022-01-05 09:24:59 +00:00
Michal Berger	f3b3c5bd84	test/common: Don't exit from within the killprocess() This breaks the flow of EXIT traps that use this function together with errexit. Consider: set -e trap 'killprocess $pid_not_found; nvmftestfini' EXIT ... if killprocess() suddenly exits the remaining parts of the trap won't be executed. Also, if the process is not found, allow killprocess() to return success - if it's gone then the main goal of the function is already achieved. If the process terminated abruptly errexit should pick that up regardless. Signed-off-by: Michal Berger <michalx.berger@intel.com> Change-Id: I93c00c2a2da5457affb2ed2b5bf0ebae9c4bf291 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10853 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-01-05 09:24:22 +00:00
Michal Berger	89df50bc2d	scripts/core-collector: Don't limit core dump collection We should gather the core regardless of the RLIMIT_CORE setting. Also, don't limit number of parallel executions of the collector in case more than 2 processes crashed (unlikely to happen anyway, but just in case make sure we cover it). Lastly, wait a bit longer for the potential cores. Signed-off-by: Michal Berger <michalx.berger@intel.com> Change-Id: Ic7de8f8000676122282469b729f45b235ab6f9ca Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10852 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-01-05 09:24:22 +00:00
Michal Berger	5d49965bb9	test/scheduler: Ignore failures while removing dummy cgroups Upon removal, there's a small chance that new threads would jump into these cgroups AFTER we read the list of PIDs to migrate. In such a case, there may be some processes left preventing us from finally removing the cgroup. Since we can't fully control that without continuously reading the list, simply ignore it. Fixes issue #2298 Signed-off-by: Michal Berger <michalx.berger@intel.com> Change-Id: I0d50b975742a8b15edd25c6b4e51ab337b017eaa Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10872 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-01-05 09:23:57 +00:00
wanghailiangx	da7c73016e	test/nvmf: distinguish between two nvmf_fio cases When we find the corresponding nvmf_fio cases in the log, we won't be confused. Change-Id: If37372d287b0ac70142234a4e29208a38d027fbd Signed-off-by: wanghailiangx <hailiangx.e.wang@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10680 Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-01-05 09:23:24 +00:00
Maciej Wawryk	8dafa5632a	test/nvme: add timeout to bdevperf perform_tests By default, this timeout waiting for response is set to 60 seconds, we want to wait a bit longer than defined RUNTIME value. Signed-off-by: Maciej Wawryk <maciejx.wawryk@intel.com> Change-Id: I6f91da6c3996967271dda7c49320a2b8573bc49f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10169 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Karol Latecki <karol.latecki@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-01-05 09:22:50 +00:00
Yifan Bian	dd6a8a80df	test/llvm_nvme_fuzz: add CREATE I/O SUBMISSION QUEUE command test case Change-Id: I4353543acf8662b14f35d36b30b2eb1fad88a871 Signed-off-by: Yifan Bian <yifan.bian@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10970 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-01-05 08:49:15 +00:00
Yifan Bian	0ac16751a5	test/llvm_nvme_fuzz: add CREATE I/O COMPLETION QUEUE command test case Change-Id: Id5c6e81c2691942380450f2ebabdf818494f76a2 Signed-off-by: Yifan Bian <yifan.bian@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10953 Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com>	2022-01-05 02:40:31 +00:00
Changpeng Liu	c7f33deaae	test/llvm_nvme_fuzz: don't use NVME_OPC_FABRIC for ADMIN fuzz tests When doing ADMIN fuzz tests, the NVME_OPC_FABRIC is special for fabric transports, so here we pick up a different one. Change-Id: I00376c08eb9eabdb109656d631615eeb37c9d09c Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10847 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-01-05 02:39:04 +00:00
Mao Jiang	dc1386e686	nvme/overhead: add vfio_user transport support Change-Id: Ib696c7787151e4898b63f57749bd134333e94a23 Signed-off-by: Mao Jiang <maox.jiang@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10413 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2022-01-04 07:53:00 +00:00
junwenx	f9c496b8e2	test/vfio_user: Restart vm, check if memory region register/unregister failed Signed-off-by: Jun Wen <junx.wen@intel.com> Change-Id: Ib18e8c3971da53f91de5352eaa476b4acd0ce4db Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10377 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Michal Berger <michalx.berger@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2021-12-31 10:10:24 +00:00
Mao Jiang	a0d255ac02	test/vfio_user: Build vfio-user test directory Add test script for launching VMs with vfio-user transport, this test script not active until vfio-user capable qemu available, new flag SPDK_TEST_VFIOUSER_QEMU will decide case whether to run. Change-Id: I9b885cec797fe3bb28860f1ec436c31a6fa2f131 Signed-off-by: Mao Jiang <maox.jiang@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10096 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Michal Berger <michalx.berger@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2021-12-31 10:10:24 +00:00
Bianyfan	5d5b293387	test/llvm_nvme_fuzz: add ABORT command test case Change-Id: Ibfdb75771abc20ecf0c436314d26c1791cf0a518 Signed-off-by: Yifan Bian <yifan.bian@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10922 Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2021-12-31 09:21:50 +00:00
Alexey Marchuk	3c4a68cafc	nvme: Do not create IO qpair during ctrlr initialization If nvme ctrlr is resetting or initializing, free_io_qids bitmap is already freed or not created yet. In that case an attempt to create IO qpair leads to segmentation fault. Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: I6a97bf81d5a568db20d23b3f88cf01e994ba42e3 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10827 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuheimatsumoto@gmail.com>	2021-12-27 08:43:03 +00:00
Alexey Marchuk	eb09178a59	nvme/rdma: Correct qpair disconnect process In current implementation RDMA qpair is destroyed right after disconnect. That is not graceful qpair shutdown process since there can be requests submitted to HW and we may receive completions for already destroyed/freed qpair. To avoid this, only disconnect qpair in ctrlr_disconnect_qpair transport callback, all other resources will be released in ctrlr_delete_io_qpair cb. This patch is useful when nvme poll groups are used since in that case we use shared CQ, if the disconnected qpair has WRs submitted to HW then qpair's destruction will be deferred to poll group. When nvme poll groups are not used, this patch doesn't change anything, in that case destruction flow is still ungraceful. However since CQ is destroyed immediately after qpair, we shouldn't receive any requests which point to released resources. A correct solution for non-poll group case requires async diconnect API which may lead to significant rework. There is a bug when Soft Roce is used - we may receive a completion with "normal" status when qpair is already disconnected and all nvme requests are aborted. Added a workaround for it. Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: I0680d9ef9aaa8737d7a6d1454cd70a384bb8efac Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10327 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Shuhei Matsumoto <shuheimatsumoto@gmail.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-12-23 08:44:40 +00:00

1 2 3 4 5 ...

6014 Commits