ivampiresp/Spdk - Spdk - Leaflow Developers

Author	SHA1	Message	Date
paul luse	dd2c08d2d1	configure/misc: make ISA-L a hard dependency Following discussion in a recent SPDK community meeting, it was determined that we no longer need to carry ISA-L as a user configuration option. It will be enabled by default. If running on an architecture that ISA-L isn't fully supported on, the configure script will disable associated features and display a warning and will also not build ISA-L. Same case if there are issues with dependencies. Note that --without-isal is no longer supported as a configure option. Signed-off-by: paul luse <paul.e.luse@intel.com> Change-Id: Ibd1e5e9454d1b090462c3e757b2f51c52e6cb774 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14393 Reviewed-by: Krzysztof Karas <krzysztof.karas@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-09-20 10:18:54 +00:00
Jim Harris	18c8b52afa	trace: allocate shm filesize based on number of cores used Previously we would always allocate the shm file based on max (128) cores which is unnecessary. So use spdk_env APIs to only allocate shm file size based on the cores we might possible use. With default settings, an shm file was 135MB before this change, now an app using cores 0-7 will just use about 9MB. A lot of the trace-related code depended on there always being a history for every core, even unused ones, so a few additional changes were needed, mainly the trace_parser library. Tested by starting an app using a 0x4 core mask and enabling a trace mask, generating some events, then checking both the size of the shm file and that spdk_trace works properly with the resulting file. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ie868b3e3658d6f82b2fea37cb87453e8a9e0abc4 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14044 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-09-20 10:17:45 +00:00
Changpeng Liu	982c25feef	nvmf: add spdk_nvmf_ctrlr_[save\|restore]_migr_data() APIs When doing live migration, there are some spdk_nvmf_ctrlr internal data structures which need to be saved/restored, these data structures are designed only for vfio-user transport, for the purpose to extend them to support other vendor specific transports, here we move them as public APIs, users can use SAVE\|RESTORE to restore a new nvmf controller based on original one. And remove the register from vfio-user transport, these registers are stored in the common nvmf library. Change-Id: I9f5847ef427f7064f8e16adcc963dc6b4a35f235 Signed-off-by: Jacek Kalwas <jacek.kalwas@intel.com> Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11059 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Thanos Makatos <thanos.makatos@nutanix.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-20 10:17:24 +00:00
Liu Xiaodong	762db2a4f4	vhost: register memtable once if unchanged Move memtable register out of start_device, into post_handler for vhost-msg SET_MEMTABLE; And unregister memtable in destroy_connection instead of destroy_device If memtable info not changed in the msg, then we don't need to register it multi times. Signed-off-by: Liu Xiaodong <xiaodong.liu@intel.com> Change-Id: I0f8c76c1ee43b6f981d703beeba92da5dac4dbd6 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14263 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-09-19 13:12:24 +00:00
Xinrui Mao	c3f628f141	lib/nbd:export bdev flush and trim ability Fix mkfs fail when using lvol as backend of nbd.Predefined NBD_FLAG_SEND_FLUSH and NBD_FLAG_SEND_TRIM are defined by default, so the operations of trim and flush are supported,but in fact lvol doesn't support trim and flush operations.Therefore add judgement for NBD_FLAG_SEND_FLUSH and NBD_FLAG_SEND_TRIM to check. Signed-off-by: Xinrui Mao <xinrui.mao@intel.com> Change-Id: I3d21034d12a038c8fc694d3383028103239ea6bd Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14099 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Xiaodong Liu <xiaodong.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2022-09-16 13:32:13 +00:00
MengjinWu	48312019c8	nvme/tcp: Remove duplicate code in nvme_tcp_read_pdu Signed-off-by: MengjinWu <mengjin.wu@intel.com> Change-Id: I63f51ecba2b4d40579d2592d2c85a7aefdacf7e7 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14503 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Xiaodong Liu <xiaodong.liu@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-09-15 19:25:02 +00:00
MengjinWu	31fc5f196f	nvme/tcp: simplify state change function state change function do not need to use swtich to do some work. Do memset in state machine. Signed-off-by: MengjinWu <mengjin.wu@intel.com> Change-Id: Ie66454d8f31860f403171f20858a6b4a24e3c76f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14502 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Xiaodong Liu <xiaodong.liu@intel.com>	2022-09-15 19:25:02 +00:00
Aleksey Marchuk	7a7f21b6fe	init: Avoid calling RPC methods twice Some methods are allowed to be run in both STARTUP and RUNTIME states and current implementation calls such methods twice. That can be a problem in some cases, so use the new spdk_rpc_get_method_state_mask function to skip such methods in RUNTIME state. Signed-off-by: Aleksey Marchuk <alexeymar@nvidia.com> Change-Id: I0a109805db428f60072a8c82161805dcde763da7 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14407 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-09-15 08:25:18 +00:00
Aleksey Marchuk	515419ac66	rpc: Add API to get method state mask The new API will be used in the next patch to prevent calling metods for the seconds time when subsystem is initialized with config file Signed-off-by: Aleksey Marchuk <alexeymar@nvidia.com> Change-Id: I60ac8196e46ccb3b22b3af0607e1ba35a11a66a6 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14406 Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-09-15 08:25:18 +00:00
Damiano	6defafc913	bdev: Add functions to [hole,data] seek These functions start from a given offset and seek for next data or for next hole. For bdevs that do not support seeking, it is assumed that only data and no holes are present Signed-off-by: Damiano Cipriani <damiano.cipriani@suse.com> Change-Id: I6bc831970223333b25683f60ce3fcbbfebb5bb81 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14361 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Community-CI: Mellanox Build Bot	2022-09-15 08:23:56 +00:00
Damiano	d8a3dee1c1	blob: Add functions to find [un]allocated io_unit These functions start from a given offset and seek for first io_unit belonging to an allocated cluster or first io_unit belonging to an unallocated cluster Signed-off-by: Damiano Cipriani <damiano.cipriani@suse.com> Change-Id: I0c632e2b3dfd2e96aa22e21796e25a36f2f55f9f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14360 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Community-CI: Mellanox Build Bot	2022-09-15 08:23:56 +00:00
Damiano Cipriani	ddf5a8da90	blobstore: Add function to get io_unit per cluster This function returns the number of io_units per cluster Signed-off-by: Damiano Cipriani <damiano.cipriani@suse.com> Change-Id: I8f33d24a63876a0a918830b9eeaa69a91ff21193 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14431 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Community-CI: Mellanox Build Bot	2022-09-15 08:23:56 +00:00
Boris Glimcher	35f7f0ce1e	nvme/tcp: Allow to choose SSL socket implementation Adding `psk` field to `spdk_nvme_ctrlr_opts` Adding `psk` parameter to `bdev_nvme_attach_controller` RPC Change-Id: Ie6f0d8b04ce472e6153934e985c026acded6cdfc Signed-off-by: Boris Glimcher <Boris.Glimcher@emc.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14046 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-09-14 07:44:53 +00:00
Kefu Chai	39ecb61ade	event: pass "const struct option" to spdk_app_parse_args() before this change, we cannot pass a `const struct option` to spdk_app_parse_args() even the callee does not mutate the value pointed by the pointer. in other words, we are not able to write something like: static const option g_options[] = {...}; // ... spdk_app_parse_args(argc, argv, &opts, "", g_options, app_parse_arg, app_usage); after this change, the requirement of the type of the `option` argument is relaxed, so we can pass a `const struct option*` to this function now. Signed-off-by: Kefu Chai <tchaikov@gmail.com> Change-Id: I8794fcf92090f538743850a28ef4a2a8c357f121 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14082 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-09-13 10:48:58 +00:00
MengjinWu	12807c5bc6	lib/nvmf: Do one memset per new PDU recv While waiting for a new PDU, target will not do too many useless memcpy. Signed-off-by: MengjinWu <mengjin.wu@intel.com> Change-Id: Ie0825c2b1e44444b210040c4a1761010e0e4cfe5 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14444 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com>	2022-09-13 07:29:38 +00:00
Kozlowski Mateusz	630922e825	ftl: Add lazy unmap process Since only L2P pages as a whole are marked as invalid during trim, the specific L2P entries won't be updated until someone touches that page. The unmap process will slowly invalidate pages during runtime, by paging them in. This will allow compaction and relocation to benefit from the trim as the user data gets invalidated. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: I239b9adf0aaaeac58f440145f4ab78b0d78d98b0 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13381 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-09 19:44:29 +00:00
Kozlowski Mateusz	b3e5d8a723	ftl: Add recovery and restart path for trim Restores necessary metadata and sets L2P during clean/dirty shutdown recovery process. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: Iaa44025250b44f424ac9de5859d1db82900ecaa9 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13380 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-09 19:44:29 +00:00
Kozlowski Mateusz	2c7c8b6ceb	ftl: Add rpc functionality for unmap Trim is now also available as a management operation via RPC. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: I05b778a611e9809a14bfed50b01986bb4649a35c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13379 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-09 19:44:29 +00:00
Kozlowski Mateusz	66fe5f75bb	ftl: Unmap functionality Adds ability to send trim commands to FTL - only 4MiB aligned requests (both for offset and length of request) will be processed. During a trim operation an L2P page (containing 1024 4B entries, 1 per user LBA; which is where the 4MiB alignment comes from) will be marked as unmapped. After this point any L2P access to that page will actually set the entries themselves as FTL_ADDR_INVALID. This is done to make the trim as fast as possible, since for large requests it's probable that most of the L2P pages aren't actually in DRAM. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: I4a04ee9498a2a6939af31b06f2e45d2b7cccbf19 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13378 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-09 19:44:29 +00:00
Artur Paszkiewicz	78c3cbf4c9	ftl: metadata for unmap support Setup trim metadata layout. Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Change-Id: I9395119cb8d5f7a5de4fde7b3f9506eb06452d7b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13377 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-09 19:44:29 +00:00
Kozlowski Mateusz	c7c9211ee0	Ftl: Open chunk recovery At the end of the recovery step, all chunks will be transferred to closed state. Missing write pointer data filled with LBA_INVALID Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: Id496e465e46fa24b04b30f2558bdacfdd668e8a4 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13375 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Community-CI: Mellanox Build Bot	2022-09-09 19:44:29 +00:00
Kozlowski Mateusz	5c5587d805	FTL: L2P chunk recovery Recover L2P from chunks' P2L. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: I039cfc54374fad0ba584d6029b752ca2f31925cf Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13374 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-09-09 19:44:29 +00:00
Kozlowski Mateusz	d1462266ce	FTL: Recover chunk state Recovers the free/open/close chunk state, initializing them to any specific lists. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: Idf689f4fbcd6fc6bd986104dc89f5079c758845a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13373 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-09-09 19:44:29 +00:00
Kozlowski Mateusz	ca53f5a6df	FTL: Band L2P recovery Recovers L2P based on all non-free bands' P2L. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: Ice9e77b00161b031c795570baf3ed8c92dfecef0 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13372 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-09-09 19:44:29 +00:00
Changpeng Liu	40f556ca38	vhost: don't kick VM when there are outstanding vhost-user messages For all the vhost-user messages processed in SPDK except VHOST_USER_GET_VRING_BASE, DPDK rte_vhost "vhost-events" thread already holds all VQ's access lock, before return response to "vhost-events" thread, SPDK should not call `rte_vhost_vring_call`, here we set a flag to TRUE for these vhost-user messages, and avoid to kick VM. The deferred IRQs will be posted in next round poll or after restarting the device. Fix issue #2518. Change-Id: I82f14b97d0b0ce602a93fd66d5fdeef64f07d179 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14402 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Dong Yi <yidong0635@126.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-09-09 15:31:06 +00:00
Changpeng Liu	097691fc18	vhost: do `rte_vhost_vring_call` from spdk context Currently we will call `rte_vhost_vring_call` in the DPDK "vhost-events" thread context when starting the device, and DPDK vhost library already holds all VQ's access lock when starting device, with new DPDK/dpdk@c573699 commit, it will cause deadlock to call `rte_vhost_vring_call` in "vhost-events" context, so here we increase 1 to `used_req_cnt` to make sure one more `rte_vhost_vring_call` will be executed later in SPDK thread context. Signed-off-by: Jim Harris <james.r.harris@intel.com> Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Change-Id: Iab53941942335744bf25ab6e9b8747bd08b0c698 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14328 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Dong Yi <yidong0635@126.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-09-09 15:31:06 +00:00
Changpeng Liu	9b74b4a3de	lib/vhost: don't clear interrupt counter for error case `rte_vhost_vring_call` may return error, then we can try to call it in next poll. Change-Id: I8f6a591837225079e004c6f57f2d7b01063f87a1 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14342 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: GangCao <gang.cao@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Dong Yi <yidong0635@126.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-09-09 15:31:06 +00:00
Jim Harris	75cc6fd62f	vhost: move the session_start_done calls to common layer Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I355790f87ef148af85d5c13002260f1120749ae5 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14340 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Dong Yi <yidong0635@126.com> Reviewed-by: GangCao <gang.cao@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-09-09 15:31:06 +00:00
Jim Harris	f869197b76	virtio: assert and ERRLOG for virtio-user dynamic mem allocations We do not support dynamic memory allocation with the virtio-user library - it results in SET_MEM_TABLE vhost messages for every change which is not supported by the vhost target. Add '-s 256' to vhost fuzz script, to ensure it does not violate the new restriction. This is a follow-on patch for issue #2596. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: If851f53d7d670ac8443f0d9c8f4e3cbe82e0df7c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14249 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Dong Yi <yidong0635@126.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-09-09 13:06:15 +00:00
Michael Piszczek	9ffb0497c1	iommu: Read AMD iommu address width Add code needed to read the virtual address width for AMD processors Fixes issue 2686 Signed-off-by: Michael Piszczek <mpiszczek@ddn.com> Change-Id: I44f988e60d7bbfb1cb137b3cbc4ac44dbb693d35 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14416 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-09-09 13:06:05 +00:00
Michal Berger	59c10a2fa2	lib/ftl: Fix -Wunused-function under clang utils/ftl_mempool.c:131:1: error: unused function 'ftl_mempool_is_initialized' [-Werror,-Wunused-function] ftl_mempool_is_initialized(struct ftl_mempool *mpool) ^ 1 error generated. Signed-off-by: Michal Berger <michal.berger@intel.com> Change-Id: I81076fb9c931fe63c79241f80584502a1ce56be9 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14291 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Dong Yi <yidong0635@126.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-09-09 13:02:07 +00:00
Kefu Chai	5a6f3a6f91	event: accept negative --shm-id as a valid option Before this change, a negative `--shm-id` value is rejected by `spdk_app_parse_args()` and this function simply errors out after detecting it. However, `build_eal_cmdline()` has a dedicated branch checking for a negative `opts->shm_id` and passes `--no-shconf` down to DPDK as a parameter, so we cannot disable the shared config support in DPDK. After this change, a negative value `--shm-id` is accepted, but if it cannot be parsed as an integer, `spdk_app_parse_args()` errors out as before. In result we can disable shared config support in DPDK by passing `--shm-id=-1` to SPDK application. Signed-off-by: Kefu Chai <tchaikov@gmail.com> Change-Id: Ibe089f13638eefa9ac28c5c99e303bcc3102f307 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14097 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Krzysztof Karas <krzysztof.karas@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-09-09 12:57:01 +00:00
Shuhei Matsumoto	cad6f55e33	bdev: Add spdk_bdev_get_current_qd to measure and return current value The generic bdev layer has a public API spdk_bdev_get_qd() but its value is the most recently measured value and it requires qd sampling to be enabled. We will have bdev modules to want to wait until all bdev_ios are aborted by a reset. Unfortunately, spdk_bdev_get_qd() is not suitable for the custom bdev module. Furthermore, spdk_bdev_channel::io_outstanding is not accessible from bdev modules. Hence, add a new public API spdk_bdev_get_current_qd(). This function should be used only from the bdev module and it should be ensured that the bdev is not unregistered during execution. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Ica30a8d8fe3264e28f0772a39bdf5f9ba72933e1 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12791 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-09-09 12:55:39 +00:00
Shuhei Matsumoto	1212b53fb8	bdev: Add spdk_bdev_for_each_bdev_io() to execute function for each bdev_io Some use cases want to abort every bdev_io submitted to the bdev by traversing the bdev channels. However, struct spdk_bdev_channel is private in lib/bdev/bdev.c. Hence, add a helper function spdk_bdev_for_each_bdev_io() to execute the function on the appropriate thread for every bdev_io submitted to the bdev. This function should be used only from the bdev module and it should be ensured that the bdev is not unregistered during execution. We keep this function as generic as possible because we may have other use cases in future. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Ic0209361bd1228ea8d4cb3241d0df07106be58d9 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12751 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-09-09 12:55:39 +00:00
GangCao	3851a64f9f	Lib/Bdev: add the new utility function For the iostat change, add a new utility function: rpc_bdev_get_iostat_dump() Change-Id: I5883fc3eb8c73a0dc2bf41c7889100e0e492359a Signed-off-by: GangCao <gang.cao@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14418 Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-09-08 07:23:07 +00:00
yidong0635	9e81535efe	reactor: Encapsulate a function _event_call. Former code, there're many repeated defines. And some add asserts checking valid event and some don't add. To get the right reports from debugging mode and catch the errors, so encapsulate a common function to do these. And add assert in this function. This will help get the right failure point. Signed-off-by: yidong0635 <dongx.yi@intel.com> Change-Id: I23d71eac6652c4104ceff80419f39634ac5ce395 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14335 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-09-08 07:17:34 +00:00
John Levon	654738ff45	lib/nvmf: small cleanup in vfio_user_qpair_delete_cb() We already define a convenient variable for the admin CQ: use it. Suggested-by: Alexis Lescouet <alexis.lescouet@nutanix.com> Signed-off-by: John Levon <john.levon@nutanix.com> Change-Id: If6570f30844a52113633bdb5f3543eec700f05d7 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14391 Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-09-07 07:04:44 +00:00
Kozlowski Mateusz	bcdedd1a2b	FTL: Add recovery iterations In order to fit inside the maximum memory usage limit, recovery needs to be split into multiple parts. During each iteration, part of L2P needs to be read, modified as necessary and saved back to the cache. This patch introduces the load/save steps, initialization of seq_id array and valid map recovery. The actual L2P recovery is done in the followup patch. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: I8ceadc5ef280542a173d83b932a983d5d86604a1 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13371 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-07 00:08:34 +00:00
Kozlowski Mateusz	8786f3b465	FTL: Open band recovery Adds recovery of open bands from P2L metadata region. Recovers the commited P2Ls and write pointers for them. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: I943c53f55e653dd075035cef7ddba448c990be87 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13370 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-07 00:08:34 +00:00
Kozlowski Mateusz	0e0f3d9af2	FTL: Shared memory recovery Adds valid map and L2P restroration for shared memory (crash) recovery. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: Ia4e0cc6cd552ea61dca8985a26aa55c84a1233db Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13369 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-07 00:08:34 +00:00
Kozlowski Mateusz	764a3675a9	Ftl: Add band state recovery after dirty shutdown Recovers the open/close/free state of bands after shutdown, initializing necessary lists. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: I4a6bd4ed1013ce8d04f44d1772dcd1f0e4e365bd Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13368 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-07 00:08:34 +00:00
Artur Paszkiewicz	1738488e41	ftl: p2l checkpointing Since base device doesn't require VSS, FTL introduces a mechanism that will allow for recovering both the P2L and write pointer of open bands after a dirty shutdown. After writing 1MiB of data to a band, a 4KiB block describing the P2L will be persisted to cache device, effectively emulating VSS for the base device. Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Change-Id: Ic6be52dc09b237297a5cda3e752d6c038e98b70e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13367 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-07 00:08:34 +00:00
Artur Paszkiewicz	36049672a3	ftl: sequence id tracking Track the relative sequence of opening and closing bands and chunks. Necessary for detecting the most recent user data during dirty shutdown recovery. Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Change-Id: I682030e58284d7b090667e4e5a9f4bbc7615708a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13366 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-09-07 00:08:34 +00:00
GangCao	b50af42b62	lib/virtio: return error if CMSG_FIRSTHDR returns NULL Fix issue: potential NULL pointer dereference Change-Id: I623096c49e7a75e66404666a2f502ba3209e3530 Signed-off-by: GangCao <gang.cao@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14330 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-09-06 07:17:26 +00:00
Blachut, Bartosz	503835ee63	util: made hexlify and unhexlify functions public hexlify and unhexlify utils from vbdev_crypto.h have been moved so that they could be included and reused outside of vbdev_crypto module. Signed-off-by: Blachut, Bartosz <bartosz.blachut@intel.com> Change-Id: Ia074250176907f4803b84024239ecd4e9d8a5fc1 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14191 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Jacek Kalwas <jacek.kalwas@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-09-06 07:17:13 +00:00
Ben Walker	34c48f1b3b	accel: Do not refer to the "framework" as "engine" The word engine was both used (interchangeably with module) to refer to the things that plug into the framework and to the framework itself. This patch eliminates all use of the word engine that meant the framework. It leaves uses of the word that meant "module". Change-Id: I6b9b50e2f045ac39f2a74d0152ee8d6269be4bd1 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13918 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-09-06 07:16:17 +00:00
Ben Walker	dd7140e627	accel: Rename spdk_accel_engine_module_finish to spdk_accel_module_finish Also move it into the internal header that defines the interface used by modules. Change-Id: I3aeb41e643f27a69556099cb8d166f64c9e5d67f Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13917 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Krzysztof Karas <krzysztof.karas@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com>	2022-09-06 07:16:17 +00:00
GangCao	0b9ba6a330	lib/vmd: return -1 if NVMe driver is not found Fix issue: potential NULL pointer dereference Change-Id: I23f90616661fdebaacb041bc9f47284231601136 Signed-off-by: GangCao <gang.cao@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14329 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Community-CI: Mellanox Build Bot	2022-09-05 12:50:06 +00:00
Shuhei Matsumoto	cdf61c2f22	nvme: Polls only the qpair if ctrlr is not fabrics when connecting synchronously For non-fabric controllers, the corresponding I/O qpairs are simply re-enabled at controller reset. This had a issue when I/O qpairs span multiple threads and poll group is used. spdk_nvme_ctrlr_reconnect_poll_async() calls nvme_transport_ctrlr_connect_qpair() with qpair->async being false. Then nvme_transport_ctrlr_connect_qpair() calls spdk_nvme_poll_group_process_completions() until the qpair is connected. spdk_nvme_poll_group_process_completions() may poll other qpairs. This may cause I/O to complete on a wrong thread. For PCIe controller, spdk_nvme_poll_group_process_completions() calls spdk_nvme_qpair_process_completions() simply for each qpair. Hence change nvme_transport_ctrlr_connect_qpair() to call spdk_nvme_qpair_process_completions() if the controller is non-fabrics. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Ieb270c2fb154124021ef6d25577b817d05e5ca9e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14295 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Michael Haeuptle <michaelhaeuptle@gmail.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-09-05 12:50:00 +00:00
Evgeniy Kochetov	2e7a7fe530	blob: Optimize copy-on-write flow for clusters backed by zeroes device Writing to unallocated cluster triggers copy-on-write sequence. If this cluster is backed by zeroes device we can skip the copy part. For a simple thin provisioned volume copy this shortcut is already implemented because `blob->parent_id == SPDK_BLOBID_INVALID`. But this will not work for thin provisioned volumes created from snapshot. In this case we need to traverse the whole stack of underlying `spdk_bs_dev` devices for specific cluster to check if it is zeroes backed. This patch adds `is_zeroes` operation to `spdk_bs_dev`. For zeroes device it always returns 'true', for real bdev (`blob_bs_dev`) always returns false, for another layer of `blob_bs_dev` does lba conversion and forwards to backing device. In blobstore's cluster copy flow we check if cluster is backed by zeroes device and skip copy part if it is. Signed-off-by: Evgeniy Kochetov <evgeniik@nvidia.com> Change-Id: I640773ac78f8f466b96e96a34c3a6c3c91f87dab Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13446 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-09-05 12:49:46 +00:00
Konrad Sztyber	ab58ddf107	sock: make impl_name const char * in all functions There's no reason for this parameter to be non-const and it makes this functions pain to use when you want to hardcode a specific sock implementation. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Ifed4426a02ab54cbd51c8a2051b1eac010f86db9 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14303 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-09-05 12:49:28 +00:00
Shuhei Matsumoto	b3e1db32a3	nvmf/rdma: Ignore async_event if its qp_context is NULL If initiator and target run on the same application, and initiator uses SRQ, target may get async events for initiator, e.g., IBV_EVENT_QP_LAST_WQE_REACHED unexpectedly. The reason is initiator and target may use the same device simultaneously and only target polls async events. Target sets attr.qp_context to rqpair when creating QP, but initiator sets attr.qp_context to NULL when creating QP. Hence one simple fix is to ignore async events whose qp_context is NULL. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Id9ead1934f0b2ad1e18b174d2df2f1bf9853f7e1 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14297 Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot	2022-09-05 12:49:11 +00:00
Shuhei Matsumoto	0e4b13dc53	nvme_rdma: Destroy qpair after it is disconnected and drained By the previous patches, a qpair is destroyed after it is actually disconnected. But after the qpair is destroyed, it is checked if drained by using rqpair->current_num_sends and rqpair->current_num_recvs. However, if the qpair is the last of a poller of a poll group, CQ is destroyed before checking if the qpair is drained. If CQ is destroyed, at least rqpair->current_num_recvs is not updated, and we may get one second timeout. This should be avoided. Hence, destroy the qpair after it is disconnected and drained. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Ibd6c83e8a3e7b6e11e9b45cee42669da6d42a621 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14278 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-05 12:49:11 +00:00
Shuhei Matsumoto	1d58eb038b	nvme_rdma: Release poller from poll group when qpair is actually disconnected If the being disconnected qpair is the last of a poller of a poll group, CQ is destroyed and the poller is released before the qpair is actually disconnected. This patch destroy CQ and release the poller after the qpair is actually disconnected. One exception is when spdk_nvme_ctrlr_free_io_qpair() is called to a connected qpair. In this case, the qpair is removed from a poll group before the qpair is actually disconnected. In this case, destroy CQ and release the poller when the qpair is removed from the poll group. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Idf266bbb6dbb40f04ae6313db724fabf80865763 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14253 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-05 12:49:11 +00:00
Shuhei Matsumoto	80d75fda06	nvme_rdma: Clean up releasing poller from poll group We have two cases to call nvme_rdma_poll_group_put_poller(). For consistency, make the two cases the same sequence. This will make the next patch easier. The next patch will release poller from poll group when qpair is actually disconnected as possible as we can. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I4178113d5277240e287e83a57e97cf32fd0f7457 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14252 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-05 12:49:11 +00:00
Kozlowski Mateusz	86619848ec	Ftl: Add clean restore management path Adds ability for FTL to startup after clean shutdown. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: I2f1b83bb3eb1487b6665c95e76c48881e8899b16 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13364 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-02 17:40:09 +00:00
Kozlowski Mateusz	d4b9f2c68b	FTL: Add metadata self test Adds additional debugging functionality - ability to check the validity of all L2P entries and valid map to check for inconsistencies after FTL startup. Since this is a very time consuming process, it's controlled by an environment variable and not executed during normal operations. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: I4766a1576c058f69fa047f45d2d8be6d0ad0b3cd Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13363 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-02 17:40:09 +00:00
Kozlowski Mateusz	cbd7ae6df7	FTL: Add metadata restore functionality Adds necessary functions for setting up the state of FTL components based on loaded in metadata. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: I3a4c05230c877850e61d4f31d495d38121d27b3f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13362 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-02 17:40:09 +00:00
Kozlowski Mateusz	55147295d7	FTL: Add L2P restore path Adds initialization code for L2P done after shutdown (both clean and dirty). Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: I7a938b298467c96d68f40cb14c3171d1533e1a08 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13361 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-02 17:40:09 +00:00
Kozlowski Mateusz	b5e2c59ad6	FTL: Add fast shutdown path Adds the ability to persist only the most important metadata. The rest is stored in shared memory. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: I4084c04ba09115a7a08ff66fd33552a2ec60d801 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13360 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-02 17:40:09 +00:00
Kozlowski Mateusz	ef93cc38ee	FTL: Persist metadata on clean shutdown Add an extra step during FTL shutdown to save all metadata. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: Idc2f77e15bbd02028548cc88355cd450175830e8 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13359 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-02 17:40:09 +00:00
Kozlowski Mateusz	b4b70e8303	FTL: Make L2P caching default mode Flat L2P (all L2P in memory) needs to be specifically built against, due to large memory consumption for big devices. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: Ib8906e10868455f88725b69b2b033b70a9f7256c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13358 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-09-02 17:40:09 +00:00
Kozlowski Mateusz	94b7f8d82d	FTL: Add L2P cache eviction logic Adds eviction of least recently used pages from the L2P cache - dirty pages will be persisted. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: Ic646f7e9da777d077b5cb9b409c3f03ef05b1273 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13357 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-02 17:40:09 +00:00
Kozlowski Mateusz	73f9b4f5fe	FTL: L2P cache page in logic Adds paging in from the cache device to memory. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: I250009d12e9ed5ad52ee861ec5157cf983cf8cfc Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13356 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-02 17:40:09 +00:00
Kozlowski Mateusz	905fbf946c	ftl: Add L2P cache pin/unpin logic There is a set amount of pinned pages available. If exceeded they will be deferred and processed in the future, using eviction logic. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: Ic642a5870db009ccf57152dd8a4178a6b2098ee1 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13355 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-02 17:40:09 +00:00
Kozlowski Mateusz	db65602a39	FTL: Add l2p cache get/set logic This commit also introduces ranking pages, based on usage for determining the least used page to be evicted. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: Iaf3812177b61376bb38aa209e4ba8576d784ffb5 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13354 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-02 17:40:09 +00:00
Kozlowski Mateusz	e7e5bc07b2	FTL: Add initial L2P cache logic L2P cache allows for partial storing of L2P in memory, paging in and out as necessary, lowering the total memory consumption. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: I727fec9d2f0ade4ca73e872d62a2ec10cfdb0a88 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13353 Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-09-02 17:40:09 +00:00
Jim Harris	01cec2499f	vhost: add start_session vhost_blk_start and vhost_scsi_start are now just a single vhost_user_session_send_event() call, so make this more generic by adding a top-level start_session function. Now this function will do the vhost_user_session_send_event(), using the user_dev_backend's start_session function pointer. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ia89ba15011e231f0474405fb7225e713dcc920bf Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14327 Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-09-02 07:32:54 +00:00
Jim Harris	f8df19a49f	vhost: assign svdev from spdk thread context Currently scsi sets it's svdev from the vhost thread context, while blk does it from the spdk thread context. Make scsi match what blk does, to make the code more consistent. This also will allow for an upcoming simplification. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I609513bc8e05b49dd9455f2f61ba0cedc35236e6 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14326 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-09-02 07:32:54 +00:00
tongkunkun	bb432b4eea	json: fix parsing json problems when json config is invalid. Add parsing json as invalid cases: 1.json content that not enclosed in {}, it should be parsed as invalid, e.g. "abc":"not encloesed in {}" 2.json content that 'subsystems' not associate with array, it will report error and return failure, e.g. {"subsystems":"123"} 3.handle other invalid json formats, report and return failure, e.g. duplicate keys. Added `spdk_json_find` API return errcode: EPROTOTYPE - json not enclosed in {}. json config with content: 1."not enclosed in {}" 2."'subsystems' not be an array" 3."duplicate key in json" and some other invaild cases will be regarded as invalid json config, and will fail to start app. Fixes #2599 Signed-off-by: tongkunkun <tongkunkun_yewu@cmss.chinamobile.com> Change-Id: I02574c9acd7671e336d4c589ebbff8ed21eb3681 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13754 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: GangCao <gang.cao@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-09-02 07:32:21 +00:00
Konrad Sztyber	4cbd23e28b	vmd: method for forcing a rescan Added a new RPC, vmd_rescan, that forces the VMD driver to do a rescan of all devices behind the VMD. A device that was previously removed via spdk_vmd_remove_device() will be found again during vmd_rescan. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Ide87eb44c1d6d524234820dc07c78ba5b8bcd3ad Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13958 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tom Nabarro <tom.nabarro@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-09-01 08:48:32 +00:00
Konrad Sztyber	052ea0baac	vmd: method for removing devices behind VMD Added new RPC, vmd_remove_device, that allows users to remove a PCI device managed by the VMD library simulating a hot-remove. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Ifb84818ce8d147d1d586b52590527e85fe9c10de Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13957 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tom Nabarro <tom.nabarro@intel.com>	2022-09-01 08:48:32 +00:00
Konrad Sztyber	9a9aed4e7b	env/pci: use TAILQ_FOREACH_SAFE in pci_foreach_device() It'll make it possible to remove a PCI device from within the callback. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I4cea2207a29bb145aee968715e873076a8c0993c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13956 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Tom Nabarro <tom.nabarro@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com>	2022-09-01 08:48:32 +00:00
Konrad Sztyber	4c482a623b	vmd: don't create new buses in hotplug This doesn't work anyway and can cause creating duplicate bus objects if vmd_scan_single_bus() is called on a parent bus with previously allocated child buses. Also, while here, removed a few unused functions and flags in struct vmd_adapter. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Ic757070188157d9851f648acd074ca4943a14c39 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13955 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tom Nabarro <tom.nabarro@intel.com>	2022-09-01 08:48:32 +00:00
Konrad Sztyber	ee1ab6f6be	vmd: increment dev_cnt once device is initialized This is done in order to avoid having to decrement this counter in case of a failure. Also, it makes the result valid for the few error cases when we didn't decrement it. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Ia944fb8b810ce69caa8db5bc7c941e0905c9d3bd Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13954 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tom Nabarro <tom.nabarro@intel.com>	2022-09-01 08:48:32 +00:00
Konrad Sztyber	55bdd88506	env/pci: add detach() callback to pci_device_provider This makes it possible to notify other PCI device providers (VMD) that a PCI device is no longer used. The VMD will driver will unhook that device and free any resources tied to it. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I42752afbb371a1d33972dac50fd679f68d05b597 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13887 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tom Nabarro <tom.nabarro@intel.com>	2022-09-01 08:48:32 +00:00
Konrad Sztyber	690eebb447	vmd: extract removing devices to separate function Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Idc9c7d0e5d0ebce8278e089bcfe5b7f76b86c270 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13953 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Tom Nabarro <tom.nabarro@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-09-01 08:48:32 +00:00
Konrad Sztyber	ffa9953a14	vmd: add attach_device() This patch implements the callback for attaching devices behind the VMD with a given PCI address. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I07cf92c94cc7e6d3c8e31af7a8615e9a4ca641bf Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13886 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tom Nabarro <tom.nabarro@intel.com>	2022-09-01 08:48:32 +00:00
Konrad Sztyber	3b2097f313	vmd: use vmd_container.count when iterating over domains It makes it possible to call this function even if the VMD library wasn't initialized. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I3d0f4677c4a1189f9d8acf07baee50a4e2050459 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14260 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tom Nabarro <tom.nabarro@intel.com>	2022-09-01 08:48:32 +00:00
Konrad Sztyber	4b08c07a62	env/pci: call driver callback in pci_hook_device Now that we have a attach_device() callback, the devices can be hooked during spdk_pci_device_attach(). With DPDK, driver->cb_fn() is called in pci_device_init(), so we need to do the same in spdk_pci_hook_device(). Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Iada8b83ce7592aa62561530192072a50ec3a904b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13884 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tom Nabarro <tom.nabarro@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-09-01 08:48:32 +00:00
Konrad Sztyber	ac8b65bdd2	vmd: extract freeing device resources to vmd_dev_free This allows to free resources tied to a vmd_pci_device that isn't on the dev_list or wasn't hooked to the PCI driver. Also, use that function whenever a vmd_pci_device is freed instead of regular free(). Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Ifca177a7eb6d8180d6f2ee2a9d9e36d58810e8ad Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14259 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tom Nabarro <tom.nabarro@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-01 08:48:32 +00:00
Konrad Sztyber	3f4e968dab	vmd: add device to dev_list after initialization is complete That way, we don't have to do TAILQ_REMOVE if vmd_assign_base_addrs() fails. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Id7a5df2093e4f9dfc95ee1fe415eb644c61bc971 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14258 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tom Nabarro <tom.nabarro@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-01 08:48:32 +00:00
Konrad Sztyber	35f8bd2a13	vmd: move pci_hook_device to vmd_dev_init_end_device Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I79c35600fc9a758bbd9d58393b7eb98c8ac82acc Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14257 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tom Nabarro <tom.nabarro@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-01 08:48:32 +00:00
Konrad Sztyber	2dfd36772f	vmd: extract end device initialization It'll make it easier to reuse this part of the code. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Id26f3f00abeeea6205df4f44689ffab1d367d777 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13885 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tom Nabarro <tom.nabarro@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com>	2022-09-01 08:48:32 +00:00
Konrad Sztyber	b20f3678dd	env/pci: method for registering PCI device providers The primary motivation for this patch is to allow the VMD driver to be notified of when users wants to attach a device under a given BDF and to make it more similar to the regular PCI path. Currently, the way the VMD driver scans for the devices is a little bit different. The initial scan is done during initialization and there's a separate poller for checking hotplugs. Also, there's no device_attach() interface, so with hotplug poller disabled, it isn't possible to attach to a device not present in the initial scan, even if the BDF is known. This causes a few issues. First of all, the VMD library isn't notified when a device is stopped being used (i.e. user calls spdk_pci_device_detach()), so when such a device is hotremoved, it never gets unhooked. But we cannot simply add a spdk_pci_device.detach() callback, as this would break cases when user detaches a device (without hotremove) and then tries to reattach it again (via spdk_pci_device_attach()), as the VMD doesn't get notified about the device_attach() call. So, in order to resolve this, a device_attach() callback is added, which will notify the VMD library that the user wants to attach a device under a specific PCI address. Then, in subsequent patches, a spdk_pci_device_provider.detach_cb() callback is added to make sure that devices are unhooked once they're no longer used. Once that is done, it'll be also possible to get rid of the VMD hotplug poller by adding something like scan_cb() to spdk_pci_device_provider and call it from spdk_pci_enumerate(). Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I084a27dcd12455f0f841440b7692375e80d07e84 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13883 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tom Nabarro <tom.nabarro@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-01 08:48:32 +00:00
Jim Harris	b90d7b5b43	nvme: add admin queue size quirk for Hyper-V Hyper-V NVMe SSD controllers require admin queue size to be even multiples of a page. Add quirk to adjust the admin queue size if user overrides the default value to something other than an even multiple. As part of this change, set the quirks earlier when constructing a pcie controller, so that the quirks value can be used in the generic nvme_ctrlr_construct() function. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I417cd3cdc7e3ba512ec412f4876b0e0b7432341c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14220 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-09-01 08:31:46 +00:00
yidong0635	0447dca450	include: Remove the last line break. The last line doesn't need the line break, otherwise it will wrongly include the next line. Signed-off-by: yidong0635 <dongx.yi@intel.com> Change-Id: I06257b18d25c060b7c6bb00853fa44963fe5b439 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14241 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com>	2022-09-01 08:30:24 +00:00
yidong0635	b813f998ea	nvme_pcie_common: Move group right before using. Better not to cache a value especially for there's an error return. Signed-off-by: yidong0635 <dongx.yi@intel.com> Change-Id: I3b243a66f4db9af34bc2ea01bafdac33004be128 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13650 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-09-01 08:26:34 +00:00
Jim Harris	3d59045a2a	nvme: remove incorrect comment about spdk_nvme_ctrlr structs This was correct back when we only supported PCIe, but doesn't in the newfangled world of fabrics and vfio-user. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I565edd2dab1eff862844585df8c25da508e4816d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14136 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Jacek Kalwas <jacek.kalwas@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-08-30 16:20:23 +00:00
Artur Paszkiewicz	8fad5718e1	ftl: validate band metadata in debug mode Adds a debug function, that scans the whole P2L of band, when it's getting closed. The P2L is compared against both L2P and valid map to check for any discrepancies. Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Change-Id: Ia4d7be65415e6af3752d676de69b6fdcb73effb4 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13352 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-08-30 14:48:50 +00:00
Artur Paszkiewicz	57cfab6808	ftl: use valid map to optimize compaction and reloc Utilize the valid map when picking physical blocks to compact/relocate, speeding up the process. Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Change-Id: I860e3cf25a5907591e4f3043def67156fec8b0df Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13351 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-08-30 14:48:50 +00:00
Artur Paszkiewicz	cea8dadecf	ftl: valid map Adds P2L validity map tracking - a bitmap marking all physical LBAs as containing valid (current) user data or not. A clear bit denotes the location has no valid data and may be skipped during relocation or compaction. A set bit means it may have valid data (it's still necessary to do the necessary comparision against L2P). Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Change-Id: I6a831a97b3080eb7c880d9c4feab41b523467885 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13350 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-08-30 14:48:50 +00:00
Artur Paszkiewicz	1e904e2b75	ftl: fast startup Adding API for the bringup part of fast shutdown/startup. Adds shared memory utilization for necessary functions during initialization. Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Change-Id: Iab2da102fd0ccaa56fbdb9b3c765be5eeefff145 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13349 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-08-30 14:48:50 +00:00
Kozlowski Mateusz	0e33da4974	ftl: fast shutdown Adds API for fast shutdown - the ability for FTL to skip most of the metadata persists made during clean shutdown, and relying on their representation in shared memory instead. This allows for faster update of SPDK (or just FTL, assuming no metadata changes), with downtime reduction from 2-5 seconds to 500-1000 ms (for 14TiB+800GiB base and cache drives). Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Change-Id: I5999d31698a81512db8d5893eabee7b505c80d06 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13348 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-08-30 14:48:50 +00:00
Kozlowski Mateusz	811a027e43	ftl: Add helper functions for creating md regions Helper functions which determine which md regions will be stored in shm. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Change-Id: I94cbfca66dfb56457a350874dbd1de63a2e07661 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14159 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-08-30 14:48:50 +00:00
Kozlowski Mateusz	101a039923	ftl: p2l map on shm Stores P2L map of open bands in shared memory, allowing for faster recovery times from application crash. Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Change-Id: I519441af05e4d0f57768835bf01c800556873c58 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13347 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-08-30 14:48:50 +00:00
Artur Paszkiewicz	71a1762821	ftl: mempool support for durable format objects Allows for using shared memory in memory pools. Adds API for accessing such pools after dirty shutdown (claiming them, ie. marking an entry as actively used; calling the ftl_mempool_initialize_ext will reclaim all unused entries back to the pool). Also introduces API for accessing objects, since using direct pointers is not possible (as addresses may change inbetween application startups). Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Change-Id: I5325b39d68aef7e231945cee9d92c925cab2fb2a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13346 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-08-30 14:48:50 +00:00
Artur Paszkiewicz	f1b079b49f	ftl: bitmap on external memory Main use case is to allow for keeping it in shared memory, to speed up the recovery time after application crash. Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Change-Id: I36b6b8331cd6483c5bd202e5f9103c351d705da8 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13345 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-08-30 14:48:50 +00:00
Kozlowski Mateusz	43a4d47a1c	FTL: Add relocation logic Relocation will 1. Read LBA map of a given band 2. Pin the LBAs 3. Issue writes of valid LBAsto the new location Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: Ie753a790e56a86bfa1e451b5eda78b88eeacd3cb Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13344 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-08-30 14:48:50 +00:00
Jim Harris	ffa823557a	blob: add assert that cluster_sz > 0 Avoids divide-by-zero scanbuild warning on Fedora36. Fixes issue #2667. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ib2793c793725e8bb8ba25fb779ffc14334929da0 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14238 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-08-29 11:41:50 +00:00
Konrad Sztyber	475b86aa8d	print better errors when creating mempools from secondary process Multiprocess is only supported by a few libraries (e.g. NVMe driver). Other libraries that don't support it will often fail on mempool initialization when running as a secondary process, as the mempools are already created by the primary process. But the error messages are vague and don't indicate why this happened. So, this patch adds a check to see if a mempool exists after spdk_mempool_create() fails and prints an error message informing users that multiprocess is unsupported. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I6f915a94266e64dda380e3b269424cc579372a10 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14234 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com>	2022-08-29 11:41:32 +00:00
Shuhei Matsumoto	4a6f858872	nvme_rdma: Set REUSEADDR to reuse source address among multiple CM IDs When we specify source address for admin and I/O qpairs, rdma_resolve_addr() succeeded only for admin qpair and failed for following all I/O qpairs because rdma_resolve_addr() returned -EADDRINUSE. To reuse source address among multiple qpairs, set the REUSEADDR option for each CM ID before executing rdma_resolve_addr() if source address is specified. We may miss something. Even if rdma_set_option() fails, execute rdma_resolve_addr(). Fixes issue #2604 Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: If03f82d4499cf83c0e428a62e91c9d9e6aad28e0 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14229 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: GangCao <gang.cao@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com>	2022-08-29 11:41:17 +00:00
Jonas Pfefferle	29977e8506	bdev: add additional io types in dump bdev info Add indication of support for compare, compare & write and abort in json bdev info dump. Signed-off-by: Jonas Pfefferle <pepperjo@japf.ch> Change-Id: Ifc8dc1a1b180f08fcd9e9d58684eab1fd50356ff Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14137 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-08-29 10:51:31 +00:00
Jim Harris	4300c62167	nvme: add spdk_nvme_ctrlr_disable_read_changed_ns_list_log_page() Commit `a119799b` ("test/nvme/aer: remove duplicated changed NS list log") changed the nvme driver to read the CHANGED_NS_LIST log page before calling the application's AER callback (previously it would read it after). Commit `b801af090` ("nvme: add disable_read_changed_ns_list_log_page") added a new ctrlr_opts member to allow the application to tell the driver to not read this log page, and will read the log page itself instead to clear the AEN. But we cannot add this option to the 22.01 LTS branch since it breaks the ABI. So adding this API here, which can then be backported manually to the 22.01 branch for LTS users that require it. Restoring the old behavior is not correct for applications that want to consume the CHANGED_NS_LIST log page contents itself to know which namespaces have changed. Even if the driver reads the log page after the application, that read could happen during a small window between when a namespace change event has occurred and the AEN has been sent to the host. The only safe way for the application to consume ChANGED_NS_LIST log page contents itself is to make sure the driver never issues such a log page request itself. Fixes issue #2647. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Iaeffe23dc7817c0c94441a36ed4d6f64a1f15a4e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14134 Reviewed-by: Michael Haeuptle <michaelhaeuptle@gmail.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-08-25 07:31:44 +00:00
liuqinfei	cd1b7ab0e7	nvmf: balance the get optimal poll group Fixes #issue 2636. The existing allocation method (nvmf_rdma_get_optimal_poll_group()) is traversal and unperceived link disconnection. A more fair method considering the number of real-time connections to allocate a poll group is implemented. Signed-off-by: liuqinfei <18138800392@163.com> Signed-off-by: luo rixin <luorixin@huawei.com> Change-Id: Ic1e6283e386dbb0dd6655bedebe26aeedb16c333 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14002 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-08-23 07:46:03 +00:00
Jonas Pfefferle	9e50d53b1a	bdev: add compare fall-back separate md support If the bdev does not natively support compare we use the fall-back which performs a read instead of a compare operation. We then compare the results of the read with the buffer provided by the user. In case the bdev has metadata, there are two options: 1) md is interleaved -> the md will be part of the data buffer allocated for the read and compared accordingly 2) md is separate -> currently we do not compare the metadata but just ignore it. This patch fixes 2) by comparing the md buffer after the read is done. Signed-off-by: Jonas Pfefferle <pepperjo@japf.ch> Change-Id: I1018b8c02540bffcba69408eb283bdc8f06bb747 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14132 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-08-23 07:18:56 +00:00
Jonas Pfefferle	7ba89d1e48	bdev: set ext_opts=NULL if not used bdev_io is allocated from a memory pool and is not zeroed on reuse. So set bdev_io->u.bdev.ext_opts = NULL for io ops where it is not supported (yet) so we can test against it. Signed-off-by: Jonas Pfefferle <pepperjo@japf.ch> Change-Id: Ia579ea6b0787cf62572ea3a6bf2251867602e952 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14056 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Community-CI: Mellanox Build Bot	2022-08-23 07:18:56 +00:00
Kozlowski Mateusz	711759a029	FTL: Add reloc helper functions Adds functions for reading end metadata and initializing band reloc state. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: I3d12c4a7edd36f0437bf10316114c83efe449f0f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13343 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-08-22 20:21:15 +00:00
Artur Paszkiewicz	f45c007512	ftl: superblock in shared memory Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Change-Id: I86e2cbf364ae3075aad2e09429754027df33eadf Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13342 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-08-22 20:21:15 +00:00
Artur Paszkiewicz	818b9c053b	ftl: support for metadata on shared memory Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Change-Id: Ibc259f61f0ef2aeadb0e5ac7230969e29d77f184 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13340 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-08-22 20:21:15 +00:00
Kozlowski Mateusz	19613862ae	FTL: Add free chunk logic After chunk is compacted it can be moved to the free state, able to be used for new user IO again. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: I7f9c341169b171ee246c5aa161d74903b91bdc2f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13338 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-08-19 17:37:14 +00:00
Kozlowski Mateusz	71f20c9a74	FTL: Add compaction logic During compaction FTL moves valid user data from the nv cache drive to the bottom device. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: Ia200af39cec80014fac3a10f20d2859b10a81088 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13337 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-08-19 17:37:14 +00:00
Artur Paszkiewicz	1dadcd8786	ftl: ftl_rq helpers for compaction Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Change-Id: I614b29e7bc7f6db20b10395bc780ff633c497b59 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13336 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-08-19 17:37:14 +00:00
Kozlowski Mateusz	31cf633679	FTL: Add writer logic Add writer - tracks and manages band state transitions and write pointer as IO is issued to it. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: I5f878dc15bc1c1ac84835f75fe440672fad541d5 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13335 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-08-19 17:37:14 +00:00
Artur Paszkiewicz	0291b2845a	FTL: Add read path Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: Ib5bac109b59d5a21a7dad1f8e79b5da7633ffa9d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13334 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-08-19 17:37:14 +00:00
Kozlowski Mateusz	5af491a2ee	FTL: Add band state change functions Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: I6a985f0b54a05fbebb8d65343cffaed7e47ed60d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13332 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-08-19 17:37:14 +00:00
Artur Paszkiewicz	7c9d3ea595	FTL: Add helper functions for IO to band regions Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: I17443ba83afd0ccee0cb84e02329b150562cfd63 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13331 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-08-19 17:37:14 +00:00
Jim Harris	e36f0d363e	nvme/pcie, nvme/tcp: add cb_arg context tracepoint argument This allows mapping an nvme_request back to the nvme_bdev_io. This requires bumping up the max number of arguments per tracepoint. 5 was previously chosen as max since it exactly fit in 64 bytes (1 cacheline) when all arguments were stored as uint64_t, but now that we support uint32_t arguments we can afford extra arguments when some of them are uint32_t. I've bumped it to 8 so we can avoid having to touch this value multiple times if we find some cases where we need 7 or 8 args. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ie2ef5e59d10549860b47542e68c1c34efa63047f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13995 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Jacek Kalwas <jacek.kalwas@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-08-19 11:06:31 +00:00
Jim Harris	54f1603954	bdev/nvme: add tracepoint support This will allow us to map spdk_bdev_io events to nvme_request events coming in a future patch. Since we pass the nvme_bdev_io to the nvme driver (not the spdk_bdev_io), we need to add tracepoints for the nvme_bdev_io so that spdk_trace can do the spdk_bdev_io->nvme_bdev_io->nvme_request mapping. An alternative would have been to pass the spdk_bdev_io as the cb_arg to the nvme driver, but that change seemed to invasive, and I think we will find other uses for the nvme_bdev_io events anyways. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Id7519e689b01875093359f41a1ca2af912061a8b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13994 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Jacek Kalwas <jacek.kalwas@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-08-19 11:06:31 +00:00
Kozlowski Mateusz	81dfe157f3	FTL: Add calculation of device size Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: I1f57ea699d7613f89270f9a47f044d1b85c72b60 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13330 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-08-18 19:09:50 +00:00
Kozlowski Mateusz	9dbdb02975	FTL: Initialize band metadata on startup Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: Ie27b3c5058ae6029262ad3861d5c64dd1ac5794f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13329 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-08-18 19:09:50 +00:00
Kozlowski Mateusz	88d1c3a69a	FTL: Add debug function for dumping band information Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: I6edef1e8e822f8428dff5f5f5da2df923191f6fc Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13328 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-08-18 19:09:50 +00:00
Kozlowski Mateusz	8c519d31bd	FTL: Add internal band state changes Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: Icaecc4e77996919a23f70c1ffad15b783741fd5e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13327 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-08-18 19:09:50 +00:00
Artur Paszkiewicz	0f99700db9	ftl: user write limits Calculates general priorities and trigger points for writers (gc and compaction) dependent on number of free bands. GC will be started at SPDK_FTL_LIMIT_START level, while at SPDK_FTL_LIMIT_CRIT compaction needs to be stopped and only GC is allowed to work. This is done to make sure FTL doesn't run out of free bands and deadlock itself. Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Change-Id: I1aab98503c2e79e97f8e4e9fb1257530fa9770e2 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13326 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-08-18 19:09:50 +00:00
Artur Paszkiewicz	c7213b9c6d	FTL: Add band P2L map usage Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: I7f526c80667ab548a2903689066ac76a8d8d3c53 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13325 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-08-18 19:09:50 +00:00
Artur Paszkiewicz	6448f33672	FTL: Add band structure and helper functions Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: I986746a008e716705304906ab4f2bdabce0a84c3 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13324 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-08-18 19:09:50 +00:00
Kozlowski Mateusz	1bbefed63b	FTL: Remove leftover ZNS code Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Change-Id: Ica358805a69582d78e0d6c4f17b5a97ff38e44ca Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14112 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-08-18 19:09:50 +00:00
paul luse	c746637df8	lib/idxd: add some flag overrides when doing PMEM writes Per upcoming specification changes. Fixes: 2486 Signed-off-by: paul luse <paul.e.luse@intel.com> Change-Id: Ic2534148a87b3dec7512f7b01384f484fee4c30f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13572 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: <wayne.gao@intel.com>	2022-08-18 18:47:02 +00:00
paul luse	61631dadb3	lib/idxd: Save device version during kernel and user initialization We'll likely need this eventually to address silicon version specific workarounds. Signed-off-by: paul luse <paul.e.luse@intel.com> Change-Id: Ie6957674113cf0c7b7d695b468c694668ebbf2bd Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13571 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-08-18 18:47:02 +00:00
Michal Berger	5f6ce57fb2	lib/ftl: Fix "unused function" error under clang This is targeted to fix the following error seen under clang: ftl_nv_cache.c:54:1: error: unused function 'nvc_data_blocks' [-Werror,-Wunused-function] nvc_data_blocks(struct ftl_nv_cache *nv_cache) ^ Signed-off-by: Michal Berger <michal.berger@intel.com> Change-Id: I11d52e76df5872819770d9468b6fa4ae54d8927c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14055 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: <sebastian.brzezinka@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com>	2022-08-18 10:10:09 +00:00
Jim Harris	0f068506ca	nvme: complete register_operations in the correct process In multi-process, we need to make sure we don't complete a register_operation in the wrong process. So save the pid in the nvme_register_completion structure when it is inserted into the STAILQ, then only complete operations where the pid matches. Fixes issue #2630. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I58c995237db486fecdd89d95e9e7a64379d0b0e5 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13940 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-08-18 10:09:55 +00:00
Chen, You	43ebecdf60	lib/idxd: break spdk_idxd_process_events loop after processing DESC_PER_BATCH ops To prevent the processing of outstanding commands from starving the rest of the system Fixes: #2586 Signed-off-by: Chen, You <you.chen@intel.com> Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Change-Id: I392db2359408cdef32cc1f46b76ecd94f0c3332c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13685 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-08-18 10:09:23 +00:00
Jim Harris	92335c01cf	event: make opts structures packed This ensures that when fields are added, that the size of the structure will change, ensuring different versions of the structure can be detected using sizeof. Adding -Wno-address-of-packed-member to Makefiles here, although we should consider disabling this warning globally in SPDK just like DPDK. Suppress abidiff errors around spdk_app_opts - structure size and offsets of all existing members were unchanged, so there is no ABI breakage here. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I2249eddb604d7b44180cadb92ba30edcd946b9bd Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14091 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot	2022-08-18 10:08:40 +00:00
Jim Harris	b801af090a	nvme: add disable_read_changed_ns_list_log_page Similar to the disable_read_ana_log_page ctrlr_opt, this enables the application to tell the NVMe driver to not read the CHANGED_NS_LIST log page in response to a NS_ATTR_CHANGED AEN, and will do the read itself. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ie447734187d4a4cb95ceef6e0131b640b8ba5984 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14088 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot	2022-08-18 10:08:40 +00:00
Jim Harris	c50cb569de	include: add STATIC_ASSERTS for opts structures with size member Various opts structures in SPDK have a size member, to enable ABI compatibility should fields be added in the future. But this requires the strucures to be packed, otherwise for example a structure may be padded at the end, and a new field added may just consume some of that padding. So add STATIC_ASSERTS for the current sizes in this patch. Upcoming patches will make the structures packed and add in reserved fields to fill in holes. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I9107d01d7b533f8542385a3538894bcd9f8c465d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14086 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Community-CI: Mellanox Build Bot	2022-08-18 10:08:40 +00:00
Jim Harris	af0d907604	bdev: wait_for_examine during spdk_bdev_finish. Wait for all bdevs to finish examination before proceeding with the spdk_bdev_finish shutdown logic. This ensures the bdev layer and its modules are not trying to examine bdevs after the bdev layer has reported it has shut down. Theoretically, bdev modules could all defer their fini callbacks until any outstanding examinations are complete, but it is WAY simpler to just use the existing spdK_bdev_wait_for_examine API instead. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: If90cc2a786281d348b82de8beb17ac37ba269c64 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13850 Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-08-18 08:35:18 +00:00
Kozlowski Mateusz	e8c5ccf039	FTL: Add write path Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: I41985617b5879bd3f4bf6d49d2a03eaffdd5ccb5 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13322 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-08-18 08:34:47 +00:00
Kozlowski Mateusz	4a24a7b3e0	FTL: Add helper L2P set/get functions for nv_cache Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: I61ed4434283c21d7dc62b70898f920e66b595a4f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13321 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-08-18 08:34:47 +00:00
Kozlowski Mateusz	506315a651	FTL: Initialize nv_cache metadata on startup Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: Ie1a60ec8d1e05b1e4dec85a7187cffad24496460 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13320 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-08-18 08:34:47 +00:00
Kozlowski Mateusz	ece0e0eee7	FTL: Add state machine for chunks Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: I1f208cb9fdb84b8a39d08746d81dde0c59df25c4 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13319 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-08-18 08:34:47 +00:00
John Levon	2eaae37ded	nvmf/vfio-user: complete queue deletion on correct thread If the queue was on another poll group, we need to send a message back to the admin CQ's thread to post the completion from the correct context. Signed-off-by: John Levon <john.levon@nutanix.com> Change-Id: I997987d5d6b822a1a5124f54fc29ce5d7f03190d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14057 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Swapnil Ingle <swapnil.ingle@nutanix.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Community-CI: Mellanox Build Bot	2022-08-17 07:19:03 +00:00
Shuhei Matsumoto	e93ba047ac	nvme: Restore complete_abort_queued_reqs() call into process_completions() spdk_nvme_qpair_process_completions() had called always _nvme_qpair_complete_abort_queued_reqs() at its end. However, the call was accidentally removed by a commit `59c8bb527b` to fix an issue. By this removal, aborting request was not completed for some error cases. Fix the degradation by restoring the call. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I0099eb7a008f823e1282576504423cdc248911d7 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14045 Reviewed-by: Michael Haeuptle <michaelhaeuptle@gmail.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot	2022-08-17 07:17:17 +00:00
Jim Harris	43a3984c6c	configure: add CONFIG_HAVE_ARC4RANDOM glibc 2.36 added arc4random(), which breaks the SPDK iSCSI build since it always implements its own arc4random() implementation on non-FreeBSD OS (meaning always on Linux). So instead add a CONFIG_HAVE_ARC4RANDOM and remove the explicit FreeBSD dependency - this will work on FreeBSD as well as Linux with >= glibc 2.36. Also fix check_format.sh, so that it does not enforce spdk/stdinc.h checks on code snippets in the configure file. Fixes issue #2637. Reported-by: Karl Bonde Torp <k.torp@samsung.com> Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Iab9da8ae30d62a56869530846372ffddf7138eed Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14028 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com>	2022-08-16 10:25:17 +00:00
Ziv Hirsch	eda407a6f0	nvme: add support for verify command Signed-off-by: Ziv Hirsch <zivhirsch13@gmail.com> Change-Id: Ic9859d5078d9568bb28eefcf8fb70a7fc222ee15 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13928 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-08-16 10:25:01 +00:00
LiadOz	5c3360ce1f	nvme/nvme_tcp: Check for timeout when socket connection fails Fixes #2614 Signed-off-by: LiadOz <liadozil@gmail.com> Change-Id: Ie4942d52b1af42ed859338fc59f3e29dcd59e68c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13891 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Dong Yi <dongx.yi@intel.com>	2022-08-16 10:23:26 +00:00
Jim Harris	a6b7e1839d	nvme/tcp: add trace points for cmd submit/complete Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Iad56e7a96cf0210bcf54825c8bcc39af9366b72c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13992 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jacek Kalwas <jacek.kalwas@intel.com>	2022-08-16 10:23:10 +00:00
Jim Harris	9396cb9a94	nvme/tcp: simplify outstanding_reqs handling Avoid putting a new req on the outstanding_reqs TAILQ until we know it can be initialized successfully. This avoids adding to the TAILQ only to remove it just after. This allow simplifies the outstanding_reqs TAILQ handling, since reqs are now only inserted and removed in one place each. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I5ccc41c14abd541ffcf2a602246e0671386840c7 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13991 Community-CI: Mellanox Build Bot Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-08-16 10:23:10 +00:00
Jim Harris	b0396da090	nvme/pcie: rename trace object to NVME_PCIE_REQ We were using "TR" for "tracker" previously, but we are tracing the nvme_requests, not nvme_trackers, so use the right names for the trace object to avoid confusion. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ia3886d74b162138c2cdbe0017224d9494f74966c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13990 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Jacek Kalwas <jacek.kalwas@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-08-16 10:23:10 +00:00
Jim Harris	97661e86b7	nvme/pcie: add cpl status to PCIE_COMPLETE trace event Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I51e87f0f23b84956f96ab2efc62ad99a8d74cd4e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13989 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Jacek Kalwas <jacek.kalwas@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-08-16 10:23:10 +00:00
Jim Harris	7b05b29d48	nvme/pcie: use 4-byte trace arguments where possible Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I24c3fd545cadc403ac1f3589c6242a08a7a2f517 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14000 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jacek Kalwas <jacek.kalwas@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-08-16 10:23:10 +00:00
Jim Harris	cdb0726b95	trace, trace_parser: support 4-byte INT/PTR arguments This allows us to pack more arguments into the same amount of shared memory, for cases where those arguments don't need a full 8 bytes. 1- and 2-byte sizes not supported for now, variadic args do automatic promotion of types smaller than int, so support for those may need more work. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Iec56cfa851b408a77d7995126d2111b0bf3d7f95 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13999 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Jacek Kalwas <jacek.kalwas@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-08-16 10:23:10 +00:00
Ben Walker	081f080a49	accel: Rename public header to accel.h The public interface of lib/accel is now include/spdk/accel.h Change-Id: Id94f623a494eb1b524b060f4413f633073ea7466 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13916 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-08-16 10:22:55 +00:00
Ben Walker	10ab81b83e	accel: Hide the definition of accel_io_channel from modules They no longer need to see the definition of this structure. Change-Id: I3e3bb5942a50da22e0bf34aa8c10b9d812f42d2f Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13915 Community-CI: Mellanox Build Bot Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-08-16 10:22:55 +00:00
Ben Walker	df892eed67	accel: Return correct values for .get_ctx_size() This expects the full size of the task for each module. This only worked because the software module returned the right size. Change-Id: I481cfad8b4bb9c3748301bdacd90e7f44fd2d878 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13913 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-08-16 10:22:55 +00:00
Ben Walker	678025c914	accel: Move the software module to its own file This will help keep the mixing of this code with the framework code to a minimum. Change-Id: I5937ebd84f32068456cdf2b9e03d3e194c760a87 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13912 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-08-16 10:22:55 +00:00
Ben Walker	6074b3a3f9	accel: Move definitions not needed by modules to accel_internal.h spdk_internal/accel_engine.h will become the API for accel modules. Move anything in there that a module doesn't need to see into lib/accel/accel_internal.h Some of the software fallback definitions didn't even need to be in a header and were moved to accel_engine.c Change-Id: Idb8b12b1c0c1de3d462b906e3df3ba9ee8f830b8 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13911 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com>	2022-08-16 10:22:55 +00:00
Ben Walker	aa156d53be	accel: Combine spdk_accel_engine and spdk_accel_module_if These are 1:1 - they do not need to be separate objects. Change-Id: I74ab52863f911d9be59ce98e1525302b5bd40846 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13910 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-08-16 10:22:55 +00:00
Changpeng Liu	91eb10b4be	nvmf/vfio-user: only kick controller when in interrupt mode There is a race condition if we call this function in the polling mode when running with multi-cores, same as other places where the function is called, we only kick controller in interrupt mode, also in `vfio_user_ctrlr_intr`, `ctrlr->sqs[0]` may be set to NULL after the controller poll call, so return earlier for this case. Change-Id: I03a7b74a39c966a2b8be610bca0e492d902f6b08 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13696 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com>	2022-08-15 19:10:54 +00:00
Boris Glimcher	6212597bda	sock/ssl: Add psk_key and psk_identity options to spdk_sock_impl_opts Note, this change only sets defaults for the ID/KEY, more specific use cases like NVMe/TCP may set the ID and KEY on a per connection basis. Also simplify PSK identity string, that isn't NVMe focused. NVMe libraries using this will need to construct more complicated identity strings and pass them to the sock layer. Example: rpc.py sock_impl_set_options -i ssl --psk-key 4321DEADBEEF1234 rpc.py sock_impl_set_options -i ssl --psk-identity psk.spdk.io ./build/examples/perf --psk-key 4321DEADBEEF1234 --psk-identity psk.spdk.io ./build/examples/hello_sock --psk-key 4321DEADBEEF1234 --psk-identity psk.spdk.io Change-Id: I1cb5b0b706bdeafbccbc71f8320bc8e2961cbb55 Signed-off-by: Boris Glimcher <Boris.Glimcher@emc.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13759 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot	2022-08-15 16:52:28 +00:00
Jim Harris	e1eee2ebac	event: always fail if invalid tpoint mask is specified There were a few error cases that weren't caught as errors, meaning the "invalid tpoint mask" string wouldn't be printed. But also change it so that when an invalid tpoint mask is specified, it fails spdk_app_start and causes the application to exit, rather than just silently stopping processing of the tpoint group mask string. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I567a4eee740559914e089dca7d7c3865ed9ce35b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13986 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jacek Kalwas <jacek.kalwas@intel.com>	2022-08-12 14:18:05 +00:00
Kozlowski Mateusz	a68a12a478	FTL: Initial nv cache structure Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: Ie40cc25ed9bf28976a5ae6d6a67491f438152fca Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13317 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-08-12 09:01:40 +00:00
Artur Paszkiewicz	b16bdc6d49	FTL: Add L2P API and flat L2P implementation Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: Ifadc8c6986164584235ee6a67799025fa7703b5d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13315 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-08-12 09:01:40 +00:00
Artur Paszkiewicz	b6eecb21e5	FTL: Add address store/load utils Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: Ibac2fe36ba0f3038915075d7105e2d6119b8ed20 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13314 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-08-12 09:01:40 +00:00
Changpeng Liu	d0cf194bc4	nvmf/vfio-user: only relisten accept poller when connection is disconnected For the case `nvmf_subsystem_remove_listener` RPC call when VM is connected, we should not relisten the accept poller, because the endpoint will be destroyed for this case. Change-Id: Icf8299f26a3bbf7bbe44fd01edb4ede344692d25 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13548 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: John Levon <levon@movementarian.org>	2022-08-12 09:00:50 +00:00
Shuhei Matsumoto	227d83e2fa	nvme: Use spdk_nvme_ctrlr_is_fabrics() to update ioccsz ioccsz is specific for fabrics. spdk_nvme_ctrlr_is_fabrics() returns true for custom fabrics transport. Hence we can use spdk_nvme_ctrlr_is_fabrics() safely in nvme_ctrlr_update_nvmf_ioccsz(). Before this change, in the unit tests, ctrlr->trid.trtype was set to zero at initialization. After this change, for most cases, spdk_nvme_ctrlr_is_fabrics() should return false for most cases. SPDK_NVME_TRANSPORT_PCIE did not work. Hence, initialize ctrlr->trid.trtype by SPDK_NVME_TRANSPORT_CUSTOM_FABRICS instead. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I4bedcab4a9f2876c1c9463ff10ad0966754f1713 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13948 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-08-12 08:59:52 +00:00
Shuhei Matsumoto	cd65512d08	nvme_rdma: Fix assertion for rqpair->current_num_sends/recvs assert() in nvme_rdma_queue_recv_wr() was wrong and assert() in nvme_rdma_cq_process_completions() was missing. This patch fixes both. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Signed-off-by: Denis Nagorny <denisn@nvidia.com> Signed-off-by: Evgeniy Kochetov <evgeniik@nvidia.com> Change-Id: Ied057d75dbfd9e54ce3c3671355b9ec3acad7ff5 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13597 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-08-12 08:59:43 +00:00
Shuhei Matsumoto	41bb31a36d	nvme_rdma: Replace rdma_dereg_mr() by ibv_dereg_mr() rdma_reg_msgs() was replaced by ibv_reg_mr() recently to support persistent PD per RDMA device. The difference between rdma_dereg_mr() and ibv_dereg_mr() is only return value and errno. For consistency, replace rdma_dereg_mr() by ibv_dereg_mr(). Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I55e0743690e74f9510863bfa122a75d0632dce4e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13949 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-08-12 08:59:43 +00:00
Shuhei Matsumoto	d75daea532	nvme_rdma: Use persistent protection domain for qpair Get a PD for the device from the PD pool managed by the RDMA provider when creating a QP, and put the PD when destroying the PD. By this change, PD is managed completely by the RDMA provider or the hooks. nvme_rdma_ctrlr::pd was added long time ago but is not referenced anywhere. Remove nvme_rdma_ctrlr::pd for cleanup and clarification. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: If8dc8ad011eed70149012128bd1b33f1a8b7b90b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13770 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-08-12 08:59:43 +00:00
Shuhei Matsumoto	b5f360c425	rdma: Maintain per device PD which is persistent across reconnect SPDK NVMe RDMA initiator used the default PD per RDMA device. Default PD may be changed when all QPs for the RDMA device are destroyed and created again. For multipath, the RDMA zero copy feature require the PD per RDMA device to be persistent when all QPs for the RDMA device are destroyed and created again. Maintain such persistent PDs in this patch. Add two APIs, spdk_rdma_get_pd() and spdk_rdma_put_pd(). In each call of two APIs, synchronize RDMA device list with rdma_get_devices(). Context may be deleted anytime by rdma-core. To avoid such deletion, hold the returned array by rdma_get_devices(). RDMA device has PD, context, ref. count, and removed flag. If context is missing in rdma_get_devices(), set the removed flag to true. Then, if the ref count becomes zero, free the PD and the RDMA device. The ref. count of a RDMA device is incremented when spdk_rdma_get_pd() is called and decremented when spdk_rdma_put_pd() is called. To simplify synchronization, sort the returned array by rdma_get_devices(). To avoid resource leakage, add destructor function and free all PDs and related data at termination. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I093cb4ec2c7d8432642edfbffa270797ccf3e715 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13769 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-08-12 08:59:43 +00:00
Shuhei Matsumoto	a26d74173e	nvme: Increase major SO version An earlier commit added ctrlr_ready into struct spdk_nvme_transport_ops. However, the major SO version was not increased. Fixes: `3dd0bc9e` (nvme: Add transport controller ready step) Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Id903634f9aaf5bdaa62fd30e92a4fb39a985b86f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13981 Reviewed-by: Jim Harris <james.r.harris@intel.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-08-11 19:16:32 +00:00
Ben Walker	32ee475a5e	accel: SPDK_ACCEL_MODULE_REGISTER is now passed the module Instead of passing each parameter to create a module, just have the user make one and pass it in. This makes it easier to change the module definition later. Change-Id: I3a29f59432a6f0773129d7b210fbc011175b2252 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13909 Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-08-10 11:00:17 +00:00
paul luse	3d5fd5a59f	lib/idxd: fix bugs with IAA decompression descriptor construction Masked by how accel_perf was doing decomp verificiation which is changed in the next few patches and verifies these fixes. Signed-off-by: paul luse <paul.e.luse@intel.com> Change-Id: Icb03fc169bf8d2f05396addaf1db56d6de1827d1 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13038 Community-CI: Mellanox Build Bot Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-08-10 07:25:29 +00:00
paul luse	efa33b8590	lib/accel: add RPC to enable override of opcode to engine Docs explaining how to use the RPC are in the next patch in the series. Signed-off-by: paul luse <paul.e.luse@intel.com> Change-Id: I7dab8fdbeb90cdfde8b3e916ed6d19930ad36e66 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12848 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-08-10 07:25:29 +00:00
王亚飞/Yafei WANG	6fcd7a79e9	lib/vhost: Add submit_inflight_desc() to cpu usage statistic submit_inflight_desc() actually do some meaningful work, so when it really process tasks, the poller should return BUSY status. Signed-off-by: YafeiWangAlice <yafei.wang@samsung.com> Change-Id: I2103cea6d28e8b355dad4ddd603d917f10e44c08 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13486 Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: GangCao <gang.cao@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-08-09 11:48:51 +00:00
Jim Harris	d33497d3f4	thread: defer unregistration when for_each ops exist There may be for_each operations outstanding on an io_device when it is unregistered. Currently we just return when this happens, not unregistering the device but also not notifying the caller that this happened (since it returns void, and the callback function doesn't have a status parameter either). We could just push this responsibility to the caller, to never unregister an io_device if it knows it has outstanding for_each calls waiting to complete. But I think we can simplify this a lot by just handling this inside of the thread library. Mark that the device is pending registration, and unregister it (on the original requesting thread!) when the for_each count gets back to zero. Also don't allow any new for_each operations either. Note this requires a bit of refactoring on the thread unit tests, since it is now possible to unregister a device with outstanding for_each operations. Fixes issue #2631. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I978f2d99a25e65d2b7d71ce9b1926a79a6c94263 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13890 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-08-09 08:27:15 +00:00
Jim Harris	821e673c1d	thread: set non-zero status when spdk_for_each_channel fails If spdk_for_each_channel is called on a device that doesn't exist, we need to set a non-zero status (-ENODEV in this case) to the completion function. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I898ad5ea499fb6087338b621b2befcadd6a05414 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13889 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-08-09 08:27:15 +00:00
GangCao	0c980660b6	FTL: move assert earlier before accessing the field Fix Klockwork issue. Change-Id: Iae9557c152a745549c8963f4f0510ae829f871a4 Signed-off-by: GangCao <gang.cao@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13860 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-08-08 13:48:28 +00:00
Jim Harris	5d651b31c9	event: require opts->name is set This has been implicitly required before, and all in-tree apps (except accel_perf) set it, so let's explicitly require it. This name gets used for things like the shm name for spdk trace event file. While here, add the name for accel_perf. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I47a22466550d4b31bacafee58d30339b4f22f4b4 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13876 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Michal Berger <michal.berger@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-08-05 10:48:42 +00:00
vagrant	fa09c9ac9b	lib/blob: Fix deleting a snapshot after decoupling it from its parent When decoupling a snapshot from its parent, we need to clear its parent. So we should remove the xattr BLOB_SNAPSHOT. Modifying the xattrs of a blob only works if its metadata are not in read-only mode. By default, a snapshot is in read-only mode so this operation fails. When we later want to delete the snapshot, we will see that it has a parent, so we will try to remove the snapshot from its parent's clones list. This will cause a crash. The fix is to remove the BLOB_SNAPSHOT xattr only after setting the snapshot's metadata in rw mode. Signed-off-by: Alex Michon <amichon@kalrayinc.com> Change-Id: I80efa6dd3dcb38b4c738ce2e97aa2ffc281cefa5 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13723 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-08-05 08:30:30 +00:00
yidong0635	5de98ef86c	reactor: Check error return for spdk_thread_lib_init_ext. DPDK may use this NULL pointer to access its member, And then got segmentation fault. But we only need it exit or report normal error. To minimize the impact, and to prevent these going on, we add check the error return for creating NULL mempool in spdk_thread_lib_init_ext in spdk_reactors_init. when error returning from spdk_thread_lib_init_ext in spdk_reactors_init. It contains thread_lib_init which reports error for failed mempool. Thus, codes will return and will not cause segmentation fault. Fixes issue #2620. Signed-off-by: yidong0635 <dongx.yi@intel.com> Change-Id: I63369fdaeb231196e8f8daa826eb5b057ed829b8 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13842 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Michal Berger <michal.berger@intel.com>	2022-08-05 08:29:53 +00:00
yidong0635	c9eb502a4a	thread: Return -ENOMEM for no mempool. Here should return -ENOMEM, and other places are changed. Signed-off-by: yidong0635 <dongx.yi@intel.com> Change-Id: Id81cd7485733e66d996b1501061a45f774f2b51a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13863 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Michal Berger <michal.berger@intel.com>	2022-08-05 08:29:53 +00:00
Changpeng Liu	a02483e67c	module/bdev_virtio_scsi: use the correct `num_queues` value Parameter `num_queues` for virtio_scsi PCI device means maximum number of queues, it SHOULD include the `eventq` and `controlq`, while for `vhost_user` RPC call, it means the number of IO queues, so here we use it as `max_queues` in lib/virtio and add the fixed number queues for `vhost_user` SCSI device. Also fix `vhost_fuzz` to get `num_queues` earlier than negotiate the feature bits. Change-Id: I41b3da5e4b4dc37127befd414226ea6eafcd9ad0 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13791 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-08-04 11:24:40 +00:00
Changpeng Liu	84ac072e2c	lib/virtio: eliminate `virtio_user_backend_ops` The `vhost_user` socket transport APIs are already in the same source file, so just call the function directly. No code logic changes in this commit. Change-Id: If471b9b0166d43591fb8614e95a17473c964e87c Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13789 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-08-04 11:24:40 +00:00
Changpeng Liu	4e6e7eafef	lib/virtio: merge vhost_user.c and virtio_user.c into one source file Similar with NVMe device driver, here `virtio` is a specification abstraction library, `pci` and `vhost_user` are transports layer, here we merge vhost_user.c and virtio_user.c into one new source file `virtio_vhost_user.c` so that to make code more clear. No logic change, just code movement in this commit. Change-Id: I8e3e5c477e7c45e6eeebad240b8cc3c9476b86d1 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13788 Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-08-04 11:24:40 +00:00
Michal Berger	be1883d978	lib/ftl: Fix -Wunused-function under clang Builds under clang fail with the following: utils/ftl_mempool.c:45:1: error: unused function 'is_element_valid' [-Werror,-Wunused-function] is_element_valid(struct ftl_mempool mpool, void element) ^ 1 error generated. Signed-off-by: Michal Berger <michal.berger@intel.com> Change-Id: Ic776f3f226e9ea6ed9d0bbd0a3d8e2a0661e0d11 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13844 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Kamil Godzwon <kamilx.godzwon@intel.com> Community-CI: Mellanox Build Bot	2022-08-04 07:30:59 +00:00
Changpeng Liu	c60cb1a8be	lib/nvmf: don't raise assertion in `nvmf_tgt_destroy_cb` While running into this function, even the subsystem can't be destroyed due to error subsystem state, it's better to continue the execution. Continue to fix #2590, QEMU is stuck for the failure case, and nvmf target should process such error because it may support other normal subsystems at the same time. Change-Id: Ib05e24996378b52070d2b760519f476f9b2d7e76 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13839 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-08-04 07:29:27 +00:00
Evgeniy Kochetov	3dd0bc9e09	nvme: Add transport controller ready step This step allows custom transports to perform extra actions or checks at controller initialization and fail initialization if required. Signed-off-by: Evgeniy Kochetov <evgeniik@nvidia.com> Change-Id: Ic7cadae5398a35903917ceace3828f4371be63a3 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12631 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-08-04 07:29:03 +00:00
paul luse	44cbea402e	lib/accel: Add new RPC to get valid engine info. The RPC provides a list of initialized engine names along with that engine's supported operations. Signed-off-by: paul luse <paul.e.luse@intel.com> Change-Id: I59f9e5cb7aa51a6193f0bd2ec31e543a56c12f17 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13745 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-08-03 07:53:21 +00:00
paul luse	c6ecddcc1c	lib/accel: add RPC to get list of OP codes per module In prep for upcoming patch that will provide an RPC to override and automatic assignment of an op code to an engine. Signed-off-by: paul luse <paul.e.luse@intel.com> Change-Id: I17d4b962fb376a77f97ce051a513679d0fba698e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12829 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-08-03 07:53:21 +00:00
Bin Yang	1cddc829ff	lib/scsi: use bkdr hash to avoid naa identifier collision fix: If the first six characters of two scsi lun's name are the same, such as aaaaaa0 and aaaaaa1, so do theirs naa identifier Signed-off-by: Bin Yang <bin.yang@jaguarmicro.com> Change-Id: I4e0541b372a0e20e95e0a24d62dd3d85b7abe230 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13824 Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-08-03 07:43:10 +00:00
yidong0635	5daedcc22e	ftl: Fix compile warning. Issue reports: spdk/lib/ftl/ftl_io.c:121:9: warning: variable ‘result’ set but not used [-Wunused-but-set-variable] 121 \| size_t result; \| ^~~~~~ Signed-off-by: yidong0635 <dongx.yi@intel.com> Change-Id: I9ed7daea97f311ca33c4116299be32f275e33fbb Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13838 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: GangCao <gang.cao@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-08-03 07:42:38 +00:00
Artur Paszkiewicz	c6880a3974	ftl: superblock Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Change-Id: Ic8ca0cd3bf3621ad5604e83ed24c0fa59a83f124 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13313 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-08-02 19:00:42 +00:00
Artur Paszkiewicz	f725ca81cf	ftl: vss emulation Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Change-Id: If22933834d640606526dec9185e849df367ac789 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13311 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-08-02 19:00:42 +00:00
Artur Paszkiewicz	884980d0aa	ftl: vss null buffer workaround Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Change-Id: I94ea399ed30fae29f92b4216eaa9209c02b3478b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13310 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-08-02 19:00:42 +00:00
Artur Paszkiewicz	d67952540f	ftl: wrappers for nv cache bdev io Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Change-Id: I33d99ae35e2bd853a16a6d20336632a955679197 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13309 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-08-02 19:00:42 +00:00
Kozlowski Mateusz	950cce2c9e	FTL: Add ftl_io unit tests Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: I1052fbfe7516b12e50e4bc4b3b7a4f452f56349f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13308 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-08-02 19:00:42 +00:00
Artur Paszkiewicz	d9a631ad4c	FTL: Add io channel logic Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: Ibf6bfbabc03c43e7938531c4fe08fde01ce02a3f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13307 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-08-02 19:00:42 +00:00
Kozlowski Mateusz	e7a03e68e1	FTL: Add ftl_rq Used for internal metadata update requests Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: I742ef2030070e7e159d4354159fb596b98742631 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13306 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-08-02 19:00:42 +00:00
Artur Paszkiewicz	06790f25f1	FTL: Add ftl_io helper structure Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: I608b500c6fb14efe289932955f508484f2ecf1b6 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13305 Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-08-02 19:00:42 +00:00
Kozlowski Mateusz	b431640409	FTL: Add ftl mempools Optimized for single thread utilization Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: I56602a3d85e0cd47256c8f3e5d7a3f0ed4e38743 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13303 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-08-02 19:00:42 +00:00
Shuhei Matsumoto	4f2f1aa9c5	nvme_rdma: Use pd of rdma_qp instead of default pd of cm_id This is another preparation to create and use ibv_context and pd. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Signed-off-by: Denis Nagorny <denisn@nvidia.com> Signed-off-by: Evgeniy Kochetov <evgeniik@nvidia.com> Change-Id: Id594fa1ccb2daf535b1aaaef0a397bda2ec98578 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13710 Community-CI: Mellanox Build Bot Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-08-02 07:39:41 +00:00
Shuhei Matsumoto	a3a51453b8	nvme_rdma: Pass pd instead of cm_id to nvme_rdma_reg_mr() The following patches will create and use ibv_context and pd explicitly instead of using default ibv_context and pd created by rdmacm. As a preparation, pass pd instead of cm_id to nvme_rdma_reg_mr(). Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Signed-off-by: Denis Nagorny <denisn@nvidia.com> Signed-off-by: Evgeniy Kochetov <evgeniik@nvidia.com> Change-Id: Ifdcd18ed363b8ba4a23a920bf3559237e38821c6 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13599 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-08-02 07:39:41 +00:00
Apokleos	89c1e5bfc0	SPDK Interrupt Mode: Improve processing of reactor interrupt mode. spdk in interrupt, reactor dosen't correctly handle exited threads, causing vhost threads still in reactor's lw_threads list. The fix will do cleanup thread when it's state becomes EXITED. Though it's exposed in v22.05.x, but the master branch also has the problem. We will do this as below: (1) When thread's state becomes SPDK_THREAD_STATE_EXITED, reactor process thread exits first. (2) Then reactor do remove lw_thread and destroy it. Fix issue: #2574 Signed-off-by: Apokleos <oliverliyn@gmail.com> Change-Id: I3ac2681d70480563db3a0aee4aff61c2f272b140 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13706 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Xiaodong Liu <xiaodong.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-08-02 07:38:15 +00:00
Konrad Sztyber	a818564374	nvme: check CSTS.CFS when initializing ctrlrs If Controller Fatal Status (CFS) bit is set, there's no point in waiting for CSTS.RDY and the only way to move forward with the initialization is to perform a controller reset. This fixes issues with test/nvme/sw_hotplug.sh when running under qemu. It seems that during that test, qemu marks the emulated NVMe drives as fatal, so if we didn't check CSTS.CFS, the initialization would time out. Fixes #2201. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I97712debc80c3dd6199545d393c0f340f29d33b2 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13820 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Michal Berger <michal.berger@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2022-08-02 07:37:04 +00:00
Changpeng Liu	78ca4b27c5	nvmf: don't raise assertion when destroying an non-inactive subsystem Sometimes VM may get a kernel panic when starting, and SPDK CI will kill `nvmf_tgt` after 60 seconds, and for this exception, SPDK will raise an assertion when destroying the subsystem, while here, we remove this assertion and print the error information. CI will still mark this case as a failed case, then we can use this error information to understand error subsystem state in vfio-user. Fix issue #2590. Change-Id: I20b16f9e96a566730eca2dd9ea165645bd9160bd Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13773 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-08-02 01:26:10 +00:00
Jacek Kalwas	8c35e1bd79	nvmf/rdma: remove lock on few transport ops it simplifies the code and improves readability sync is done on generic layer Signed-off-by: Jacek Kalwas <jacek.kalwas@intel.com> Change-Id: If324039ef2b26fa8ba026b80ec49788a7b2dcaa3 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13667 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-07-29 16:34:41 +00:00
Jacek Kalwas	c7ac84d1f2	nvmf/tcp: remove lock on few transport ops it simplifies the code and improves readability sync is done on generic layer Signed-off-by: Jacek Kalwas <jacek.kalwas@intel.com> Change-Id: I75753511842dff237bb27561e406c43ea68269fe Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13666 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-07-29 16:34:41 +00:00
Jacek Kalwas	b17919d8bc	lib/nvmf: add lock around few transport ops this is a prework for further changes - with lock on generic layer lock on specific transport (e.g. tcp, rdma) layer becomes optional possibly it won't be required if some contract introduced on public interfaces (to be considered) - spdk_nvmf_poll_group_[create\|destroy] - spdk_nvmf_tgt_listen_ext, spdk_nvmf_tgt_stop_listen - spdk_nvmf_get_optimal_poll_group Signed-off-by: Jacek Kalwas <jacek.kalwas@intel.com> Change-Id: Ib132babf9e7022342129fe795991cdad834e7f53 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13665 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-07-29 16:34:41 +00:00
Alexey Marchuk	7fbda6d916	nvmf/rdma: Fix data_wr_pool corruption When there are not enought transport buffers for multi SGL request in state NEED_BUFFER, WRs received from the data_wr_pool are returned back to the pool. However rdma_req->data.wr.next pointer still points to the first WR from the pool. Usually it doesn't cause any problems since rdma_req will try to fill buffers again, but when qpair is being destroyed, all requests are completed forcefully. When the request is completed and data.wr.next pointer is not NULL, we'll try to put already released WRs into the pool one more time. That corrupts the pool and leads to undefined behavior. Fixes #2541 Signed-off-by: Alexey Marchuk <alexeymar@nvidia.com> Change-Id: I238b92eec132d8d845330362af6f335421177454 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13760 Reviewed-by: Michael Haeuptle <michaelhaeuptle@gmail.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-07-28 07:12:43 +00:00
Changpeng Liu	673c8a65e1	nvme: remove `nvme_ctrlr_init_ana_log_page` function The function `nvme_ctrlr_init_ana_log_page` is exactly same with `nvme_ctrlr_update_ana_log_page`, so remove it. Change-Id: I1ad51635f47cf95cfa6de217e3b9144885c3b74e Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13652 Community-CI: Mellanox Build Bot Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-07-28 07:07:31 +00:00
MengjinWu	7fc2c0856e	lib/nvmf: use DSA to offload recv data digest crc32 in nvmf-TCP allow DSA device to async offload crc32 calculation in nvmf-TCP This patch can use DSA to accelerate crc32 computation, making the io performance of TCP paths using crc32 approach the io performance of TCP paths that do not use crc32. Using SLIST to minimize the performance drop. SLIST has less operation compared to TAILQ. Thinking about memory thrashing, we should use the same memory as possible to receive new PDUs. So, insert newly freed PDU in to head is better. The performance drop is within 1% compared to the TCP path without crc32. Signed-off-by: MengjinWu <mengjin.wu@intel.com> Change-Id: I480eb8db25f0e730cb198ca5ec19dbe3b4d38440 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11708 Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: GangCao <gang.cao@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-07-27 08:39:35 +00:00
Evgeniy Kochetov	b46cfdb6c9	bdev/qos: Process whole QoS queue on every Qos poll We have to process whole QoS queue on each QoS poll. It may contain IOs that still have quota or not affected by QoS rules at all. If we stop on the first queued IO, all IOs will be limited by the minimum QoS rule even if they're not affected by this rule. Here is an example and simple test. We have a NVMf target with Null bdev and QoS configured with read bandwidth limited to 10 MB/s and write bandwidth limited to 100 MB/s. First we start nvme_perf with only write IOs and we see that reported bandwidth is 100 MB/s. Then we start another instance of nvme_perf with only read IOs. We see that reported read bandwidth is 10 MB/s but we also see that write bandwidth also drops to 10 MB/s. Signed-off-by: Evgeniy Kochetov <evgeniik@nvidia.com> Change-Id: I1edf09d038e65f873deef19ecb0f4bf9725a5ca5 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13767 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com>	2022-07-26 08:33:45 +00:00
Evgeniy Kochetov	f79af9ab19	bdev/qos: Factor out check for QoS limits into a helper function Signed-off-by: Evgeniy Kochetov <evgeniik@nvidia.com> Change-Id: I139f78bb6fc2ccfce871c1f6a81dd1e25c51a826 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13766 Community-CI: Mellanox Build Bot Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-07-26 08:33:45 +00:00
Artur Paszkiewicz	c682c78992	FTL: Add FTL bdev module Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: I8c40b96f0726d83d6a307e8b9a04b7c210b80255 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13299 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-07-25 07:19:29 +00:00
Artur Paszkiewicz	17147949cf	FTL: Add core thread poller Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: I70158123d7b503c909b121d418abe31a8d441152 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13298 Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-07-25 07:19:29 +00:00
Evgeniy Kochetov	3b26e2c594	nvme/rdma: Create poller and CQ on demand Original implementation creates pollers and CQs for all discovered devices at poll group creation. Device (ibv_context) that has no references, i.e. has no QPs, may be removed from the system and ibv_context may be closed by rdma_cm. In this case we will have a CQ that refers to closed ibv_context and it may crash in ibv_poll_cq. With this patch pollers are created on demand when we create the first QP for a device. When there are no more QPs on the poller, we destroy the poller. This also helps to avoid polling CQs that don't have any QPs attached. Signed-off-by: Evgeniy Kochetov <evgeniik@nvidia.com> Change-Id: I46dd2c8b9b2902168dba24e139c904f51bd1b101 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13692 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-07-22 07:27:22 +00:00
Changpeng Liu	c88345ab3d	nvme: apply `nvme_pcie_poll_group_get_stats` to vfio-user Both PCIE and VFIO-USER can use the same APIs to get IO queue pair statistic data, so merge them here. Change-Id: Iadf9ead2bd5abaf11d2ef5d1884acb67369f85bb Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13538 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-07-22 06:43:35 +00:00
Boris Glimcher	806744b7c8	sock: Add ktls and tls_version to spdk_sock_impl_opts Since `sock_impl_opts` was added to `sock_opts` Can remove `ktls` and `tls_version` from spdk_sock_opts Example: rpc.py sock_impl_set_options -i ssl --enable-ktls rpc.py sock_impl_set_options -i ssl --disable-ktls rpc.py sock_impl_set_options -i ssl --tls-version=12 ./build/examples/perf --enable-ktls ./build/examples/perf --disable-ktls ./build/examples/perf --tls-version=12 Check kTLS statistics here: /proc/net/tls_stat Change-Id: Icf7ee822bad92fda149710be77feb77fc8d4f163 Signed-off-by: Boris Glimcher <Boris.Glimcher@emc.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13510 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-07-22 06:41:39 +00:00
Nathan Claudel	d0038b70df	bdev: fix use-after-free in bdev registration When a bdev is registered, it is examined by the bdev modules before the bdev register even is notified. Examination may be asychronous, e.g. when the bdev module has to perform I/O on the new bdev. This causes a race condition where the bdev might be destroyed while examination is not finished. Then, once all modules have signaled that examination is done, `bdev_register_finished` makes an invalid access to the freed bdev pointer. To fix this, defer the unregistration until the examine is completed by opening a descriptor on the bdev. Change-Id: I79a2faa96c1c893fc1cee645fbe31f689b03ea4a Signed-off-by: Nathan Claudel <nclaudel@kalray.eu> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13630 Community-CI: Mellanox Build Bot Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-07-22 06:40:10 +00:00
Artur Paszkiewicz	d974bad6fc	ftl: retrieve device’s attributes and configuration Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Change-Id: Ide6bb24d2c1ec2b0da3f20ce4013a4cd6e339114 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13297 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com>	2022-07-21 10:53:01 +00:00
Kozlowski Mateusz	92b5ebe014	FTL: Dump statistics on shutdown Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: I9168af3cacffe9c4efae169b56df974a35bd4e2c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13296 Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-07-21 10:53:01 +00:00
Kozlowski Mateusz	5022d8f372	FTL: Add first startup basic initialization flow Scrubbing nv cache region and finalizing initialization Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: I654b9a92004042c773c3672a5f27b0f66200469d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13295 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-07-21 10:53:01 +00:00
Kozlowski Mateusz	b872e29fef	FTL: Add config checks during startup flow Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: I336880ee263dbb23b613bd933c776f0b922412cc Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13294 Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-07-21 10:53:01 +00:00
Artur Paszkiewicz	7a7ac2af33	ftl: metadata utils and initialization Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Change-Id: Iaa9d7dd3f9e3147f0acfe18e23506a33fe3fd5a3 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13293 Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-07-21 10:53:01 +00:00
Artur Paszkiewicz	2b5bba569f	ftl: device layout abstraction Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Change-Id: I5db829ffb9044179cdf0807c3aeeb3a850a276d2 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13292 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-07-21 10:53:01 +00:00
Artur Paszkiewicz	e49ccfc820	ftl: device startup and shutdown Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Change-Id: Ia4a3439a2ac79e24bc6dc11a5c131d44ecb2ad80 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13291 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com>	2022-07-21 10:53:01 +00:00
Changpeng Liu	dbecab8da0	nvme/pcie: make `nvme_pcie_ctrlr_delete_io_qpair` call trace multi-process safe When a secondary process exit without deleting allocated IO queue pair, then a new secondary process will do cleanup for previous allocated queue pair, then segment fault will happen due to `stat` inside IO queue pair data strucutre can't be accessed in this cleanup process. Fix issue #2565. Change-Id: I01a037642683901941b5268ac20d17b78b6c6350 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13537 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-07-21 08:11:50 +00:00
Jim Harris	ee8167e3e1	virtio: rename header to vhost_user_internal.h This avoids conflict with public vhost_user.h header file which can cause problems with abidiff. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ia258b4621eda9f6855d46bbf67d8369a053a7116 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13732 Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-07-21 07:03:26 +00:00
Jim Harris	fff345b145	vmd: rename internal header file to vmd_internal.h This avoids conflict with public vmd.h header which can cause problems with abidiff. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I2f00c07226dec273516868f5fa9d7aa384378308 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13731 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com>	2022-07-21 07:03:26 +00:00
Jim Harris	e70dc52ff2	blobfs: rename tree.h to cache_tree.h Avoids conflict with public tree.h that can cause problems with abidiff. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I3ccf4c0198f7975d8ebbee57f50c52f9f2e96fc0 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13730 Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-07-21 07:03:26 +00:00
Jim Harris	79c9b1e5df	idxd: rename internal header file to idxd_internal.h This avoids confusion with the public idxd.h header file which causes problems with abidiff. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I7910c93d9d95b99c82f4dfdba845e6804e1b6568 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13729 Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-07-21 07:03:26 +00:00
Changpeng Liu	6abb4764ad	nvmf: check interleaved metadata size when adding NS When doing DIF insert and strip, we will reserve extra buffer in block device layer to save DIF information, so when attaching one device to Namespace, we will check the value first so that the reserved buffer size isn't smaller than metadata size. Change-Id: Id9272886ce8a7c01271279686730af4e5b24f35a Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12188 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com>	2022-07-19 12:31:59 +00:00
Changpeng Liu	a438718fc2	nvmf: don't report E2E Protection Capabilities to client When `dif_insert_or_strip` is enabled, NVMf library will do DIF insert and strip automatically, client isn't aware of it, when `dif_insert_or_strip` is disabled, we will report Namespace E2E Protection Capabilities to client, but we don't process PRACT and PRCHK flags in NVMf library, so here we don't report the capabilities to client and leave the use of extended LBA buffer to users. Change-Id: Ic610dc65fef210a7799c6ab693d89138b99e1193 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12165 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-07-19 12:31:59 +00:00
Konrad Sztyber	7f83361553	sock: add sock_impl_opts to sock_opts Some of the options in sock_impl_opts could be different for different sockets (even if they're using the same impl). However, outside of a few selected options (recv_buf_size, send_buf_size), there was no interface to change them. This change will allow users to change impl_opts on a per-socket basis when creating a socket. Sockets created through accept() inherit impl_opts from the listening socket. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I7628ae19def25cef6ffa62aa54bd34e446632579 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13661 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot	2022-07-19 09:35:03 +00:00
Konrad Sztyber	cfe2d76db2	sock: remove zerocopy_threshold from spdk_sock Now that spdk_sock has impl_opts, we no longer need to store a copy of impl_opts.zerocopy_threshold in spdk_sock. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I96377e330351b1afb57811578acfadf05d53f49c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13660 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot	2022-07-19 09:35:03 +00:00
Artur Paszkiewicz	b71eebd85a	ftl: mngt: pass status and ctx directly to completion cb Also remove ftl_mngt_get_status() because it won't be necessary now. Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: I335831cb1c506379e9afeb0bf87f1f873033073d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13668 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com>	2022-07-18 12:54:48 +00:00
Jacek Kalwas	0adabc9eb1	lib/nvmf: rm nvmf_poll_group_add_transport from internal header it is impl and used only in nvmf.c source file Signed-off-by: Jacek Kalwas <jacek.kalwas@intel.com> Change-Id: I1236f9ede28c5da313d118ce73e1da64381379c5 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13664 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com>	2022-07-18 10:18:19 +00:00
GangCao	0b92da6c48	NVMe/TCP: explicitly initialize the cpl structure To fix the Klocwork issues. Change-Id: Ib9e490cd3f2140a1c2f86300979efd604054b972 Signed-off-by: GangCao <gang.cao@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13695 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Xiaodong Liu <xiaodong.liu@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com>	2022-07-18 10:16:29 +00:00
Alexey Marchuk	3512714b3f	nvme_fabrics: Lock mutext when prcessing set/get regs That is possible to get/set registers from any thread, during regs processing we are polling admin qpair to get a completion. At the same time, another thread can also poll admin qpair and that can lead to undefined behavior. This patch fixes an issue when bdev_nvme is configured with io_timeout. If remote target becomes unresponsive (e.g. due to link down), IO timeout occurs and bdev_nvme tries to get csts registers in timeout_cb. At the same time another thread can process adminq, so we may have 2 simultaneous adminq polls. If admin qpair is disconnecting at that time (RDMA transport) we may destroy resources twice from different threads. We don't see a problem with set_regs function but it won't be redundant to lock mutex in set_regs as well. Signed-off-by: Alexey Marchuk <alexeymar@nvidia.com> Change-Id: I7ec3984d25d0249061005533d13b22315b44ddf2 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13687 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-07-15 16:06:54 +00:00
Jim Harris	9cb5f885df	nvmf: decrement mgmt_io_outstanding for all AER cases We cannot count AERs as outstanding IO for purposes of subsystem pause, because we cannot expect them to be completed. Previously we would account for this in nvmf_ctrlr_async_event_request() by decrementing the counter, but this did not consider cases in the calling function (nvmf_ctrlr_process_admin_cmd) where an AER might complete with error before this function, resulting in the counter getting stuck indefinitely with a >0 value. Rather than adding a decrement in all of those error cases, do a single check at the beginning of nvmf_ctrlr_process_admin_cmd, and remove the one from nvmf_ctrlr_async_event_request. Fixes issue #2215. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ica969f116d80dfba0168369ff2fba9a4a42fc076 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13678 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-07-15 12:51:31 +00:00
Konrad Sztyber	3e47d7fa22	sock: asynchronous readv interface This patch defines a new function, spdk_sock_readv_async(), which allows the user to send a readv request and receive a callback once the supplied buffer is filled with data from the socket. It works simiarly to asynchronous writes, but there can only be a single outstanding read request at a time. For now, the interface isn't implemented and any calls will return -ENOTSUP. Subsequent patches will add support for it in the uring module and as well as emulation in the posix module. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I924e2cdade49ffa18be6390109dc7e65c2728087 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12170 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-07-14 09:45:54 +00:00
BinYang0	20cd4841f1	lib/nvmf: set low water mark in NVMe/TCP target to 1 byte In NVMe/TCP target, the socket low water mark is set to sizeof(struct spdk_nvme_tcp_common_pdu_hdr), which is 8 bytes. In corner test, there might be 4 bytes data packet sent to NVMe/TCP target, after that, if there is no more data sent to the same socket, the 4 bytes won't be read by NVMe/TCP target qpair thread. Because of this, there is a IO request didn't complete in initiator. Then, if manual call the readv function to read the 4 bytes for the pdu in target, the io request complete normally in initiator. It seems like the pdu might be split, and in the situation, the IO request will not complete until new IO request reach. After set low water mark in NVMe/TCP target to 1 byte, just like iscsi target done, the issue disappear immediately. Signed-off-by: BinYang0 <bin.yang@jaguarmicro.com> Change-Id: I59d3d900f0b25632d786ef25ab096eabe43476bd Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13633 Reviewed-by: <chuanwei.ji@jaguarmicro.com> Reviewed-by: Qingmin Liu <qingmin.liu@jaguarmicro.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-07-13 07:28:52 +00:00
Jim Harris	f3dd8f7e0d	bdev: allow NULL md_buf for md-related APIs It is a nicer API to allow users to use an md-related API such as spdk_bdev_read_blocks_with_md passing md_buf as NULL to mean "don't read metadata". This avoids the need for an if-statement in the users code to check if the md buffer is NULL before deciding which API needs to be called. This basically requires two changes: 1) only check if the metadata is separate for the bdev if the md_buf != NULL 2) do not fail if the buffer is specified but the md buffer is not (we only need to fail the case where the md buffer is specified but the data buffer is not) Note that spdk_bdev_readv/writev_blocks_ext was already allowing the metadata buffer to be NULL, but change those functions too to match the others on how we check if the data buffer isn't allocated. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I764cf49b9f573fccb19e73876a376fd231cc3580 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13612 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-07-11 22:41:35 +00:00
Konrad Sztyber	ee3ec3f7c2	vhost/rpc: return errno from virtio_blk_create_transport This will allow the code calling this RPC to interpret the error and check whether the transport already exists (-EEXIST) or some other error occurred. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I8c4af84763ddba908c59ff881b09834a439186a8 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13577 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com>	2022-07-11 07:41:22 +00:00
Changpeng Liu	ac31590b37	nvme: make `spdk_nvme_ctrlr_free_io_qpair` multi-process safe In the multi-process case, a process may call `spdk_nvme_ctrlr_free_io_qpair` on a foreign I/O qpair (i.e. one that this process did not create) when that qpairs process exits unexpectedly. The variable `qpair->poll_group` isn't multi-process safe, we can't use it in `spdk_nvme_ctrlr_free_io_qpair` and related transport poll group APIs. Change-Id: Ic13a6a2c7d760477be5be5a56a45caa2b5518717 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13573 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-07-11 07:41:09 +00:00
MengjinWu	427cbb46a3	lib/nvmf: optimize the performance for h2c handle It will not find the h2c related reqs in the tailq now. We can get it from tqpair->reqs directly. Signed-off-by: MengjinWu <mengjin.wu@intel.com> Change-Id: I25f0900e875b054d7617450477e9719e7a59aa18 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12861 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-07-11 07:40:53 +00:00
Thanos Makatos	caadae6c10	nvmf/vfio-user: briefly explain live migration Signed-off-by: Thanos Makatos <thanos.makatos@nutanix.com> Change-Id: I08d3aa90ec4f3e29bece820919bd39d20c74c6cf Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11745 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: John Levon <levon@movementarian.org>	2022-07-11 07:38:04 +00:00
Thanos Makatos	50a4875255	nvmf/vfio-user: ensure migration data are generated in stop-and-copy state Currently we initialize pending_bytes only in pre-copy state. This is pointless since we don't generate any migration data at this state, so if the vfio-user client reads migration data it will be garbage. Even worse, we don't re-initialize pending_bytes in stop-and-copy state, so if the vfio-user client reads the entire migration data in pre-copy state then there will be nothing left to read in the stop-and-copy state, which is where we actually produce the migration data. This results in corruption of the controller's state (e.g. queues). This patch ensures that migration data are available in the stop-and-copy state, by setting pending_bytes accordingly only in that state. Signed-off-by: Thanos Makatos <thanos.makatos@nutanix.com> Change-Id: I0b215e64cd1f58f254e1079f06402d196f984099 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11718 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-07-11 07:38:04 +00:00
Thanos Makatos	db73e999e9	nvmf/vfio-user: migration: don't ignore unsupported ranges The read_data, write_data, and data_written migration callbacks assume that the migration data are accessed in one go. Until this is fixed, with this patch we ensure we don't ignore unsupported ranges. Change-Id: I640415858b8c374ffc9e487cd20f5130e0be9305 Signed-off-by: Thanos Makatos <thanos.makatos@nutanix.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11717 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-07-11 07:38:04 +00:00
Artur Paszkiewicz	310836b9af	ftl: configuration structure and utils Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Change-Id: I5364e09e0e501443ac6e99df5d814cc5fac397e8 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13290 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-07-11 07:23:58 +00:00
Artur Paszkiewicz	293cdc484b	ftl: management framework Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Change-Id: I8261863e80a53a37183b0148d4a08fa97e208dda Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13289 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Community-CI: Mellanox Build Bot	2022-07-11 07:23:58 +00:00
Artur Paszkiewicz	5140958837	ftl: utils Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Change-Id: I3476a7b11e3078da519beb39fd5f49b8e838a238 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13409 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com>	2022-07-11 07:23:58 +00:00
Artur Paszkiewicz	769984a925	ftl: core structure Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Change-Id: I5360b43348c8eb7bdfcbc394bb1ac83768dec49f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13408 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com>	2022-07-11 07:23:58 +00:00
Wojciech Malikowski	81dca28884	ftl: remove deprecated ftl library Signed-off-by: Wojciech Malikowski <wojciech.malikowski@intel.com> Change-Id: I3ebb05be3f1b9864b238cb74f469b4fdf573cd0d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11120 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com>	2022-07-11 07:23:58 +00:00
Jim Harris	a6704e454c	nvme: put rdma req in nvme_rdma_req_complete All of the callers immediately put the req right after the nvme_rdma_req_complete call, so just move the put into that function instead. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ic370cf689850924e0c902a6071af8b3a7ed58c0b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13527 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com>	2022-07-04 07:23:13 +00:00
Jim Harris	e415bf0033	nvme: add cmd/cpl printing for rdma errors This follows similar logic in the pcie and tcp completion paths, including omitting error messages when aborting aers by adding a print_on_error parameter to the completion function. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Id558d0af2cdd705dfb60abb842bd567a0949ccce Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13525 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com>	2022-07-04 07:23:13 +00:00
Jim Harris	05dce1ee78	nvme: don't try to enable intel log pages on fabrics ctrlrs By default, the SPDK nvmf target reports vid==INTEL, which results in the SPDK nvme driver trying to enable Intel vendor-specific log page. Fix this by trying to enable those log pages only for PCIE transport controllers. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I78ebf365d4fa6295d1f610697266c3ead765988d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13524 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com>	2022-07-04 07:23:13 +00:00
Jim Harris	988ce2ecaa	nvme: use assert for INTEL_VID check on log pages We can only get to this code path if the controller has vid==INTEL, so make that more clear by changing the check to an assert. Remove unit test that calls nvme_ctrlr_construct_intel_support_log_page_list() for a controller that is not VID==INTEL - this is no longer valid. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I3b58451bc95992bf641e7452f0ac4c2bac9fe31c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13523 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com>	2022-07-04 07:23:13 +00:00
Jim Harris	4a24f581d6	nvme: add cmd/cpl printing for tcp errors This follows similar logic in the pcie completion path, including omitting error messages when aborting aers by adding a print_on_error parameter to the completion function. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I96df72280bb8fcbee3847fdc27f38e14a1bf3251 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13522 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com>	2022-07-04 07:23:13 +00:00
Jim Harris	21d15cb043	nvme: cache values in nvme_tcp_req_complete nvme_tcp_req_complete_safe caches values on the request, so that we can free the request before completing it. This allows the recently completed req to get reused in full queue depth workloads, if the callback function submits a new I/O. So do this nvme_tcp_req_complete as well, to make all of the completion paths identical. The paths that were calling nvme_tcp_req_complete previously are all non-fast-path, so the extra overhead is not important. This allows us to call nvme_tcp_req_complete from nvme_tcp_req_complete_safe to reduce code duplication, so do that in this patch as well. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I876cea5ea20aba8ccc57d179e63546a463a87b35 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13521 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com>	2022-07-04 07:23:13 +00:00
Jim Harris	d1179a5801	nvme: put req in nvme_tcp_req_complete All callers of nvme_tcp_req_complete call nvme_tcp_req_put immediately afterwards, so move this call into nvme_tcp_req_complete. This will help enable some improvements in later patches. Note that nvme_tcp_req_complete_safe has this same functionality open coded right now, but that will get changed in the next patch. It calls nvme_tcp_req_put immediately after the TAILQ_REMOVE, so do that in nvme_tcp_req_complete as well. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I368122bc49a7f0772e3011e5427e3c43618380eb Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13520 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com>	2022-07-04 07:23:13 +00:00
Shuhei Matsumoto	4be6d30438	nvme: Add ctrlr_abort_queued_aborts() into qpair_abort_all_queued_reqs() nvme_qpair_abort_all_queued_reqs() aborts error injections, queued requests, aborting queued requests, and outstanding requests. (Aborting outstanding requests depends on transports.) However, it did not abort queued aborts. Include nvme_ctrlr_abort_queued_aborts() into nvme_qpair_abort_all_queued_reqs() to do really the name of the function indicates. nvme_ctrlr_abort_queued_aborts() has been called in a few cases, but we do not care duplication. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I19102cc6603a72ce5c398a7947cb4d606b692991 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12849 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Vasuki Manikarnike <vasuki.manikarnike@hpe.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot	2022-06-30 07:51:23 +00:00
Boris Glimcher	7104c8332d	sock: Add ktls and tls_version to spdk_sock_opts See https://docs.kernel.org/networking/tls-offload.html See https://www.openssl.org/docs/man3.0/man3/SSL_set_options.html Change-Id: I2fb433cbc34061cb03e1591bb0b47063fcafc68c Signed-off-by: Boris Glimcher <Boris.Glimcher@emc.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13071 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com>	2022-06-30 07:44:26 +00:00
Changpeng Liu	7003bd0de3	nvmf/vfio-user: take endpoint as input parameter in quiesce_done QEMU may exit due to some exceptions which mean the socket connection may be disconnected at any time, so for asynchronous callbacks especially the subsystem pause/resume callbacks, they all run in asynchronous way, the controller pointer may become invalid before the callbacks are called. Fix #2530. Change-Id: I6d73597d75761e28844e83bfee7f8a446d85fa49 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12831 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-06-29 07:10:05 +00:00
GangCao	48ce2c978e	Bdev: remove the QD poller at the time of Bdev unregister Fix issue: #2561 The issue here is that in the bdev_set_qd_sampling_period RPC command, the QD sampling period has been set. Then later the related Desc is closed and in the bdev_close() function the QD sampling period is reset to 0. A new QD desc is added as the QD sampling period update could be handled properly. Meanwhile, a new QD Poll In Progress flag is also added so as to indicate there are ongoing events of QD sampling and the Bdev unregister will be handled in the proper way. Related test case and unit test also updated for this change. Change-Id: Iac86c2c6447fe338c7480cf468897fc8f41f8741 Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Signed-off-by: GangCao <gang.cao@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13016 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot	2022-06-28 18:13:02 +00:00
yupeng	1f0b8df7b0	blobstore: implement spdk_bs_grow and bdev_lvol_grow_lvstore RPC The bdev_lvol_grow_lvstore will grow the lvstore size if the undering bdev size is increased. It invokes spdk_bs_grow internally. The spdk_bs_grow will extend the used_clusters bitmap. If there is no enough space resereved for the used_clusters bitmap, the api will fail. The reserved space was calculated according to the num_md_pages at blobstore creating time. Signed-off-by: Peng Yu <yupeng0921@gmail.com> Change-Id: If6e8c0794dbe4eaa7042acf5031de58138ce7bca Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9730 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-06-28 17:55:43 +00:00
yupeng	88833020eb	blobstore: reserve space for growing blobstore Reserve space for used_cluster bitmap. The reserved space is calculated according to the num_md_pages. The reserved space would be used when the blobstore is extended in the future. Add the num_md_pages_per_cluster_ratio parameter to the bdev_lvol_create_lvstore API. Then calculate the num_md_pages according to the num_md_pages_per_cluster_ratio and bdev total size, then pass the num_md_pages to the blobstore. Signed-off-by: Peng Yu <yupeng0921@gmail.com> Change-Id: I61a28a3c931227e0fd3e1ef6b145fc18a3657751 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9517 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-06-28 17:55:43 +00:00
John Levon	022da3d276	nvmf/vfio-user: correct vfu_setup_log() usage SPDK was previously incorrectly requesting log levels such as LOG_NOTICE. Update libvfio-user so it is in fact supported, and check that setting up the callback actually worked. Signed-off-by: John Levon <john.levon@nutanix.com> Change-Id: I41c2a8cf683868c3c2e40470f78e1af3dba29de4 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12839 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Thanos Makatos <thanos.makatos@nutanix.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Swapnil Ingle <swapnil.ingle@nutanix.com>	2022-06-28 07:05:27 +00:00
John Levon	554b3b3fe9	nvmf/vfio-user: refactor out ctrlr_start() Signed-off-by: John Levon <john.levon@nutanix.com> Change-Id: I71563037c15ebe0b76cfa603deea7576bad5c73c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12836 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Thanos Makatos <thanos.makatos@nutanix.com>	2022-06-28 07:05:27 +00:00
John Levon	6066e62ee6	nvmf/vfio-user: allow multiple reactors Update libvfio-user such that the SGL access APIs can be used concurrently. We are guaranteed that the guest memory remains mappable now that the vfio-user transport has implemented quiescence. This is currently only really useful (for a single controller) in poll mode, but shouldn't break interrupt mode, as we still ensure all a controller's queues are on the same poll group in that case. Signed-off-by: John Levon <john.levon@nutanix.com> Change-Id: I0988e731558e9bf63992026afc53abc66ec2a706 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12349 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2022-06-28 07:05:27 +00:00
Ben Walker	8dd1cd2104	check_format: For C files only, fix return type breaks In SPDK, declarations have the return type on the same line. Definitions have the return type on a separate line. Astyle has an option for enforcing this. Unfortunately, it seems to have two bugs: 1) It doesn't work correctly at all on C++ files. 2) It often fails on functions that return enums, or long type names Deal with 1) by adjusting the check_format.sh script to only tell astyle to fix return type line breaks for C files and not C++. Deal with 2) by adding a few typedefs to work around the problem. Change-Id: Idf28281466cab8411ce252d5f02ab384166790c6 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13437 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com>	2022-06-27 09:33:48 +00:00
Kefu Chai	9644491dde	thread: let spdk_thread_create() accept const spdk_cpuset* the underlying spdk_cpuset_copy() takes `const spdk_cpuset` as the `src` parameter. there is no need to take non-const spdk_cpuset. hence, in this change, let's relax the requirement of the pointer type. Signed-off-by: Kefu Chai <tchaikov@gmail.com> Change-Id: I1f626c7fea45cf7250bf56b891bcba4a0f2a8917 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13443 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com>	2022-06-24 07:22:53 +00:00
Shuhei Matsumoto	ceaa4ee0f7	nvme: Increment ctrlr->outstanding_aborts when aborting req in ctrlr->queued_aborts We had not incremented ctrlr->outstanding_aborts when aborting a request in the ctrlr->queued_aborts, and ctrlr->outstanding_aborts became negative. Fix the bug in this patch. Additionally add assert to check if ctrlr->outstanding_aborts is not negative. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I58090286f070ba854bdea87f0f8ecb7810890338 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13452 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-06-24 07:22:36 +00:00
John Levon	0a153e8af4	nvmf/vfio-user: only process SQs in VFIO_USER_CTRLR_RUNNING state While we are quiesced, we're not allowed to access guest memory via the SGL APIs. Refuse to process any commands unless we're in RUNNING state. We need to synchronize with each poll group via a message before we can call vfu_device_quiesced(), otherwise we could still be processing commands via nvmf_vfio_user_sq_poll(). For interrupt mode, we then might miss processing commands in a corresponding interrupt callback, so make sure we process them when we return to RUNNING state. Signed-off-by: John Levon <john.levon@nutanix.com> Change-Id: Ieae5a9ae8d9de722e0bdf4bb8d61e7e678159f1f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12912 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2022-06-24 07:22:01 +00:00
John Levon	667809a4ae	nvmf/vfio-user: pause all I/O during quiesce An oversight meant that quiesce was in fact only pausing the admin queue, and not ensuring no I/O was ongoing. Fix this by passing the right flag to spdk_nvmf_subsystem_pause(). Change-Id: I930c616d1170ac0299339b04928da57f6a7489ab Signed-off-by: John Levon <john.levon@nutanix.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13441 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2022-06-24 07:22:01 +00:00
Ben Walker	761056f8d2	nvmf: Make spdk_nvmf_subsystem_pause accept the broadcast NSID If the broadcast NSID is supplied, every namespace is paused. Change-Id: I40cc3e04b5a75b731ab0c8946ed8146275cc8ee4 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13394 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2022-06-24 07:22:01 +00:00
Changpeng Liu	619da10386	libvfio-user: compile shared library based on CONFIG_SHARED flag Fix #2556. Change-Id: I843dace8408d09bdb9222a37731a95732736bb78 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13041 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-06-23 07:43:26 +00:00
zhaoshushu.zss	e450b8e728	jsonrpc: add SOCK_CLOEXEC for spdk.sock fd Signed-off-by: zhaoshushu.zss <zhaoshushu.zss@alibaba-inc.com> Change-Id: I8e2cb7c686900f6c1873dd6a04d4255030505c5f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13063 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-06-21 07:54:52 +00:00
Balaji G	965d578f51	bdev: SPDK_BDEV_IO_STATUS_ABORTED is not handled in the Fuse command Fixes #2553 Signed-off-by: Balaji G <bg@hpe.com> Change-Id: I0c95ee22b06c40ec9d71f032b6fff4076b227d2b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13025 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-06-20 10:01:42 +00:00
yidong0635	dabca25646	util: Extract a common lib between iovs and buf. It's useful to add these APIs. spdk_copy_iovs_to_buf and spdk_copy_buf_to_iovs. It prepares that other ones can call these. We don't need to define them in static state repeatedly. And add corresponding unit tests. Change-Id: Ife40fec8d047a48af67b04e6c055e4932282abfb Signed-off-by: yidong0635 <dongx.yi@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12075 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-06-20 10:01:15 +00:00
John Levon	a8326f8155	nvmf/vfio-user: avoid doorbell reads in cq_is_full() Profiling data showed the deference of the CQ head in cq_is_full() was a significant contributor to the CPU cost of post_completion(). Use the cached ->last_head value instead of a doorbell read every time. Signed-off-by: John Levon <john.levon@nutanix.com> Change-Id: Ib8c92ce4fa79683950555d7b0c235449e457b844 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11848 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2022-06-20 10:01:01 +00:00
Sebastian Brzezinka	14ecc7787d	nvme: Complete pending register operations first Fully asynchronous ctrlr detach (`b6ecc3729`) introduce a register operation state machine that waits for operation to complete. When controller failed to initialize, `nvme_ctrlr_fail` set qpair state to `DISCONNECTED` immediately, causing qpair process completions to never complete register operations therefore prevent async detach exit. Signed-off-by: Sebastian Brzezinka <sebastian.brzezinka@intel.com> Change-Id: I205c5157b8ea7b4535f98ff4052414310e421446 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12858 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-06-20 10:00:17 +00:00
Tomasz Zawadzki	f7e1f48a79	lib/event: do not set default scheduling period during init reactor_run() decides whether to start gather_metrics based on non-zero scheduler period. The default of 1 sec was set during initialization, in scheduler_subsystem_init(). This resulted in unessecary operations each second, even if only 'static' scheduler is used. This patch moves setting default scheduling period to respective schedulers. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I953aee271a959b6314c8e83434c922dba9638de4 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9492 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-06-20 09:56:09 +00:00
yidong0635	f77b678a14	lvol: encapsulate an exit_error_lvs_req function. Put the error lvol exit functions to exit_error_lvs_req. Signed-off-by: yidong0635 <dongx.yi@intel.com> Change-Id: I39c978e41417d8f4dc82641cb16e81d492958388 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11071 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-06-15 11:19:15 +00:00
Jun Zeng	a773ed9a9a	lib/vfio_user: change the calculation of bar_addr When calculating the bar_addr which is used to access SPARSE MMAP area, we should use the (offset - region->mmaps[i].offset) as the increment to get the valid access address. Signed-off-by: Jun Zeng <jun1.zeng@intel.com> Change-Id: Ie5d0c63cf572847d15dc92f0995fddecf35f1cdc Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13021 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Xiaodong Liu <xiaodong.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-06-15 08:10:32 +00:00
Tomasz Zawadzki	0f3ddc9c98	env/dpdk: skip build of DPDK based governors when missing rte_power rte_power was added to DPDK long time ago, but some of the DPDK packages do not include it. For those cases just skip building components that depend on in. This change still allows to use dynamic scheduler, since the dpdk_governor usage is optional. Fixes #2534 Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: Ied88edc8d58aae07d1384c1c40203fc80b919d80 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12993 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-06-15 08:08:55 +00:00
Tomasz Zawadzki	ec1d6fb71e	env/dpdk: simplify checks for rte_power dpdk_governor and gscheduler use rte_power, which is only available on Linux and when DPDK env is used. Rather than repeat those checks in each mk or Makefile, added DPDK_POWER flag directly to DPDK env. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I438caad8d333a4df697a79aa45de2930cce71d23 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12992 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: John Kariuki <John.K.Kariuki@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-06-15 08:08:55 +00:00
Tomasz Zawadzki	f961b32333	env/dpdk: add rte_net dependency to vhost rte_net is a dependency for both rte_vhost and rte_power. Next patch will simplify the checks to include rte_power, and keeping this depenency next to component that directly depends on it will make it easier to understand. Since DPDK_LIB_LIST is sorted by the end of the env.mk, it shouldn't be a problem to include the rte_net twice. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: If2bb2aa5d972148ca8143023657b0aec45306a08 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12991 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-06-15 08:08:55 +00:00
Richael Zhuang	4295661eb8	nvme_tcp: fix bug about qpair stuck in CONNECTING state When running perf test, sometimes after CONNECT req's resp was received and processed, the qpair still failed to change from state CONNECTING to CONNECTED. For when it goes to nvme_fabric_qpair_connect_poll -> nvme_wait_for_completion_robust_lock_timeout_poll to process the CONNECT req's resp, the req may have not been finished in sock_check_zcopy, although its resp has been received and processed, which means the tcp_req->ordering.bits.send_ack is still 0 and the status->done still is false. And after the req is completed in sock_check_zcopy, we need to poll this qpair again to make the state enter CONNECTED. And if icreq's resp received and processed before nvme_tcp_send_icreq_complete is called by _sock_check_zcopy, the qpair will be stuck in CONNECTING and it never proceed to send the CONNECT req. We also need to put it in pgroup->needs_poll to fix it. I can reproduce this bug with the following configuration. target: 16NVMe SSD, running on 20 cores; initiator: randread test using nvme perf with 32 cpu cores and zerocopy enabled. The error doesn't always occur. CONNECT failure is about 1 failure in ten with the following log. And icreq failure is less frequent with only target side's "keep alive timeout" log. Error reported in initiator side: Initialization complete. Launching workers. [2022-05-23 14:51:07.286794] nvme_qpair.c: 760:spdk_nvme_qpair_process_completions: ERROR: CQ transport error -6 (No such device or address) on qpair id 2 ERROR: unable to connect I/O qpair. ERROR: init_ns_worker_ctx() failed And target side shows: Disconnecting host from subsystem nqn.2016-06.io.spdk:cnode2 due to keep alive timeout Change-Id: Id72c2ffd615ab73c5fc67d36c3ff8b730cebcef7 Signed-off-by: Richael Zhuang <richael.zhuang@arm.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12975 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com>	2022-06-14 09:18:04 +00:00
Tomasz Zawadzki	e3377795c3	lib/nvmf: bump SO_VER due to addition of spdk_nvme_cdata_fuses Patch below changed the struct spdk_nvmf_ctrlr_data by inserting spdk_nvme_cdata_fuses. This affects large number of nvmf interfaces. (`cbfd581`) nvmf: Add NVMe fused operations to spdk_nvmf_ctrlr_data Unfortunately was missed due to lack of rebase after ABI update on CI machines. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: Ifd06d0ddbefe9ea6c9715adae9881d4606e34b44 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13013 Reviewed-by: Krzysztof Karas <krzysztof.karas@intel.com> Reviewed-by: Kamil Godzwon <kamilx.godzwon@intel.com> Reviewed-by: Michal Berger <michallinuxstuff@gmail.com> Community-CI: Mellanox Build Bot	2022-06-10 11:55:00 +00:00
Jim Harris	fb6f88cc88	env_dpdk: remove -rpath-link from ENV_LINKER_ARGS We already list the libraries with their explicit pathnames, so the -rpath-link serves no purpose. Our Makefile was actually specifying this option without an = sign - i.e: -Wl,-rpath-link /path/to/lib On the submitter's system, this resulted in an error: cc: Missing argument for -Wl,-rpath-link I have no idea why no one has ever run into this error, except for this one submitter. But removing the -rpath-link is the right thing to do here, since it is not needed - so do that rather than adding the = sign and continuing to figure out differences in -Wl option processing on these different systems.. Fixes issue #2540. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I4f6176e55701a5dea5b10bba1ad621250cb5cb51 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12984 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com>	2022-06-10 07:56:16 +00:00
Alexis Lescouet	6c2ce12217	nvmf/vfio_user: Add an option to disable compare in vfio_user_transport_opts Add an option to stop nvmf transport advertising support for both the compare command and the fused compare_and_write operation in vfio_user transport. Signed-off-by: Alexis Lescouet <alexis.lescouet@nutanix.com> Change-Id: I3900218c0e9884f86a5c8698a030f8106b64f2f7 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12919 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: John Levon <levon@movementarian.org> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-06-10 07:54:33 +00:00
Alexis Lescouet	16c65744d8	nvmf: Make nvmf transport advertise compare Compare command, when not supported natively by the underlying bdev is emulated by the bdev layer. Change nvmf ctrlr data to advertise compare command by default. Signed-off-by: Alexis Lescouet <alexis.lescouet@nutanix.com> Change-Id: I88646e6c1a7d7a2829be813ff0241661724bd127 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12918 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: John Levon <levon@movementarian.org> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-06-10 07:54:33 +00:00
Alexis Lescouet	cbfd581c13	nvmf: Add NVMe fused operations to spdk_nvmf_ctrlr_data Fused compare_and_write operation is always advertised by the nvmf transport. Add the fuses structure to spdk_nvmf_ctrlr_data to make advertising fused operation configurable. Signed-off-by: Alexis Lescouet <alexis.lescouet@nutanix.com> Change-Id: I73ee03dc8948f1d250cc0a8f0b8a3bde042a45e7 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12917 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: John Levon <levon@movementarian.org> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-06-10 07:54:33 +00:00
Jim Harris	ddf8904c51	Use SPDX license identifiers in remaining files. There are a few places we can replace existing license text with SPDX license identifiers, that did not match the auto-replacement script in the previous patch. Make those replacements manually in this patch instead. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I258720c03bc2153d1c56a8adf6357f224b911c0b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12913 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com>	2022-06-09 07:35:12 +00:00
Jim Harris	488570ebd4	Replace most BSD 3-clause license text with SPDX identifier. Many open source projects have moved to using SPDX identifiers to specify license information, reducing the amount of boilerplate code in every source file. This patch replaces the bulk of SPDK .c, .cpp and Makefiles with the BSD-3-Clause identifier. Almost all of these files share the exact same license text, and this patch only modifies the files that contain the most common license text. There can be slight variations because the third clause contains company names - most say "Intel Corporation", but there are instances for Nvidia, Samsung, Eideticom and even "the copyright holder". Used a bash script to automate replacement of the license text with SPDX identifier which is checked into scripts/spdx.sh. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Iaa88ab5e92ea471691dc298cfe41ebfb5d169780 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12904 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: <qun.wan@intel.com>	2022-06-09 07:35:12 +00:00
John Levon	faa0ba86e0	nvmf/vfio-user: rename self_kick() Reflect that we are kicking the entire controller. Signed-off-by: John Levon <john.levon@nutanix.com> Change-Id: If5723a5f485745ef0a2456942b6df1d54133815b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12665 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Thanos Makatos <thanos.makatos@nutanix.com>	2022-06-08 20:40:48 +00:00
John Levon	fa4ddd2d8c	nvmf/vfio-user: refactor set_ctrlr_intr_mode() This function is really about re-arming all SQs for a poll group; refactor to reflect this. This is necessary ground-work before we can support multiple reactors in vfio_user.c in interrupt mode. Signed-off-by: John Levon <john.levon@nutanix.com> Change-Id: I170fae2076fc80e742926cf448973671ac9e3bd9 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12664 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-06-08 20:40:48 +00:00
Heinrich Schuchardt	72b5626d33	nvme/pcie: memory barrier for RISC-V Play it safe and add the same memory barrier in nvme_pcie_qpair_process_completions() as for ppc64. Signed-off-by: Heinrich Schuchardt <heinrich.schuchardt@canonical.com> Change-Id: I7079b4769d30106387ef4549495a72b7fea6a77a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12879 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-06-06 07:34:27 +00:00
MengjinWu	bb33310aa0	nvmf: remove XOR in nvme_tcp_pdu_calc_data_digest Prepare for the later patch, and make the later patch code clean Signed-off-by: MengjinWu <mengjin.wu@intel.com> Change-Id: I12b175c86a5245f38dc76fe2d3918ec4b30a475a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12830 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Xiaodong Liu <xiaodong.liu@intel.com>	2022-06-02 08:16:38 +00:00
MengjinWu	b5383af40a	lib/nvmf: another chance to calc crc32 when accel_tasks are used up If accel_tasks are used up, we should not directly return but give an another chance to calc it directly. Signed-off-by: MengjinWu <mengjin.wu@intel.com> Change-Id: I983b65d7dfff0fea3974682e886d2dcf309cd2c3 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12841 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Xiaodong Liu <xiaodong.liu@intel.com>	2022-06-02 08:16:38 +00:00
Konrad Sztyber	1f3bd08fa0	nvme/tcp: check tcp_req for NULL in pdu_payload_handle For a C2HTermReq PDU, there's no associated tcp_req, so we need to check it for NULL before dereferencing it. Also, while here, moved some of the assignments to the declarations to reduce the number of boilerplate lines. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Iac05ef0ba605e2f40d0026ad1b131c28d29f7314 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12845 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-06-01 08:56:58 +00:00
Konrad Sztyber	14adf7f70f	nvmf/tcp: unregister timeout poller in qpair_destroy The timeout poller might still be registered when a qpair is destroyed if we send C2HTermReq and then destroy the qpair before host terminates the connection. Fixes #2527 Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I21acc147fdba3aaac66b0c6ed54e155195fe9816 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12844 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-06-01 08:56:58 +00:00
John Levon	a6b0cd0c05	nvmf/vfio-user: fix set_ctrlr_intr_mode() queue check We need to check that the given SQ is active (i.e. is currently mapped into the process), so make the check the same as that in poll_group_poll(). Signed-off-by: John Levon <john.levon@nutanix.com> Change-Id: Ibd3babd7520f611f596f3bab15765fa13b4d6b99 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12663 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-05-31 07:08:28 +00:00
John Levon	3c481cc271	nvmf/vfio-user: rename vfio_user_handle_intr() This is better represented under the name vfio_user_ctrlr_intr(). Signed-off-by: John Levon <john.levon@nutanix.com> Change-Id: Ic3fa0fe238fd8ce4930bfd3e34b9dbc1b935aa6e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12662 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-05-31 07:08:28 +00:00
John Levon	c47c93fac4	nvmf/vfio-user: avoid handle_suppressed_irq() if not needed There's a non-zero cost to looking up the CQ; only call this function in the poll path if we need to. While here, we'll streamline the ctrlr-level check. Signed-off-by: John Levon <john.levon@nutanix.com> Change-Id: I6bf123f759fcd856196f6613cb6c7d0219550136 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12660 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Rui Chang <rui.chang@arm.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-05-31 07:08:28 +00:00
Jim Harris	64df311eba	nvme: add KEYED_DATA_BLOCK to sgl_types This SGL type was missed in the original commit that added the pretty printing. Fixes: `4d9ab1e9a1` ("nvme: pretty print dptr") Reported-by: Ramanjaneya Burugula <burugula@gmail.com> Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ibc655db4e65009071f39f55f691c94a094cea0bc Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12705 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-05-25 07:43:03 +00:00
Or Gerlitz	9b5dabff7f	nvme/rdma: Always use spdk allocation scheme Use the conventional huge-pages based spdk allocation scheme for the initiator data-structures unconditionally. Change-Id: I5baee7614e3ac9b5497b3d771dfddfbaa7fdf65b Signed-off-by: Or Gerlitz <ogerlitz@nvidia.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12687 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-05-25 07:42:47 +00:00
wanghailiangx	31513614a7	some remaining rpc: remove support for deprecated RPC names These were deprecated in 2019, it's time to remove support for them now. Change-Id: I7d3804a84851753992af4a3a37b60dc6de0d22cb Signed-off-by: wanghailiangx <hailiangx.e.wang@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12780 Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-05-24 07:17:42 +00:00
wanghailiangx	f552937ef4	trace module: remove support for deprecated RPC names These were deprecated in 2019, it's time to remove support for them now. Change-Id: Ie50c7421f991ad0474edba0e0f339180f7afee00 Signed-off-by: wanghailiangx <hailiangx.e.wang@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12778 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-05-24 07:16:58 +00:00
paul luse	d780d23532	accel: add ISAL based compress/decompress to accel SW module Note that without ISAL or IAA a call to compress/decompress will fail. Signed-off-by: paul luse <paul.e.luse@intel.com> Change-Id: Id20a08f6e61b9a51fa4a1634a5314e6ca18fa504 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12310 Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-05-23 19:10:46 +00:00
paul luse	997433f918	lib/accel: fix bug in completing SW engine tasks Previously an error would have been completed twice. Signed-off-by: paul luse <paul.e.luse@intel.com> Change-Id: Ief645fc30754433398531c50357876e92804e4b5 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12789 Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-05-23 19:10:46 +00:00
paul luse	fe044f6988	lib/idxd: add raw request for low level testing Provide an interface to allow the caller to provide a proprely formatted descriptor. Signed-off-by: paul luse <paul.e.luse@intel.com> Change-Id: I5c397761f556361040ec962d61169459150b6494 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12703 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-05-23 19:09:50 +00:00
wanghailiangx	000ee408e7	app module: remove support for deprecated RPC names These were deprecated in 2019, it's time to remove support for them now. Change-Id: Ia09368e426a83274d9c7fc90ed8b0391f4d0b67c Signed-off-by: wanghailiangx <hailiangx.e.wang@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12774 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-05-23 18:58:46 +00:00
Tomasz Zawadzki	b727e804d6	vhost: add virtio_blk abstraction This patch adds virtio_blk abstraction for custom transports, with the 'vhost_user_blk' first one being used. Added spdk_virtio_blk_transport_ops describing the nessecary callbacks to be implemented by each transport. Please use SPDK_VIRTIO_BLK_TRANSPORT_REGISTER to register the transport. Transports can use virtio_blk_process_request() to process the incoming I/O from their queues. virtio_blk_create_transport RPC was added to create one of the registered transports, possibly with custom JSON arguments. Added 'transport' argument to vhost_create_blk_controller RPC, to specify which transport should create the controller. By default the vhost_user_blk transport is used. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: Ic9d93a6e0f483796eb56b7174a678e41a6ea4808 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9540 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-05-23 17:31:16 +00:00
wanghailiangx	81d3cc1b5a	subsystem module: remove support for deprecated RPC names These were deprecated in 2019, it's time to remove support for them now. Change-Id: I56dbaef56ff793e48441219e07dc6b02dda0b470 Signed-off-by: wanghailiangx <hailiangx.e.wang@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12777 Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-05-23 16:16:55 +00:00
wanghailiangx	23d832a04c	vhost: remove support for deprecated RPC names These were deprecated in 2019, it's time to remove support for them now. Change-Id: I33a497fb134320f13606b66ad55fc7b068d011d9 Signed-off-by: wanghailiangx <hailiangx.e.wang@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12716 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: GangCao <gang.cao@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Xiaodong Liu <xiaodong.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-05-23 16:13:42 +00:00
wanghailiangx	405be3b794	notify module: remove support for deprecated RPC names These were deprecated in 2019, it's time to remove support for them now. Change-Id: I477da05a42ca607fbad4d178aa541726197d7c83 Signed-off-by: wanghailiangx <hailiangx.e.wang@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12775 Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-05-23 16:13:30 +00:00
paul luse	b483811ff1	modules/accel/iaa: add IAA accel_fw module And associated RPC to enable. Signed-off-by: paul luse <paul.e.luse@intel.com> Change-Id: I06785bcd8b8957293ad41d13bab556fe62f29fd5 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12765 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-05-23 16:04:57 +00:00
paul luse	0ff560ea3b	lib/idxd: Add compress/decompress support to low level lib Accel module coming in next patch... Add support for compress and decompress. The low level IDXD library supports both DSA and IAA hardware. There are separate modules for DSA and IAA. accel_perf patch follows. Signed-off-by: paul luse <paul.e.luse@intel.com> Change-Id: I55014122f6555f80985c11d49a54eddc5d51c337 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12292 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-05-23 16:04:57 +00:00
paul luse	4d9a00d791	lib/idxd: factor out batch allocation in spdk_idxd_get_channel() In prep for upcoming IAA additions. Signed-off-by: paul luse <paul.e.luse@intel.com> Change-Id: Id89124a3c3d5b1bcfd4d805ff4ee84a2f64f8a4a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12767 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-05-23 07:02:21 +00:00
paul luse	ecaa8e1000	lib/idxd: prepare some plumbing for adding IAA Misc internal IDXD changes needed to support the upcoming addition of IAA. Signed-off-by: paul luse <paul.e.luse@intel.com> Change-Id: Idb180088af545b174ed33a4f8ee113e58640477f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12764 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-05-23 07:02:21 +00:00
paul luse	bf234f4202	pci/accel/idxd: add PCI IDs for IAA device Intel Analytics Accelerator, this is the start of the patches to add this support to accel_fw. Signed-off-by: paul luse <paul.e.luse@intel.com> Change-Id: I7410710697d2947355181616b35cc8ab78bbddfe Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11985 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-05-23 07:02:21 +00:00
paul luse	76fae14976	lib/idxd: update names from IDXD->DSA where it makes sense In prep for upcoming addition of IAA. Signed-off-by: paul luse <paul.e.luse@intel.com> Change-Id: I47c5880aac37da9a38d6af6e52a51cefbfec91b9 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12762 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-05-23 07:02:21 +00:00
paul luse	87060965b3	include/env: update PCI ID names from IDXD->DSA In prep for adding IAA support Signed-off-by: paul luse <paul.e.luse@intel.com> Change-Id: I7eed173f9f907aa1c010d12db87b8dc27cd7495b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12760 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-05-23 07:02:21 +00:00
Tomasz Zawadzki	aec00435a0	lib/vhost: separate out bdev events handling Generic vhost-blk layer is responsible for opening the bdev attached to the vhost controller. This patch adds vhost_user_bdev_event_cb() that is called for vhost_user backend. This function will be replaced with a callback to particular virtio-blk transport. Having this piped through to the transports, allows to adjust their behavior upon bdev events. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: Id73f5131b6e57f0354e970d0bce92716ec69985b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12132 Reviewed-by: Jim Harris <james.r.harris@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-05-20 19:40:56 +00:00
Tomasz Zawadzki	34c7b6c18c	lib/vhost: expose spdk_bdev to virtio_blk transports There are configuration details that are needed to configure the virtio device based on spdk_bdev properties. Please see vhost_blk_get_config() for an example of vhost_user retrieving properties of bdev such as size or supported I/O type. Rather than trying to anticipate every such property, add vhost_blk_get_bdev() to allow usage of bdev API directly. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I757f96e2fb0861c97b07ce279a7c04c77a2ad11f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12373 Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-05-20 19:40:56 +00:00
Ben Walker	7ac08606e9	idxd: Support running without an IOMMU This requires handling vtophys entries that cross page boundaries. Fixes #2316 Change-Id: I9e9aafc1612bc89375c783bcf91bd04ab523ab9e Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12217 Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com>	2022-05-20 19:40:47 +00:00
Alexey Marchuk	619b4dba8a	lib/reduce: Check if user's buffer crosses huge page boundary If compress driver doesn't support SGL input of output then we need to copy user's buffers into reduce internal buffers Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: I0c07243a5b668d0e0adcc153e5b573f59c26ab64 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12281 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-05-20 17:39:57 +00:00
Alexey Marchuk	b86e85f56f	lib/reduce: Properly allocate comp/decomp buffers Reduce library allocates one big chunk of memory and then splits it between requests. The problem is that a chunk of memory assigned to a request may cross huge page boundary and if compress driver doesn't support SGL input of output, operation will be failed. To avoid this problem, align buffer start on 2MiB and check each chunk of memory if it crosses huge page boundary. Fixes issue #2454 Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: Ie730b8ba928f27a43bde1222b6c18d29b797575a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12249 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-05-20 17:39:57 +00:00
Jonas Pfefferle	192e64bcc5	bdev: spdk_bdev_ext_io_opts missing size check ext_io_opts uses the size member to allow backwards compatibility however currently we only check if it is below or equal the current size of the opts struct and that it is not 0. size is only used when we copy opts because of split or push/pull. This patch introduces size checks to allow safe access to e.g. metadata and memory domain pointers of the user provided opts pointer. The minimum size of the struct passed is now the size of the initial version of spdk_bdev_ext_io_opts. To not introduce additional checks when opts are consumed by a bdev module we now always copy if the size is smaller than the current opts struct size. When introducing new members to opts additional checks might be needed if those are directly accessed through the passed pointer or bdev_io->internal.ext_opts. Change-Id: Ibd181a5840a3d5022018a9f61403df961ffd6e1d Signed-off-by: Jonas Pfefferle <pepperjo@japf.ch> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12550 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-05-20 15:55:50 +00:00
Tomasz Zawadzki	e0516095fc	event/vhost: separate vhost subsystem to scsi and blk Separate out SCSI and BLK vhost subsystems to later add virtio_blk transport abstraction. This allows for further changes to the vhost_blk, not affecting vhost_scsi. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: Id1ecfeafeb936809a479a43c321e13f75cb3d5ad Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9539 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com>	2022-05-20 09:20:07 +00:00
Shuhei Matsumoto	51e897c42e	nvme: Abort queued requests even if they are children of a large I/O A iterator function nvme_request_add_abort() covers not only a small I/O request but also children of a large I/O. However nvme_qpair_abort_queued_reqs_with_cbarg() did not check the latter. check if cmd_cb_arg matches not only req->cb_arg but also req->parent_cb_arg. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I015e29b0a8f58920b9a13081330a94f9dd976a45 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12557 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-05-20 09:19:07 +00:00
Changpeng Liu	9df0f59444	nvmf/vfio-user: add check for property_access Only 4 bytes or 8 bytes are valid numbers when to access NVMe registers, add the check here. Fix issue #2495. Change-Id: I63b6e16a156f6eba17f397ec9d1a447e6a80b4da Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12643 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: John Levon <levon@movementarian.org> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-05-20 09:18:41 +00:00
Shuhei Matsumoto	09c7c76876	nvme: Set I/O qpairs to failed only if reset is synchronous For PCIe transport, we need to stop any activity of the controller before deleting I/O qpair resource in a controller reset sequence. However, we set I/O qpairs to failed before disabling a controller. In the NVMe bdev module, this caused disconnected qpair callback to delete I/O qpairs before disabling the controller. Hence, change the code slightly to set I/O qpairs to failed only if reset is synchronous to keep backward compatibility. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Ica71aad0a1dabce45616dfdfff5f11b07131bbd1 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12736 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Michael Haeuptle <michaelhaeuptle@gmail.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-05-20 09:17:28 +00:00
Changpeng Liu	7791085984	nvmf/vfio-user: add comments for endpoint and controller Change-Id: Idde0f9c9cea6c26b7e65c8699b2e5f120d759d7f Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11825 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Thanos Makatos <thanos.makatos@nutanix.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Dong Yi <dongx.yi@intel.com>	2022-05-19 21:12:02 +00:00
Changpeng Liu	673859cd0d	nvmf/vfio-user: remove unnecessary controller SHN state check The CSTS.SHN is changed only in shutting down the controller, nvmf library already ensure that all the outstanding IOs will be flushed before that, so we can remove this check here. Change-Id: Ib93a256e986b7b2ec1da0fc7992feb3a02c1d657 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11674 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Thanos Makatos <thanos.makatos@nutanix.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-05-19 21:12:02 +00:00
Changpeng Liu	63f6d50b5b	nvmf/vfio-user: resume the subsystem in source VM After finishing migration in source VM, the subsystem is in PAUSED state, the controller is dead for the source VM, we will destroy the controller when disconnecting socket, but after that, we should RESUME the subsystem so that it can be ready for the next new client. Fix issue #2363. Change-Id: Icf0999b9085cebe8be4c8783e1a43bb13d4f7987 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11422 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-05-19 21:12:02 +00:00
Changpeng Liu	8ab0975b2a	nvmf/vfio-user: set controller state in one thread The completion callback of `spdk_nvmf_subsystem_resume` and `spdk_nvmf_subsystem_pause` can run in different core other than the `vfu_ctx` core, this may lead to race condition when changing controller's state. Here we use a thread message to change it in the same thread context. Change-Id: I53d139adcca6ff72a3b91a2a931f1239f3271fa9 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12558 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: John Levon <levon@movementarian.org> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-05-19 08:24:34 +00:00
Shuhei Matsumoto	64454afb7c	nvme: disconnect() sets and reconnect_async() clears prepare_for_reset The following patches swaps the ordering of destrloying I/O qpairs and disconnecting a controller for PCIe transport. prepare_for_reset is a flag for PCIe transport. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I3009de9fea089fc93ecf87adba42e85c9a77e715 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12582 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Michael Haeuptle <michaelhaeuptle@gmail.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-05-19 08:23:57 +00:00
Shuhei Matsumoto	736b9da034	nvme: Do Controller Level Reset when disconnecting adminq for PCIe As described in the previous patches, we need to delete all I/O SQ/CQs before aborting trackers when disconnecting a controller. The following patches reorder the operations. This patch changes adminq disconnection to initiate a Controller Level Reset and adminq completion processes it if ctrlr->is_disconnecting is true. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I64f06bae2ce8a9127124029fd042db0028198e3c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12560 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Michael Haeuptle <michaelhaeuptle@gmail.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-05-19 08:23:57 +00:00
Ben Walker	813756e75e	nvme: Do not abort transport commands when disconnecting a qpair Make this a transport-level decision instead. TCP and RDMA do want to abort, but PCIe cannot because these commands may still be receiving DMA operations from the device. Change-Id: I305acddc3819c903eb3217e8f710d4216d0b3931 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11509 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Michael Haeuptle <michaelhaeuptle@gmail.com>	2022-05-19 08:23:57 +00:00
Shuhei Matsumoto	bdc9fa832d	nvme: Add helper functions to do a Controller Level Reset (Set CC.EN to 0) Previously, we did not do any Controller Level Reset when disconnecting the admin qpair. However, for PCIe transport, we need to stop any activity of the controller, i.e., delete all I/O SQ and CQs before nvme_transport_ctrlr_disconnect_qpair_done() calls nvme_transport_qpair_abort_reqs() (i.e., nvme_pcie_qpair_abort_trackers()). Otherwise, some corruption may occur because completed I/Os may still be in progress on the NVMe device. Not to change any public API, nvme_pcie_ctrlr_disconnect_qpair() is a convenient place to initiate a Controller Level Reset because it is called from spdk_nvme_ctrlr_disconnect(). Then nvme_pcie_qpair_process_completions() can process it until completion. However, necessary functions are not accessible from PCIe transport. This patch adds two helper functions and guards us from some undesirable behaviors because it was not assumed that nvme_ctrlr_process_init() is called from the completion context and ends in the middle of transition. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I3d986e94ba71b83beeff7e75cf92033b5fa6f075 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12559 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Michael Haeuptle <michaelhaeuptle@gmail.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-05-19 08:23:57 +00:00
Alexey Marchuk	1eca87c39c	blobstore: Preallocate md_page for new cluster When a new cluster is added to a thin provisioned blob, md_page is allocated to update extents in base dev This memory allocation reduces perfromance, it can take 250usec - 1 msec on ARM platform. Since we may have only 1 outstainding cluster allocation per io_channel, we can preallcoate md_page on each channel and remove dynamic memory allocation. With this change blob_write_extent_page() expects that md_page is given by the caller. Sicne this function is also used during snapshot deletion, this patch also updates this process. Now we allocate a single page and reuse it for each extent in the snapshot. Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: I815a4c8c69bd38d8eff4f45c088e5d05215b9e57 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12129 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-05-18 09:02:02 +00:00
GangCao	7bcd316de1	bdev: abort all IOs when unregistering the bdev To fix issue: #2484 When unregistering the bdev, will send out the message to each thread to abort all the IOs including IOs from nomem_io queue, need_buf_small queue and need_buf_large queue. The new SPDK_BDEV_STATUS_UNREGISTERING state is newly added to indicate this unregister operation. In this case, the bdev unregister operation becomes the async operation as each thread will be sent the message to abort the IOs and as the last step, it will unregister the required bdev and associted io device. On the other hand, the queued_resets will be handled separately and not aborted in the bdev unregister. New unit test cases are also added: enomem_multi_bdev_unregister: to abort the IO from nomem_io queue during the unregister operation bdev_open_ext_unregister: to handle the events and async operations from the unregister operation Change-Id: Ib1663c0f71ffe87144869cb3a684e18eb956046b Signed-off-by: GangCao <gang.cao@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12573 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Dong Yi <dongx.yi@intel.com>	2022-05-18 07:30:00 +00:00
Ben Walker	855390a585	idxd: Release batches based on refcnt Instead of releasing the batch memory when the batch generates a completion, instead do it via refcnt. This will allow us to later hold onto batch memory longer if vectored transactions end up spanning a batch. Change-Id: I942d6aa5052029eb0951e51a046dd98943108b94 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12259 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-05-18 07:28:57 +00:00
Ben Walker	14757fe8fb	accel: Correctly set nbytes for copy_crc32cv tasks If nbytes is not set, then the desination iovec sent to the underlying driver has a length of 0. Change-Id: Ia55f5ece942bd70f32bfdb3bcf02134ba98fca96 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12612 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com>	2022-05-18 07:28:57 +00:00
Alexey Marchuk	622ceb7f07	nvme/rdma: Use rdma qpair as cm_id context It simplifies code and removes cast of nvme_qpair to rdma_qpair Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: I363246cf9d8c9cbafd48b26facdb5cc37fdd8e67 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12701 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-05-18 00:34:29 +00:00
Alexey Marchuk	1003e28623	nvme/rdma: Fix qpair destroy/disconnect race When qpair is attached to a poll group, disconnect process is async - we are waiting for the DISCONNECTED event from rdmacm to destroy rdma resources. However the user (nvme_perf) can destroy qpair immediatelly, so memory allocated for qpair is freed but rdma resouces are still allocated. That means that we may receive rdmacm event (DISCONNECTED) for the destroyed qpair, that leads to use-after-free. To fix this problem, add a check for internal qpair state when qpair is destroyed, if disconnect is not finished, then we forcefully destroy rdma resources. Fixes issue #2515 Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reported-by: Or Gerlitz <ogerlitz@nvidia.com> Change-Id: I7bfa53c9f6fe6ed787323a8941f1f2db17ea0c20 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12700 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-05-18 00:34:29 +00:00
Alexey Marchuk	007fb1d3cb	nvme: Fix keyed/unkeyd SGL nvme cmd dump Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: I0a08518b5c30455a17158aa440715515d0c066fc Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12133 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-05-17 20:11:43 +00:00

... 5 6 7 8 9 ...

9958 Commits