ivampiresp/Spdk - Spdk - Leaflow Developers

Author	SHA1	Message	Date
Shuhei Matsumoto	e0daee9840	bdev/error: Use spdk_bdev_part_submit_request_ext() to use custom completion callback This is a preparation to inject data corruption at completion for read I/O. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I839f454d643254f2b805f3d4c65282deee037d71 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15003 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-11-08 08:21:22 +00:00
Alexey Marchuk	6fa5007edd	bdev/zone: Call bdev_with_md even if md is NULL The bdev_with_md APIs now allow to pass NULL md pointer, so calling this function without checking for metadata simplifies code Signed-off-by: Alexey Marchuk <alexeymar@nvidia.com> Change-Id: Ie4137f7a6a7628a13d14c7c9a5e9aa1ceb99d322 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15091 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com>	2022-11-03 14:54:41 +00:00
Alexey Marchuk	c89891ea8c	bdev/delay: Use ext bdev API Fixes commit `c3a5848` where support of memory domains was added without usage of the ext API Signed-off-by: Alexey Marchuk <alexeymar@nvidia.com> Change-Id: I7b318f515d7421b8876d4717c0ef293084401bbc Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15089 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com>	2022-11-03 14:54:41 +00:00
Alexey Marchuk	d8d1a4dd38	bdev/passthru: Use ext bdev API Fixes commit `c3a5848` where support of memory domains was added without usage of the ext API Signed-off-by: Alexey Marchuk <alexeymar@nvidia.com> Change-Id: Ia0d7132f11c233e334965669ab0d237c24074745 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15088 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com>	2022-11-03 14:54:41 +00:00
Alexey Marchuk	21db73f909	bdev_nvme: Return memory domains of each controller Signed-off-by: Alexey Marchuk <alexeymar@nvidia.com> Change-Id: I7417dcf69bbb8a526308075459c5887283896823 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15087 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com>	2022-11-03 14:54:41 +00:00
Shuhei Matsumoto	fa5e7d1b8d	bdev/error: Use switch-case to process error injection at submission The following patches will support data corruption. For write I/O, data corruption will be injected before submission, and for read I/O, data corruption will be injected after completion. To do these cleanly, use switch-case and reorder to process error injection at submission. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I3b830b4331cb4c7d0794a555957cdcc73902c14f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15026 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Krzysztof Karas <krzysztof.karas@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Community-CI: Mellanox Build Bot	2022-11-03 14:54:28 +00:00
Shuhei Matsumoto	7078874b80	bdev/error: Passthrough I/Os other than read, write, unmap, or flush If we use error bdev in general use cases, the upper layer may submit I/O commands other than read, write, unmap, or flush. However, before this patch, the upper layer could submit only read, write, unmap, and flush. To improve the usability of error bdev, pass thorugh I/Os other than read, write, unmap, or flush. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Ia642b13771f42505055f1372733825153085b805 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15027 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Krzysztof Karas <krzysztof.karas@intel.com>	2022-11-03 14:54:28 +00:00
Shuhei Matsumoto	ffee98ddd9	bdev/error: Consolidate params for injection into a options structure This will make it easier to add more parameters for error injection. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Ie5b22c31b5ba9d8c256d369213fa8fb4b985fa26 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15025 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Krzysztof Karas <krzysztof.karas@intel.com>	2022-11-03 14:54:28 +00:00
Shuhei Matsumoto	972013e29d	bdev/error: Use custom JSON decoders for bdev_error_injection_error This is a small clean up. Use custom JSON decoders for the io_type and error_type parameters in the bdev_error_injection_error RPC. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I528fe4a31fac7eddb8ec33594b90e107d71693be Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15024 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Krzysztof Karas <krzysztof.karas@intel.com>	2022-11-03 14:54:28 +00:00
Denis Barakhtanov	9191665486	bdev/daos: early bdev creation failure detection If during a channel creation, an error happens, due to incorrect parameters e.g. wrong pool / container name, or some other internal DAOS errors (like reaching CART context limit), bdev_daos_io_channel_create_cb() signals about such errors, however, spdk_io_device_register() does not takes them into account. The device creation succeeds, returning successful RPC response and leaving bdev in the bdev lists but it's completely unusable and not amendable. This patch tries to detect it early and return an RPC error on failure. Signed-off-by: Denis Barakhtanov <denis.barahtanov@croit.io> Change-Id: I04758e6243566b4e619a1420aa7c01f6041441a6 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15168 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot	2022-11-02 10:50:35 +00:00
Evgeniy Kochetov	b052435962	vbdev/passthru: Add copy IO type support Signed-off-by: Evgeniy Kochetov <evgeniik@nvidia.com> Change-Id: I46b7775c956435e2ffb8ec124576ac97992dee58 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14386 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-11-02 10:33:00 +00:00
Evgeniy Kochetov	789f48dec7	bdev/nvme: Add copy IO type support Signed-off-by: Evgeniy Kochetov <evgeniik@nvidia.com> Change-Id: I6a272161e129b592f535f8671e174e6b20e09fd0 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14347 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Community-CI: Mellanox Build Bot	2022-11-02 10:33:00 +00:00
Evgeniy Kochetov	1f47bbba51	bdev/malloc: Add copy IO type support Signed-off-by: Evgeniy Kochetov <evgeniik@nvidia.com> Change-Id: I05bd40eb191d2f70347dee5f1cf4cb87e15809fd Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14346 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Community-CI: Mellanox Build Bot	2022-11-02 10:33:00 +00:00
Shuhei Matsumoto	00bff560dd	bdev/malloc: Support protection information for read and write For write, verify DIF/DIX before submission and for read, verify DIF/DIX after successful completion. As same as the NVMe bdev module and the NULL bdev module, DIF/DIX verification is done based on the DIF type and DIF insert/strip is not supported. In near future, the bdev I/O APIs bring an I/O flag to the underlying bdev and the malloc bdev module will be able to decide DIF/DIX verification based on the I/O flag. One important feature is to setup protection information when creating a malloc disk. Otherwise, all initial reads will fail if protection information is enabled. For users, add some explanation about the dif_type parameter into doc/jsonrpc.md. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I93757b77c03cade766c872e418bb46d44918bee2 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14985 Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot	2022-10-28 06:49:40 +00:00
Shuhei Matsumoto	aef00d4420	bdev/malloc: Support both of interleaved and separated metadata The malloc bdev module supports both of interleaved and separated metadata in this patch. Different from the NULL bdev module, opts->block_size is a data block size and a block size is caculated internally as a sum of opts->block_size and opts->md_size if opts->md_interleave is true, or opts->block_size otherwise. This will be more intuitive. Additionally, opts->md_size accepts only either of 0, 8, 16, 32, 64, or 128. Protection information (T10 DIF/DIX) will be supported in the following patches. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Icd9e92c8ea94e30139e416f8c533ab4cf473d2a8 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14984 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Community-CI: Mellanox Build Bot	2022-10-28 06:49:40 +00:00
Shuhei Matsumoto	e6b2b9075a	bdev/malloc: Use options structure to create a malloc bdev Define a options structure, malloc_bdev_opts, and use it directly for the bdev_malloc_create RPC. To do this, bdev_malloc.h includes bdev_module.h instead of bdev.h to have the definition of the struct spdk_uuid, and the struct malloc_bdev_opts has a instance of struct spdk_uuid. Clean up file inclusion together. Furthermore, use spdk_uuid_copy() to copy uuid from the malloc_bdev_opts to the malloc disk rather than the = operator, and remove a duplicated size check. These are helpful to add more parameters for creation. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Ief25f12586c21b1666180ce10cfc6256ede8eba9 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14982 Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-10-28 06:49:40 +00:00
Shuhei Matsumoto	83eff61df6	bdev/malloc: Use custom decoder for malloc disk's uuid If we use a custom decoder for malloc disk's uuid for the bdev_malloc_create RPC, the code is simplified. Furthermore, when we add an options structure, we will be able to include the options structure into struct rpc_construct_malloc directly. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Ib36fa628569f973218f2cc5ce65a51181cd9fb71 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15125 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-10-28 06:49:40 +00:00
Jim Harris	6202973033	bdev_nvme: change discovery_poller check to assert spdk_nvme_probe_poll_async() can only return 0 or -EAGAIN. But the code currently indicates that other values might be possible - so change the code to remove that confusion. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I8e7b2e1183f6650043dd751d610d434d626efa7e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15110 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com>	2022-10-25 07:13:08 +00:00
Richael Zhuang	dbd7999f5e	bdev/nvme: rename functions and add more logs Change 'bdev_nvme_compare_trids' to name 'bdev_nvme_check_secondary_trid' and 'bdev_nvme_compare_namespaces' to 'bdev_nvme_check_secondary_namespace' which better explain the aim of these function. RPC bdev_nvme_attach_controller only response "invalid parameter" if the path is not sucessfully added due to not meeting the conditions in bdev_nvme_compare_trids. Add more log. Change-Id: If9ba6ec1d397d49689aa2ed6ab74fbfa96e16afa Signed-off-by: Richael Zhuang <richael.zhuang@arm.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14251 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot	2022-10-25 07:12:24 +00:00
Denis Barakhtanov	3d7851e011	bdev/daos: Add object class parameter A new `oclass` parameter allow to specify DAOS DFS object class that is responsible for data redundancy and protection. Examples of the object classes could be found here: https://github.com/daos-stack/daos/blob/master/src/include/daos_obj_class.h The default value is OC_SX for the max IOPS. Signed-off-by: Denis Barakhtanov <denis.barahtanov@croit.io> Change-Id: Ia48681832458c2266eb7c3bcae0df2055e59e309 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15006 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot	2022-10-25 07:12:06 +00:00
Denis Barakhtanov	bdc683aaa9	bdev/daos: add resize rpc call Signed-off-by: Denis Barakhtanov <denis.barahtanov@croit.io> Change-Id: I71d643733e31eb2229649cff3db610d5bee07796 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14921 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Community-CI: Mellanox Build Bot	2022-10-21 07:19:17 +00:00
Yuhua	4c6a2e3daa	bdev/aio: implement read-only Support Aio bdev 'readonly' option in RPC call. The read-only flag can be dumped from the bdev info. Any writes on a read-only aio bdev will be fail. Signed-off-by: Yuhua <yuhua@smartx.com> Change-Id: I939f72479f8953a3678a8df3083ecce0f96844fb Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14955 Reviewed-by: GangCao <gang.cao@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot	2022-10-21 07:18:15 +00:00
Shuhei Matsumoto	0954302091	sock: Ensure recv/send_buf_size to be larger than g_sock_impl_opts.recv/send_buf_size In a use case, a custom sock module supports zero copy for read. The custom sock module wants to keep the recv_buf_size to be sufficiently large, for example 16MB. However, most upper layers overwrite the recv_buf_size by a smaller value via spdk_sock_set_recvbuf() later. This is not desirable. To fix the described issue, change the meaning of impl_opts->recv_buf_size to be the minimum size, and spdk_sock_set_recvbuf() uses the maximum value among the requested size, g_spdk_sock_impl_opts->recv_buf_size, and MIN_SO_RCVBUF_SIZE. We may have to change the code to initially create a socket. However, for most cases, the upper layer calls spdk_sock_set_recvbuf() anyway. Hence this fix will be minimal and enough. For the use case, it is enough to change recv_buf_size of the posix sock module. However, the custom sock module may support I/O uring in future. Hence, change I/O uring sock module together. Additionally, for consistency, change the meaning of impl_opts->send_buf_size to be the minimum size, and spdk_sock_set_sendbuf() uses the maximum value among the requested size, g_spdk_sock_impl_opts->send_buf_size, and MIN_SO_SNDBUF_SIZE. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I051ba7cb50bc9dcad229e922198b04fe45335219 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14915 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Community-CI: Mellanox Build Bot	2022-10-21 07:17:37 +00:00
gongwei	6f445382a8	bdev_iscsi: add bdev iscsi config json save bdev iscsi opts config Signed-off-by: gongwei <gongwei833x@gmail.com> Change-Id: I9601098b426b8c080ae374f2fa1c23eec14f140b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14898 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-10-17 12:37:36 +00:00
Artur Paszkiewicz	9cf1ab5b1c	module/raid: fix async module stopping If the module stop handler is asynchronous we must wait until it finishes before unregistering the io_device. Change-Id: I149b716d9f4b0c1680b3e43b395fc9ec5b90d70c Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14717 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-29 22:52:45 +00:00
Artur Paszkiewicz	c2eea87ac4	raid5f: calculate and write parity Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: Ia1b82d555c966b9b291eeb2426c42846b93e7fec Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/7703 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-29 22:52:45 +00:00
Artur Paszkiewicz	7131bec415	raid5f: full stripe writes Implement support for full stripe writes without parity calculation. The size and alignment of write IOs must be a multiple of full stripe size. To reflect this, the raid bdev's write_unit_size is set accordingly. We rely on the bdev layer to split larger IOs based on that. Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: I6940280ad870f3bd678fd19346b06ba4bdadd52e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/7702 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-29 22:52:45 +00:00
Artur Paszkiewicz	c89e20084b	module/raid: support for raid module private io channel Let the raid modules create their own IO channels by implementing the get_io_channel callback. Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: Id4f6c90721474edd70a6e987c67f8f774737da27 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/7700 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-09-29 22:52:45 +00:00
Artur Paszkiewicz	95a0494902	module/raid: fix device destruction Move device cleanup to spdk_io_device_unregister() callback. This fixes a case when the device would be freed before its last io channel was closed, leading to use after free condition. Repurpose raid_bdev_free() to actually free the bdev. Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: Ib667b4d5ac1b34a0f2dda69f6b0775d9363dbfee Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11398 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-09-29 22:52:45 +00:00
Artur Paszkiewicz	77bddc0f96	raid5f: read support Depend on bdev layer to send down chunk sized IOs, based on optimal_io_boundary. Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: Iec45f4917117d35c3a9e807940e49091dfcba870 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/7699 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-29 22:52:45 +00:00
Changpeng Liu	8b260d5c92	virtio/vfio_user: add virtio_scsi device support Example usage using `bdevperf` as the client: Start `spdk_tgt` with created virtio_scsi device: 1. scripts/rpc.py bdev_malloc_create -b malloc0 $((512)) 512 2. scripts/rpc.py vfu_virtio_create_scsi_endpoint vfu.0 --cpumask 0x1 --num-io-queues=4 \ --qsize=128 --packed-ring 3. scripts/rpc.py vfu_virtio_scsi_add_target vfu.0 --scsi-target-num=0 --bdev-name malloc0 Start `bdevperf`: 1. test/bdev/bdevperf/bdevperf -r /var/tmp/spdk.sock.1 -g -s 2048 -q 128 -o 4096 \ -w randread -t 30 -m 0x2 2. scripts/rpc.py -s /var/tmp/spdk.sock.1 bdev_virtio_attach_controller --dev-type scsi \ --trtype vfio-user --traddr vfu.0 VirtioScsi0 3. test/bdev/bdevperf/bdevperf.py -s /var/tmp/spdk.sock.1 perform_tests Change-Id: Id95908a5243e9a224a9bd709a22b87740c50b835 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13897 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com>	2022-09-29 19:42:56 +00:00
Changpeng Liu	295e54d144	virtio/vfio_user: add virtio_blk device support Add vfio-user transport support based on existing virtio client library. Test steps using bdevperf: Start `spdk_tgt` with created virtio_blk device: 1. build/bin/spdk_tgt 2. scripts/rpc.py bdev_malloc_create -b malloc0 $((512)) 512 3. scripts/rpc.py vfu_virtio_create_blk_endpoint vfu.0 --bdev-name malloc0 \ --cpumask=0x1 --num-queues=2 \ --qsize=256 --packed-ring Start `bdevperf`: 1. test/bdev/bdevperf/bdevperf -r /var/tmp/spdk.sock.1 -g -s 2048 -q 128 -o 4096 \ -w randread -t 30 -m 0x2 2. scripts/rpc.py -s /var/tmp/spdk.sock.1 bdev_virtio_attach_controller --dev-type blk \ --trtype vfio-user --traddr vfu.0 VirtioBlk0 3. test/bdev/bdevperf/bdevperf.py -s /var/tmp/spdk.sock.1 perform_tests Change-Id: I368c4becebbca57328a25fc750e41c353420e481 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13896 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-29 19:42:56 +00:00
Changpeng Liu	5004d7b8e8	module/vfu_device: add virtio-scsi emulation Here we use vfu_tgt library and emulate a virtio-scsi device as the next use case. Compared with vhost-user-scsi, the packed ring is supported with this patch. Example usage method: 1. scripts/rpc.py bdev_malloc_create -b malloc0 $((512)) 512 2. scripts/rpc.py vfu_virtio_create_scsi_endpoint vfu.0 --cpumask 0x1 --num-io-queues=4 \ --qsize=128 --packed-ring 3. scripts/rpc.py vfu_virtio_scsi_add_target vfu.0 --scsi-target-num=0 --bdev-name malloc0 4. Start QEMU with '-device vfio-user-pci,socket=/spdk/vfu.0' Change-Id: I8f35d1d21aaec34844d6ddb59dc997a64f141179 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12673 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-29 19:42:56 +00:00
Changpeng Liu	23ef63882c	module/vfu_device: add virtio-blk emulation Here we use vfu-tgt library and emulate a virtio-blk device as the first use case of vfu-tgt library. Usage example with QEMU: 1. build/bin/spdk_tgt 2. scripts/rpc.py bdev_malloc_create -b malloc0 $((512)) 512 3. scripts/rpc.py vfu_virtio_create_blk_endpoint vfu.0 --bdev-name malloc0 \ --cpumask=0x1 --num-queues=2 \ --qsize=256 --packed-ring 4. Start QEMU with '-device vfio-user-pci,socket=/spdk/vfu.0' Change-Id: I45e45360c669584583b0b8a3f83250ab6c48efec Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12315 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Community-CI: Mellanox Build Bot	2022-09-29 19:42:56 +00:00
Changpeng Liu	da231290b2	lib/vfu_tgt: add library for PCI device emulation Previously SPDK use libvfio-user library to provide emulated NVMe devices to VM, but it's limited to NVMe device type only. Here we add SPDK vfu_target library abstraction based on libvfio-user which supports more PCI device types. We will add virtio-blk and virtio-scsi devices emulation based on vfu_tgt library in following patches, actually this library can support NVMe emulation too, due to the fact that the NVMe emulation is already exist, so we will keep the NVMe emulation which based on libvfio-user directly as it is. Change-Id: Ib0ead6c6118fa62308355fe432003dd928a2fae9 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12597 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-29 19:42:56 +00:00
Krishna Kanth Reddy	78ff96bb73	bdev_xnvme: free bdev_xnvme structure after device is fully unregistered Fix for the issue 2702. spdk_bdev plugin crashes while handling xnvme devices. xnvme_queue_term gets invoked after the bdev_xnvme_free executes and frees the xnvme structures. xnvme_queue_term references the xnvme->dev structure, after it is freed, resulting in a segmentation fault. Implemented a bdev_xnvme_destruct_cb, that frees the xnvme structures after the xnvme_queue_term invokes and the device is fully unregistered. Signed-off-by: Krishna Kanth Reddy <krish.reddy@samsung.com> Change-Id: I9338c84baf4b61ec2e0d324e67bfefcb96485156 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14680 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com>	2022-09-28 06:47:35 +00:00
Indraneel M	5eafc3a279	bdev/uring: Proper handling for conventional zones Identify and properly handle conventional zones (in smr drives) by using zone type and WP state. Bdevs supporting zoned devices(like uring, nvme and vbdev_zone_block) now update the zone type information. As a result, the fio plugin now uses this info instead of hard coding the zone type. Also adds new WP state(ZONE_STATE_NOT_WP) for handling zones w/o WP. Signed-off-by: Indraneel M <Indraneel.Mukherjee@wdc.com> Change-Id: If031e0742d68c55c35e95ddc33d478939bbd52fe Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14572 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Community-CI: Mellanox Build Bot	2022-09-27 19:40:44 +00:00
Simon A. F. Lund	cfe5d18823	bdev/xnvme: enable polling on io_uring_cmd Fixes #2708 for io_uring_cmd. This is a minimal approach to enabling polling for bdev_xnvme. Ideally, more would be done, however, there are several blockers to remove when doing so. Thus, this commit only enables polling on io_mechanism=io_uring_cmd. Signed-off-by: Simon A. F. Lund <simon.lund@samsung.com> Change-Id: Ifa604b52bb2b924fab4b559fae06f26a3574db42 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14679 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-09-27 15:36:32 +00:00
paul luse	850cd90082	accel/idxd/iaa: Convert to use iovecs In prep for upcoming iovec based compression/decompression patches. Signed-off-by: paul luse <paul.e.luse@intel.com> Change-Id: I413493f764bead9e56266e488b74f8bca979e225 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14633 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-09-23 00:10:08 +00:00
paul luse	28886ac352	lib/accel: rename iovec elements with src prefix In prep for adding both src and dst iovec support for compression. Signed-off-by: paul luse <paul.e.luse@intel.com> Change-Id: I704b8d2bd459de03deb7f8ee45d76261910a3727 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13746 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-09-23 00:10:08 +00:00
Krzysztof Karas	dfc9894396	bdev: send bdev reset based on outstanding IO and a new timeout parameter A new parameter io_drain_timeout has been added to spdk_bdev structure. If this value is unset, the bdev reset behavior does not change. The io_drain_timeout controls how long a bdev reset must wait for IO to complete prior to issuing a reset to the underlying device. If there is no outstanding IO at the end of that period, the reset is skipped. Change-Id: I585af427064ce234a4f60afc3d69bc9fc3252432 Signed-off-by: Krzysztof Karas <krzysztof.karas@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14501 Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot	2022-09-22 19:18:30 +00:00
Ben Walker	712e8cb7ef	accel: Refer to plugins as 'modules' instead of 'engines' This is consistent with the use of terms in other parts of SPDK and fits with the code living under module/ Change-Id: If182f7cf2d160d57443a1b5f24e0065f191b59b2 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13919 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-09-21 08:17:48 +00:00
Shuhei Matsumoto	2826ef13ce	bdev/nvme: Fail reset sequence immediately if ctrlr is already removed. After a controller was hot-removed, if a reset sequence started to the controller, spdk_nvme_ctrlr_disconnect() failed and caused core dump in debug mode. When implemented, how to cause the failure and how to process the failure were not clear. Hence assert was added to detect the failure. We know how we cause the failure now. Let's handle the failure appropriately. If spdk_nvme_ctrlr_disconnect() fails, we are on the nvme_ctrlr->thread. Hence call bdev_nvme_reset_complete() with failure immediately. Even if spdk_nvme_ctrlr_disconnect() completes synchronously, the completion callback is executed asynchronously when polling an adminq. Hence set the completion callback only if spdk_nvme_ctrlr_disconnect() succeeds. Fixes issue #2632 Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I11f61853aba9eca2515592f964a291e59def7247 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13892 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Krzysztof Karas <krzysztof.karas@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-09-21 07:52:44 +00:00
Shuhei Matsumoto	6439abc0d8	bdev/nvme: Ignore failure of I/O qpair creation if reconnect and retry are enabled By a recent improvement, failure of I/O qpair creation is ignored if the nvme_ctrlr is being reset or scheduled to reconnect. However, failure of I/O qpair creation is not ignored if a new I/O channel is allocated. It is normal to allocate a new I/O channel when a nvme_ctrlr is being reset or scheduled to reconnect. Fix this bug by relaxing the condition to ignore the result of bdev_nvme_create_qpair() to if reconnect_delay_sec is non-zero and bdev_retry_count is non-zero. If reconnect_delay_sec is non-zero, reconnect will be tried sooner or later, and if bdev_retry_count is non-zero, submitted IOs will be queued until it succeeds. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Icf26e1ea65d292f9b8d24966abe25907d2cc33ec Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14446 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-21 07:52:44 +00:00
Artur Paszkiewicz	d1dd6ca814	ftl: check structure sizes for future ABI compatibility Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: Ic32f6fe085d94b00d025b6cab7e5073341169a73 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13677 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-09-20 19:24:26 +00:00
Artur Paszkiewicz	1790ee8a8d	ftl: I/O statistics Add gathering of some performance counters and RPC for printing them. Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Change-Id: I2e77d37fb66459240ff2e241f2b1f77c60f4eef4 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13390 Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-09-20 19:24:26 +00:00
Tomasz Zawadzki	69bcb185f3	bdev/xnvme: save xnvme bdevs from runtime config_json callback is used to preserve all bdevs that were created during application runtime. xnvme bdev module was missing this callback. While here, io_mechanism is now saved in the bdev_xnvme structure for reference in the config_json. Haven't seen an option to pull this from xnvme itself. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I1a88ad2bb761f589d214fec8f0690c38572824d6 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14116 Reviewed-by: Michal Berger <michal.berger@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Krzysztof Karas <krzysztof.karas@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-09-19 13:12:07 +00:00
Konrad Sztyber	35a71c7748	sock/ssl: free SSL resources in case of failures Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Ifba581c92269c675b03b67586b1fea0b096f9e95 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14511 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Krzysztof Karas <krzysztof.karas@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-09-16 13:33:17 +00:00
jun.ran	488d6e84c4	bdev_aio: handle unexpected res=0 correctly If res = 0, do not pass this value directly to bdev io completion function, otherwise it will be treated as a successful completion. Fix issue: 2679 Signed-off-by: jun.ran <ranjunsh@163.com> Change-Id: Ice525de1bb5b0ada9ac30d3b59e5c731419efe2f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14364 Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Xiaodong Liu <xiaodong.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-09-16 13:32:40 +00:00
Damiano Cipriani	67bb33160a	vbdev_lvol: Implement SEEK_[DATA,HOLE] io type Two functions have been added to implement bdev io type SEEK_DATA and SEEK_HOLE. Blob functions to find next [un]allocated io_unit are used Signed-off-by: Damiano Cipriani <damiano.cipriani@suse.com> Change-Id: Ic3be3c0c86bd3010a23ba2681f0f00c62abcaaba Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14362 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-15 08:23:56 +00:00
Boris Glimcher	2de485346e	sock/ssl: move SSL_CTX creation to accept() This will allow to have context per connection. And free context when connection closes. Fixes #2689 Change-Id: Ic4e9adfa3f1bd8574b9ccf75ff42c4f3bd442b26 Signed-off-by: Boris Glimcher <Boris.Glimcher@emc.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14443 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-09-14 07:45:09 +00:00
Boris Glimcher	35f7f0ce1e	nvme/tcp: Allow to choose SSL socket implementation Adding `psk` field to `spdk_nvme_ctrlr_opts` Adding `psk` parameter to `bdev_nvme_attach_controller` RPC Change-Id: Ie6f0d8b04ce472e6153934e985c026acded6cdfc Signed-off-by: Boris Glimcher <Boris.Glimcher@emc.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14046 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-09-14 07:44:53 +00:00
Amir Haroush	ef65d8467c	bdev/raid: fix flush on raid0 the calculation (offset_blocks + num_blocks - 1) didn't check if num_blocks is 0 which it is in case of flush. in flush case, offset_blocks was also 0, so we got (-1) and we got big unsigned number. just replace it with (offset_blocks + num_blocks - (num_blocks > 0)) to solve the issue. NOTE this is only fixing the wrong math, there might be more issues that outside the scope of this commit for example, if flush(x, y) called, no bdev implementation really use those values, and all bdevs just flush the whole disk also, a convention is that fluch(0, 0) should flush it all (but like I said before, all bdevs flush the whole disk anyway) Change-Id: I7e991653bc3050349dc155365b2c37ecc2d6b24c Signed-off-by: Amir Haroush <amir.haroush@huawei.com> Signed-off-by: Shai Fultheim <shai.fultheim@huawei.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13579 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-09-14 07:44:19 +00:00
Yifan Bian	c6824c7944	bdev/rbd: return a error value(-1) when create RBD bdev on non existing RBD image Fix issue #2690 Change-Id: Ia41b32a7f54be7e9a63c9b6284fba7d8ad4eebd6 Signed-off-by: Yifan Bian <yifan.bian@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14484 Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Xiaodong Liu <xiaodong.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot	2022-09-13 07:28:57 +00:00
Boris Glimcher	4f8bf4c380	sock/ssl: free SSL on socket close() Fixes #2684 Change-Id: I0a91e7a70c53c8130921c70872b28d199d862346 Signed-off-by: Boris Glimcher <Boris.Glimcher@emc.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14432 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot	2022-09-12 07:23:10 +00:00
Kozlowski Mateusz	2c7c8b6ceb	ftl: Add rpc functionality for unmap Trim is now also available as a management operation via RPC. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: I05b778a611e9809a14bfed50b01986bb4649a35c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13379 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-09 19:44:29 +00:00
Kozlowski Mateusz	66fe5f75bb	ftl: Unmap functionality Adds ability to send trim commands to FTL - only 4MiB aligned requests (both for offset and length of request) will be processed. During a trim operation an L2P page (containing 1024 4B entries, 1 per user LBA; which is where the 4MiB alignment comes from) will be marked as unmapped. After this point any L2P access to that page will actually set the entries themselves as FTL_ADDR_INVALID. This is done to make the trim as fast as possible, since for large requests it's probable that most of the L2P pages aren't actually in DRAM. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: I4a04ee9498a2a6939af31b06f2e45d2b7cccbf19 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13378 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-09 19:44:29 +00:00
gongwei	c8e594c2a0	bdev_iscsi: modify the timeout parameter name of iscsi opts the current timeout parameter of the interface bdev_iscsi_set_opts is duplicated with the timeout parameter of the JSONRPCClient parameter, which may cause the iscsi timeout and JSONRPCClient parameters to overwrite each other. Signed-off-by: gongwei <gongwei833x@gmail.com> Change-Id: I96604a7e1a495ac2e99518812297230680df42fd Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14306 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-09-09 13:02:25 +00:00
Shuhei Matsumoto	dd3460582b	bdev/nvme: Rename check_multipath_params by check_io_error_resiliency_params These checked parameters are necessary themselves even for single path configuration. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Ie1eb2f51eeec1dbc634c6bae462a41d4c209d6ac Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12052 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Dong Yi <yidong0635@126.com> Community-CI: Mellanox Build Bot	2022-09-09 12:56:12 +00:00
Shuhei Matsumoto	57f15765f7	bdev/nvme: Use custom decoder for multipath param of bdev_nvme_attach_controller RPC Clean up the code by using a custom decoder. Use multipath mode to follow doc/nvme_multipath.doc. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Ie1f4109dae3a2929dcf933939a9c1b67bece0caf Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12051 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Community-CI: Mellanox Build Bot	2022-09-09 12:56:12 +00:00
Shuhei Matsumoto	72cb529751	bdev/rbd: Reset returns completion after all inflight I/Os complete Previously, reset just set a long timer to wait until all inflight I/Os complete. As already noted as a TODO item, check if any I/O is still inflight before completing the reset by using a new API spdk_bdev_get_current_qd(). The RBD bdev module can count outstanding I/Os itself but is not efficient. Signed-off-by: liu-darong <liu.darong@xsky.com> Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Iecaf90b06cae8e21198ec3822b978b54f5404d2b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13945 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Dong Yi <yidong0635@126.com> Reviewed-by: Xiaodong Liu <xiaodong.liu@intel.com> Reviewed-by: GangCao <gang.cao@intel.com>	2022-09-09 12:55:39 +00:00
Konrad Sztyber	a8a7edcd52	sock/ssl: don't free SSL_CTX on accept() failure SSL_CTX isn't created in accept(), but when a socket on which accept() is called is created, so it shouldn't be freed when accept() fails, as this makes the socket unusable (any subsequent operations using SSL_CTX would be using freed memory). This caused the segfaults reported in issue #2681, where the second connection would crash the application. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I1a01a748c5a34ce3dd0fd3c557b860c0ff314b85 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14355 Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot	2022-09-07 07:04:11 +00:00
Konrad Sztyber	f2ddc6e78b	sock: make sure ssl impl has lowest priority Bumped up the priority of the posix and uring sock implementations to make sure they are selected before SSL, when no impl is explicitly specified. Fixes #2681 Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Ic8e1e2e13f7bce7ccd746f66087e348677df28d9 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14354 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-09-07 07:04:11 +00:00
Blachut, Bartosz	503835ee63	util: made hexlify and unhexlify functions public hexlify and unhexlify utils from vbdev_crypto.h have been moved so that they could be included and reused outside of vbdev_crypto module. Signed-off-by: Blachut, Bartosz <bartosz.blachut@intel.com> Change-Id: Ia074250176907f4803b84024239ecd4e9d8a5fc1 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14191 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Jacek Kalwas <jacek.kalwas@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-09-06 07:17:13 +00:00
Blachut, Bartosz	1a24dc8fef	hexlify util: v2c helper function return type is signed char `v2c` helper function returns -1 on error, hovewer its return type was char - which may cause an overflow situation on architectures where char is interpreted as unsigned char, which was reported by the CI. The return type has been changed to signed char. Signed-off-by: Blachut, Bartosz <bartosz.blachut@intel.com> Change-Id: I84f5784b2de7d681f78c69b5f3646e851e8dee88 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14273 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Jacek Kalwas <jacek.kalwas@intel.com>	2022-09-06 07:17:13 +00:00
Ben Walker	34c48f1b3b	accel: Do not refer to the "framework" as "engine" The word engine was both used (interchangeably with module) to refer to the things that plug into the framework and to the framework itself. This patch eliminates all use of the word engine that meant the framework. It leaves uses of the word that meant "module". Change-Id: I6b9b50e2f045ac39f2a74d0152ee8d6269be4bd1 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13918 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-09-06 07:16:17 +00:00
Ben Walker	dd7140e627	accel: Rename spdk_accel_engine_module_finish to spdk_accel_module_finish Also move it into the internal header that defines the interface used by modules. Change-Id: I3aeb41e643f27a69556099cb8d166f64c9e5d67f Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13917 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Krzysztof Karas <krzysztof.karas@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com>	2022-09-06 07:16:17 +00:00
Evgeniy Kochetov	2e7a7fe530	blob: Optimize copy-on-write flow for clusters backed by zeroes device Writing to unallocated cluster triggers copy-on-write sequence. If this cluster is backed by zeroes device we can skip the copy part. For a simple thin provisioned volume copy this shortcut is already implemented because `blob->parent_id == SPDK_BLOBID_INVALID`. But this will not work for thin provisioned volumes created from snapshot. In this case we need to traverse the whole stack of underlying `spdk_bs_dev` devices for specific cluster to check if it is zeroes backed. This patch adds `is_zeroes` operation to `spdk_bs_dev`. For zeroes device it always returns 'true', for real bdev (`blob_bs_dev`) always returns false, for another layer of `blob_bs_dev` does lba conversion and forwards to backing device. In blobstore's cluster copy flow we check if cluster is backed by zeroes device and skip copy part if it is. Signed-off-by: Evgeniy Kochetov <evgeniik@nvidia.com> Change-Id: I640773ac78f8f466b96e96a34c3a6c3c91f87dab Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13446 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-09-05 12:49:46 +00:00
Kozlowski Mateusz	e7e5bc07b2	FTL: Add initial L2P cache logic L2P cache allows for partial storing of L2P in memory, paging in and out as necessary, lowering the total memory consumption. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: I727fec9d2f0ade4ca73e872d62a2ec10cfdb0a88 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13353 Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-09-02 17:40:09 +00:00
Konrad Sztyber	4cbd23e28b	vmd: method for forcing a rescan Added a new RPC, vmd_rescan, that forces the VMD driver to do a rescan of all devices behind the VMD. A device that was previously removed via spdk_vmd_remove_device() will be found again during vmd_rescan. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Ide87eb44c1d6d524234820dc07c78ba5b8bcd3ad Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13958 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tom Nabarro <tom.nabarro@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-09-01 08:48:32 +00:00
Konrad Sztyber	052ea0baac	vmd: method for removing devices behind VMD Added new RPC, vmd_remove_device, that allows users to remove a PCI device managed by the VMD library simulating a hot-remove. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Ifb84818ce8d147d1d586b52590527e85fe9c10de Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13957 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tom Nabarro <tom.nabarro@intel.com>	2022-09-01 08:48:32 +00:00
Konrad Sztyber	f0441b29db	vmd: rename enable_vmd RPC to vmd_enable The new name is consistent with the naming scheme of <subsystem>_<action> that all of our other RPCs use. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I2cae7af5715add8eba26501cd192a6ac4884ec69 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13952 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Tom Nabarro <tom.nabarro@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-09-01 08:48:32 +00:00
Konrad Sztyber	dc5b33d982	module/vmd: move subsystem start to ->init() This makes sure that if the app is killed before the subsystems have been initialized (i.e. before framework_start_init is called), we don't leak resources. Before this change, we initialized the VMD library and registered the hotplug poller from the context of the RPC and we only freed it in subsystem's fini(). However, if subsystems were never initialized, we never freed them. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I56b746c46b94135ef56a478c5f80194ebe51a942 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13951 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tom Nabarro <tom.nabarro@intel.com>	2022-09-01 08:48:32 +00:00
Shuhei Matsumoto	03f8da8819	bdev/nvme: Set multipath policy correctly when creating nvme_bdev_channel bdev_nvme_create_bdev_channel_cb() did not initialized the multipath policy of the newly created channel. 0 was active-passive and hence multipath policy was always initialized to active-passive. Fix the bug and add unit tests for verification. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I6e44108740da4b9ff72311ae4b5500558c65c5c3 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14225 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com>	2022-09-01 08:25:22 +00:00
Shuhei Matsumoto	db75f4b678	bdev/nvme: Remove admin passthrough retry and failover Admin passthrough supported retry and failover as same as I/O by using the bdev_retry_count. However, doing retry or failover for admin passthrough may have unexpected side effects and its value is not clear. The safest way is to limit retry and failover for I/O. If we need to support retry and failover for admin passthrough, restore the code and add a new option bdev_admin_retry_count. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I680513a40a80041f6ea6f546c74c672f2a81812d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14227 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-09-01 08:25:22 +00:00
Shuhei Matsumoto	fc912a0284	bdev/nvme: Prioritize aborted_by_request higher than ctrlr_is_unavailable Abort is used usually for error cases. When abort is done, controller may not be available. At completion, controller availability was checked prior to aborted by request. Hence if abort is done when controller is not available, the aborted request is retried. Fix the case in this patch. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I3e7b356797dd50faed0e5113f6f7a47fea26d9cc Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14098 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Michael Haeuptle <michaelhaeuptle@gmail.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-09-01 08:25:22 +00:00
Kozlowski Mateusz	0e33da4974	ftl: fast shutdown Adds API for fast shutdown - the ability for FTL to skip most of the metadata persists made during clean shutdown, and relying on their representation in shared memory instead. This allows for faster update of SPDK (or just FTL, assuming no metadata changes), with downtime reduction from 2-5 seconds to 500-1000 ms (for 14TiB+800GiB base and cache drives). Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Change-Id: I5999d31698a81512db8d5893eabee7b505c80d06 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13348 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-08-30 14:48:50 +00:00
Indraneel M	8b8401959e	bdev/uring: Add support for zoned io in uring bdev. Enables the use of uring bdev with ZNS devices. Uses BLKXXXZONE ioctls for implementing the zone operations. Signed-off-by: Indraneel M <Indraneel.Mukherjee@wdc.com> Change-Id: I440e316138182e25d89eb7224932e19bef9a005f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13550 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-08-30 07:17:06 +00:00
yidong0635	fc3a8514c7	xnvme: Remove unused rc. Here no need to define a variable only return. Signed-off-by: yidong0635 <dongx.yi@intel.com> Change-Id: I6babb8ef94a9b111d5522f1e782cbd095c7da83a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14207 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: wanghailiang <hailiangx.e.wang@intel.com> Reviewed-by: Krzysztof Karas <krzysztof.karas@intel.com>	2022-08-29 11:40:08 +00:00
yidong0635	43c5293e16	xnvme: Fix memory leak for xnvme dev. If we use dd or bdevperf to run the xnvme bdev. the destruct function of xnvme misses to free dev while destructing bdev_xnvme. xnvme_dev_close does free dev, and we can put it in bdev_xnvme_free. Meanwhile fixing cleanup for delete_xnvme_bdev. This rpc for delete_xnvme_bdev doesn't really cleanup to remove the bdev from the list and free the names. Fixes issue: #2654 Signed-off-by: yidong0635 <dongx.yi@intel.com> Change-Id: I1c493cb8130b012d891ba8ee90cd0bfb127207d4 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14177 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: GangCao <gang.cao@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-08-29 11:40:08 +00:00
Jim Harris	08422b5843	bdev_virtio: fix use-after-free in scsi scan_ctx Found while debugging issue #2596, unfortunately this is not the root cause of that issue. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I27501e283ce7c9bf7a431e8b48842c83f80792c8 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14165 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Dong Yi <dongx.yi@intel.com>	2022-08-29 10:51:48 +00:00
gongwei	1519aa4715	bdev_iscsi: support iscsi timeout setting Change-Id: I189ae677dfb13feb834b73fd1f55bf2545679213 Signed-off-by: gongwei <gongwei833x@gmail.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14110 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Community-CI: Mellanox Build Bot	2022-08-25 07:43:24 +00:00
0xe0f	2e283fcb67	bdev/daos: introduction of daos bdev This commmit introduces a new bdev type backed up by DAOS DFS. Design wise this bdev is a file named as the bdev itself in the DAOS POSIX container that uses daos event queue per io channel. Having an event queue per io channel is showing the best IO throughput. The implementation uses the independent pool and container connections per device's channel for the best IO throughput. The semantic of usage is the same as any other bdev type. To build SPDK with daos support, daos-devel package has to be installed. The current supported DAOS version is v2.X, please see the installatoin and setup guide here: https://docs.daos.io/v2.0/ $ ./configure --with-daos To run it, the target machine should have daos_agent up and running, as well as the pool and POSIX container ready to use, please see the detailed requirements here: https://docs.daos.io/v2.0/admin/hardware/. To export bdev over tcp: $ ./nvmf_tgt & $ ./scripts/rpc.py nvmf_create_transport -t TCP -u 2097152 -i 2097152 $ ./scripts/rpc.py bdev_daos_create daosdev0 <pool-label> <cont-label> 1048576 4096 $ ./scripts/rpc.py nvmf_create_subsystem nqn.2016-06.io.spdk1:cnode1 -a -s SPDK00000000000001 -d SPDK_Virtual_Controller_1 $ ./scripts/rpc.py nvmf_subsystem_add_ns nqn.2016-06.io.spdk1:cnode1 daosdev0 $ ./scripts/rpc.py nvmf_subsystem_add_listener nqn.2016-06.io.spdk1:cnode1 -t tcp -a <IP> -s 4420 On the initiator side, make sure that `nvme-tcp` module is loaded then connect drives, for instance: $ nvme connect-all -t tcp -a 172.31.91.61 -s 4420 $ nvme list Signed-off-by: Denis Barakhtanov <denis.barahtanov@croit.io> Change-Id: I51945465122e0fb96de4326db742169419966806 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12260 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-08-23 07:15:13 +00:00
Artur Paszkiewicz	0291b2845a	FTL: Add read path Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: Ib5bac109b59d5a21a7dad1f8e79b5da7633ffa9d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13334 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-08-19 17:37:14 +00:00
Jim Harris	e36f0d363e	nvme/pcie, nvme/tcp: add cb_arg context tracepoint argument This allows mapping an nvme_request back to the nvme_bdev_io. This requires bumping up the max number of arguments per tracepoint. 5 was previously chosen as max since it exactly fit in 64 bytes (1 cacheline) when all arguments were stored as uint64_t, but now that we support uint32_t arguments we can afford extra arguments when some of them are uint32_t. I've bumped it to 8 so we can avoid having to touch this value multiple times if we find some cases where we need 7 or 8 args. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ie2ef5e59d10549860b47542e68c1c34efa63047f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13995 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Jacek Kalwas <jacek.kalwas@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-08-19 11:06:31 +00:00
Jim Harris	54f1603954	bdev/nvme: add tracepoint support This will allow us to map spdk_bdev_io events to nvme_request events coming in a future patch. Since we pass the nvme_bdev_io to the nvme driver (not the spdk_bdev_io), we need to add tracepoints for the nvme_bdev_io so that spdk_trace can do the spdk_bdev_io->nvme_bdev_io->nvme_request mapping. An alternative would have been to pass the spdk_bdev_io as the cb_arg to the nvme driver, but that change seemed to invasive, and I think we will find other uses for the nvme_bdev_io events anyways. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Id7519e689b01875093359f41a1ca2af912061a8b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13994 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Jacek Kalwas <jacek.kalwas@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-08-19 11:06:31 +00:00
Jim Harris	8bc1165edd	bdev/nvme: add __bdev_nvme_io_complete This is just a simple wrapper for now around the calls to spdk_bdev_io_complete and its nvme status variant. Upcoming patch will add an spdk_trace_record to this function as well. This avoids having to litter spdk_trace_record calls in too many places. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Id2fb3aeb8b070ad6e09c1dfb9a30a61666a35688 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13993 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Jacek Kalwas <jacek.kalwas@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Community-CI: Mellanox Build Bot	2022-08-19 11:06:31 +00:00
Kozlowski Mateusz	1bbefed63b	FTL: Remove leftover ZNS code Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Change-Id: Ica358805a69582d78e0d6c4f17b5a97ff38e44ca Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14112 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-08-18 19:09:50 +00:00
Kozlowski Mateusz	e8c5ccf039	FTL: Add write path Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: I41985617b5879bd3f4bf6d49d2a03eaffdd5ccb5 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13322 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-08-18 08:34:47 +00:00
GangCao	8b6fc34939	NVMe Bdev: add explicit warning on not permitted operation Below command is failed without intuitive warning on why the operation is not permitted: request: { "action_on_timeout": "abort", "delay_cmd_submit": true, "disable_auto_failback": false, "method": "bdev_nvme_set_options", "req_id": 1 } Got JSON-RPC error response response: { "code": -1, "message": "RPC not permitted with nvme controllers already attached" } Change-Id: Ic35292885aa4b507fe8bb278a4b41363cfbae9a5 Signed-off-by: GangCao <gang.cao@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13950 Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot	2022-08-16 10:25:55 +00:00
Ben Walker	081f080a49	accel: Rename public header to accel.h The public interface of lib/accel is now include/spdk/accel.h Change-Id: Id94f623a494eb1b524b060f4413f633073ea7466 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13916 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-08-16 10:22:55 +00:00
Ben Walker	225ae35a2f	accel: Add per-module task structures These hold a pointer to the channel which eliminates the need to look it back up in the completion path. Change-Id: Ie4fc98d92d6434262e64b9483ef8b3b0591d764a Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13914 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-08-16 10:22:55 +00:00
Ben Walker	df892eed67	accel: Return correct values for .get_ctx_size() This expects the full size of the task for each module. This only worked because the software module returned the right size. Change-Id: I481cfad8b4bb9c3748301bdacd90e7f44fd2d878 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13913 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-08-16 10:22:55 +00:00
Ben Walker	aa156d53be	accel: Combine spdk_accel_engine and spdk_accel_module_if These are 1:1 - they do not need to be separate objects. Change-Id: I74ab52863f911d9be59ce98e1525302b5bd40846 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13910 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-08-16 10:22:55 +00:00
Krishna Kanth Reddy	17948bcaa9	bdev : xNVMe BDEV module performance fix This change fixes the lower performance issue of the xNVMe BDEV for libaio and io_uring backends. There is no drop in the performance for the io_uring passthrough backend. Signed-off-by: Krishna Kanth Reddy <krish.reddy@samsung.com> Change-Id: I21de1b49a534cfc642d1873ef623271063da6af8 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14015 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-08-15 19:11:14 +00:00
Boris Glimcher	6212597bda	sock/ssl: Add psk_key and psk_identity options to spdk_sock_impl_opts Note, this change only sets defaults for the ID/KEY, more specific use cases like NVMe/TCP may set the ID and KEY on a per connection basis. Also simplify PSK identity string, that isn't NVMe focused. NVMe libraries using this will need to construct more complicated identity strings and pass them to the sock layer. Example: rpc.py sock_impl_set_options -i ssl --psk-key 4321DEADBEEF1234 rpc.py sock_impl_set_options -i ssl --psk-identity psk.spdk.io ./build/examples/perf --psk-key 4321DEADBEEF1234 --psk-identity psk.spdk.io ./build/examples/hello_sock --psk-key 4321DEADBEEF1234 --psk-identity psk.spdk.io Change-Id: I1cb5b0b706bdeafbccbc71f8320bc8e2961cbb55 Signed-off-by: Boris Glimcher <Boris.Glimcher@emc.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13759 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot	2022-08-15 16:52:28 +00:00
Shuhei Matsumoto	a4ebc5714f	bdev/nvme: Fix race between for_each_channel() and io_device_unregister() If a nvme_ctrlr is unregistered while I/O path caches are clearing, the unregistration would fail. This race was not considered for a case that a nvme_ctrlr is unregsitered when adminq poller started I/O path cache clearing. As a bug fix, control ANA log page updating and I/O path cache clearing separately by adding a new flag io_path_cache_clearing. Fixes issue #2617 Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Id58684caf9b2a10fa68bdb717288ff2bd799c3f7 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13924 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Dong Yi <dongx.yi@intel.com>	2022-08-12 09:00:59 +00:00
Jim Harris	b7f04e2be0	bdev/nvme: add newline in aer_cb WARNLOG Noticed while looking at mhae's log in issue #2632. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ie7f17a010368696eb59585a2d31191e309c4576c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13888 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-08-11 21:04:38 +00:00
Ben Walker	32ee475a5e	accel: SPDK_ACCEL_MODULE_REGISTER is now passed the module Instead of passing each parameter to create a module, just have the user make one and pass it in. This makes it easier to change the module definition later. Change-Id: I3a29f59432a6f0773129d7b210fbc011175b2252 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13909 Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-08-10 11:00:17 +00:00
Shuhei Matsumoto	6b8de2ea66	bdev/nvme: Move up _nvme_ctrlr_read_ana_log_page_done() in a file Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Ida517be9e16dd6f7cf39a0c6ef2044fd500f19a3 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13923 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: GangCao <gang.cao@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-08-10 07:26:10 +00:00
Changpeng Liu	a02483e67c	module/bdev_virtio_scsi: use the correct `num_queues` value Parameter `num_queues` for virtio_scsi PCI device means maximum number of queues, it SHOULD include the `eventq` and `controlq`, while for `vhost_user` RPC call, it means the number of IO queues, so here we use it as `max_queues` in lib/virtio and add the fixed number queues for `vhost_user` SCSI device. Also fix `vhost_fuzz` to get `num_queues` earlier than negotiate the feature bits. Change-Id: I41b3da5e4b4dc37127befd414226ea6eafcd9ad0 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13791 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-08-04 11:24:40 +00:00
Changpeng Liu	515d028ec4	bdev/virtio_blk\|scsi: don't negotiate `VHOST_USER_F_BITS` for PCI devices VHOST_USER_F_PROTOCOL_FEATURES is used for `vhost_user` transport, so unmask it for PCI devices. Change-Id: If84d6c0ee7558886f14647dad07e41530e306206 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13790 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-08-04 11:24:40 +00:00
Changpeng Liu	c60cb1a8be	lib/nvmf: don't raise assertion in `nvmf_tgt_destroy_cb` While running into this function, even the subsystem can't be destroyed due to error subsystem state, it's better to continue the execution. Continue to fix #2590, QEMU is stuck for the failure case, and nvmf target should process such error because it may support other normal subsystems at the same time. Change-Id: Ib05e24996378b52070d2b760519f476f9b2d7e76 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13839 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-08-04 07:29:27 +00:00
Artur Paszkiewicz	884980d0aa	ftl: vss null buffer workaround Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Change-Id: I94ea399ed30fae29f92b4216eaa9209c02b3478b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13310 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-08-02 19:00:42 +00:00
Artur Paszkiewicz	06790f25f1	FTL: Add ftl_io helper structure Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: I608b500c6fb14efe289932955f508484f2ecf1b6 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13305 Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-08-02 19:00:42 +00:00
Artur Paszkiewicz	be90ea6e7b	FTL: Add bdev_ftl_create and delete rpc definitions Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: I0837028fbe349e8df7f05fb3c9db1f4682f04679 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13301 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-08-02 19:00:42 +00:00
Changpeng Liu	78ca4b27c5	nvmf: don't raise assertion when destroying an non-inactive subsystem Sometimes VM may get a kernel panic when starting, and SPDK CI will kill `nvmf_tgt` after 60 seconds, and for this exception, SPDK will raise an assertion when destroying the subsystem, while here, we remove this assertion and print the error information. CI will still mark this case as a failed case, then we can use this error information to understand error subsystem state in vfio-user. Fix issue #2590. Change-Id: I20b16f9e96a566730eca2dd9ea165645bd9160bd Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13773 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-08-02 01:26:10 +00:00
Konrad Sztyber	bae771fcdb	sock: add assertions checking sock_impl_opts size Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I5afc3481470f876a59505d9c4c9dc3d699c5cfd9 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13714 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-07-29 16:49:54 +00:00
Artur Paszkiewicz	83a4b155ca	module/raid: raid5: rename to raid5f Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: I4ccd54f995f0f413dadd700ba3f6ed82fe17e223 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12395 Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com>	2022-07-27 08:51:21 +00:00
Artur Paszkiewicz	c682c78992	FTL: Add FTL bdev module Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: I8c40b96f0726d83d6a307e8b9a04b7c210b80255 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13299 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-07-25 07:19:29 +00:00
Chuanwei Ji	2460515509	aio: aio rescan rpc response is sent incorrectly. Failed rpc response will be sent when rescan successfully. Signed-off-by: Chuanwei Ji <chuanwei.ji@jaguarmicro.com> Change-Id: I99a2491ec76b63cb01fb384e621b41b10ee0ed83 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13711 Community-CI: Mellanox Build Bot Reviewed-by: Qingmin Liu <qingmin.liu@jaguarmicro.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-07-22 07:28:49 +00:00
Changpeng Liu	c88345ab3d	nvme: apply `nvme_pcie_poll_group_get_stats` to vfio-user Both PCIE and VFIO-USER can use the same APIs to get IO queue pair statistic data, so merge them here. Change-Id: Iadf9ead2bd5abaf11d2ef5d1884acb67369f85bb Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13538 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-07-22 06:43:35 +00:00
Boris Glimcher	806744b7c8	sock: Add ktls and tls_version to spdk_sock_impl_opts Since `sock_impl_opts` was added to `sock_opts` Can remove `ktls` and `tls_version` from spdk_sock_opts Example: rpc.py sock_impl_set_options -i ssl --enable-ktls rpc.py sock_impl_set_options -i ssl --disable-ktls rpc.py sock_impl_set_options -i ssl --tls-version=12 ./build/examples/perf --enable-ktls ./build/examples/perf --disable-ktls ./build/examples/perf --tls-version=12 Check kTLS statistics here: /proc/net/tls_stat Change-Id: Icf7ee822bad92fda149710be77feb77fc8d4f163 Signed-off-by: Boris Glimcher <Boris.Glimcher@emc.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13510 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-07-22 06:41:39 +00:00
Konrad Sztyber	7f83361553	sock: add sock_impl_opts to sock_opts Some of the options in sock_impl_opts could be different for different sockets (even if they're using the same impl). However, outside of a few selected options (recv_buf_size, send_buf_size), there was no interface to change them. This change will allow users to change impl_opts on a per-socket basis when creating a socket. Sockets created through accept() inherit impl_opts from the listening socket. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I7628ae19def25cef6ffa62aa54bd34e446632579 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13661 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot	2022-07-19 09:35:03 +00:00
Konrad Sztyber	50afaf1ee8	sock: store impl_opts in socket structure It'll make it possible to change some of the impl_opts options on a per-socket basis, as well as make it easier to use fields common to all implementations in the generic layer. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Id3b5e0a0b302fdecc2387d07fb87b75b487dc5c5 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13659 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-07-19 09:35:03 +00:00
Alexey Marchuk	36589e1e44	bdev_nvme_rpc: Update bdev_nvme_get_io_paths RPC This RPC prints only cntlid, to identify a controller it is needed to issue bdev_get_bdevs and find more info using cntlid. Printing the controller's trid helps to identify the controller faster. Signed-off-by: Alexey Marchuk <alexeymar@nvidia.com> Change-Id: I97325b822528ef9e71afbe2ff1c30b3bce2ae203 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13655 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com>	2022-07-19 08:05:07 +00:00
Krishna Kanth Reddy	6f338d4bf3	bdev : xNVMe BDEV module implementation This implementation of xNVMe BDEV module supports the char-device / ioctl-over-uring, along with the "regular" io_uring, libaio, POSIX aio, emulated aio (via threadpools) etc. Code changes done : a. Addition of xNVMe submodule to SPDK b. Modification of RPC scripts to Create / Delete xNVMe BDEVs c. Implementation of xNVMe BDEV module Signed-off-by: Krishna Kanth Reddy <krish.reddy@samsung.com> Change-Id: If814ca1c784124df429d283015a6570068b44f87 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11161 Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-07-15 12:52:13 +00:00
Konrad Sztyber	030bbebeb2	sock: extract copying impl_opts to a function Both get_opts and set_opts use very similar macros to achieve almost the same thing, so it makes sense to extract it to a separate function. Additionally, it'll be also useful in subsequent patches introducing per-sock impl_opts, as there will be more places when we want to copy impl_opts. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I8ab27298d62ea0118463ee945c708acd91aa5104 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13658 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-07-14 09:48:25 +00:00
Konrad Sztyber	38f82ecf1e	sock: move (get\|set)_opts up Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I9ddbfb1018d23117582b947058fcd6c322c2ef6b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13657 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-07-14 09:48:25 +00:00
Shuhei Matsumoto	81e92f6bca	bdev/nvme: Calculate and use active ns count to read ANA log page nvme_ctrlr_init_ana_log_page() had used cdata->mnan as active ns count to allocate a buffer and read a ANA log page into the buffer. However, cdata->mnan was larger than the real active ns count and caused an issue when the corresponding NVMe-oF target set the bit 18 of the SGL support field in the Identify Controller Data structure. We still need to use cdata->mnan to allocate the buffer because number of namespaces may be increased dynamically after initialization. Hence, rename nvme_ctrlr::ana_log_page_size to nvme_ctrlr::max_ana_log_page_size and calculate and use the active ns count to read the ANA log page. Check if the current ana_log_page_size is not larger than nvme_ctrlr->max_ana_log_page_size. Fixes issue #2584 Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Ieb10b9c793c4f48ffd88d517c0e9a55184b7d935 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13653 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-07-14 09:47:41 +00:00
yidong0635	256bfe7685	raid/concat: update bdev readv/writev to ext API. Other parts have changed these since readv and writev support ext_opts. And change corresponding unittest. Signed-off-by: yidong0635 <dongx.yi@intel.com> Change-Id: I79260a7e6110aa46df2016e579f1da5c241c9844 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13620 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2022-07-14 09:46:56 +00:00
Konrad Sztyber	c61554bb68	sock/uring: mark non-listen sockets as blocking Instead, we pass MSG_DONTWAIT flag to all recvmsg()/sendmsg() calls. That way, the IOs remain non-blocking, but the socket is marked as blocking allowing us to pass flags like MSG_WAITALL, which only make sense if a socket is in blocking mode. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Ic52162e84aa14efaf6ad0fb3343d289822758e81 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12592 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-07-14 09:45:54 +00:00
Konrad Sztyber	6d3506d893	sock/uring: extract advancing req's offset to a function It'll make it possible to reuse this code for asynchronous read requests. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I56dab62587884e2e37fad11b5f0d12df92e175ea Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12590 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-07-14 09:45:54 +00:00
Konrad Sztyber	6453cbe783	sock/uring: rename sock_complete_reqs -> sock_complete_write_reqs This function processes and completes asynchronous write requests, so it makes sense to rename it. This is done in preparation for handling asychronous read requests. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I3f36631dc24a3170204aaaba56f4968be0672fe5 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12172 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-07-14 09:45:54 +00:00
Konrad Sztyber	98b3ff5656	sock/uring: rename recv task to errqueue This task is used to receive the socket's error queue to check for completed zero-copy write requests. The rename is done in preparation for adding a read task that will actually receive the data from the socket. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I61bf45e210bb09bd89f3161a75478b41bd8eb070 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12171 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-07-14 09:45:54 +00:00
Konrad Sztyber	3e47d7fa22	sock: asynchronous readv interface This patch defines a new function, spdk_sock_readv_async(), which allows the user to send a readv request and receive a callback once the supplied buffer is filled with data from the socket. It works simiarly to asynchronous writes, but there can only be a single outstanding read request at a time. For now, the interface isn't implemented and any calls will return -ENOTSUP. Subsequent patches will add support for it in the uring module and as well as emulation in the posix module. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I924e2cdade49ffa18be6390109dc7e65c2728087 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12170 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-07-14 09:45:54 +00:00
Wojciech Malikowski	81dca28884	ftl: remove deprecated ftl library Signed-off-by: Wojciech Malikowski <wojciech.malikowski@intel.com> Change-Id: I3ebb05be3f1b9864b238cb74f469b4fdf573cd0d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11120 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com>	2022-07-11 07:23:58 +00:00
Konrad Sztyber	0e25e2a6e2	bdev/compress: make lb_size optional in bdev_compress_create Treat it as zero if the user doesn't provide it. This will make it consistent with other RPCs and the behavior of that command in scripts/rpc.py. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Ic5505f853bc203fb1144ba3f1f44c736563a3677 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13529 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Karol Latecki <karol.latecki@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-07-04 07:23:26 +00:00
Boris Glimcher	7104c8332d	sock: Add ktls and tls_version to spdk_sock_opts See https://docs.kernel.org/networking/tls-offload.html See https://www.openssl.org/docs/man3.0/man3/SSL_set_options.html Change-Id: I2fb433cbc34061cb03e1591bb0b47063fcafc68c Signed-off-by: Boris Glimcher <Boris.Glimcher@emc.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13071 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com>	2022-06-30 07:44:26 +00:00
yupeng	1f0b8df7b0	blobstore: implement spdk_bs_grow and bdev_lvol_grow_lvstore RPC The bdev_lvol_grow_lvstore will grow the lvstore size if the undering bdev size is increased. It invokes spdk_bs_grow internally. The spdk_bs_grow will extend the used_clusters bitmap. If there is no enough space resereved for the used_clusters bitmap, the api will fail. The reserved space was calculated according to the num_md_pages at blobstore creating time. Signed-off-by: Peng Yu <yupeng0921@gmail.com> Change-Id: If6e8c0794dbe4eaa7042acf5031de58138ce7bca Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9730 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-06-28 17:55:43 +00:00
yupeng	88833020eb	blobstore: reserve space for growing blobstore Reserve space for used_cluster bitmap. The reserved space is calculated according to the num_md_pages. The reserved space would be used when the blobstore is extended in the future. Add the num_md_pages_per_cluster_ratio parameter to the bdev_lvol_create_lvstore API. Then calculate the num_md_pages according to the num_md_pages_per_cluster_ratio and bdev total size, then pass the num_md_pages to the blobstore. Signed-off-by: Peng Yu <yupeng0921@gmail.com> Change-Id: I61a28a3c931227e0fd3e1ef6b145fc18a3657751 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9517 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-06-28 17:55:43 +00:00
Ben Walker	8dd1cd2104	check_format: For C files only, fix return type breaks In SPDK, declarations have the return type on the same line. Definitions have the return type on a separate line. Astyle has an option for enforcing this. Unfortunately, it seems to have two bugs: 1) It doesn't work correctly at all on C++ files. 2) It often fails on functions that return enums, or long type names Deal with 1) by adjusting the check_format.sh script to only tell astyle to fix return type line breaks for C files and not C++. Deal with 2) by adding a few typedefs to work around the problem. Change-Id: Idf28281466cab8411ce252d5f02ab384166790c6 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13437 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com>	2022-06-27 09:33:48 +00:00
GangCao	df1e07e909	Bdev/RAID: cleanup the RAID config even there is no base bdev added Fix issue: #2557 Related test also added. Change-Id: I1fa8895cf9dd81c75e5b8b1092733e62be8abd32 Signed-off-by: GangCao <gang.cao@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13061 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Dong Yi <dongx.yi@intel.com>	2022-06-21 07:55:09 +00:00
Boris Glimcher	120382b7ec	test/sock: Fixing hexstr2buf for PSK Adding more unit tests using standard openssl The unfortunate small sleep is needed due to issue: https://www.mail-archive.com/openssl-users@openssl.org/msg02937.html Change-Id: I6f55453f12371bec6a402ba4c1d20e21aed73cf4 Signed-off-by: Boris Glimcher <Boris.Glimcher@emc.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12625 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-06-20 14:59:47 +00:00
Boris Glimcher	2fb5ff4985	sock: Add support for SSL Added new `ssl` based socket implementation. For now we are using hard-coded PSK and only support TLS 1.3 One can use it via sock_set_default_impl RPCs Nvme/TCP published secure channel specification (TP 8011) Which is based on TLS 1.3 and PSK. So this is a primary but not the oly use case. Before any SSL connection can be established we need to create SSL context. The context should be client/server aware. Similar to regular sockets, to establish connection server must call SSL_accept and client must call SSL_connect. For now I'm using PSK and not certificates since we aim this for NVMe/TCP TP-8011 which supports only PSK. Adding certificates later on will be very easy. The complication with SSL state machine during accep and connect comes with returned SSL_ERROR_WANT_READ or SSL_ERROR_WANT_WRITE. According to documentation, call have to be repeated in this case. Using openssl here for TLS from user space. openssl also has support for kTLS. Will be part of the next changes. openssl doesn't have implemetation for iovec only basic SSL_read and SSL_write. So adding here SSL_readv and SSL_writev wrappers. Tested using: ./build/examples/hello_sock -N ssl -H 127.0.0.1 -P 12345 ./build/examples/hello_sock -N ssl -H 127.0.0.1 -P 12345 -S Also tested using: nvmf_tgt + sock_set_default_impl + perf Signed-off-by: Boris Glimcher <Boris.Glimcher@emc.com> Change-Id: Ie730077c5c581b7e112c18f5f9e1b683015e7b4b Signed-off-by: Boris Glimcher <Boris.Glimcher@emc.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12327 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-06-20 14:59:47 +00:00
Tomasz Zawadzki	f7e1f48a79	lib/event: do not set default scheduling period during init reactor_run() decides whether to start gather_metrics based on non-zero scheduler period. The default of 1 sec was set during initialization, in scheduler_subsystem_init(). This resulted in unessecary operations each second, even if only 'static' scheduler is used. This patch moves setting default scheduling period to respective schedulers. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I953aee271a959b6314c8e83434c922dba9638de4 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9492 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-06-20 09:56:09 +00:00
Anton Eidelman	0b9100e8a5	bdev/nvme: replace nn with mnan in ana_log size calculation Calculation of the ANA log page size should use the identify ctrl MNAN field (maximum number of allowed namespaces) not the NN (maximum valid nsid value). An ANA-enabled controller must have a non-zero MNAN value, see NVMe Base Specification, Figure 251, therefore nvme_ctrlr_init_ana_log_page() may safely use MNAN. Since NN might be much higher than MNAN, ANA log size based on NN may results in a very large log page and cause a failure to get ANA log, e.g. if it is larger than the controller's MDTS. Fix: replace cdata->nn with cdata->mnan in nvme_ctrlr_init_ana_log_page() Signed-off-by: Anton Eidelman <anton@lightbitslabs.com> Change-Id: I2a522dca833a27dddad25848d7688efa23d23091 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13039 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Jacek Kalwas <jacek.kalwas@intel.com>	2022-06-15 08:10:48 +00:00
Tomasz Zawadzki	0f3ddc9c98	env/dpdk: skip build of DPDK based governors when missing rte_power rte_power was added to DPDK long time ago, but some of the DPDK packages do not include it. For those cases just skip building components that depend on in. This change still allows to use dynamic scheduler, since the dpdk_governor usage is optional. Fixes #2534 Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: Ied88edc8d58aae07d1384c1c40203fc80b919d80 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12993 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-06-15 08:08:55 +00:00
Tomasz Zawadzki	ec1d6fb71e	env/dpdk: simplify checks for rte_power dpdk_governor and gscheduler use rte_power, which is only available on Linux and when DPDK env is used. Rather than repeat those checks in each mk or Makefile, added DPDK_POWER flag directly to DPDK env. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I438caad8d333a4df697a79aa45de2930cce71d23 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12992 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: John Kariuki <John.K.Kariuki@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-06-15 08:08:55 +00:00
Jim Harris	488570ebd4	Replace most BSD 3-clause license text with SPDX identifier. Many open source projects have moved to using SPDX identifiers to specify license information, reducing the amount of boilerplate code in every source file. This patch replaces the bulk of SPDK .c, .cpp and Makefiles with the BSD-3-Clause identifier. Almost all of these files share the exact same license text, and this patch only modifies the files that contain the most common license text. There can be slight variations because the third clause contains company names - most say "Intel Corporation", but there are instances for Nvidia, Samsung, Eideticom and even "the copyright holder". Used a bash script to automate replacement of the license text with SPDX identifier which is checked into scripts/spdx.sh. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Iaa88ab5e92ea471691dc298cfe41ebfb5d169780 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12904 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: <qun.wan@intel.com>	2022-06-09 07:35:12 +00:00
Jim Harris	68108360f9	bdev/nvme: check bdev's module when setting multipath policy We cannot try to set multipath policy on a non-nvme bdev. While here, make the error messages match what we use for setting preferred path. Fixes issue #2543. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I823077f92634ee3c16e77e7e0d67eb343ec3584e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12916 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: <qun.wan@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com>	2022-06-08 08:08:04 +00:00
Liu Xiaodong	9be660eabd	bdev_aio: fix aio error indication From aio_abi.h, io_event.res is defined __s64, negative errno is assigned to io_event.res for error situation in Linux Kernel. But from libaio.h, io_event.res is defined unsigned long, so convert it to signed value for error detection. Fixes issue #2325 Change-Id: I99f8b03e8bfedfa260fe91f3e9e8ef0e5ecd0b77 Signed-off-by: Liu Xiaodong <xiaodong.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12882 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-06-07 00:04:09 +00:00
Monica Kenguva	95e057210e	test/nvmf: test reconnect_delay_sec parameter Signed-off-by: Monica Kenguva <monica.kenguva@intel.com> Change-Id: I45dc2b2fe660e5a53c8976dea2640ea53ec00a3d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12184 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-06-06 07:51:54 +00:00
Jim Harris	89c939e0a6	Eliminate license header differences. There are a few places where a typo, extra character or newline was added to the BSD 3-clause license text which made it differ very slightly from the rest of the license headers in the source tree. Remove those differences in this patch, to help with automation of SPDX identifier replacement in the next patch. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I542dc53cd252b1699253fd6dcc3ccac9643d7878 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12905 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-06-06 07:34:55 +00:00
wanghailiangx	465f99e9ff	accel module: remove support for deprecated RPC names These were deprecated in 2019, it's time to remove support for them now. Change-Id: I5a715e9b9e991c6febec5e505384728281eee8b7 Signed-off-by: wanghailiangx <hailiangx.e.wang@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12773 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-05-23 16:14:41 +00:00
wanghailiangx	25114fce69	mvnf module: remove support for deprecated RPC names(missed two) These were deprecated in 2019, it's time to remove support for them now. Change-Id: Ia28abb9b724c9589e1da66f0f22a1ba95d96f5dc Signed-off-by: wanghailiangx <hailiangx.e.wang@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12776 Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-05-23 16:14:16 +00:00
wanghailiangx	7aa92ad513	bdev module: remove support for deprecated RPC names These were deprecated in 2019, it's time to remove support for them now. Change-Id: I9e203a52877802127df8144e68090d7975f9d200 Signed-off-by: wanghailiangx <hailiangx.e.wang@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12772 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-05-23 16:13:00 +00:00
paul luse	b483811ff1	modules/accel/iaa: add IAA accel_fw module And associated RPC to enable. Signed-off-by: paul luse <paul.e.luse@intel.com> Change-Id: I06785bcd8b8957293ad41d13bab556fe62f29fd5 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12765 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-05-23 16:04:57 +00:00
wanghailiangx	3ac967baa6	bdev_iscsi: remove support for deprecated RPC names These were deprecated in 2019, it's time to remove support for them now. Change-Id: I25aea510648a55d751db3740b36fb9924d1f52ed Signed-off-by: wanghailiangx <hailiangx.e.wang@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12747 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com>	2022-05-23 07:02:37 +00:00
paul luse	ecaa8e1000	lib/idxd: prepare some plumbing for adding IAA Misc internal IDXD changes needed to support the upcoming addition of IAA. Signed-off-by: paul luse <paul.e.luse@intel.com> Change-Id: Idb180088af545b174ed33a4f8ee113e58640477f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12764 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-05-23 07:02:21 +00:00
paul luse	ffef30ae0d	modules/accel_dsa: update IDXD references to DSA where it makes sense IDXD has always been used everywhere but technically it stands for the driver, not the HW (Intel Data Streaming Accelerator Driver) where the X comes from "Streaming Accelerator" somehow. Anyway, the underlying hardware is just DSA. It doesn't matter much now but upcoming patches will add support for a new HW accelerator called the Intel In-Memory Analytics Accelerator which we'll call IAA and it will use the same (mostly) device driver (IDXD) as DSA. So, calling the HW what it is will lessen confusion when adding IAA support. This patch just does renaming for the accel_fw module and associated files (RPC, etc). Signed-off-by: paul luse <paul.e.luse@intel.com> Change-Id: Ib3b1f982cc60359ecfea5dbcbeeb33e4d69aee6a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11984 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-05-23 07:02:21 +00:00
Alexey Marchuk	366e19efe8	vbdev_compress: Remove mentioning of 2MiB huge pages We explicitly refer to 2MiB huge pages when a buffer spans huge page boundary. That is not correct since it may happen even with 1GiB huge pages. Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reported-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ief72b677ccf3d8c1d9835bb8b74621f2b349175a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12250 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-05-20 17:39:57 +00:00
Tomasz Zawadzki	e0516095fc	event/vhost: separate vhost subsystem to scsi and blk Separate out SCSI and BLK vhost subsystems to later add virtio_blk transport abstraction. This allows for further changes to the vhost_blk, not affecting vhost_scsi. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: Id1ecfeafeb936809a479a43c321e13f75cb3d5ad Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9539 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com>	2022-05-20 09:20:07 +00:00
Changpeng Liu	e1b5d28c8b	bdev/nvme: set `max_segment_size` and `max_num_segments` Maximum Data Transfer Size indicates the maximum data transfer size for a command that transfers data between memory and the controller. SPDK NVMe driver will split IO request based on MDTS and stripe boundary, however, if user submited a big enough IO size, SPDK NVMe driver will run out of requests data structure and return -EINVAL as the error code, it's OK for application who call SPDK NVMe driver directly. SPDK bdev module also will do similar split based on `max_segment_size` and `max_num_segments`, but we don't set them for bdev/nvme driver, once there was a big enough IO request, the NVMe driver will return -EINVAL to bdev module, here we set `max_segment_size` based on MDTS and set `max_num_segments` based on number of requests of NVMe controller. Fix #2403. Change-Id: Ic6e14a0b12413783597122285ac648b87e1f1e16 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12186 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-05-20 09:18:56 +00:00
GangCao	7cfb12f437	Bdev/Lvol: check base bdev's md before examining To fix issue #2514 Change-Id: If507382202e729f5934a354e2515a035ad5aeb0c Signed-off-by: GangCao <gang.cao@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12750 Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-05-20 09:18:18 +00:00
Shuhei Matsumoto	e4584d937e	bdev/nvme: Poll adminq more often during ctrlr disconnection During ctrlr reconnection, spdk_nvme_ctrlr_reconnect_poll_async() is executed by a non-timed poller. We should poll adminq more often during ctrlr disconnection too. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Ib1f5b41015aed20deda8df6f2c837981ac233c04 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12615 Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Michael Haeuptle <michaelhaeuptle@gmail.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-05-20 09:17:28 +00:00
Shuhei Matsumoto	fcf52fbff5	bdev/nvme: Reversed orderings for reset between PCIe and NVMe-oF As described in the NVMe specification, a controller level reset includes the following actions: - the controller stops processing any outstanding admin or I/O commands; - all I/O SQs and CQs are deleted. In a full controller reset sequence for a PCIe controller, if we do a controller level reset first, we can abort outstanding commands after the hardware has actually been stopped. For NVMe-oF controller, each I/O qpair is an independent network connection and is disconnected safely. We do not want to change NVMe-oF controller. Fixes the issue #2360 Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: If05febac74705bfd3df5abd15064c1203126e027 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12447 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Michael Haeuptle <michaelhaeuptle@gmail.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-05-20 09:17:28 +00:00
wanghailiangx	c77f17a53e	bdev_malloc and bdev_null : remove support for deprecated RPC names These were deprecated in 2019, it's time to remove support for them now. Change-Id: Ic80ce74344b24814dad792cfff6a4791d0430527 Signed-off-by: wanghailiangx <hailiangx.e.wang@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12741 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-05-19 13:07:41 +00:00
Konrad Sztyber	e5f9e82291	bdev/nvme: add timeout option to start_discovery It's now possible to specify a time to wait until a connection to the discovery controller and the NVM controllers it exposes is made. Whenever that time is exceeded, a callback is immediately executed. However, depending on the stage of the discovery process, we might need to wait a while before actually stopping it (e.g. because a controller attach is in progress). That means that a discovery service might be visible for a while after it timed out. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I2d01837b581e0fa24c8e777730d88d990c94b1d8 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12684 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-05-18 07:24:06 +00:00
Konrad Sztyber	10dbbf6d8a	bdev/nvme: free log_page when freeing discovery ctx Currently this should be a no-op as the log_page is always freed in discovery_remove_controllers(), but it'll make it easier to handle cases when we want to stop the discovery service while it's attaching NVM controllers. We'll be relying on this in the subsequent patch adding attach timeout to start_discovery(). Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Ia03fde92bf5ae5590bca507b7a0f963885d85f4f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12721 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Dong Yi <dongx.yi@intel.com>	2022-05-18 07:24:06 +00:00
Shuhei Matsumoto	00d46b80b2	bdev/nvme: Disable automatic failback in multipath mode By default, failback to the preferred I/O path is done automatically if it is restored. Some users may want to keep using the backup I/O path even if the preferred I/O path is restored. In this case, bdev_nvme_set_preferred_path can be used to do manual failback. We may be able to clear/fill I/O path cache more strictly but it will be complicated and have bugs. This patch does the minimal change, just skips an apparent case. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I78fe5faee6ff04e88ae3d7c6be6da1c20637c912 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12431 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-05-17 12:54:45 +00:00
Konrad Sztyber	0aa212bddb	bdev/nvme: use stop_discovery in module_fini Using stop_discovery() will also free the entry contexts tied to that discovery service. It's not a big deal to leave them, as module_fini is usually called on app shutdown, but this gets rid of asan reporting memory leaks. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I81fc843708694791a5846c24cbde4b32e0fa7287 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12683 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-05-16 10:20:49 +00:00
Konrad Sztyber	82889ba8e5	bdev/nvme: free entry_ctx_in_use It was never freed, so we leaked it. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I9e17d043ebd0d8d11bb776cb2676ac7f53f8ee41 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12682 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-05-16 10:20:49 +00:00
Konrad Sztyber	3c61b8ca6d	bdev/nvme: extract stopping discovery to a function For now, it's only called from a single location, bdev_nvme_stop_discovery, but it'll make it possible to stop the discovery from other places. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I3ba447f30fd8ed23c392d008b3d03cdad30cdb33 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12681 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-05-16 10:20:49 +00:00
Konrad Sztyber	623105bb09	bdev/nvme: zero out the referred discovery trid If the trid is not zeroed out, trid.trstring might contain garbage value, which means that nvme_probe_internal() might not populate it based on trtype and will use that garbage value to get a transport, leading to the following failure: ``` nvme.c: 834:nvme_probe_internal: ERROR: NVMe trtype 3 () not available ``` Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I56bc6502d285ee5ce094184e00d3297f6332e8c0 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12680 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-05-16 10:20:49 +00:00
Konrad Sztyber	5c2b5fc14b	bdev/nvme: fix wait_for_attach assignment Moved assigning ctx->start_cb_fn before it's checked for NULL to set ctx->wait_for_attach, otherwise it was always false. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I819d9e326dbb36f943279c3714695ae604dd64b2 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12628 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-05-16 10:20:49 +00:00
Konrad Sztyber	ef675d38c1	bdev/nvme: check discovery service trids during start Check that we're not already connected to a discovery service that has the same address (or has a referall to) as the service we're trying to start. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I84863fd959f62b30e9a348f69d10c7f1edffda7a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12627 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-05-16 10:20:49 +00:00
Konrad Sztyber	33c14a1445	bdev/nvme: don't allow duplicate discovery service names Otherwise, we'll try to use the same name to create the bdevs. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I38cfb9073f343a7bf3966754008065c7ab89e609 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12626 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-05-16 10:20:49 +00:00
Gal Hammer	90aba31ac3	bdev/ocf: Implement ENOMEM handling when submitting IO to base device Replaced the TODO comment with an error handing in case SPDK fails to allocate memory for the IO request. Returning the proper error code will use the bdev's no memory retry mechanism to handle the failure. Signed-off-by: Gal Hammer <gal.hammer@huawei.com> Signed-off-by: Shai Fultheim <shai.fultheim@huawei.com> Change-Id: I151668f26b7f122dca95eaf65934be675e601952 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12372 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Rafal Stefanowski <rafal.stefanowski@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-05-16 10:15:27 +00:00
Alexey Marchuk	b0262063d3	vbdev_lvol: Report memory domains Update functional test to verify that lvol supports memory domains Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: I5e91eedc8879359c3add45d417b6f3eaad4d75b9 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11375 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-05-16 10:14:26 +00:00
Alexey Marchuk	248ccd8607	lvol: Use blobstore ext API in data path The new blobstore ext API is used when the user provides ext_io_opts in bdev layer. To store blobstore ext_io_opts, vbdev_lvol reports non-zero get_ctx_size in bdev module interface. Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: I64076b5369533be0c1d69ca48aef9d70a9abe488 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11373 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-05-16 10:14:26 +00:00
Alexey Marchuk	c3a40396cf	blob/bdev: Use extended blob API Queued vectored IOs use extended API - if ext_opts is NULL there is no difference between regular and extended blob API Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: I091f36d81dab04bcd483ea61e667a368441d70e4 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11372 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-05-16 10:14:26 +00:00
Alexey Marchuk	ba8f1a9e5d	blob: Add readv/writev ext ops to spdk_bs_dev Introduce spdk_blob_ext_io_opts structure which is used in the new *_ext functions. Zeroes dev is updated with implementation of readv_ext which uses memory domains memzero or regular memset(). Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: Id94542196eff999827bf00591fd43804256fccb4 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11369 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-05-16 10:14:26 +00:00
Shuhei Matsumoto	fe9a8fab7b	bdev/nvme: Factor out set callback and call ctrlr_disconnect() into a helper function The following patches will swap the ordering of destroying I/O qpairs and disconnecting a controller for PCIe transport to fix a github issue. Setting callback and calling spdk_nvme_ctrlr_disconnect() have been executed in two cases now. After the following patches, these will be executed in three cases. Factoring out these into a helper function will be rewarded. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I8597908d7fd8acc6dc62ec442ba2e8c4c08f11a4 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12562 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Michael Haeuptle <michaelhaeuptle@gmail.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-05-12 07:28:02 +00:00
Shuhei Matsumoto	91165150cc	bdev/nvme: Factor out deleting all I/O qpairs for reset into a helper function The following patches will swap the ordering of destroying I/O qpairs and disconnecting a controller for PCIe transport to fix a github issue. For PCIe transport, destroying I/O qpairs will be as the completion callback to controller disconnection. To do this easier, factor out destroying I/O qpairs into a helper function. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I96bb6d0487fc67b71c759ee64200bd7fb71aceb1 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12561 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Michael Haeuptle <michaelhaeuptle@gmail.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-05-12 07:28:02 +00:00
Gal Hammer	84863d99a8	bdev/ocf: Fix no memory error value check On I/O request completion, OCF uses OCF_ERR_NO_MEM (and not ENOMEM) to report a memory allocation failure. Signed-off-by: Gal Hammer <gal.hammer@huawei.com> Signed-off-by: Shai Fultheim <shai.fultheim@huawei.com> Change-Id: If06608d7aa0fc747a79564b7372c915d99307235 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12352 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Rafal Stefanowski <rafal.stefanowski@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-05-11 09:34:45 +00:00
Konrad Sztyber	f331ae167b	bdev/nvme: add RPC returning information about discovery service The RPC returns a list of active discovery service connections. Each discovery service is described by a name, its trid, and a list of discovery service trids it refers to. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Ifa4b9501dd353e7b4948ad830575a6c94dafd86b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12380 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-05-05 14:08:57 +00:00
paul luse	d58a2f6cc5	lib/accel: support multiple accel modules (aka engines) at once We enable multiple engines by: * getting rid of the globals that point to the one available HW and one available SW engine * adding a submit_tasks() entry point for the SW engine so that it is treated like any other engine allowing us to just call submit_tasks() to the assigned engine for the opcode instead of checking what is supported * changing the definition of engine capabilities from "HW accelerated" to simply "supported" * during init, use a global (g_engines_opc) that contains engines and is indexed by opcode so we know what the best engine is for each op code * future patches will add RPC's to override engine priorities or specifically assign an opcode(s) to an engine. Signed-off-by: paul luse <paul.e.luse@intel.com> Change-Id: I9b9f3d5a2e499124aa7ccf71f0da83c8ee3dd9f9 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11870 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-05-05 07:11:32 +00:00
Shuhei Matsumoto	8f9b977504	bdev/nvme: Add active/active policy for multipath mode The NVMe bdev module supported active-passive policy for multipath mode first. By this patch, the NVMe bdev module supports active-active policy for multipath node next. Following the Linux kernel native NVMe multipath, the NVMe bdev module supports round robin algorithm for active-active policy. The multipath policy, active-passive or active-active, is managed per nvme_bdev. The multipath policy is copied to all corresponding nvme_bdev_channels. Different from active-passive, active-active caches even non_optimized path to provide load balance across multiple paths. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Ie18b24db60d3da1ce2f83725b6cd3079f628f95b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12001 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-05-05 07:11:24 +00:00
Shuhei Matsumoto	0869265d66	bdev/nvme: Factor out searching io_path operation from find_io_path() Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I26939b39cfd4b92bdbc1d4ef10961ba35145043c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12000 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-05-05 07:11:24 +00:00
Shuhei Matsumoto	22b77a3c80	bdev/nvme: Set preferred I/O path in multipath mode If we specify a preferred path manually for each NVMe bdev, we will be able to realize a simple static load balancing and make the failover more controllable in the multipath mode. The idea is to move I/O path to the NVMe-oF controller to the head of the list and then clear the I/O path cache for each NVMe bdev channel. We can set the I/O path to the I/O path cache directly but it must be conditional and make the code very complex. Hence, let find_io_path() do that. However, a NVMe bdev channel may be acquired after setting the preferred path. To cover such case, sort the nvme_ns list of the NVMe bdev too. This feature supports only multipath mode. The NVMe bdev module supports failover mode too. However, to support the latter, the new RPC needs to have trid as parameters and the code and the usage will be come very complex. Add a note for such limitation. To verify one by one exactly, add unit test. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Ia51c74f530d6d7dc1f73d5b65f854967363e76b0 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12262 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: <tanl12@chinatelecom.cn> Reviewed-by: GangCao <gang.cao@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-05-05 07:11:24 +00:00
Richael Zhuang	9bff828f99	sock: introduce dynamic zerocopy according to data size MSG_ZEROCOPY is not always effective as mentioned in https://www.kernel.org/doc/html/v4.15/networking/msg_zerocopy.html. Currently in spdk, once we enable sendmsg zerocopy, then all data transferred through _sock_flush are sent with zerocopy, and vice versa. Here dynamic zerocopy is introduced to allow data sent with MSG_ZEROCOPY or not according to its size, which can be enabled by setting "enable_dynamic_zerocopy" as true. Test with 16 P4610 NVMe SSD, 2 initiators, target's and initiators' configurations are the same as spdk report: https://ci.spdk.io/download/performance-reports/SPDK_tcp_perf_report_2104.pdf For posix socket, rw_percent=0(randwrite), it has 1.9%~8.3% performance boost tested with target 1~40 cpu cores and qdepth=128,256,512. And it has no obvious influence when read percentage is greater than 50%. For uring socket, rw_percent=0(randwrite), it has 1.8%~7.9% performance boost tested with target 1~40 cpu cores and qdepth=128,256,512. And it still has 1%~7% improvement when read percentage is greater than 50%. The following is part of the detailed data. posix: qdepth=128 rw_percent 0 \| 30 cpu origin thisPatch opt \| origin thisPatch opt 1 286.5 298.5 4.19% 307 304.15 -0.93% 4 1042.5 1107 6.19% 1135.5 1136 0.04% 8 1952.5 2058 5.40% 2170.5 2170.5 0.00% 12 2658.5 2879 8.29% 3042 3046 0.13% 16 3247.5 3460.5 6.56% 3793.5 3775 -0.49% 24 4232.5 4459.5 5.36% 4614.5 4756.5 3.08% 32 4810 5095 5.93% 4488 4845 7.95% 40 5306.5 5435 2.42% 4427.5 4902 10.72% qdepth=512 rw_percent 0 \| 30 cpu origin thisPatch opt \| origin thisPatch opt 1 275 287 4.36% 294.4 295.45 0.36% 4 979 1041 6.33% 1073 1083.5 0.98% 8 1822.5 1914.5 5.05% 2030.5 2018.5 -0.59% 12 2441 2598.5 6.45% 2808.5 2779.5 -1.03% 16 2920.5 3109.5 6.47% 3455 3411.5 -1.26% 24 3709 3972.5 7.10% 4483.5 4502.5 0.42% 32 4225.5 4532.5 7.27% 4463.5 4733 6.04% 40 4790.5 4884.5 1.96% 4427 4904.5 10.79% uring: qdepth=128 rw_percent 0 \| 30 cpu origin thisPatch opt \| origin thisPatch opt 1 270.5 287.5 6.28% 295.75 304.75 3.04% 4 1018.5 1089.5 6.97% 1119.5 1156.5 3.31% 8 1907 2055 7.76% 2127 2211.5 3.97% 12 2614 2801 7.15% 2982.5 3061.5 2.65% 16 3169.5 3420 7.90% 3654.5 3781.5 3.48% 24 4109.5 4414 7.41% 4691.5 4750.5 1.26% 32 4752.5 4908 3.27% 4494 4825.5 7.38% 40 5233.5 5327 1.79% 4374.5 4891 11.81% qdepth=512 rw_percent 0 \| 30 cpu origin thisPatch opt \| origin thisPatch opt 1 259.95 276 6.17% 286.65 294.8 2.84% 4 955 1021 6.91% 1070.5 1100 2.76% 8 1772 1903.5 7.42% 1992.5 2077.5 4.27% 12 2380.5 2543.5 6.85% 2752.5 2860 3.91% 16 2920.5 3099 6.11% 3391.5 3540 4.38% 24 3697 3912 5.82% 4401 4637 5.36% 32 4256.5 4454.5 4.65% 4516 4777 5.78% 40 4707 4968.5 5.56% 4400.5 4933 12.10% Signed-off-by: Richael Zhuang <richael.zhuang@arm.com> Change-Id: I730dcf89ed2bf3efe91586421a89045fc11c81f0 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12210 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-04-28 07:29:28 +00:00
Jim Harris	ab9c7a6a81	bdev/nvme: account for ACWU values being 0-based ACWU and NACWU are 0-based values. But spdk_bdev_get_acwu() specifies the compare-and-write-unit in terms of blocks (i.e. 1-based). So the bdev/nvme module needs to add 1 to this value before registering the bdev. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I7c19975a2bd8c09bb65374838fe20aad690d1ecf Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12384 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-04-27 07:36:44 +00:00
Alex Michon	2bc134eb4b	bdev/nvme: Fix aborting fuse commands When sending a fused compare and write command, we pass a callback bdev_nvme_comparev_and_writev_done that we expect to be called twice before marking the io as completed. In order to detect if a call to bdev_nvme_comparev_and_writev_done is the first or the second one, we currently rely on the opcode in cdw0. However, cdw0 may be set to 0, especially when aborting the command. This may cause use-after-free issues and this may call the user callbacks twice instead of once. Use a bit in the nvme_bdev_io instead to keep track of the number of calls to bdev_nvme_comparev_and_writev_done. Signed-off-by: Alex Michon <amichon@kalrayinc.com> Change-Id: I0474329e87648e44b08998d0552b2a9dd5d34ac2 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12180 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-04-26 07:47:09 +00:00
Shuhei Matsumoto	2a6a64485c	bdev/nvme: Add bdev_nvme_get_io_paths RPC to monitor I/O path states Add an new RPC bdev_nvme_get_io_paths to query all active I/O paths. One io_path belongs to One nvme_bdev_channel. Each nvme_bdev_channel is associated with one nvme_bdev. If the RPC bdev_nvme_get_io_paths has a bdev name as a parameter it can use spdk_for_each_channel() simply for the corresponding nvme_bdev. However, users will want to know I/O paths of all nvme_bdevs like the RPC bdev_get_bdevs. One io_path has one nvme_qpair. One nvme_qpair belongs to one nvme_poll_group. By relying on these relationships, the RPC bdev_nvme_get_io_paths traverses all nvme_poll_groups by using spdk_for_each_channel() to g_bdev_nvme_ctrlrs. The RPC bdev_nvme_get_io_paths has two modes, display all or the specified NVMe bdev's active I/O paths. The specified bdev name is used just for comparison and empty array is returned if no matched io_path is found. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I4a0dbf3ef7aaa9a7b7345fc03dc493cc6d37bc99 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12146 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-04-22 09:44:57 +00:00
Shuhei Matsumoto	2730f5cac0	bdev/nvme: Add cntlid to bdev_get_bdevs and bdev_nvme_get_controllers RPCs NVMe bdev name already includes the name of the NVMe bdev controller and the NSID. CNTLID will be a good ID to identify a namespace from a NVMe bdev when multipath is configured. However, the query RPCs, bdev_get_bdevs and bdev_nvme_get_controllers had not returned such information. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I2f2e355ff13f69ced616be803a3152c838cdc980 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12276 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com>	2022-04-22 09:44:57 +00:00
Shuhei Matsumoto	972a9f6c40	bdev/nvme: Add multipath info to the bdev_get_bdevs RPC Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Iacc3458a209e31b758455f55ab3bae276ae60dd8 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12312 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Monica Kenguva <monica.kenguva@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-04-22 09:44:57 +00:00
Shuhei Matsumoto	50b6329ca0	bdev/nvme: Factor out ctrlr info json dump into a helper function Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I7f1e08ff13d890cb780e7b66c18a77ab85c82029 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12311 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-04-22 09:44:57 +00:00
Shuhei Matsumoto	13ca6e52d3	bdev/nvme: Handle ANA transition (change or inaccessible state) correctly Previously, if a namespace is in ANA inaccessible state, I/O had been queued infinitely. Fix this issue according to the NVMe spec. Add a temporary poller anatt_timer and a flag ana_transition_timedout for each nvme_ns. Start anatt_timer if the nvme_ns enters ANA transition. If anatt_timer is expired, set ana_transition_timedout to true. Cancel anatt_timer or clear ana_transition_timedout if the nvme_ns exits ANA transition. nvme_io_path_become_available() returns false if ana_transition_timedout is true. Add unit test case to verify these addition. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Ic76933242046b3e8e553de88221b943ad097c91c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12194 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Monica Kenguva <monica.kenguva@intel.com>	2022-04-22 09:44:57 +00:00
Shuhei Matsumoto	da2fc15f2a	bdev/nvme: Factor out updating ANA state of ns operation Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Ib703f57c4bc00c7305856b2f0613fe68428c953e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12193 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Monica Kenguva <monica.kenguva@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-04-22 09:44:57 +00:00
paul luse	bf5aca3274	module/accel/idxd: change json write of kernel mode from U32 to bool g_kernel_mode is a bool everywhere it is used but was written as a U32 in write_config_json routine. Signed-off-by: paul luse <paul.e.luse@intel.com> Change-Id: I16b68b88a259df0d8240b64464729bd4a0ef84ec Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12275 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: <wayne.gao@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-04-21 08:12:37 +00:00
Shuhei Matsumoto	e63eebca1b	bdev/nvme: Retry creating qpair if it fails when creating bdev channel We may fail creating qpair when adding io_path while creating a bdev_channel if connection is down. But if we enable I/O error recovery, we can retry creating qpair later. So let nvme_qpair_create() succeed if the ctrlr is being reset or I/O error recovery is enabled even if creating qpair failed. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I7d4ff036187bb79ada258cfc299582b4d287018b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12288 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: <tanl12@chinatelecom.cn> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-04-20 10:56:54 +00:00
Shuhei Matsumoto	8cd418883d	bdev/nvme: Call failover() instead of reset() if I/O qpair gets error first Previously, only if admin qpair gets error, bdev_nvme_failover() was called. However, I/O qpair may get error earlier than admin qpair. In this case, bdev_nvme_failover() was called but reset was already in progress. So bdev_nvme_failover() returned without doing anything. bdev_nvme_reset_complete() executes bdev_nvme_failover() if reset failed. However the test time of test/nvmf/host/failover.sh was very short. Timeout came before trying bdev_nvme_failover(). We can replace other bdev_nvme_reset() calls by bdev_nvme_failover() but this patch focuses on the critical case. Fixes issue #2128. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I68f54bbf54f92343aa56ae41f2b4cd92421c4bbb Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12295 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2022-04-20 10:56:54 +00:00
zhangduan	87cfed8442	sock: Add ack_timeout to spdk_sock_opts Due to the same reason as transport_ack_timeout for RDMA transport, TCP transport also needs ack timeout. This timeout in msec will make TCP socket to wait for ack util closes connection. Signed-off-by: zhangduan <zhangd28@chinatelecom.cn> Change-Id: I81c0089ac0d4afe4afdd2f2c7e5bff1790f59199 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12214 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-04-14 08:34:29 +00:00
paul luse	37b68d7287	accel: cleanup by getting rid of capabilties enum In support of upcoming patches and to greatly simplify things, the capabilites enum which held bit positions for each opcode has been removed. Only the opcodes enum remains and thus only opcodes are used throughout. For the capabiltiies bitmap a helper function is added to convert from opcode to bit position. Right now it is used in the IO path but in upcoming patches that goes away and the conversion is only done at init time. Signed-off-by: paul luse <paul.e.luse@intel.com> Change-Id: Ic4ad15b9f24ad3675a7bba4831f4e81de9b7bc70 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11949 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-04-14 08:32:50 +00:00
yupeng	df80e8ffc6	bdev/raid: stop the raid bdev in raid_bdev_destruct The raid bdev may receive IO requests after the raid layer invokes spdk_bdev_unregister function. This patch moves the raid_bdev->module->stop to the raid_bdev_destruct function. Then there will be no IO after the raid_bdev->module->stop is invoked. Below is a way to reproduce this issue: (1) create a malloc bdev sudo ./scripts/rpc.py bdev_malloc_create --name malloc0 128 4096 (2) create a concat bdev base on the malloc bdev sudo ./scripts/rpc.py bdev_raid_create --name concat0 \ --raid-level concat --base-bdevs malloc0 --strip-size-kb 4 (3) create a lvstore base on the concat bdev sudo ./scripts/rpc.py bdev_lvol_create_lvstore \ --cluster-sz 4194304 --clear-method unmap concat0 lvs0 (4) remove the concat bdev sudo ./scripts/rpc.py bdev_raid_delete concat0 In the step(4), the spdk app will crash because an IO request is sent to the concat bdev after the concat_stop is invoked. Signed-off-by: Peng Yu <yupeng0921@gmail.com> Change-Id: I76d6964ee9528d4590ed86e9c5c28d53e85da32f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12221 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-04-13 08:33:19 +00:00
Jim Harris	0ab324dd2c	bdev/pmem: add support for IO_TYPE_FLUSH There's nothing to actually flush, the support just immediately completes the command with success. Fixes issue #2446. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ib634bbb2dfc6f68b273a2d933d6c9b5dbfa03758 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12203 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: fengchunsong <fengchunsong@huawei.com> Reviewed-by: Rafal Stefanowski <rafal.stefanowski@intel.com> Reviewed-by: GangCao <gang.cao@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-04-11 07:43:23 +00:00
Ben Walker	c86778398b	bdev/nvme: Remove ctrlr from nvme_ctrlr_channel This was neither set nor used. Signed-off-by: Ben Walker <benjamin.walker@intel.com> Change-Id: I3119135843c5fc0b8724e593db40df46e6b5bdb0 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12097 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Michael Haeuptle <michaelhaeuptle@gmail.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-04-07 07:23:56 +00:00
yupeng	64eebbd132	bdev/raid: Add concat module The concat module can combine multiple underlying bdevs to a single bdev. It is a special raid level. You can add a new bdev to the end of the concat bdev, then the concat bdev size is increased, and it won't change the layout of the exist data. This is the major difference between concat and raid0. If you add a new underling device to raid0, the whole data layout will be changed. So the concat bdev is extentable. Change-Id: Ibbeeaf0606ff79b595320c597a5605ab9e4e13c4 Signed-off-by: Peng Yu <yupeng0921@gmail.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11070 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-04-05 07:39:00 +00:00
Alexey Marchuk	cff525d336	vbdev_gpt: Report memory domains vbdev_gpt reports memory domains of its base bdev if the base bdev is not configured to check DIF refence tag since in that case bdev_part will remap reftags thereby it will touch metadata. Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: Ie1fee7f267c318b3cede47fbbbe9f7639571bdeb Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11050 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-04-04 09:57:56 +00:00

... 2 3 4 5 6 ...

1655 Commits