ivampiresp/Spdk - Spdk - Leaflow Developers

Author	SHA1	Message	Date
Yuriy Umanets	4a11579eaf	bdev/crypto: Improved debug messages - More user-friendly style of error and debug messages. - Remove "ERROR" word from SPDK_ERRLOG() to avoid duplicating in the log. Signed-off-by: Yuriy Umanets <yumanets@nvidia.com> Change-Id: Iaee068f96e66f567fc23b34ae0ae6221c1bd710c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11632 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com>	2022-03-24 09:21:35 +00:00
Yuriy Umanets	5ba9b78e17	bdev/crypto: Cleanup with crypto opts fields duplication - Fixed duplication of key, key2, drv_name, cipher, etc., fields in struct bdev_names and struct vbdev_crypto. Moved all of them into the new struct vbdev_crypto_opts, which is re-used by both structs. This aslo removes duplication in error handling and fininalization logic that checks the keys are zeroed out and properly freed. - Moved unhexlify into vbdev rpc code. All keys passed to vbdev already in the binary form. - Provide meaningful error messages in the rpc response on keys validation issues during setup of crypto vbdev. - Updated unit tests. Signed-off-by: Yuriy Umanets <yumanets@nvidia.com> Change-Id: I1fab8771bbbc0cd2f359f0d105fec28fb86893b3 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11631 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-03-24 09:21:35 +00:00
Yuriy Umanets	1434e255ef	bdev/crypto: Critical fix about using binary keys - Added hexlify() and unhexlify() for key and key2. This is required for keys that contain zero and non-ascii characters. Since binary keys may contain zero character, strlen(key) cannot be used and key_size and key2_size are used instead. Non-asci chars are not allowed in json and using hexlified keys fixes this issue as well. - Updated documentation to clearly state that hexlified keys must be used. - Updated test scripts. Signed-off-by: Yuriy Umanets <yumanets@nvidia.com> Change-Id: I3fce7839f7eaa67d0307071eba80b4cea472d731 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11891 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2022-03-24 09:21:35 +00:00
Yuriy Umanets	45f24aebe7	bdev/crypto: MLX5 AES_XTS general support - General MLX5 crypto support. - Unit-tests MLX5 crypto support. - Documentation update to list the MLX5 driver as supported, enumerate the cipher algorithms and provide some configuration hints. Signed-off-by: Yuriy Umanets <yumanets@nvidia.com> Change-Id: I0da1f49f4acd068d75a4d8633f84fe626d774431 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11630 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-03-24 09:21:35 +00:00
Alexey Marchuk	c20dd8afee	bdev: Add ext_opts in public bdev_io section Bdev modules must not access internal bdev_io structure, so add a new pointer in a public section. Pointer in internal section will be used in next patch Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: Ib631563015b3e5fa9300d22b7ae59d8db43c8275 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10421 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-03-23 09:01:40 +00:00
Ben Walker	f0bf4e75f5	idxd: Eliminate configs SPDK has settled on what the optimal DSA configuration is, so let's always use it. Change-Id: I24b9b717709d553789285198b1aa391f4d7f0445 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11532 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com>	2022-03-21 11:05:28 +00:00
Jim Harris	26d4436a52	bdev/nvme: rename discovery_ctx->detach to ->stop Detach and stop are two different operations. This ->detach field was used to denote when the associated discovery service should be stopped. So call the field 'stop' instead. That may trigger the currently attached discovery controller to be detached. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I61c7fc860cd9dbcfab71eedfd223c06c51a41f27 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11771 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-03-21 11:05:17 +00:00
Jim Harris	40a9d6a03e	bdev/nvme: add entry_ctx_in_use We will now keep a list of the possible paths to the discovery subsystem. One of them will be the path we are currently connected to (which at service start, is the path specified by the user). Additional entries are added for discovery log page entries referencing the discovery subsystem. When the discovery service starts, we just have the initial entry in the list - the discovery poller tries to connect to it, and if the connect starts successfully, removes it from the list and points ctx->entry_ctx_in_use to it. This will be useful later when we want to iterate through the available paths to the discovery subsystem if the current path fails. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I5b18e0f20c4607e29ac0f12f27ba7eb169d0206d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11770 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-03-21 11:05:17 +00:00
Jim Harris	5428fb0133	bdev/nvme: add function to create an entry_ctx for discovery This reduces some code duplication since the same function will be reused in an upcoming patch. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Id6764171ff93c95de49792a4488f2c205b8eddb6 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11769 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-03-21 11:05:17 +00:00
Jim Harris	81587f0663	bdev/nvme: always defer start of discovery service We used to wait until the discovery service could connect to the discovery subsystem before calling the callback function provided by the caller (mainly the start_discovery RPC). Moving forward, we will be handling the case where the discovery subsystem is unavailable temporarily. For now, let's not fail the bdev_nvme_start_discovery call if we cannot connect to the discovery subsystem. This will keep the initial service start path the same as the path where the discovery subsystem is temporarily unavailable. In the future, we can consider adding functionality to the start_discovery RPC that waits up to X number of seconds to see if we were able to connect and fail otherwise. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Icb05523b9d59f508bfbc0233595c8bf58c10488f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11768 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2022-03-21 11:05:17 +00:00
Shuhei Matsumoto	0b32309bf6	bdev/nvme: Check not only I/O qpair but also adminq when finding optimal I/O path For RDMA transport, adminq will find transport error first because usually only adminq polls CM events. Change-Id: I7b22cc8883bf02198f1a90d2654c1de6f2e736e6 Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11331 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-03-21 10:49:11 +00:00
Shuhei Matsumoto	3edbcba287	bdev/nvme: Factor out clearing all I/O path caches into a helper function This is a preparation to the following patches. Change-Id: I1bb0052c745d4f83ff621e4110907a8ac1f1d597 Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11330 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-03-21 10:49:11 +00:00
Shuhei Matsumoto	3182be6d26	bdev/nvme: Fail fast I/O qpair if poll_group_process_completions() returns negated errno If qpair is disconnected asynchronously, it takes time from detecting transport error to actually disconnected. We should avoid using the path as soon as possible after detecting any transport error. Poll group clears I/O path cache if it finds transport error and avoid using the path which had transport error. These changes will reduce the failover time. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I00580159a84372a115ed5e62a6ce13eed4368999 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11329 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-03-21 10:49:11 +00:00
Shuhei Matsumoto	aca0d56e3d	bdev/nvme: Reconnect ctrlr after it is disconnected at completion poller spdk_nvme_ctrlr_disconnect() will be made asynchronous in the following patches and so we will need to have some changes. spdk_nvme_ctrlr_disconnect() disconnects adminq and ctrlr synchronously now. If spdk_nvme_ctrlr_disconnect() is made asynchronous, spdk_nvme_ctrlr_process_admin_completions() will complete to disconnect adminq and ctrlr, and will return -ENXIO only if adminq is disconnected. However even now spdk_nvme_ctrlr_process_admin_completions() returns -ENXIO if adminq is disconnected. So as a preparation, set a callback before calling spdk_nvme_ctrlr_disconnect() and call the callback if it is set and spdk_nvme_ctrlr_process_admin_completions() returns -ENXIO. Besides, fix the return value of bdev_nvme_poll_adminq() in this patch. Change-Id: I2559f86bb8cf9a92b5b386ed816c00b08c9832df Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10950 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-03-21 10:49:11 +00:00
Shuhei Matsumoto	a76bbe3553	bdev/nvme: Disconnect and then free I/O qpair in a ctrlr reset sequence As we do when deleting ctrlr_channel, disconnect and then free I/O qpair in a ctrlr reset sequence. Deleting ctrlr_channel and resetting ctrlr_channel may cause conflicts. This patch processes such conflicts correctly. If destroy_ctrlr_channel_cb() is executed between pending and executing reset_destroy_qpair(), reset_destroy_qpair() is not executed because ctrlr_channel is not found. In this case, destroy_qpair_channel() starts disconnecting qpair and deletes ctrlr_channel. Then disconnected_qpair_cb() releases a reference to poll group. If destroy_ctrlr_channel_cb() is excuted between executing reset_destroy_qpair() and disconnected_qpair_cb(), destroy_ctrlr_channel_cb() skips ctrlr_channel for a reset sequence. Change-Id: I1f49f74b94aefbea178680aa53ded3a12876c676 Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10766 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-03-21 10:49:11 +00:00
Yuriy Umanets	8ec34933e9	bdev/crypto: Add qp_desc_nr to struct vbdev_crypto At the moment MLX5 uses different number of qp descriptors than the other pmd crypto drivers. Adding it to vbdev_crypto on init and re-use everywhere we need it. Signed-off-by: Yuriy Umanets <yumanets@nvidia.com> Change-Id: Iea4d4787fc5fd91f27c4a70cf78c5660f09bc854 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11878 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2022-03-16 08:20:03 +00:00
Yuriy Umanets	0d857f441c	bdev/crypto: Zero out key and key2 before release. Even released memory contains key and key2 until it is re-allocated for other purposes. Zero out key and key2 when not longer needed. Signed-off-by: Yuriy Umanets <yumanets@nvidia.com> Change-Id: If80f3faeb98b5b5acab7f2f857f284909247d1ac Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11877 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2022-03-16 08:20:03 +00:00
Yuriy Umanets	15a5bd8264	bdev/crypto: Rename AES_CBC_IV_LENGTH to IV_LENGTH Since IV length is the same for all pmd crypto drivers, AES_CBC_IV_LENGTH is renamed to IV_LENGTH. Signed-off-by: Yuriy Umanets <yumanets@nvidia.com> Change-Id: If8769db119eb599a17c267e8950f18f5a0ea995b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11875 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2022-03-16 08:20:03 +00:00
Liu Xiaodong	884bcfcf15	bdev_nvme: update reset_start_tsc for failover When connection is disconnected, bdev_nvme will call bdev_nvme_failover, and then reset the controller. nvme_ctrlr->reset_start_tsc should be updated in function bdev_nvme_failover, then bdev_nvme_check_xxx_timeout can work well. Change-Id: I99b639545e9dd4082cdc14696bb7872cb4917b1d Signed-off-by: Liu Xiaodong <xiaodong.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11957 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2022-03-16 08:19:58 +00:00
Krzysztof Karas	32da70152f	scheduler: create and parse JSON values for dynamic scheduler params Creates a JSON on scheduler side to return after .get_opts is called and parses a JSON on .set_opts call. The JSON passed to dynamic scheduler on .set_stats is a copy of a pointer already available during RPC framework_set_scheduler call. Getting and setting scheduler stats via RPC calls is going to be implemented in the next patch in this series. Change-Id: I62880a71066a140c74336a5725e7b10952008e5c Signed-off-by: Krzysztof Karas <krzysztof.karas@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11448 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-03-16 08:19:26 +00:00
Shuhei Matsumoto	068ca77ab2	bdev/nvme: Disconnect and then free I/O qpair when deleting ctrlr_channel For RDMA transport, current synchronous qpair disconnect occupied CPU for a second when qpair disconnect gets timeout. To remove this limitation, we will do the following: - make spdk_nvme_ctrlr_disconnect_io_qpair() asynchronous, - spdk_nvme_qpair_process_completions() returns -ENXIO only if the qpair is actually disconnected. Even at this patch, spdk_nvme_poll_group_process_completions() invokes disconnected_qpair_cb only if a qpair is actually disconnected. This behavior will be maintained. To use the upcoming asynchronous qpair disconnect easily, when deleting a ctrlr_channel, disconnect the qpair, and then free the qpair and release a reference to the poll group when the qpair is actually disconnected. We need to delete a nvme_qpair asynchronously after the corresponding nvme_ctrlr_channel is deleted and defer the deletion of the corresponding nvme_ctrlr until the nvme_qpair is deleted. To satisfy this requirement, utilize the reference count of the nvme_ctrlr. disconnected_qpair_cb() may call spdk_nvme_ctrlr_free_io_qpair() and spdk_io_device_unregister() successively. The spdk_io_device_unregister() will execute spdk_nvme_detach_async() from its callback. spdk_nvme_ctrlr_free_io_qpair() has to complete earlier than spdk_nvme_detach_async() starts. spdk_nvme_ctrlr_free_io_qpair() is executed after unwinding stack. spdk_nvme_detach_async() is executed after sending a message. Sending message is later than unwinding stack. Hence the requirement is satisfied naturally. spdk_io_device_unregister() for the nvme_ctrlr is required to be called on the nvme_ctrlr->thread. To satisfy this requirement, redirect nvme_ctrlr_unregister() to the nvme_ctrlr->thread. This change is too small to stand as an independent patch. So include the change in this patch. Change-Id: Id8c01966c40b1dae9c4ef17f1b0b3f60a0bd17d5 Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10765 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-03-15 09:05:09 +00:00
Shuhei Matsumoto	c113e4cdca	bdev/nvme: Alloc qpair context dynamically on nvme_ctrlr_channel This is another preparation to disconnect qpair asynchronously. Add nvme_qpair object and move the qpair and poll_group pointers and the io_path_list list from nvme_ctrlr_channel to nvme_qpair. nvme_qpair is allocated dynamically when creating nvme_ctrlr_channel, and nvme_ctrlr_channel points to nvme_qpair. We want to keep the times of references at I/O path. Change nvme_io_path to point nvme_qpair instead of nvme_ctrlr_channel, and add nvme_ctrlr_channel pointer to nvme_qpair. nvme_ctrlr_channel may be freed earlier than nvme_qpair. nvme_poll_group lists nvme_qpair instead of nvme_ctrlr_channel and nvme_qpair has a pointer to nvme_ctrlr. By using the nvme_ctrlr pointer of the nvme_qpair, a helper function nvme_ctrlr_channel_get_ctrlr() is not necessary any more. Remove it. Change-Id: Ib3f579d3441f31b9db7d3844ec56c49e2bb53a5d Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11832 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-03-15 09:05:09 +00:00
Shuhei Matsumoto	c1b0b339cf	bdev/nvme: Refactor create/destroy_ctrlr_channel_cb() The following patches will have the following changes. Add nvme_qpair object and move qpair and poll_group pointers and the io_path_list list from nvme_ctrlr_channel to nvme_qpair. nvme_qpair is allocated dynamically when creating nvme_ctrlr_channel, and nvme_ctrlr_channel points to nvme_qpair. qpair is disconnected asynchronously and nvme_ctrlr_channel is deleted asynchronously. To make the following patches simpler, refactor two functions, bdev_nvme_create_ctrlr_channel_cb() and bdev_nvme_destroy_ctrlr_channel_cb(). The details are as follows. Factor out nvme_qpair_create() from bdev_nvme_create_ctrlr_channel_cb() and factor out nvme_qpair_delete() from bdev_nvme_destroy_ctrlr_channel_cb(). Then reorder a few operation in these. Additionally, reorder a operation in _bdev_nvme_add_io_path(). Change-Id: Idf0328fa77a54f40fe52ca72c3842dde82d55972 Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11831 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-03-15 09:05:09 +00:00
Shuhei Matsumoto	d7f0a1820e	bdev/nvme: Inline bdev_nvme_destroy_qpair() In the following patches, spdk_nvme_ctrlr_disconnect_io_qpair() will be changed to be asynchronous, spdk_nvme_ctrlr_disconnect_io_qpair() will be called first and then spdk_nvme_ctrlr_free_io_qpair() after the qpair is actually disconnected. We will not be able to keep the current bdev_nvme_destroy_qpair() function. As a preparation, inline bdev_nvme_destroy_qpair() and remove it. Additionally, this patch has the following changes. Previously I/O qpair was freed and then I/O path caches were cleared. Both are SPDK thread local. So there is no dependency for the ordering of these two operations. However, it will reduce the size of the following patches if we clear I/O path caches before freeing I/O qpair when the qpair is disconnected. Hence we clear I/O path caches and then free I/O qpair. Remove DTRACE for bdev_nvme_destroy_qpair() for now. It will be restored in the following patches. Furthermore, fix potential NULL pointer acces in bdev_nvme_create_qpair(). Change-Id: I0ab78ccb0d240e56b95b53179341afcd909a31f6 Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10746 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-03-15 09:05:09 +00:00
Shuhei Matsumoto	0fba8dc8cb	bdev/nvme: I/O error resiliency can be configured by global options Add three options for I/O error resiliency to spdk_nvme_bdev_opts. Then the RPC bdev_nvme_set_options can configure these. These can be overridden if these are given by the RPC bdev_nvme_attach_controller. Change-Id: If3ee23aeef8b7585fe0fb5ec4695df5866fc1e74 Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11830 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-03-15 09:05:09 +00:00
Jim Harris	002b25cc5a	bdev_nvme: use INFOLOG for discovery messages This is not in the fast path, so using INFOLOG instead of DEBUGLOG allows these messages to be enabled in release builds. While here, set this flag in the discovery.sh test script so that we get better information if there are test failures. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I1c0d087b5c0cb40118691f4a1bc16adc2fdaad9c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11932 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2022-03-14 08:44:21 +00:00
Richael Zhuang	3ee923eff1	uring: fix heap-use-after-free bug in sock_flush_client If the req's cb_fn will close the socket, there is heap-use-after-free error if continuing to access sock. Signed-off-by: Richael Zhuang <richael.zhuang@arm.com> Change-Id: I88c6adb9d25e52d94b08f53e8ccac611c4d29fff Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11855 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2022-03-11 08:04:59 +00:00
John Levon	c4f7ddd2c7	lib/nvme: report shadow doorbell update stats Currently shadow doorbell updates are not counted; add statistics for those, and rename the other statistic for clarity. Signed-off-by: John Levon <john.levon@nutanix.com> Change-Id: I211a77902e38265c99b15862034c6d022dc582a0 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11844 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-03-10 09:49:25 +00:00
Krzysztof Karas	fcf5ae7e4e	scheduler: prepare scheduler code for dynamic changes Change dynamic scheduler parameters from #define to global variables. Change-Id: I5bbbf40ac66971bcc24fc8bf0ac5d13efdc7412f Signed-off-by: Krzysztof Karas <krzysztof.karas@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11447 Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-03-09 13:33:21 +00:00
John Levon	ba4ffda671	lib/nvme: correct typo in transport stats "doorbell" not "doobell" Signed-off-by: John Levon <john.levon@nutanix.com> Change-Id: I9261559576e72a09b63fbc984ae0ec2a2572eb2c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11841 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-03-09 08:03:42 +00:00
Shuhei Matsumoto	00a7998254	bdev/nvme: Move per controller settings into a option structure The following patches will enable us to specify I/O error resiliency options per nvme_ctrlr as global options. To do it easier, move per controller options about I/O error resiliency into struct nvme_ctrlr_opts. prchk_flags is not exactly for resiliency but move it into struct nvme_ctrlr_opts too. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I85fd1738bb6e293cd804b086ade82274485f213d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11829 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-03-09 08:00:45 +00:00
Shuhei Matsumoto	df32eb112b	bdev/nvme: Add helper functions to decode T10 DIF parameters Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: If218cbc8ee0a5354e5c4e58eaae111660eb9c099 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11828 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-03-09 08:00:45 +00:00
Shuhei Matsumoto	d40292d05a	bdev/nvme: Add prefix "drv_" to instance or pointer of spdk_nvme_ctrlr_opts The following patches will add options per struct nvme_ctrlr in the NVMe bdev module. bdev_opts will be used for it. Additionally, fabrics_connect_timeout_us is set directly to spdk_nvme_ctrlr_opts. So remove it from the RPC request structure. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I981cda5e69375edc43a8581cd3b43497c38a3d56 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11827 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-03-09 08:00:45 +00:00
Shuhei Matsumoto	1a00f5c094	bdev/nvme: Fix overflow of RB tree comparison when the NSID is very big If 0 - UINT32_MAX or UINT32_MAX - 0 is substituted into a int variable, we cannot get any expected result. Fix the bug and add unit test case to verify the fix. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Ib045273238753e16755328805b38569909c8b83a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11836 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot	2022-03-09 08:00:45 +00:00
Yuriy Umanets	c58d5161e9	bdev/crypto: Fixed g_session_mp init error handling In vbdev_crypto_init_crypto_drivers() when g_session_mp init failed it was possible to jump to cleanup label but return 0 instead of -ENOMEM. Signed-off-by: Yuriy Umanets <yumanets@nvidia.com> Change-Id: I128968699b0d2dbb2f769ac5fd7bd53ab409562b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11659 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com>	2022-03-08 15:07:46 +00:00
Yuriy Umanets	bfb676e93d	bdev/crypto: Fixed bdev_io double completion _crypto_operation_complete(bdev_io) should not be called in _crypto_operation() because it is done by caller function on read or write. Signed-off-by: Yuriy Umanets <yumanets@nvidia.com> Change-Id: Ie03412c72f41abf661b069d4b00eaf74f40261d6 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11629 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-03-08 15:07:46 +00:00
Yuriy Umanets	3d0bae35c4	bdev/crypto: Error handling fixes in vbdev_crypto_claim() - Fixed missed spdk_bdev_module_release_bdev() during error handling. - Fill the keys with zeros before releasing memory. - Fixed issue with g_number_of_claimed_volumes that can become negative because of invalid error handling. Signed-off-by: Yuriy Umanets <yumanets@nvidia.com> Change-Id: I4171f4326d87b1d8f886416bf53b0f2043ccbfe7 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11628 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com>	2022-03-08 15:07:46 +00:00
Jim Harris	43d17a844c	bdev/nvme: handle detach first in discovery_poller This will be helpful in later patches, when we handle detach not just at discovery service stop, but also when a discovery controller is disconnected. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ie62d62f73b328c6e058f6480c61fbdf91e854e2a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11767 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-03-08 07:52:49 +00:00
Jim Harris	a96c1bfdbd	bdev/nvme: change order of add/remove for discovery If the path to a subsystem changes from one discovery log to the next, we should add the new paths first, and only then remove paths. This ensures we don't remove the last path to a subsystem, causing associated bdevs to get unregisterd and reregistered. This requires adding a new log_page member to discovery_ctx, since we now need to walk the log page to find removed paths after all the new paths are attached. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I99fc2e40e6f7e2e26d558ebe7bc5208fe474c0ea Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11766 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-03-08 07:52:49 +00:00
paul luse	89ee5a13bf	module/bdev/compress: print a reasonable message on create error Specifically when a compress bdev already exists on the supplied base. Before this you'd get a bunch of nasty messages providing really no clue as to what was wrong. Signed-off-by: paul luse <paul.e.luse@intel.com> Change-Id: I8cce8902909659fba0e9613891c7ef8ebe4b06d0 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11806 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-03-08 07:50:49 +00:00
paul luse	8951c15759	accel/idxd: add and respect flag to support writes to PMEM Plumbing for flags was added in prior pathces. This patch introduces and respects the relevant flags for use with PMEM aka durable memory through the accel_fw, IDXD, IOAT and SW modules. Signed-off-by: paul luse <paul.e.luse@intel.com> Change-Id: I792f31459e061d220965feced60e0c236d819a68 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9455 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-03-04 21:56:54 +00:00
paul luse	12c40f05e2	accel: plumb accel flags through operations that need them This patch is just plumbing the flags param. Use of it for PMEM will come in upcoming patches. Signed-off-by: paul luse <paul.e.luse@intel.com> Change-Id: I620df072aaad3f8062a0312bbea3da1bc3f911b9 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9281 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: GangCao <gang.cao@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-03-04 21:56:54 +00:00
paul luse	7d63e716eb	idxd: Add flags parameter to all low level API for prep/submission Previously required flags were hardcoded in the low level library. By having the user pass them in there is more flexbility and control. This was driven by the need to add a new flag for pmem durability, coming in a future patch in this series. There is no change in functionality with this patch, just movement of where flags are set and by whom and the plumbing of 'flags'.. Also note that some flags in scenarios that we know are required are still set by the library. Signed-off-by: paul luse <paul.e.luse@intel.com> Change-Id: I194278f9e3cec0886628585cf84bcc2eae635e0a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9449 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: GangCao <gang.cao@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-03-04 21:56:54 +00:00
Yuriy Umanets	5990c4ecef	bdev/crypto: Close device on errors during init - Always stop/close crypto devices properly on error handling in vbdev_crypto_init_crypto_drivers(). - Stop/close crypto devs during finalization in vbdev_crypto_finish(). - When finalizing device qp, check for dev id. Maintain g_qat_total_qp counter correctly. Signed-off-by: Yuriy Umanets <yumanets@nvidia.com> Change-Id: I285788c562007847d9fc5921eb59b59cc73920bf Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11627 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com>	2022-03-04 09:40:04 +00:00
Yuriy Umanets	9afa85b543	bdev/crypto: Fixed page boundary bug in _crypto_operation() It is possible that physical address returned from spdk_vtophys() will lie on the page boundary for the mbuf size we want. In this case we have to allocate one more mbuf and setup its chaining with the original mbuf. This holds true for src and dst mbufs, though reproduced only for dst. Signed-off-by: Yuriy Umanets <yumanets@nvidia.com> Change-Id: Ibf82a97fac2ee0217a906a7c6f8558bdc2eedda2 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11626 Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot	2022-03-04 09:40:04 +00:00
Yuriy Umanets	ddb443751b	bdev/crypto: Cancel re-enqueue in crypto_dev_poller() If re-enqueue of pending crypto ops failed in crypto_dev_poller() and DPDK reports errors then stop re-enqueue, remove the ops from the re-submit queue and fail the IO. Signed-off-by: Yuriy Umanets <yumanets@nvidia.com> Change-Id: I258f7b8986f35fa70e4af25bc8ad2b3b26aa206b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11625 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-03-04 09:40:04 +00:00
Yuriy Umanets	8ecf8dfcd7	bdev/crypto: Continue init after AESNI_MB failure - Continue init of the other crypto devices (mlx5) after failure of rte_vdev_init(AESNI_MB) in vbdev_crypto_init_crypto_drivers(). It simply may not be enabled in DPDK because it requires IPSec_MB>=1.0 installed in the system. Reproduces with --with-dpdk=dpdk/install option used, when the target DPDK is built without control of IPSec version from the SPDK side. - Updated crypto_ut to test the new behavior of error handling from rte_vdev_init(AESNI_MB) in vbdev_crypto_init_crypto_drivers(). Signed-off-by: Yuriy Umanets <yumanets@nvidia.com> Change-Id: Icd4db8877afe87db8166c40d6e7b414cd43c9c25 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11624 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-03-04 09:40:04 +00:00
Yuriy Umanets	a837ea37da	bdev/crypto: Switched to pkt_mbuf API - Switched to using rte_mempool for mbufs instead of spdk_mempool. This allows using rte pkt_mbuf API that properly handles mbuf fields we need for mlx5 and we don't have to do it manually when sending crypto ops. - Using rte_mempool *g_mbuf_mp in vbdev crypto ut and added the mocking API code. - crypto_ut update to follow pkt_mbuf API rules. Signed-off-by: Yuriy Umanets <yumanets@nvidia.com> Change-Id: Ia5576c672ac2eebb260bfdbb528ddb9edcd8f036 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11623 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2022-03-04 09:40:04 +00:00
Yuriy Umanets	03d9c67139	bdev/crypto: Error handling in create_vbdev_dev() - Properly rte_cryptodev_stop() and rte_cryptodev_close() device on errors in create_vbdev_dev(). - Check for device id before removing its qp from the qp list. - Maintain correct g_qat_total_qp counter if qat qp is removed on errors. Signed-off-by: Yuriy Umanets <yumanets@nvidia.com> Change-Id: I088d7057eebff89ff0d995adcc2a05c724c3323b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11622 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-03-04 09:40:04 +00:00
Yuriy Umanets	706d2c1dc1	bdev/crypto: Fixes with json export - Fixed bug in vbdev_crypto_config_json(). crypto_bdev->key was used for "key2" json field. - Fixed bug in vbdev_crypto_dump_info_json(). crypto_bdev->key was used for "key2" json field. Signed-off-by: Yuriy Umanets <yumanets@nvidia.com> Change-Id: Iac441bc30b03234c96d646db14ee36ad56a546dc Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11621 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2022-03-04 09:40:04 +00:00
Kefu Chai	a7d174e2ef	bdev/null: call spdk_bdev_module_fini_done() even if not registered in bdev subsystem, if any of the bdev module fails to initialize in bdev_modules_init(), this function just stops immediately. in general, the non-zero rc is returned to the callback func passed to spdk_subsystem_init(). if spdk app is used for building the spdk application, it's very likely that app_start_rpc() is used as this very callback func. in this case, app_start_rpc() would just pass the `rc` to spdk_app_stop() which tears down all subsystems one after another. bdev tears itself down by calling all its modules' module_fini(), including those whose .module_init never gets called. the problem is, if a bdev module marks its `.async_fini` true, and it calls spdk_bdev_module_fini_done() only if spdk_io_device_unregister(), then a bdev module which fails to initialize would leave us an spdk application hanging in the air. a typical logging message sequence looks like: [2022-02-27 20:47:13.766578] bdev.c:1438:spdk_bdev_initialize: ERROR: bdev modules init failed [2022-02-27 20:47:13.766622] subsystem.c: 169:spdk_subsystem_init_next: ERROR: Init subsystem bdev failed [2022-02-27 20:47:13.766638] app.c: 691:spdk_app_stop: WARNING: spdk_app_stop'd on non-zero [2022-02-27 20:47:13.766658] thread.c:2050:spdk_io_device_unregister: ERROR: io_device 0x10d3c30 not found this is exactly the case we could run into if a bdev module fails to initialize and bdev_null is unable to call spdk_bdev_module_fini_done() when being teared down, because spdk_io_device_unregister() just refuses to call the callback if the I/O device is never registered. since `g_null_read_buf` is set in bdev_null_initialize(), in this change, this pointer is checked for zero before calling spdk_io_device_unregister(), if it is NULL, spdk_bdev_module_fini_done() is called directly instead of calling spdk_io_device_unregister(). this helps to address the hanging issue. Signed-off-by: Kefu Chai <tchaikov@gmail.com> Change-Id: I3a41fcd2f1c986e416dacecd5ca352dfd1e379b7 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11750 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-03-02 08:39:40 +00:00
Rafal Stefanowski	c39647df83	bdev/ocf: Improve OCF mpools - Reduce the size of initial memory needed by OCF. Number of allocator buffers equal to 16383 is tested to work on 24 caches running IO of io_size=512 and io_depth=512, which should be more than enough for any real life scenario. This reduces initial OCF memory usage from 726 MiB to 392 MiB. - Fix string handling for the name of the mempool. Signed-off-by: Rafal Stefanowski <rafal.stefanowski@intel.com> Change-Id: I40063ab1897c479c25904ae4096c5dae3351f73b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10843 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-03-02 08:34:39 +00:00
Jim Harris	84bec316c2	bdev/nvme: add additional DEBUGLOGs for discovery Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Iba16c5f3273fe2335b847b6bd396e45aa97da7c9 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11734 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-02-28 11:06:16 +00:00
Jim Harris	bcb75753dc	bdev/nvme: add DISCOVERY_DEBUGLOG/ERRLOG These macros are used to prefix the following to any discovery-related DEBUGLOG or ERRLOG: Discovery[127.0.0.1:8009] Inside the brackets are the traddr and trsvcid of the discovery service associated with that message. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ib1991a13f550bb8c9aaf1194a56b218cbd71c96c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11733 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-02-28 11:06:16 +00:00
Jim Harris	9614dca9f9	bdev/nvme: save the trid of the discovery service This is useful for adding trid details to discovery related log messages in a later patch. Future patches will update this trid if the current discovery ctrlr fails and we need to fail over to a different path to the discovery subsystem. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I51712bab2d891ae9c683f8716b4228741f64e7db Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11732 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-02-28 11:06:16 +00:00
Jim Harris	a0690464dd	bdev/nvme: allocate discovery_entry_ctx for discovery subsystems For now, just allocate entries and put them on a new TAILQ on the discovery_ctx. Future patches will use these to try to reattach to the discovery subsystem if the current discovery ctrlr fails. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I3cd841df2260bbe8a497bbbf36dea4a1081f25c0 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11731 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-02-28 11:06:16 +00:00
Jim Harris	21aa2ba37e	bdev/nvme: move discovery_attach_cb up in file It will be referenced in a second location in an upcoming patch, so move its definition now to reduce the size of that patch and avoid a forward declaration. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Iae12cc613190c03f0d48d71475df98384f8e47c7 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11730 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-02-28 11:06:16 +00:00
Jim Harris	9b04bd8d5c	bdev_nvme: rename discovery_ctrlr_ctx to discovery_entry_ctx This name better describes the purpose of this structure. Currently it is used to represent discovery log page entries for NVM subsystems found by the discovery service. Upcoming patches will also use this structure to represent discovery log page entries for the discovery subsystem. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I84996c9968200c50c32427f0233cb707cdc2d54c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11547 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-02-28 11:06:16 +00:00
Jim Harris	09240a1c3c	bdev/nvme: don't connect to discovered discovery subsystems For now, if the discovery service finds a discovery subsystem, don't connect to it. Support for nested discovery controllers will be coming soon, but for now we need to make sure we don't try to connect to a discovery subsystem as if it was an NVM subsystem. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I00234718b0e39eda6e1cb1b1150a4fadcf6d8b11 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11546 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-02-28 11:06:16 +00:00
Shuhei Matsumoto	ecdbaa2310	bdev/nvme: Call spdk_free() to the object allocated by spdk_malloc() This is a bug fix. free() was called to the object allocated by spdk_malloc(). Hence free(): invalid pointer: 0x00002000146ece00 was printed. This was found during multipath testing. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Icf6aa6dcdda728fef91b3acad7a1f1ee219c27af Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11710 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-02-24 14:56:03 +00:00
John Levon	5f27092835	thread: add spdk_thread_exec_msg() A common pattern is: if (foo->thread == spdk_get_thread()) cb(arg); else spdk_thread_send_msg(foo->thread, cb, arg); for cases where it's important the callback runs on a particular thread, but it doesn't matter if it's synchronous or asynchronous. Add a new API to support this pattern, and convert over the current instances. Signed-off-by: John Levon <john.levon@nutanix.com> Change-Id: Idfbf77c02c9321c52e07181ffd8b0c437e1ab335 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11503 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com>	2022-02-23 10:06:49 +00:00
Alexey Marchuk	b5b752792f	bdev/aio: Correct error message when IO fails structure io_event defined in aio_abi.h has res member with type __s64 which is typically mapped to long long int. When we print error message, res member can be treated as an error code. In the following error message: failed to complete aio: requested len is 4096, but completed len is 18446744073709551611 the last digit in int representation is -5 which is -EIO Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Reported-by: Anil <aniruddha080699@gmail.com> Change-Id: I33b98d2118bbc9cace2d9da7cf9cd9bd06d784e6 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11453 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-02-22 09:03:51 +00:00
zgu3	af2bd57875	bdev/nvme/bdev_nvme_rpc.c: apply_firmware_complete: free bdev_io after each command finish apply_firmware_complete bug fix: after each firmware image download command finished, apply_firmware_complete is called and issue the next firmware image download command, and get another bdev_io. After last command, apply_firmware_complete_reset only release the last bdev_io, and all the ios in previous commands are not release. So after rpc_bdev_nvme_apply_firmware cycling, the io pool will be used up and cause assert. Signed-off-by: Gu, Zhimin <kookoo.gu@intel.com> Change-Id: Icb1c722d85b1985521e5f25031ae70557b7ba84a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11586 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-02-17 17:10:21 +00:00
Jim Harris	5d2d253125	nvmf: remove deprecated conn_sched parameter This parameter was ignored, and was a parameter to the nvmf_set_config RPC. For reference, this was deprecated in June 2020, commit `c37cf9fb`. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I013f4d7cf874e7e26a8a1d299fdf9d8fa05da580 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11544 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2022-02-15 14:38:37 +00:00
Shuhei Matsumoto	0ee07a484e	bdev/nvme: Missing newline (\n) for SPDK_ERRLOG Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I77711a6d3fdbaf6698ebec5a233cf6cd795726ba Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11401 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Dong Yi <dongx.yi@intel.com>	2022-02-09 18:06:15 +00:00
Shuhei Matsumoto	79829ae40b	bdev/nvme: Set ana_state_updating only after starting read ANA log page In a test case, test/nvmf/host/failover.sh, we got ANA error even if the target did not enable ANA reporting. We marked the corresponding namespace as ANA state updating but we had no way to clear it. Check if we can read ANA log page before setting the flag. If read ANA log page failed, disable ANA feature until the nvme_ctrlr is created again. In this operation, all ana_state_updating flags are cleared. Fixes #2335 Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I4e2608a35d9dfa0395ad74fceebae9faf8cd973c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11399 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-02-09 18:06:15 +00:00
Richael Zhuang	f36c033c71	uring: fix bug when inserting sock into pending_recv list There is io error when running NVMe over TCP fio test with uring socket. It's easy to reproduce the bug with the following configuration: target 1 core, 16NVMe SSD, 2 initiators each connects to 8 NVMe namespaces, each runs fio with numjobs=3. For if in each round, we inset the sock to the head of the pending_recv list, and then get max_events socks from head of the list to process, there is possibility that some socks are always not processed. Although there was a strategy to cycle the pending_recv list to make sure we poll things not in the same order. Such as a list: A B C D E F, if max_events is 3, then this strategy makes the list is rearranged to D E F A B C. But it will make this strategy not effective if using TAILQ_INSERT_HEAD(&group->pending_recv, sock...). Using TAILQ_INSERT_TAIL(&group->pending_recv, sock...) can fix it. Signed-off-by: Richael Zhuang <richael.zhuang@arm.com> Change-Id: I8429b8eee29a9f9f820ad291d1b65ce2c2be22ea Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11154 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot	2022-02-04 20:57:53 +00:00
Tomasz Zawadzki	047c067c05	so_ver: increase all major versions To allow SO_MINOR updates on LTS for the whole year it is supported, the major version for all components needs to be increased. This is to prevent scenario where two versions exists with matching versions, but conflicting ABI. Ex. Next SPDK release adds an API call increasing the minor version, then LTS needs just a subset of those additions. Increasing major so version after LTS, allows the future releases to update versions as needed. Yet allowing LTS to increase minor version separately. Disabled test for increasing SO version without ABI change, as that is goal of this patch. This check shall be removed with SPDK 22.05 release. This patch: - increases SO_VER by 1 for all components - resets SO_MINOR to 0 for all components - removes suppressions for ABI tests Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: Id1a5358882dc496faa5b0b5c9a63b326c378c551 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11361 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-01-31 15:29:56 +00:00
Michael Haeuptle	7a0c901a4c	bdev/nvme: RPCs for adding/removing error injections Provides RPCs for the qpair error injection APIs to bdev_nvme. These RPCs are useful in testing NVMeoF/NVMe behavior for various error scenarios in production. Signed-off-by: Michael Haeuptle <michael.haeuptle@hpe.com> Change-Id: I0db7995d7a712d4f8a60e643d564faa6908c3a55 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10992 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-01-31 09:51:07 +00:00
Alexey Marchuk	2ccaf2acfa	bdev/nvme: Add transport_ack_timeout to bdev_nvme_set_options RPC It may take a long time to detect network transport error when e.g. port is removed on remote target. This timeout depends on 2 parameters - retry_count and ack_timeout. bdev_nvme_set_options supports configuration of retry_count but transport_ack_timeout is missed. Note: this parameter is used by RDMA transport only. Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: I7c3090dc8e4078f64d444e2392a9e0a6ecdc31c0 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11175 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: <tanl12@chinatelecom.cn> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-01-31 09:44:28 +00:00
Jim Harris	120a6c1c2a	nvmf: do not encode core number into thread name The nvmf subsystem cannot control which core its threads get scheduled on. Even in the normal, default case, the app thread has already been scheduled on the first core, so the first nvmf thread will get scheduled on the second core, etc. So instead, always use a 0-based index for the names of the nvmf threads. Reported-by: Jacek Kalwas <jacek.kalwas@intel.com> Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I8a0f161860b985f36920845de28b39dbae9fdca5 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11351 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Jacek Kalwas <jacek.kalwas@intel.com>	2022-01-31 09:43:16 +00:00
Evgeniy Kochetov	08f9b40113	bdev/nvme: Fix namespace comparison This patch aligns namespace comparison with Linux kernel implementation: - UUID is optional and may be NULL - command set (CSI) should be the same Signed-off-by: Evgeniy Kochetov <evgeniik@nvidia.com> Change-Id: I8f889989f24cd51b104057217f87eb303b30fa68 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11312 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2022-01-27 18:53:41 +00:00
shuochen0311	b635d19a26	aio: add aio bdev rescan feature Signed-off-by: shuochen0311 <shuo.chen@databricks.com> Change-Id: I7f2788640a56d1e1bc8b7b311622628e8a6be56e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11084 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com>	2022-01-25 08:13:28 +00:00
Krzysztof Karas	30ea7ecc6f	bdev/nvme: implement additional dtrace probes Add more dtrace probes to help with identifying issues in production. Change-Id: I8fb621a15c5e33ae94d75b4fc31135e2635dcfce Signed-off-by: Krzysztof Karas <krzysztof.karas@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10561 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-01-20 21:12:56 +00:00
Tomasz Zawadzki	c2bd95ee54	vbdev_compress: reduce MAX_NUM_QP This is a workaround for #2338. Ideally the fix should remove this define and use number of cores from the application. With large number of QAT devices following error can be obsered: compdev_isal_create(): ISA-L library version used: 2.30.0 vbdev_compress.c: 358:vbdev_init_compress_drivers: NOTICE: created virtual PMD compress_isal EAL: memzone_reserve_aligned_thread_unsafe(): Number of requested memzone segments exceeds RTE_MAX_MEMZONE RING: Cannot reserve memory isal_comp_pmd_qp_setup(): Failed to create unique name for isal compression device vbdev_compress.c: 268:create_compress_dev: NOTICE: FYI failed to setup a queue pair on compressdev 48 with error 4294967295 so limiting to 84 qpairs Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I689ab6bda991e3864da9f4135f57849e3c0c3986 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11179 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-01-20 20:08:12 +00:00
Tomasz Zawadzki	ca89b502aa	vbdev_crypto: skip handling QAT_ASYM devices Historically only QAT_SYM devices for crypto were supported. The DPDK submodule explicitly disabled its compilation. For details please see: https://review.spdk.io/gerrit/c/spdk/dpdk/+/9217 Starting with DPDK 21.11 QAT_SYM and QAT_ASYM were merged together, so it is no longer possible to disable it QAT_ASYM as it was before. As vbdev_crypto didn't make use of it, this driver is now skipped in preparation for update to DPDK 21.11. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: Ib606a4b450cd224d96bc21a64384297b2182967c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11178 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-01-20 20:08:12 +00:00
Alexey Marchuk	11f0a6ec0f	nvmf: Destroy subsystems before destroying poll groups When nvmf_tgt application shuts down, it stops all subsystems, than destroyes poll groups and than destroyes nvmf_tgt. Part of nvmf_tgt destruction is destruction of subsystems and this process may require cross thread communication but since poll groups and threads are already destroyed, we may get segfaults. One possible solution is to change the order and destroy nvmf_tgt before destroying poll groups but it doesn't work since nvmf_tgt is registered as io_device and poll groups have its channel, so it can't be destroyed while poll groups exist. This patch adds a new state to nvmf_tgt state machine which destroys all subsystems before destroying poll groups and nvmf_tgt. It guarantees that all threads exist when subsystems are destroyed. Also rename state NVMF_TGT_FINI_FREE_RESOURCES to NVMF_TGT_FINI_DESTROY_TARGET, the new name better reflects the purpose of this state. Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: I08971d78cc9ad70d43cd43c346fd74d35c8bda60 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9668 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-01-20 20:00:51 +00:00
GangCao	6b7e9d0af2	Lib/iSCSI: add the LUN Resize support From SAM-4, section 5.13 (Sense Data); “When a command terminates with a CHECK CONDITION status, sense data shall be returned in the same I_T_L_Q nexus transaction (see 3.1.50) as the CHECK CONDITION status. After the sense data is returned, it shall be cleared except when it is associated with a unit attention condition and the UA_INTLCK_CTRL field in the Control mode page (see SPC-4) contains 10b or 11b.” SPDK does not set UA_INTLCK_CTRL to 10b or 11b, so we set the unit attention condition immediately against a single IO or Admin IO after reporting it via a CHECK CONDITION. Once the failed IO received at iSCSI initiator side, it will be retried. In the case of resize operation, if there is no IO from iSCSI initiator side, the unit attention condition will be delayed to report until the first IO is received at the iSCSI target side. Meanwhile, we clear the resizing (newly added) flag on our SCSI LUN structure after first time we report the resize unit attention condition. The kernel initiator won’t actually resize the corresponding block device automatically. It will report a uevent, and then you can set up udev rules to trigger a rescan. SPDK iSCSI initiator will automatically report the LUN size change. Change-Id: Ifc85b8d4d3fbea13e76fb5d1faf1ac6c8f662e6c Signed-off-by: GangCao <gang.cao@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11086 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Xiaodong Liu <xiaodong.liu@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com>	2022-01-20 07:56:23 +00:00
Ben Walker	5914f02b2a	accel: Don't query the channel queue depth. Rely on -EBUSY We can just queue things up until we get -EBUSY and not track the queue depth. Signed-off-by: Ben Walker <benjamin.walker@intel.com> Change-Id: I49d3bcae0e6705a322de54fa91c9e1c6dfaea0c2 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11028 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2022-01-20 07:54:55 +00:00
Jim Harris	e0415f1720	bdev/nvme: set default bdev_retry_count to 3 Now that we have a much more robust retry framework, set the default bdev_retry_count to 3. Users can still override this default with the bdev_nvme_set_options RPC as before. This ensures that by default, we will retry I/O when possible. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I045bf4969d02be32b951e72a148ce6b6e251dec1 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11107 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-01-19 08:55:46 +00:00
Ben Walker	61c9017c64	idxd: Eliminate spdk_idxd_configure_chan We can do all of the configuration in spdk_idxd_get_channel, and the configuration step was always done immediately after getting the channel anyway. Change-Id: I9fef342e393261f0db6308cd5be4f49720420aa0 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10349 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Monica Kenguva <monica.kenguva@intel.com>	2022-01-19 08:49:25 +00:00
Shuhei Matsumoto	a9fd7f0ba6	bdev/nvme: Add nvme_ctrlr's state string to the bdev_nvme_get_controllers RPC The state of a nvme_ctrlr can be more fine grained than a boolean and such state gives more information to end users for debug or root cause analysis. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I3e2459f449e2dac73f04b155e38b696495f1a335 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10183 Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com>	2022-01-17 14:25:15 +00:00
Shuhei Matsumoto	80e81273e2	bdev/nvme: Do not use ctrlr for I/O submission if reconnect failed repeatedly If ctrlr_loss_timeout_sec is set to -1, reconnect is tried repeatedly indefinitely, and I/Os continue to be queued. This patch adds another option fast_io_fail_timeout_sec, a flag fast_io_fail_timedout to nvme_ctrlr. If the time fast_io_fail_timeout_sec passed after starting reset, set fast_io_fail_timedout to true not to use the path for I/O submission. fast_io_fail_timeout_sec is initialized to zero as same as ctrlr_loss_timeout_sec and reconnect_delay_sec. The name of the parameter follows the famous DM-multipath, its fast_io_fail_tmo. Change-Id: Ib870cf8e2fd29300c47f1df69617776f4e67bd8c Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10301 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-01-17 14:25:15 +00:00
Shuhei Matsumoto	ae4e54fdc3	bdev/nvme: Retry reconnecting ctrlr after seconds if reset failed Previously reconnect retry was not controlled and was repeated indefinitely. This patch adds two options, ctrlr_loss_timeout_sec and reconnect_delay_sec, to nvme_ctrlr and add reset_start_tsc, reconnect_is_delayed, and reconnect_delay_timer to nvme_ctrlr to control reconnect retry. Both of ctrlr_loss_timeout_sec and reconnect_delay_sec are initialized to zero. This means reconnect is not throttled as we did before this patch. A few more changes are added. Change nvme_io_path_is_failed() to return false if reset is throttled even if nvme_ctrlr is reseting or is to be reconnected. spdk_nvme_ctrlr_reconnect_poll_async() may continue returning -EAGAIN infinitely. To check out such exceptional case, use ctrlr_loss_timeout_sec. Not only ctrlr reset but also non-multipath ctrlr failover is controlled. So we need to include path failover into ctrlr reconnect. When the active path is removed and switched to one of the alternative paths, if ctrlr reconnect is scheduled, connecting to the alternative path is left to the scheduled reconnect. If reset or reconnect ctrlr is failed and the retry is scheduled, switch the active path to one of alternative paths. Restore unit test cases removed in the previous patches. Change-Id: Idec636c4eced39eb47ff4ef6fde72d6fd9fe4f85 Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10128 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Monica Kenguva <monica.kenguva@intel.com>	2022-01-17 14:25:15 +00:00
Shuhei Matsumoto	f85370b168	bdev/nvme: Use enum to select operations after reset complete This is a clean up as a preparation to the following patches. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Ib8bc90e17f52086d4e887463e04f65273bb1079b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11068 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2022-01-17 14:25:15 +00:00
Shuhei Matsumoto	962c4c3800	bdev/nvme: Fix a degradation that I/O gets queued infinitely We noticed the difference between the SPDK 21.10 and the latest master in a test. The simplified scenario is as follows: 1. Start SPDK NVMe-oF target 2. Run bdevperf for the target with -f parameter to suppress exit on failure. 3. Kill the target after I/O started. With the SPDK 21.10, bdevperf retries failed I/Os and exits after the test time is over. With the latest SPDK master, bdevperf hungs and does not exit even after the test time is over. The cause was as follows: reset ctrlr is repeated very quickly (once per 10ms by default) and hence I/Os were queued infinitely because nvme_io_path_is_failed() returned false if nvme_ctrlr is resetting. We should queue I/O when nvme_ctrlr is resetting only if reset is throttoled and fail-fast for the repeated failures is supported. Hence in this patch, fix the degradation and remove the related unit test cases. Reported-by: Evgeniy Kochetov <evgeniik@nvidia.com> Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I4047d42dc44488a05264c6a841d101a7c371358b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11062 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-01-17 14:25:15 +00:00
Tan Long	a79af5e7a5	bdev/rbd: Support config_param and config_file simultaneously for rbd_register_cluster config_param and config_file are not conflict to specify rados configurations, support specify both of them is more reasonable. Therefore, After this patch, users can choose the one from the three ways: config_param, config_file + key_file or config_param + config_file + key_file. Signed-off-by: Tan Long <tanl12@chinatelecom.cn> Change-Id: Ide17af72c4965df1e6541f4f50d4fa5309865486 Signed-off-by: Tan Long <tanl12@chinatelecom.cn> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10679 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: GangCao <gang.cao@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Xiaodong Liu <xiaodong.liu@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-01-17 09:44:56 +00:00
Tan Long	20c8a3b8db	bdev/rbd: Add key_file to the rbd_register_cluster RPC In project practice, config_file and key_file are often used to connect to a rados cluster, config_file includes "mon_host" and other rados configurations like "rbd_cache", and key_file includes the secret key and the access authority to each pool for current user. This patch adds key_file option, user can specify config_file and key_file or only config_param to connect rados cluster. This will make it much more flexible for users with his/her convenience. Signed-off-by: Tan Long <tanl12@chinatelecom.cn> Change-Id: I6b49aad70b578bdeb3ac8ea9ca0fcbd931582025 Signed-off-by: Tan Long <tanl12@chinatelecom.cn> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10485 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: GangCao <gang.cao@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Xiaodong Liu <xiaodong.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-01-17 09:44:56 +00:00
Changpeng Liu	31d684d759	bdev_malloc: exit early in case of no acceleration task If acceleration tasks are exhausted, then we can exit the submission loop earlier, also print number of IOVs for each R/W request. Change-Id: Ia98ed43b0bb2be229b7c0054f3ade0ad39337b09 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10836 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2022-01-14 08:35:32 +00:00
Ben Walker	8d2b6e6873	idxd: Add support for vectored crc32 + copy Change-Id: Ib017280d6d0b2e115f5609b6b1a50793953ffa29 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10290 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com>	2022-01-12 08:20:39 +00:00
Ben Walker	e2efeef080	idxd: Add support for vectored crc32c generation This uses a batch with the fence flag for now. There are several other implementation options that will be explored in the future. Change-Id: I4f344d671400508de05f80b026d42f775c5b9588 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10289 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-01-12 08:20:39 +00:00
Ben Walker	fa6ac87778	idxd: Add support for vectored fill operations Signed-off-by: Ben Walker <benjamin.walker@intel.com> Change-Id: I0d58320a03ee82169e83be6449ba52c9d2ee3a55 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10288 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2022-01-12 08:20:39 +00:00
Ben Walker	f11869c44d	idxd: Add support for vectored compare operations Compare two scattered memory regions Change-Id: I6ce5c9e7bc1ee1ef0e9173c00e86628d43a1e41f Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10287 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2022-01-12 08:20:39 +00:00
Ben Walker	fe70548070	idxd: Add support for vectored copy operations Change-Id: Icb650129488b3cea76cf9082c02667f5b13b5ab4 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10286 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2022-01-12 08:20:39 +00:00
Jim Harris	932ee64b8f	bdev/nvme: add bdev_nvme_stop_discovery RPC This RPC will stop the specified discovery service, including detaching from any controllers that were attached as part of that discovery service. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I9222876457fc45e1acde680a7bd1925917c22308 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10832 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-01-12 08:20:23 +00:00
Jim Harris	f2bf7e9727	bdev/nvme: connect to discovered controllers Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I3b05ab3d22851d433e3d0573e65943c4a30b9aa4 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10695 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com>	2022-01-12 08:20:23 +00:00
Konrad Sztyber	2bb8a83e0a	bdev/malloc: complete requests through poller Requests that are completed immediately (i.e. those not using the accel engine) are now queued and their completion is delayed to the completion poller. It ensures that they're not completed from the context of a submission, which gets rid of an spdk_thread_send_msg() call. It significantly improves performance on some workloads. For instance, 4k zcopy reads (queue depth 128) on an malloc bdev exposed through NVMe/TCP went from 204k IOPS to 485k IOPS. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I196f55fc07d167f1ed117d2430e9c37f9d05f70d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10805 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-01-12 08:20:11 +00:00
Konrad Sztyber	0f0c16a76a	bdev/malloc: remove bdev_malloc_(reset\|flush) The only thing these functions were doing was completing the IO, so it could just be inlined. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I5fbd9df763dd68953b1bda9c7752c57ef9ee5dd6 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10804 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-01-12 08:20:11 +00:00
Konrad Sztyber	0a49fbd241	bdev/malloc: completion poller This poller is registered on each IO channel and can be used to schedule asynchronous completion of a request. This can be especially useful for requests that can be completed immediately. For now, nothing enqueues the requests to be completed through this poller - this will be changed in the following patch. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: If6b26541907bb46402fc0904216bff74dad57b88 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10803 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-01-12 08:20:11 +00:00
Konrad Sztyber	fcd5f60144	bdev/malloc: malloc IO channel It'll allow the malloc bdev to store per-thread data. For now, it's only used to keep the pointer to the accel library's IO channel, more fields will be added in subsequent patches. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I604a38877ae8d6075b911f5a484d1793d4bc2ddb Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10802 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-01-12 08:20:11 +00:00

1 2 3 4 5 ...

1320 Commits