ivampiresp/Spdk - Spdk - Leaflow Developers

Author	SHA1	Message	Date
Mike Gerdts	ad6ece23d0	blob: blob_open_opts_copy macro uses wrong type The FIELD_OK macro in blob_open_opts_copy() should consider offsets in struct spdk_blob_open_opts, not struct spdk_blob_opts. Signed-off-by: Mike Gerdts <mgerdts@nvidia.com> Change-Id: I62e22acbe7dfb994453a379c92f78b7e9bc7fc13 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14962 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2023-01-09 12:41:30 +00:00
Mike Gerdts	f4dc558245	blob: log blob ID as hex Blob IDs are sequentially assigned starting at 0x100000000. When debugging with a small number of blob IDs, it is much more intuitive to see blob ID 0x100000000 rather than blob ID 4294967296. In commit `76a577b082` a similar change was made to blobcli. Signed-off-by: Mike Gerdts <mgerdts@nvidia.com> Change-Id: Ic5321a83b57cf8c9f8df48cd424a926b6fec4ba8 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14963 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2023-01-09 12:41:21 +00:00
Konrad Sztyber	33b12a4411	util: add spdk_iovmove() It's the same as spdk_iovcpy(), but the dst/src buffers can overlap. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I6daa0a846d7d1deac2c01d1a1be09171fa8bf796 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15747 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2023-01-09 12:37:37 +00:00
Konrad Sztyber	940be80363	accel: accel buffer allocation functions The data buffers backed by these accel buffers aren't allocated immediately, but only when they're necessary to execute a given operation. It allows users to append operations to a sequence, without actually reserving large space for the data. That way, if some of these buffers aren't needed to execute a sequence, they won't be allocated. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Ieeea8a011b40c7f2f33e9a6f03fe34264e9316f3 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15746 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2023-01-09 12:37:37 +00:00
Konrad Sztyber	7b0f452b4f	accel: add iobuf channel to accel channel It will be used for allocating buffers from accel domain and allocating bounce buffers to push/pull the data from memory domains for modules that don't support memory domains. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Idbe4d2129d0aff87d9e517214e9f81e8470c5088 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15745 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2023-01-09 12:37:37 +00:00
Konrad Sztyber	d3ac42caa4	dma: add "virtual" accel memory domain This domain is meant to represent data being transformed by accel engine. Users will be able to allocate buffers from that memory domain and use them when appending operations to an accel sequence. Since these buffers are only meant to be used as placeholders for actual buffers, none of the push/pull/translate callbacks are implemented. To access the data after it was transformed by accel, users should make sure that the final command's destination buffer isn't allocated from accel memory domain. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Ia031c7b205e98792d0a93f01513101b86afa9faa Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15744 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2023-01-09 12:37:37 +00:00
Konrad Sztyber	7b36fe5238	accel: add support for reversing a sequence Reversing a sequence means that the order of its operations is reversed, i.e. the first operation becomes last and vice versa. It's especially useful in read paths, as it makes it possible to build the sequence during submission, then, once the data is read from storage, reverse the sequence and execute it. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I93d617c1e6d251f8c59b94c50dc4300e51908096 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15636 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2023-01-09 12:37:37 +00:00
Konrad Sztyber	f778e8e53a	accel: remove redundant copy operations Operation sequence should always be treated as a whole, meaning that users cannot rely on the contents of any intermediate buffers and should only care about the buffer that's the destination of the whole operation. This allows us to remove some of those copy operations by changing source / destination buffer of a preceding / following operation. If a sequence is using buffers from non-local memory domain, users can append a copy operation to a sequence to specify a local destination buffer. If the module executing the operations is aware of memory domains, this can avoid doing an extra spdk_memory_domain_pull_data(). Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I93b94d46ee32700819e9e6f1c55350692db8a67a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15530 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2023-01-09 12:37:37 +00:00
Konrad Sztyber	59f55d23f2	accel: add support for appending a decompress operation Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I5f091a554e08f0e052ab9e7eb9a1789d381b885f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15635 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2023-01-09 12:37:37 +00:00
Konrad Sztyber	6293ac8759	accel: initial operation chaining support This patch introduces the concept of chaining multiple accel operations and executing them all at once in a single step. This means that it will be possible to schedule accel operations at different layers of the stack (e.g. copy in NVMe-oF transport, crypto in bdev_crypto), but execute them all in a single place. Thanks to this, we can take advantage of hardware accelerators that supports executing multiple operations as a single operation (e.g. copy + crypto). This operation group is called spdk_accel_sequence and operations can be appended to that object via one of the spdk_accel_append_* functions. New operations are always added at the end of a sequence. Users can specify a callback to be notified when a particular operation in a sequence is completed, but they don't receive the status of whether it was successful or not. This is by design, as they shouldn't care about the status of an individual operation and should rely on other means to receive the status of the whole sequence. It's also important to note that any intermediate steps within a sequence may not produce observable results. For instance, appending a copy from A to B and then a copy from B to C, it's indeterminate whether A's data will be in B after a sequence is executed. It is only guaranteed that A's data will be in C. A sequence can also be reversed using spdk_accel_sequence_reverse(), meaning that the first operation becomes last and vice versa. It's especially useful in read paths, as it makes it possible to build the sequence during submission, then, once the data is read from storage, reverse the sequence and execute it. Finally, there are two ways to terminate a sequence: aborting or executing. It can be aborted via spdk_accel_sequence_abort() which will execute individual operations' callbacks and free any allocated resources. To execute it, one must use spdk_accel_sequence_finish(). For now, each operation is executed one by one and is submitted to the appropriate accel module. Executing multiple operations as a single one will be added in the future. Also, currently, only fill and copy operations can be appended to a sequence. Support for more operations will be added in subsequent patches. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Id35d093e14feb59b996f780ef77e000e10bfcd20 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15529 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2023-01-09 12:37:37 +00:00
Changpeng Liu	b0df03c531	lib/vhost: rename device stop function calls Existing `vhost_user_session_send_event` is only used to stop vhost user device's session now, so we rename it to `vhost_user_wait_for_session_stop` and also rename the whole function calls when stopping the device with more apposite names. Change-Id: Ib8ea48273e85f7856ca2dfca57b5fd933ac4cf7a Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15296 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2023-01-06 16:14:35 +00:00
Changpeng Liu	73f06e0d57	lib/vhost: remove `active_session_num` for vhost-user device For vhost-user device, the variable `active_session_num` is used to count number of sessions of a vhost-user device, we don't use it anywhere, and the assertion of this variable is already guaranteed by `vsessions_num`, so just remove it. Change-Id: I335a75d17583b3744a41152b35cd5a1a8762a687 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15189 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2023-01-06 16:14:35 +00:00
Changpeng Liu	e753aa807f	lib/vhost: quit vhost subsystem while VM is connected If we kill the vhost process while VM is connected, the `g_fini_cb` will not be called due to active session is in the vhost-user device, but we're sure that this VM is stopped for this case, because `vhost_driver_unregister` is called in the shutdown thread, so here we reuse `g_vhost_user_started` flag for this case and free the sessions, the following call to `vhost_driver_unregister` can also handle this case, because the Unix Domain socket is already unregistered. Fixes commit `327d1c98` ("vhost: defer vhost_dev_unregister until scsi tgts removed") Change-Id: I4f368ac8c304dd9525d15abdce8fd5b2ed79b96e Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15623 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2023-01-06 16:14:35 +00:00
Changpeng Liu	63dab84449	lib/vhost: fix race condition when destroying a device `rte_vhost_driver_unregister` API for removing socket is not asynchronous, it may call SPDK ops for adding a new connection or removing a connection, so we can't hold the user device lock when calling this function, and reject to add a new connection while calling `rte_vhost_driver_unregister`. Fix issue #2748. Change-Id: I5594224f26374b2336d64175ecd5e5ec3d545a58 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15483 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2023-01-06 16:14:35 +00:00
Changpeng Liu	376c25ed0c	lib/vhost: use user_dev's lock to protect vhost sessions `spdk_vhost_dev` is created\|deleted via RPC or APIs, and we use a global `spdk_vhost_lock` to protect it, but for some other places such as: vhost-user message processing, we also use the global lock for now, actually we don't need to use this lock, because these vhost-user messages processing will not delete nor add vhost devices. While here, we add a `spdk_vhost_user_dev` access lock to protect vhost-user message processing as an optimization. Change-Id: Ia9c45b056cebb7b65f458d56ed775a15e386f905 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15184 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Feng Li <lifeng1519@gmail.com>	2023-01-06 16:14:35 +00:00
Xue Liu	e9a94122b8	nvme/pcie: add memory barrier for LOONGARCH Add memory barrier for LOONGARCH in nvme_pcie_qpair_process_completions. Signed-off-by: Xue Liu <liuxue@loongson.cn> Change-Id: Icc992ef612a00dd18ff33f70ab8f54e8c5d5c5b7 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/16083 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2023-01-06 15:54:46 +00:00
John Levon	bae7cfb49b	lib/nvmf: sanity check connect buffer nvmf_ctrlr_cmd_connect() can only handle a request in one buffer (req->data); sanity check it's not split across IOVs. Signed-off-by: John Levon <john.levon@nutanix.com> Change-Id: I595d8542ce71e56cf2b074f4cf41bce440f6dc26 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/16123 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Thanos Makatos <thanos.makatos@nutanix.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2023-01-06 15:54:18 +00:00
John Levon	ad5217307e	lib/nvmf: fix req->data usage in nvmf_ctrlr_get_features() handlers This code has a similar potential problem as the identify and log page commands did: stop using req->data in favour of IOVs. We also need to fix the unit tests to initialize the iovs. We don't change the existing "set" behaviour of requiring a single IOV here. Signed-off-by: John Levon <john.levon@nutanix.com> Change-Id: I257567a7abd5fc3ed9ee21b432c7da7d70fbbde0 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/16122 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Thanos Makatos <thanos.makatos@nutanix.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2023-01-06 15:54:18 +00:00
John Levon	acc4d1766c	lib/nvmf: fix identify command corruption In the previous fix: `adc2942ad` nvmf: nvmf_ctrlr_get_log_page use iovs to store the log page a data corruption bug in the log page code was fixed. Previously, it used req->data, which may be too short a buffer in the case that the buffer is split across more than one IOV. req->data is never safe to use in this situation. The code was changed to use the provided iovs instead of req->data. However, the identify command handling was still vulnerable to this problem, and has been seen in real life at least with a CentOS guest VM. The fix is basically the same: use the IOV utility functions to write out the response instead of directly trying to use req->data. Signed-off-by: John Levon <john.levon@nutanix.com> Change-Id: I00445895af20e43be73189629576eee0667f86dd Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/16121 Reviewed-by: Thanos Makatos <thanos.makatos@nutanix.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2023-01-06 15:54:18 +00:00
John Levon	56fe6fdf85	lib/nvmf: relocate iov utility code Move the IOV handling code in ctrlr.c to the top of the file, for subsequent use. Signed-off-by: John Levon <john.levon@nutanix.com> Change-Id: Ibddde1cb964d8aaecf4673ffa6d4147d0a48020c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/16120 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Thanos Makatos <thanos.makatos@nutanix.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2023-01-06 15:54:18 +00:00
John Levon	b6f674772c	nvme: add SPDK_NVME_IDENTIFY_BUFLEN Add a define for the Identify command buffer instead of using a raw value. Signed-off-by: John Levon <john.levon@nutanix.com> Change-Id: I9073ff84e2fa2ef9268051b898fe1027d8e97baa Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/16119 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2023-01-06 15:54:18 +00:00
Mike Gerdts	4bb902a6f4	bdev: add claim type In preparation for supporting additional claim types, create a claim type that represents the current claim type. Everything that sticks to the public APIs should continue to work as before. Signed-off-by: Mike Gerdts <mgerdts@nvidia.com> Change-Id: I0d02e4b3f4bbf4eb5a7391028aa31e999f9da915 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15286 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2023-01-05 23:28:32 +00:00
Mike Gerdts	9fd2f931cd	bdev: claim_module becomes claim.v1.module In preparation for an updated claims API, refactor bdev->internal.claim_module into a union that will eventually hold different information based on the the type of claim. Change-Id: I7ade6f03128bdb0f8375a95ae953cb63d6aa686d Signed-off-by: Mike Gerdts <mgerdts@nvidia.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15285 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Community-CI: Mellanox Build Bot	2023-01-05 23:28:32 +00:00
Mike Gerdts	93b53c0268	bdev: call bdev_ok_to_examine() once per examine This calls bdev_ok_to_examine() once per bdev_examine(). Prior to this commit, bdev_ok_to_examine() may be called up to twice per bdev module. The results returned by bdev_ok_to_examine() could be affected by: 1. g_bdev_opts.bdev_auto_examime changing 2. spdk_bdev_examine() being called on a particular bdev 3. An alias being added for an existing bdev It's not clear that anything good comes from racing in conditions 1 and 3. In condition 2, spdk_bdev_examine() calls bdev_examine(), so any required examine_config() and examine_disk() calls are still made, just now with less of a race with the previous invocation of spdk_examine_confg(). Signed-off-by: Mike Gerdts <mgerdts@nvidia.com> Change-Id: I496fc44fd74693837d6b449d7fa60f58f9dbf36f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15284 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Community-CI: Mellanox Build Bot	2023-01-05 23:28:32 +00:00
Mike Gerdts	7241a075be	bdev: hold spinlock while changing claim_module This closes races between concurrent spdk_bdev_module_claim_bdev() and/or spdk_bdev_module_release_bdev() calls affecting the same bdev by holding bdev->internal.spinlock while claiming and releasing a bdev. It also closes a potential TOCTOU bug in that optimizing compilers probably already eliminate in bdev_finish_unregister_bdevs_iter() and documents that bdev->internal.claim_module is protected by bdev->internal.spinlock. This can be removed when the bdev_register_examine_thread deprecation is removed. Signed-off-by: Mike Gerdts <mgerdts@nvidia.com> Change-Id: Ib48552df065d5172139a61bbc00b391f36552c0c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15282 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Community-CI: Mellanox Build Bot	2023-01-05 23:28:32 +00:00
Mike Gerdts	b5075dcc5b	bdev: action_in_progress counting is racy Since bdev_examine() can happen on any thread and it happens without any other lock being held on the spdk_bdev_module, it is possible for multiple threads to try to simultaneously increment module->internal.action_in_progress. Decrements may also race. This commit adds bdev_module->internal.spinlock and holds it while modifying module->internal.action_in_progress. This can be removed when the bdev_register_examine_thread deprecation is removed. Change-Id: I9c401eeb3c7c97c484e16fa9cfd82668b32e508b Signed-off-by: Mike Gerdts <mgerdts@nvidia.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15281 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Community-CI: Mellanox Build Bot	2023-01-05 23:28:32 +00:00
Mike Gerdts	a6e58cc44c	bdev: examine and register on app thread This introduces a deprecation for calling spdk_bdev_register() and spdk_bdev_examine() on a thread other than the app thread. The deprecation period starts in SPDK 23.01 and removal is expected in SPDK 23.05. The intent of this deprecation is to ensure that bdev modules' examine_config() and examine_disk() callbacks are only ever called on the app thread. This largely a formalization of what has long happened due to the RPC poller running on the first thread started by spdk_app_start(). Signed-off-by: Mike Gerdts <mgerdts@nvidia.com> Change-Id: Ic9d7b87b6522be20357d2eab2d0c77cd5753452f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15690 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Mateusz Kozlowski <mateusz.kozlowski@intel.com>	2023-01-05 23:28:32 +00:00
Sebastian Brzezinka	be59f5d513	nvmf/vfio_user: add numdw to avoide signed integer overflow This patch fix issue: #2835 Signed-off-by: Sebastian Brzezinka <sebastian.brzezinka@intel.com> Change-Id: Ide49314c39a17e1da78303e59dde5855a0ee38a0 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/16029 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2023-01-05 23:27:12 +00:00
Fengnan Chang	958d4e0e05	nvme: fix memleak when submit request failed Some memory alloc in nvme_allocate_request_user_copy, and submit through nvme_qpair_submit_request, if nvme ctrlr is failed or qpair state not meet the requirements, submit will return -ENXIO, and call nvme_free_request(), but it will not free req->payload.contig_or_cb_arg, those memory only gets freed when the request is actually completed, through nvme_user_copy_cmd_complete(). Let's fix this by add check when submit failed. Fixes issue #2832 Change-Id: I54f0fc60dbb53ced9f52da7d89017be13db2eee1 Signed-off-by: Fengnan Chang <changfengnan@bytedance.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15985 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Xiaodong Liu <xiaodong.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2023-01-05 23:26:42 +00:00
Fengnan Chang	02ecb2dcba	nvme: make submit request error handle in one place rc to -ENXIO and goto error, make all error handle in one place, so it's easy to add more check in later patch. Change-Id: I13edeef75bbf6c52e18d6b94b78c2e560012bfee Signed-off-by: Fengnan Chang <changfengnan@bytedance.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/16004 Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Xiaodong Liu <xiaodong.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2023-01-05 23:26:42 +00:00
Michael Haeuptle	7706450f2a	nvme_rdma: Support TOS for RDMA initiator The spdk_nvme_ctrlr_opts now supports a transport_tos option that allows setting of the 'type of service' value in the IPv4 header. This is needed to support lossless RoCE setups. Note: Only RDMA is supported at this point. Change-Id: I21825fc197c60f539a7d2d651a970ea380d8b56d Signed-off-by: Michael Haeuptle <michael.haeuptle@hpe.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15908 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2023-01-05 19:54:53 +00:00
Shuhei Matsumoto	ce92d919d7	nvme: Add a helper function to return status type string Add spdk_nvme_cpl_get_status_type_string() to return ASCII string for the type of an error. Append a dummy entry to return "RESERVED" for unknown types. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Ibc07132ee067f146ac149884c6344f313bfcbfff Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15835 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2023-01-04 08:22:31 +00:00
Shuhei Matsumoto	8f990f5e47	nvme: Update status-string array to add newly or missing status codes spdk_nvme_cpl_get_status_string() will be used to count and display NVMe specific errors via JSON-RPC. This patch is a preparation. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Ia96890172d752d2906549e3033c0b26eef9c20bd Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15834 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2023-01-04 08:22:31 +00:00
Shuhei Matsumoto	8c439a6799	bdev: Add function pointers to display and reset module specific I/O statistics However, when querying or resetting module specific statistics, the generic bdev layer have to access it. For this purpose, add functions pointers. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Ie86d0a4a406cec7e0f1e9a62de5982cd3d877eae Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14839 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2023-01-04 08:22:31 +00:00
Shuhei Matsumoto	53a9a8c4d1	bdev: Add counts per I/O error status into I/O statistics Define struct spdk_bdev_io_error_stat privately in lib/bdev/bdev.c. Add a pointer to struct spdk_bdev_io_error_stat to struct spdk_bdev_io_stat. Allocate spdk_bdev_io_error_stat for bdev and RPC, but do not allocate spdk_bdev_io_error_stat for I/O channel. Dump the contents of spdk_bdev_io_error_stat only if its total is non-zero. As a result of these, only spdk_bdev_get_device_stat() can query spdk_bdev_io_error_stat for the bdev_get_iostat RPC. This will be acceptable. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Idae868afe65347a96529eedc3dcc692101de4a29 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14826 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2023-01-04 08:22:31 +00:00
Shuhei Matsumoto	c134d11ca7	bdev: Rename io_stat helper functions to bdev_ + verb + _io_stat The following patches will make some of io_stat helper functions public APIs. Then, for consistency, bdev_ + verb + _io_stat will be better naming rules. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: If36d4ed29253e87954c23c270e8414731d083f03 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15896 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2023-01-04 08:22:31 +00:00
GangCao	46d02f3e95	lib/nvme: add the NULL check after getting ns Change-Id: Ib6188269dfce1a9229850b06dc61d8bfc0ede74a Signed-off-by: GangCao <gang.cao@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/16072 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2023-01-03 07:59:59 +00:00
Jim Harris	c695156049	iscsi: add EXITING conns to pg after full_feature_migrate Commit `41f59559e` added code to skip adding EXITING connections to the new poll group in the full_feature_migrate message callback. The problem is that since the connection is in EXITING state and is not in a poll group, it will never move to EXITED state, nor get removed from g_active_conns, and hence will block the iscsi subsystem from being able to shutdown. So instead, assert that the connection is not in EXITED state. If it is in EXITING state, we will add it to the poll group, and then when the poll group is next polled, it will destroy the connection, moving it to EXITED state and removing it from the g_active_conns STAILQ. This fix is related to issue #2416. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ie8e64c811a5602ba4b28871bc535f5fa49dffc18 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/16019 Reviewed-by: Michal Berger <michal.berger@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot	2022-12-23 09:27:48 +00:00
GangCao	56f5f7e9d4	lib/iscsi: missing a comma for the string Change-Id: I67f2b73923c2ea0fe985c4a92f6f72cd2fb4a438 Signed-off-by: GangCao <gang.cao@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/16008 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: wanghailiang <hailiangx.e.wang@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Community-CI: Mellanox Build Bot	2022-12-20 09:20:31 +00:00
GangCao	de02db6366	lib/nvmf: check the return value of the resume operation Change-Id: I87975e8cfc450463f46f00e90b4c6ff1744014ee Signed-off-by: GangCao <gang.cao@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/16007 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Community-CI: Mellanox Build Bot	2022-12-20 09:19:57 +00:00
Mike Gerdts	cc27c1ab11	blobstore: missing lock leads to md page race Many parts of the blobstore.c seem to have gone with the assumption that blob creation, deletion, etc. all happen on the md thread. This assumption would allow modification of the bs->used_md_pages and bs->used_clusters bit arrays without holding a lock. Placing "assert(spdk_get_thread() == bs->md_thread)" in bs_claim_md_page() and bs_claim_cluster() show that each of these functions are called on other threads due writes to thin provisioned volumes. This problem was first seen in the wild with this failed assertion: bs_claim_md_page: Assertion `spdk_bit_array_get(bs->used_md_pages, page) == false' failed. This commit adds "assert(spdk_spin_held(&bs->used_lock))" in those places where bs->used_md_pages and bs->used_lock are modified, then holds bs->used_lock in the places needed to satisfy these assertions. Signed-off-by: Mike Gerdts <mgerdts@nvidia.com> Change-Id: I0523dd343ec490d994352932b2a73379a80e36f4 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15953 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Community-CI: Mellanox Build Bot	2022-12-20 09:19:09 +00:00
Mike Gerdts	67c7e85809	blobstore: use common return path in bs_create_blob() A future commit will add to the complexity when returning with a non-zero value. Rather than further complicating the several error return locations, all affected error returns are handled after the error label. Signed-off-by: Mike Gerdts <mgerdts@nvidia.com> Change-Id: I56e8e338b0560f849399c085d0bb07efb7df26fb Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15983 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Community-CI: Mellanox Build Bot	2022-12-20 09:19:09 +00:00
Mike Gerdts	c1544908e0	blobstore: use common return path in blob_resize() A future commit may need to release a lock before returning. This refactors blob_resize() to always return at end of the function using an out label and goto. Signed-off-by: Mike Gerdts <mgerdts@nvidia.com> Change-Id: I671fbdbe0e3b766c264c45589dad3a864ba1f192 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15982 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Community-CI: Mellanox Build Bot	2022-12-20 09:19:09 +00:00
Mike Gerdts	316cf9ef99	blobstore: convert used_lock to spinlock Convert bs->used_lock to a spinlock. This is being done to help with the debugging and fixing of a race that has led to a failed assertion in bs_claim_md_page. Signed-off-by: Mike Gerdts <mgerdts@nvidia.com> Change-Id: I11b80096de022f79a217c65d787ee57ca54240f9 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15952 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Community-CI: Mellanox Build Bot	2022-12-20 09:19:09 +00:00
Mike Gerdts	2a608d0241	blobstore: rename used_clusters_mutex to used_lock The bs->used_clusters_mutex protects used_md_pages, used_clusters, and num_free_clusters. A more generic name is appropraite. The next patch in this series will convert it from a mutex to a spinlock and having "mutex" or "spin" in the name is of little help to maintainers, so a more generic name is used. Signed-off-by: Mike Gerdts <mgerdts@nvidia.com> Change-Id: I5ce7b85b84fdec2a0c5d2ac959e0109e1d80c7f5 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15981 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Community-CI: Mellanox Build Bot	2022-12-20 09:19:09 +00:00
GangCao	58549382d0	lib/jsonrpc: check the return value from setsockopt Change-Id: I47c0635dcc53e28a8c7cfa85416b42c6475a3b65 Signed-off-by: GangCao <gang.cao@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15915 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Community-CI: Mellanox Build Bot	2022-12-20 09:17:59 +00:00
GangCao	4f4bf8c482	lib/env_dpdk: add a valid check before fclose Change-Id: I43fc46500aa95a1f34365d0ac269dc1aa4b4bfa6 Signed-off-by: GangCao <gang.cao@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15955 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-12-16 09:43:47 +00:00
GangCao	1450c5470b	lib/bdev: send back the eligible QoS IO to the original thread Fix issue: #2815 Change-Id: Ic1533b9ed055734a721be0fd7159754e5db1791b Signed-off-by: GangCao <gang.cao@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15917 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-12-16 09:43:28 +00:00
Jim Harris	e39512ec18	nvmf: add completed_nvme_io to nvmf_poll_group_stat Basic IO completion counting can be done at the common layer, to enable some level of stat tracking even for transports that don't have transport-specific tracking yet. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: If04f854b97440089b8ad149b64cb59173c73975c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15912 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Community-CI: Mellanox Build Bot	2022-12-16 09:27:50 +00:00
Tomasz Zawadzki	32e6ffb55c	env_dpdk: add support for DPDK main branch for 23.03 For validation of upcoming DPDK releases, pci_dpdk needs to initialize and work. This patch adds support for testing DPDK main branch, with appropriate notice given when that DPDK version is used. Change-Id: I5257beac3a3926bd432d9c00e50858facd21e6f5 Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15891 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com>	2022-12-16 09:27:11 +00:00

1 2 3 4 5 ...

9958 Commits