ivampiresp/Spdk - Spdk - Leaflow Developers

Author	SHA1	Message	Date
Rui Chang	4274fe55c9	nvmf/vfio-user: add copy support in vfio-user Fix req length issue in supporting copy command in vfio-user. Signed-off-by: Rui Chang <rui.chang@arm.com> Change-Id: If4ec325777e1a1f00d15edb2fea4dc85016b3b95 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/17279 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: John Levon <levon@movementarian.org> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2023-03-24 07:26:14 +00:00
Michal Berger	ede1caf025	lib/vhost: Rename rte_vhost_slave_config_change() As per https://github.com/DPDK/dpdk/commit/71998eb61ff Change-Id: Ie4e5a38976145e1037ef45593b4dc4265091482d Signed-off-by: Michal Berger <michal.berger@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/17322 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Pawel Piatek <pawelx.piatek@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2023-03-24 07:23:19 +00:00
Peng Lian	b13ee3005d	nvmf: clean sgroup->queued in _nvmf_qpair_destroy when ctrlr is NULL Let us consider the following process: 1. one fabric connect request A comes but the subsystem is paused due to adding/removing ns or other operations, so this request A will be put into sgroup->queued until the subsystem becomes active; 2. the subsystem is paused for a long time until the connect timeout, related qpair is destroyed, the sgroup->queued will not be cleaned because qpair's ctrlr is NULL; 3. if a new request B comes, it is more likely to be allocated to the same memory as the previous fabric command request. And it will be put into sgroup->queued again, where has already exists the exactly same pointer with request B. This leads to the pointer hanging problem and it will cause infinitely loop when traversing sgroup->queued! So this patch avoids the ptr-hanging problem by checking and cleaning all sgroups queued req whose qpair is the being destroyed qpair in _nvmf_qpair_destroy when ctrlr is NULL. This problem is already described in issue #2133. Signed-off-by: Peng Lian<peng.lian@smartx.com> Change-Id: I909d673b5050f21fa193914cc4ffe6634232fa7d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/17147 Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com>	2023-03-22 10:11:30 +00:00
Mike Gerdts	c64ce716e4	blob: add spdk_blob_is_esnap_clone Add an API to easily determine if a blob is an esnap clone, similar to what already exists for snapshot, clone, and thin_provisioned. Signed-off-by: Mike Gerdts <mgerdts@nvidia.com> Change-Id: Ie07cd09b30513893e82f1c85e94a24a93c79d71e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/16862 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Community-CI: Mellanox Build Bot	2023-03-22 09:39:29 +00:00
Mike Gerdts	2948183f2b	blob: prepare sequences for esnap channels When a sequence is used to perform IO on an esnap clone, differenent channels will be needed for the blobstore device and the esnap device. No special esnap handling is required when a sequence is used to perform IO directly on the blobstore device. This commit splits bs_sequence_start() into bs_sequence_start_bs() and bs_sequence_start_blob() to handle these two scenarios. A later commit introduces special handling of ensap clone blobs. Signed-off-by: Mike Gerdts <mgerdts@nvidia.com> Change-Id: I3a6f46640cdb7fdc380bf557736638f1b39f05e3 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/17172 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Community-CI: Mellanox Build Bot	2023-03-22 09:39:29 +00:00
Mike Gerdts	31c2852bb8	blob: prepare sets for esnap channels For the various forms for read_bs_dev() and readv_bs_dev() to perform reads from esnap devices, the spdk_bs_request_set used for the IO needs to keep track of the back_bs_dev IO channel as well as the blobstore's IO channel. This commit has no change in functionality: it is preparation for a change in a later commit. Signed-off-by: Mike Gerdts <mgerdts@nvidia.com> Change-Id: I8edd9c4bf29bc074194331b42c5ef9d27590ce88 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14973 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com>	2023-03-22 09:39:29 +00:00
Mike Gerdts	34d31cdc20	blob: refactor destruction of back_bs_dev External snapshots have a slightly more complicated cleanup of back_bs_dev. This moves all calls to back_bs_dev->destroy() into a function so that this more complicated cleanup can have a single implementation. Signed-off-by: Mike Gerdts <mgerdts@nvidia.com> Change-Id: I78460aa3877481788118e2b0b76931dcf5c56338 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14972 Reviewed-by: Jim Harris <james.r.harris@intel.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2023-03-22 09:39:29 +00:00
Mike Gerdts	4d5ee263b1	blob: pass blob context to esnap_bs_dev_create When consumers open a blob with spdk_bs_open_blob_ext(), they can set esnap_ctx in struct spdk_blob_open_opts to have that context passed to bs->external_bs_dev_create(). Change-Id: I0c1a9cec0e5aed5ef2a7143103e822cbe400aabb Signed-off-by: Mike Gerdts <mgerdts@nvidia.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14971 Reviewed-by: Jim Harris <james.r.harris@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2023-03-22 09:39:29 +00:00
Amir Haroush	6984eff3c5	ocf: fix env_ticks_to_{msec,usec,nsec} precision & accuracy - fix precision when one convert to seconds and then multiply we can have precision errors for example if one have 77ms, it will go to 0 when converted to seconds and then multiply that 0 by 1000 will return 0 instead of 77ms. - fix mismatch nsec/usec nsec was multiplied by 10001000 while usec by 100010001000 it should be the opposite. anyway the implementation had changed. - implementation description env_ticks_to_msec: j / (tick_hz / 1000) this is exactly the same as (j * 1000) / tick_hz (eq #2). but this implementation (eq #2) can only handle 54b in j (before overflowing) because of the multiplication by 1000 (10b). with the correct implementation we use all 64b in j. we assume that tick_hz will be prefectly divisible by 1000 so we are ok. * env_ticks_to_usec: j / (tick_hz / (1000 * 1000)) same as in msec case, we use all 64b in j. here we assume that tick_hz is perfectly divisible by (1000 * 1000) i.e. we assume that CPU frequency is some multiple of 1MHz. * env_ticks_to_nsec: (j * 1000) / (tick_hz / (1000 * 1000)) in this case we can't assume that tick_hz is divisible by 10^9 because there are many CPUs with 2.8GHz or 3.3GHz for example. so we multiply j by 1000 this means that we can only handle correctly j up to 54b. (64b - 10b, 10b for the *1000 operation) Signed-off-by: Amir Haroush <amir.haroush@huawei.com> Signed-off-by: Shai Fultheim <shai.fultheim@huawei.com> Change-Id: Ia8ea7f88b718df206fa0731e3f39f419ee922aa7 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/17078 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2023-03-21 19:08:59 +00:00
Amir Haroush	7c7267e931	ocf: fix env atomic64 functions arguments and return types atomic64 functions should operate with atomic64 and long types. Signed-off-by: Amir Haroush <amir.haroush@huawei.com> Signed-off-by: Shai Fultheim <shai.fultheim@huawei.com> Change-Id: I2ea8f1cc06d6df0f7dd5b9d628839138b78bc412 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/17077 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2023-03-21 19:08:59 +00:00
Amir Haroush	a0d24145bf	ocf: fix ENV_WARN to use SPDK_WARNLOG instead of SPDK_NOTICELOG Signed-off-by: Amir Haroush <amir.haroush@huawei.com> Signed-off-by: Shai Fultheim <shai.fultheim@huawei.com> Change-Id: Ie5bbdb003573fdca6d56439f6a006749a29e9d6b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/17076 Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2023-03-21 19:08:59 +00:00
Jim Harris	7c3c0b6630	blob: track last md_page index correctly during resize During resize, we correctly determine if we have enough md_pages for new extent pages, before proceeding with actually allocating clusters and associated extent pages. But during actual allocation, we were incrementing the lfmd output parameter, which was incorrect. Technically we should increment it any time bs_allocate_cluster() allocated an md_page. But it's also fine to just not increment it at the call site at all - worst case, we just check that bit index again which isn't going to cause a performance problem. Also add a unit test that demonstrated the original problem, and works fine with this patch. Fixes issue #2932. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Iba177a66e880fb99363944ee44d3d060a44a03a4 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/17150 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: 阿克曼 <lilei.777@bytedance.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Community-CI: Mellanox Build Bot	2023-03-15 09:27:17 +00:00
Jim Harris	037c8b01a1	blob: remove short-circuiting path for blob_freeze If blob_freeze_io() is called twice in a row, and the second time occurs before the for_each_channel for the first completes, the second caller will receive its callback too soon. Instead just simplify the whole process, always do the for_each_channel and don't try to optimize it at all. These are infrequent operations - correctness and simplicity are in order. A few additional changes: 1) Make same changes for unfreeze path. 2) Add blob_verify_md_op() calls, just to be sure these are only called from md_thread. This was already checked in calling functions, but as these functions get called from new code paths (i.e. esnap clones) it can't hurt to add additional checks. 3) Add unit test that failed with original code, but passes with this patch. Fixes issue #2935. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ibefba554547ddf3e26aaabfa4288c8073d3c04ff Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/17148 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Mike Gerdts <mgerdts@nvidia.com> Community-CI: Mellanox Build Bot	2023-03-15 09:27:17 +00:00
Konrad Sztyber	3fbe74fd82	accel: don't modify user iovs when allocating buffers It is quite common for a user to use the exact same iovec (in memory) to describe buffers for two different operations. If that iovec was describing accel buffer, accel would modify it replacing it with an actual buffer. This is broken if that iovec was used by some other task in a sequence, as accel wouldn't be aware that it has been changed too. To address this, accel will use a new iovec from the aux_iovs array. It means that accel buffers always must be passed using a single iovec. Theoretically, users could chunk that buffer into several iovecs, but spdk_accel_get_buf() always returns a single buffer, so, in practice, this should never happen, and therefore is unsupported. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I25271bc032987dd6028fb7b3adde061657759b4b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/17039 Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2023-03-13 21:02:27 +00:00
Konrad Sztyber	d69e6f64b3	bdev: prevent aborting reqs doing push/pull or accel seq exec Requests that have their data pushed/pulled from a memory domain or have an accel sequence executed aren't handled by a bdev module, so we shouldn't submit an abort request. Those operations cannot be aborted either, so the abort request is failed in this case. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Icd185c4a2951a555d321cd037de0af1ab157f37a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/17020 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2023-03-13 21:02:27 +00:00
Konrad Sztyber	250566568a	bdev: delay reset until accel/memory domain ops completion These operations are handled internally by the bdev layer, so it should first wait until they're completed before issuing reset to a bdev module. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I74f0d42dcb9a289aa7c3115ca309cb92870548e2 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/17019 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2023-03-13 21:02:27 +00:00
Konrad Sztyber	000b9697e7	bdev: track IOs doing memory domain pull/push Similarly to requests executed by accel, we need to track bdev_ios that have their data pushed/pulled. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Ie6b0d2c058e9f13916a065acf8e05d1484eae535 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/16978 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2023-03-13 21:02:27 +00:00
Konrad Sztyber	2326924683	bdev: track IOs executing accel sequence It will make it possible to check if a request is being processed by accel when doing resets/aborts. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Ice07211df316e1eee9640e750ff8e176c8a3ca6f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/16977 Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot	2023-03-13 21:02:27 +00:00
Konrad Sztyber	04c222f2db	bdev: accel sequence support for read requests This patch enables passing accel sequence for read requests. The handling is pretty similar to writes, but the sequence is executed after a request is completed by a bdev module. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I79fd7d4873265c81a9f4a66362634a1c4901d0c9 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/16975 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2023-03-13 21:02:27 +00:00
Konrad Sztyber	22c0e97884	bdev: accel sequence support for write requests It is now possible to submit a write request with a sequence of accel operations that need to be executed before actually writing the data. Such requests will be directly passed to a bdev module (so that it can append subsequent operations to an accel sequence) if that bdev supports accel sequences and the request doesn't need to be split. If either of these conditions are not met, bdev layer will execute all the accumulated accel operations before passing the request to a bdev module. The reason for not submitting split IOs with an accel sequence is that we would need to split that accel sequence too. Currently, there's no such functionality in accel, so we treat this case in the same way as if the underlying bdev module didn't support accel sequences (it's executed before bdev_io is split). Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I66c53b3a1a87a35ea2687292206c899f80aaed4a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/16974 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2023-03-13 21:02:27 +00:00
Konrad Sztyber	54a935a669	bdev: cache whether IO needs to be split bdev_io_should_split() adds some non-zero overhead, so checking it multiple times in an IO path is inefficient. So, to avoid that, call bdev_io_should_split() once during IO initialization and cache the result in bdev_io. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I1da6514d409f8a4e4bbb14722dd53b2c88988cac Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/17058 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2023-03-13 21:02:27 +00:00
Konrad Sztyber	f555961ff1	bdev: move bdev.submit_request() to a function Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I64556e1ae3241fc69fa68fec7568c50db9152d7f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/16973 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2023-03-13 21:02:27 +00:00
Konrad Sztyber	80b22cf314	bdev: allocate accel_channel for each bdev_channel This channel will be used to execute accel operation sequences. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Ied4bb57d14a50a923908ffb13ef4ba34ca65175c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/16972 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2023-03-13 21:02:27 +00:00
Konrad Sztyber	1be4e82d15	bdev: allow bdevs to report accel_sequnce support Modules can now report that they support accel chaining for specific operations through the accel_sequnce_supported() callback. The support is reported per IO type. This allows modules to support accel sequences for some operations, while relying on the bdev layer to handle them for other IO types. Only bdevs without separate metadata buffers are allowed to support this new mode. That's because metadata in separate buffer is expected to use the same memory domain as data buffers. With an accel sequence, those data memory domains can change, while metadata's memory domain always stays the same. To support bdevs with separate metadata buffers, we'd need to add separate pointers for metadata's memory domain. For now, simply disallow registering bdevs with separate metadata supporting accel sequences. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I0c49cc00096837d70681a69b2633c2cb3dfd4e39 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/16971 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2023-03-13 21:02:27 +00:00
Konrad Sztyber	9beb6b163c	bdev: fix completion for unsubmitted IOs If an IO is completed, before submitting it to a module, it isn't put on the io_submitted list, so we can't use bdev_io_complete() to complete it, as it'll break that list. To avoid that, a new function was added, bdev_io_complete_unsubmitted(), that will safely complete the IOs in such case. For now, it's equivalent to executing user's completion callback, but it'll serve as a good place to release any resources that should be freed before an IO is completed. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I1442ead9d272d9210553803bed1d1c989a2bf761 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/16970 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2023-03-13 21:02:27 +00:00
Konrad Sztyber	7b71fdc28c	accel: move spdk_accel_get_memory_domain() to accel.h This function can be useful in places other than accel modules (e.g. to check if a buffer belongs to accel), so it needs to be declared in accel.h. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I8fdd58b2ed40dc4a4acce2a8d3e1c5f76944c929 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/16969 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2023-03-13 21:02:27 +00:00
Konrad Sztyber	4832235759	accel: re-enable iobuf thread caches They were disabled before the v23.01 release, because none of the other libraries were using the new spdk_accel_append_* API. But now, they will be used in the bdev layer and bdev modules, so they need to be re-enabled. We're using the same values as we do in the bdev layer. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Ibda86ca5619e4104e107048ce0965171501fdc5a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/16968 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2023-03-13 21:02:27 +00:00
zhipeng Lu	216991edec	vhost_rpc:add sessions information for vhost_get_controllers we can get sessions information by vhost_get_controllers Signed-off-by: zhipeng Lu <luzhipeng@cestc.cn> Change-Id: I8e63aea64d02b3467a62f30a712e1dcbf6fb8854 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/16315 Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Community-CI: Mellanox Build Bot	2023-03-13 07:59:08 +00:00
Mike Gerdts	a4a73fec9c	blob: pass bs context with esnap_bs_dev_create When a blobstore consumer creates or loads a blobstore, it should be able to set a per-blobstore context pointer that will be passed back to the consumer via bs->esnap_bs_dev_create(). Signed-off-by: Mike Gerdts <mgerdts@nvidia.com> Change-Id: I59c0ebe21eaf65c3d79a4ac3469715283f56313a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14970 Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2023-03-13 07:57:24 +00:00
Tomasz Zawadzki	1e39a6df17	lib/nvme_rdma: return negated error from nvme_rdma_parse_addr All paths in nvme_rdma_parse_addr(), except the one in this patch already returned negated error values, so fix it. Change-Id: I615956e4139f70bfc171bcab94e6e89f60e62ac3 Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/17098 Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2023-03-10 16:44:37 +00:00
Tomasz Zawadzki	f6866117ac	freebsd: return negated error from getaddrinfo() On FreeBSD getaddrinfo() report positive error code values, meanwhile Linux does it with negative ones. Make sure that regardless of the system used, error codes with same sign are reported. This can be observed in the log reported in #2936. Besides the above, in some instances replaced EINVAL with the actual return value. Change-Id: I7f88c314bdf5c3a03f8661c2213e33b2fc276ef7 Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/17097 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2023-03-10 16:44:37 +00:00
Tomasz Zawadzki	f92411c4da	lib/nvme_tcp: check destination port before parsing address nvme_tcp_parse_addr() uses getaddrinfo() to parse the address. Depending on the system behavior of this function differs. On FreeBSD the port is verified not to be exceeding 65535 for IPv4, meanwhile Linux does not check it at this point. test_nvme_tcp_qpair_connect_sock() UT was attempting to test the code path that is moved in this patch, but on FreeBSD was encountering failure during getaddrinfo() with different error code. This patch moves the destination port check before parsing addresses to take the same path regardless of the system used. Fixes #2936 Change-Id: I271e8c32e07a15dcf0e0ee7e90dd174c96b18858 Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/17095 Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2023-03-10 16:44:37 +00:00
Alexey Marchuk	ada9333423	nvme/tcp: Report correct max sge NVME TCP driver supports up to 16 sge elements while only 1 sge is reported - that leads to unnecessary requests split which degrades perf. Also pass correct iovcnt to nvme_tcp_build_iovs - it should be 32. Otherwise, pdu header consumes 1 iov and data is written partially. Add a check that at least data_len bytes were appended to the socket iovs and fail request otherwise. Signed-off-by: Alexey Marchuk <alexeymar@nvidia.com> Change-Id: Ie83c807dd3fec2c7e7cbcda1e493d6fd74ebe599 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/17006 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2023-03-10 11:43:40 +00:00
Krzysztof Karas	a8f7d7cf0a	trace: update description of tracepoint number Make it clear that number of entries might not be equal to the number of recorded traces, as some of the latter might occupy two entries due to their length. Change-Id: I3099cfb719c38bdee48fbe20fccef3ef43e820a3 Signed-off-by: Krzysztof Karas <krzysztof.karas@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/16916 Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2023-03-10 11:42:23 +00:00
Shuhei Matsumoto	7413e1e497	nvme: Initialize cpl->sqid when aborting requests for RDMA and TCP nvme_rdma_qpair_abort_reqs() and nvme_tcp_qpair_abort_reqs() did not initialize cpl->sqid. Hence, unexpected message was printed by spdk_nvme_print_completion(). Fix the bugs in this patch. Fixes #2930 Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I8b41166e58b26ce22c453ab85794b46dbe3dd3a2 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/17067 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Michael Haeuptle <michaelhaeuptle@gmail.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2023-03-09 23:31:15 +00:00
Shuhei Matsumoto	a475aed55d	nvme_pcie: Call transport_ctrlr_disconnect_qpair_done() if ctrlr_disable_poll() failed nvme_ctrlr_disable_poll() continued to be called until it returned 0. However, if the corresponding drive was unresponsive, the continuous calls consumed CPU and affected other operations. If the corresponding drive is unresponsive, we cannot complete disabling the controller. Hence, call nvme_transport_ctrlr_disconnect_qpair_done() if nvme_ctrlr_disable_poll() returned any value other than -EAGAIN. Even before this patch, nvme_ctrlr_disable_poll() collected an error log if it failed. Hence, we do not have to add more error logs. Fixes issue #2931 Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I26cabb94e5744e3a2d975670adbf2e4e48d5bd7a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/17002 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Michael Haeuptle <michaelhaeuptle@gmail.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2023-03-09 23:31:15 +00:00
Shuhei Matsumoto	cb2a73304b	nvme_pcie: Fix nvme_qpair_abort_all_queued_reqs() to be called for adminq By the patch `736b9da034` nvme_qpair_abort_all_queued_reqs() was changed to be called after the adminq is actually disconnected. However, the patch `ac31590b37` unexpectedly disabled to call nvme_qpair_abort_all_queued_reqs() for adminq because qpair->active_proc is NULL for adminq. Add one more condition to nvme_transport_ctrlr_disconnect_qpair_done(). Fixes issue #2928 Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Ic65f4cd952e6e89275788ff4b86ceca050f624d5 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/17001 Reviewed-by: Michael Haeuptle <michaelhaeuptle@gmail.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2023-03-09 23:31:15 +00:00
Ankit Kumar	eb37d519b8	lib/nvme: include dword 13 field to extendable command structure Add command dword 13 field to the extendable structure spdk_nvme_ns_cmd_ext_io_opts. This now enables us to pass dspec and dsm fields. Signed-off-by: Ankit Kumar <ankit.kumar@samsung.com> Change-Id: Id4d3dac14fdbf0e2a57e0bf287551dfd827dd503 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/16945 Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Community-CI: Mellanox Build Bot	2023-03-09 23:22:41 +00:00
Swapnil Ingle	1afb1effc4	nvmf/vfio_user: simplify cq_is_full() Made cq_is_full() as wrapper around cq_free_slots() Signed-off-by: Swapnil Ingle <swapnil.ingle@nutanix.com> Change-Id: I392f62e959c7e23b4360e77759027ea55c2398b9 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/16789 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: John Levon <levon@movementarian.org> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot	2023-03-08 08:45:00 +00:00
Swapnil Ingle	23b518a013	nvmf/vfio_user: mitigate cq full race Linux host nvme driver processes all pending cqe's in one batch along with completing backing blk_mq req's and later rings cq_doorbell once for all processed cqes. As blk_mq req's are completed there is room for more submissions before ringing cq_doorbell. This may race with vfio_user cq_is_full() which uses cq_doorbell to make final decision and as host has not updated cq_doorbell we fail with cq_full error. To mitigate this only process commands from sq which have free cq slot. Signed-off-by: Swapnil Ingle <swapnil.ingle@nutanix.com> Change-Id: I0cefb41df8099eb71de25923d05a9fcb28e4d124 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/16788 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot	2023-03-08 08:45:00 +00:00
Rui Chang	8613654074	bdev: Add default copy command support in bdev Add default copy command support in bdev layer for backing devices that does not support copy command. Signed-off-by: Rui Chang <rui.chang@arm.com> Change-Id: I5632e25544e95ac0c53ff91c4cd135dac53323ae Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/16638 Community-CI: Mellanox Build Bot Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2023-03-07 11:52:45 +00:00
sijie.sun	e44d631724	nvme_rdma: handle DEVICE_REMOVAL event in RDMA initiator When IBV_EVENT_DEVICE_FATAL & RDMA_CM_EVENT_DEVICE_REMOVAL occurs, destroy qpair immediately and do no assume that no successful WQE will be received after rdma_disconnect. Signed-off-by: sijie.sun <sijie.sun@smartx.com> Change-Id: I23e44dd32c8adea301e5251659b1be519f5dfdf7 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/16314 Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Community-CI: Mellanox Build Bot	2023-03-07 11:50:05 +00:00
sijie.sun	549be9ad81	nvmf/rdma: Recreate resources and listeners after IB device is hotplugged IB device may be unplugged & hotplugged when modifying slaves of bonding IB devices. This patch will try to recreate ibv device contexts, poller and listeners after IB devices come back. Signed-off-by: sijie.sun <sijie.sun@smartx.com> Change-Id: I3288174bad847edc2d9859cb34aa93c6af8c673b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15616 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2023-03-07 11:50:05 +00:00
sijie.sun	8ddc5cd4a7	nvmf/rdma: Destroy all related resources after IB device removed When IBV_EVENT_DEVICE_FATAL & RDMA_CM_EVENT_DEVICE_REMOVAL occurs, destory all userspace resources such as qp, poller and ibv_context. Signed-off-by: sijie.sun <sijie.sun@smartx.com> Change-Id: Ie4832e4804eb572d6ec3bdc44fb7f9339f443d7e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15615 Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2023-03-07 11:50:05 +00:00
Jim Harris	584d295245	nvmf/fc: fix memleaks Submitted by @udayawati via GitHub comment on issue #2872. Fixes issue #2872. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Id662fc0178f6112dfe791733bda43f634107403f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/16932 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Anil Veerabhadrappa <anil.veerabhadrappa@broadcom.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Community-CI: Mellanox Build Bot	2023-03-06 13:21:38 +00:00
Mike Gerdts	ce67e0c787	blob: clones of external snapshots This is the beginning of support for external snapshots. An external snapshot is a read-only blobstore device (struct spdk_bs_dev) that can be used as a blob's back device. Normally a blob will have no back device (a normal blob), a zeroes back device (a thin provisioned blob), or a blob back device (a clone blob). When a blob has an external snapshot ("esnap") as its back device, it is called an esnap clone. With this patch, esnap clones can be created but they are not yet useful. Subsequent patches in the series will plumb the IO path, enable various features, and allow lvol bdevs to be esnap clones. Signed-off-by: Mike Gerdts <mgerdts@nvidia.com> Change-Id: I29206b628a2b03b6386a88532565e228df988e0e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14969 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2023-03-03 11:25:35 +00:00
Marcin Spiewak	fa272c9bc6	lib/idxd: Removed SPDK_IDXD_FLAG_PERSISTENT flag Removed SPDK_IDXD_FLAG_PERSISTENT flag and associated code. Change-Id: Ib4e038794792ae9866bdf344f1ec58dd04dbd483 Signed-off-by: Marcin Spiewak <marcin.spiewak@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/16986 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2023-03-03 11:20:03 +00:00
Marcin Spiewak	27d5b38523	lib/accel: remove PMEM support from accel library This patch removes references to deprecated PMEM from accel library. The code that was executed when ACCEL_FLAG_PERSISTENT flag is set, is no longer needed and is removed. _sw_accel_copy() function is removed and replaced with memcpy(), as after PMEM removal its functionality is the same as memcpy(). _sw_accel_dualcast() is no longer needed, replaced with direct calls to memcpy() Removed 'flags' parameter - it is no longer needed accel_ut.c: removed references to PMDK deprecation.md updated ACCEL_FLAG_PERSISTENT flag will be removed in next patch. Change-Id: I86130466fe7a5f6ee547df1517b803035ff41a7a Signed-off-by: Marcin Spiewak <marcin.spiewak@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/16899 Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2023-03-03 11:20:03 +00:00
Konrad Sztyber	e6a8401a1d	accel: execute accel sequences using a driver If a driver is registered and selected, it'll now be used to execute sequences of accel operations. The driver has priority over accel modules, so the modules will only be used to execute operations that the driver cannot perform. Once driver completes a task (or a number of tasks), it notifies accel using standard spdk_accel_task_complete(). To let accel continue processing a sequence, driver can call spdk_accel_sequence_continue(). This can be done when the driver executes all tasks (1), an error occurs (2), or the driver doesn't know how to execute a given opcode (3). In case of (3), that operation will be executed using appropriate accel module and, while the rest of the sequence will be sent back to the driver. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: If414c02073ffc731454e03d25c7ee02bef58463b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/16548 Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2023-03-01 09:46:40 +00:00
Richael Zhuang	fed358a0e7	util: fix misaligned load for uint64_t type The following error was reported when running gpt_ut which is related to crc32_update(). "load of misaligned address 0x001ffeff78cc for type 'const uint64_t', which requires 8 byte alignment". This patch preprocesses the first several bytes to make the buf address passed to __crc32_d or__crc32_cd is 8 byte aligned. And finally process the trailing bytes. For function spdk_crc32c_update in crc32c.c, memcpy was used to avoid misaligned load problem. Update it with above solution to reduce extra overhead. Signed-off-by: Richael Zhuang <richael.zhuang@arm.com> Change-Id: I7c7aaa41e1c042a96668158818b06729fb3ceec6 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/16801 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2023-03-01 08:56:01 +00:00

1 2 3 4 5 ...

10167 Commits