ivampiresp/Spdk - Spdk - Leaflow Developers

Author	SHA1	Message	Date
Mike Gerdts	54adced8e5	blob: log blob ID as hex, again This is a followup to commit `f4dc558245` which strove to log blob IDs as hex to make small blob IDs more recognizable. That commit missed a few cases where the blob ID is logged as decimal. Signed-off-by: Mike Gerdts <mgerdts@nvidia.com> Change-Id: I75d1b5973ee7e812f7caf0e826d3edbcba126743 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/17641 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Community-CI: Mellanox Build Bot	2023-05-09 17:58:11 +08:00
Amir Haroush	8d65ab7476	OCF: fix compilation dependencies we don't have dependency files for OCF sources/headers. for example, if someone 'touch metadata_collision.h' it will not compile anything. with this fix, it will compile all the relevant files. Signed-off-by: Amir Haroush <amir.haroush@huawei.com> Signed-off-by: Shai Fultheim <shai.fultheim@huawei.com> Change-Id: I35b1c1f80a60f4be59cdca95f68bbafc7a212774 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/17914 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2023-05-09 17:58:11 +08:00
Shuhei Matsumoto	509652cc4b	bdev: Use unified split logic for write_zeroes command fallback Write_zeroes command fallback had used its own split logic but multiple writes had been serialized. Use the unified split logic also for the write_zeroes command fallback. This not only improves the performance but also simplifies the code. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I955870947ae036482871453b4870f06f6f7f947b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/17902 Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Community-CI: Mellanox Build Bot	2023-05-09 17:58:11 +08:00
Shuhei Matsumoto	9454d16810	bdev: Calculate max_write_zeroes once at bdev registration for fallback case As same as copy command, calculation of max write_zeroes size for fallback case includes division and is costly. The result is constant for each bdev. Hence, we can calculate it only once and store it into bdev->max_write_zeroes at bdev registration. However, in unit tests, bdev->blocklen and bdev->md_len can be changed dynamically. Hence, adjust bdev->max_write_zeroes for such changes. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I16e4980e7a283caa6c995a7dc61f7e77585d464e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/17911 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2023-05-09 17:58:11 +08:00
Shuhei Matsumoto	387bdaca1b	bdev: Fix max write_zeroes calculation for fallback case ZERO_BUFFER_SIZE is in bytes but it is easier to calculate max write_zeroes in blocks first and then get the minimum between max write_zeroes in blocks and remaining_num_blocks rather than converting remaining_num_blocks to num_bytes. This is helpful to store the result into bdev->max_write_zeroes for fallback case. We have one small fix in this patch. As we recently fixed bdev_io_get_max_buf_len(), to get aligned length, spdk_bdev_get_buf_align() - 1 is correct. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I104bc837c9eee1303664bfdb3559b0e840d6f0e5 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/17910 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2023-05-09 17:58:11 +08:00
Shuhei Matsumoto	95975beb1c	bdev: Copy command fallback supports split to make copy size unlimited The generic bdev layer has a fallback meachanism for the copy command used when the backend bdev module does not support it. However, its max size is limited. To remove the limitation, the fallback supports split by using the unified split logic rather than following the write zeroes command. bdev_copy_should_split() and bdev_copy_split() use spdk_bdev_get_max_copy() rather then referring bdev->max_copy to include the fallback case. Then, spdk_bdev_copy_blocks() does the following. If the copy size is large and should be split, use the generic split logic regardless of whether copy is supported or not. If copy is supported, send the copy request, or if copy is not supported, emulate it using regulard read and write requests. Add unit test case to verify this addition. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Iaf51db56bb4b95f99a0ea7a0237d8fa8ae039a54 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/17073 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com>	2023-05-09 17:58:11 +08:00
Shuhei Matsumoto	1a5af9305f	bdev: Small clean up for copy command fallback As name suffix, _done has been used more often than _complete for fallback function names. 100 chars per line is suggested implicitly. Do these small clean up in this patch. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Id14dd3f09be8fd49b947b7a8f8b87108fb56c346 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/17900 Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Community-CI: Mellanox Build Bot	2023-05-09 17:58:11 +08:00
Shuhei Matsumoto	fa2c6446fa	bdev: Calculate max_copy once at bdev registration for fallback case Calculation of max copy size for fallback case includes division and is costly. The result is constant for each bdev. Hence we can calculate it only once and store it into bdev->max_copy at bdev registration. Calculation of max copy size for fallback case is almost same as calculation of max write zero size for fallback case. To reuse the calculation, the helper function is named as bdev_get_max_write() and has a num_bytes parameter. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Iac83a1f16b908d8b36b51d9c51782de40313b6c8 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/17909 Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot	2023-05-09 17:58:11 +08:00
Shuhei Matsumoto	4d37b13cf7	bdev: Fix spdk_bdev_get_max_copy() for fallback case As we recently fixed bdev_io_get_max_buf_len(), to get aligned length, spdk_bdev_get_buf_align() - 1 is correct. _bdev_get_block_size_with_md() considers both interleaved metadata and separate metadata cases. It is simpler to use _bdev_get_block_size_with_md(). The copy command fallback uses write command. As the write zeroes fallback does, bdev->write_unit_size should be considered. Fix all in this patch. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I88fe1b250289f2bab7b541523e8be931eeb8150c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/17899 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2023-05-09 17:58:11 +08:00
Mike Gerdts	80ab43ae97	thread: detect spinlocks that are not initialized If spdk_spin_lock() is called on an uninitialized spinlock, it will deadlock. This commit detects whether a lock is initialized and aborts instead of deadlocking. Signed-off-by: Mike Gerdts <mgerdts@nvidia.com> Change-Id: Ie7497633091edd4127c06ca0530e9a1dff530d1b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/16002 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2023-05-09 17:58:11 +08:00
Mike Gerdts	4077eff3ae	thread: spinlock aborts print stack traces Debug builds have information about when each spinlock was initialized, last locked and last unlocked. This commit logs that information when a spinlock operation aborts. Signed-off-by: Mike Gerdts <mgerdts@nvidia.com> Change-Id: I11232f4000f04d222dcaaed44c46303b7ea6cf6b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/16001 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2023-05-09 17:58:11 +08:00
Mike Gerdts	e85368a325	thread: get debug stack traces on spinlocks To help debug spinlocks, capture stack traces as spinlocks are used. Future commits in this series will make debugging with these stack traces easier. Signed-off-by: Mike Gerdts <mgerdts@nvidia.com> Change-Id: I597b730ca771ea3c5b831f5ba4058d359215f7f6 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15998 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2023-05-09 17:58:11 +08:00
Mike Gerdts	9b687d7753	bdev_part: allow UUID to be specified This introduces spdk_bdev_part_construct_ext(), which takes an options structure as an optional parameter. The options structure has one option: uuid. Signed-off-by: Mike Gerdts <mgerdts@nvidia.com> Change-Id: I5e9fdc8e88b78b303e60a0e721d7a74854ac37a9 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/17835 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2023-05-09 17:58:11 +08:00
Konrad Sztyber	f5962f9145	bdev: add extra function when pushing bounce data This is done in preparation for retrying IOs on ENOMEM when pushing bounce data. Also, rename md_buffer to md_buf to keep the naming consistent with other code which uses this abbreviation. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I014f178a45a2a751ecca40d119f45bf323f37d0c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/17762 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2023-05-09 17:58:11 +08:00
Konrad Sztyber	d89b59610d	bdev: retry IOs on ENOMEM from pull/append_copy The IOs will now be retried after ENOMEM is received when doing memory domain pull or appending an accel copy. The retries are performed using the mechanism that's already in place for IOs completed with SPDK_BDEV_IO_STATUS_NOMEM. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I284643bf9971338094e14617974f7511f745f24e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/17761 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2023-05-09 17:58:11 +08:00
Konrad Sztyber	e6634bb501	bdev: count push/pull/seq_finish as io_outstanding The IOs with an outstanding memory domain push/pull or accel sequence finish operation are now added to the io_outstanding counter. It'll be necessary to correctly calculate nomem_threshold when handling ENOMEM from those operations. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Ice1fb94f1c9054a3a96312a0960ac5085d0b21bc Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/17760 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2023-05-09 17:58:11 +08:00
Konrad Sztyber	29651a8971	bdev: remove leading underscore from _bdev_io_(inc\|dec)rement_outstanding The leading underscore usually indicate that a function providing the actual implementation for something that's called from some other wrapper function without the leading underscore. That is not the case for these functions, so this patch removes the leading underscores. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I6e1186b156116249ee53a3845ae99ba87db5122b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/17868 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2023-05-09 17:58:11 +08:00
Konrad Sztyber	1280245cb2	bdev: add _bdev_io_increment_outstanding() In the next patches we'll need to increment the io_outstanding from a few more places, so it'll be good to have a dedicated function for that. Also, move _bdev_io_decrement_outstanding() up, so that both functions are near each other. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I1af5dbe288f7f701c8ba5e85406f02330ae21a39 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/17759 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Community-CI: Mellanox Build Bot	2023-05-09 17:58:11 +08:00
Konrad Sztyber	444d4e8c35	bdev: add common sequence finish callback There are some common operations that need to be done each time a sequence is executed (and more will be added in the following patches), so it makes sense to have a common callback. data_transfer_cpl is used for executing user's callbacks since it's unused at this point. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I4570acbdbe158512d13c31c0ee0c7bb7bf62d18c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/17678 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2023-05-09 17:58:11 +08:00
Konrad Sztyber	417d3f6738	bdev: keep IOs on the io_memory_domain queue during pull/push The IOs are now kept on the io_memory_domain queue only if they have an outstanding pull/push operation. It'll make it easier to support retrying pull/push in case of ENOMEM. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: If5a54fac532206ee8472bacf364a5ef6cde8edea Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/17677 Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com>	2023-05-09 17:58:11 +08:00
Konrad Sztyber	72345f3a69	bdev: allow different ways of handling nomem IOs This is a preparation for reusing the code handling nomem_io for other type of NOMEM errors (e.g. from pull/push/append_copy). This patch doesn't actually change anything functionally - only IOs completed by a module with SPDK_BDEV_IO_STATUS_NOMEM status are retried. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I12ecb2efcf2d2cdf75b302f9f766b4c16ac99f3e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/17676 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Community-CI: Mellanox Build Bot	2023-05-09 17:58:11 +08:00
Konrad Sztyber	268f5ee272	bdev: move adding IOs to the nomem_io queue to functions Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I0da93c55371652c5725da6cf602fa40391670da3 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/17867 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2023-05-09 17:58:11 +08:00
Konrad Sztyber	6bb629ffa7	bdev: push bounce data only for successful IOs The actual memory domain push already only happened for successfully completed requests, but the code would go still go through _bdev_io_push_bounce_data_buffer(), which could cause issues for IOs completed with NOMEM, because the bounce buffer would be released in _bdev_io_complete_push_bounce_done(). Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Id1af1e31cb416e91bf11101a5ce7919530245e1e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/17866 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2023-05-09 17:58:11 +08:00
Konrad Sztyber	fe3978b26d	bdev: use parent_io when executing sequence for split IOs The sequence is associated with parent IO, so that's the IO that should be used when executing a sequence. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Ifcdb06094b38a5eaee1691e5aa8de1c8dc9d01a6 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/17865 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2023-05-09 17:58:11 +08:00
Konrad Sztyber	143ab947c1	bdev: move pulling md_buf to a function Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I935983a14bedc386ffe31abacc8fa200cd79f750 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/17675 Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2023-05-09 17:58:11 +08:00
Konrad Sztyber	aef861c9a3	bdev: move pulling data to bounce buffer to a function Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Idbabcd5bd812cede6f5159ba0691b2dc28a4022a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/17674 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2023-05-09 17:58:11 +08:00
Konrad Sztyber	2f00e990fc	bdev: move resubmitting nomem IOs to a function Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I9f91af30ee1dd5f2568d9f76a30f00497aff6bbc Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/17673 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2023-05-09 17:58:11 +08:00
Jim Harris	c79cfb193b	nvmf: fix comparison in nvmf_stop_listen_disconnect_qpairs This function disconnects any qpairs that match both the listen trid and the subsystem pointer. If the specified subsystem is NULL, it will just disconnect all qpairs matching the listen trid. But there are cases where a qpair doesn't yet have an associated subsystem - for example, before a CONNECT is received. Currently we would always disconnect such a qpair, even if a subsystem pointer is passed. Presumably this check was added to ensure we don't dereference qpair->ctrlr when it is NULL but it was added incorrectly. Also while here, move and improve the comment about skipping qpairs. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I8b7988b22799de2a069be692f4a5b4da59c2bad4 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/17854 Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot	2023-05-09 17:58:11 +08:00
Jim Harris	4a47f1f926	usdt: add SPDK_DTRACE_PROBE variants that don't collect ticks While userspace probes have a high overhead when enabled due to the trap, it is still cleaner and slightly more efficient to not have all of the SPDK_DTRACE_PROBE macros implicitly capture the tsc counter as an argument. So rename the existing SPDK_DTRACE_PROBE macros to SPDK_DTRACE_PROBE_TICKS, and create new SPDK_DTRACE_PROBE macros without the implicit ticks argument. Note this does cause slight breakage if there is any out-of-tree code that using SPDK_DTRACE_PROBE previously, and programs written against those probes would need to adjust their arguments. But the likelihood of such code existing is practically nil, so I'm just renaming the macros to their ideal state. All of the nvmf SPDK_DTRACE_PROBE calls are changed to use the new _TICKS variants. The event one is left without _TICKS - we have no in-tree scripts that use the tsc for that event. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Icb965b7b8f13c23d671263326029acb88c82d9df Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/17669 Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Mike Gerdts <mgerdts@nvidia.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com>	2023-05-09 17:58:11 +08:00
Alexey Marchuk	e16f4bc7ce	lib/nvmf: Defer port removal while qpairs exist in poll group The following heap-use-after-free may happen when RDMA listener is removed: 1. At least 2 listeners exist, at least 1 qpair is created on each listening port 2. Listener A is removed, in nvmf_stop_listen_disconnect_qpairs we iterate all qpair (let's say A1 and B1) and we check if qpair's source trid matches listener's trid by calling nvmf_transport_qpair_get_listen_trid. Trid is retrieved from qpair->listen_id which points to the listener A cmid. Assume that qpair's A1 trid matches, A1 starts the disconnect process 3. After iterating all qpairs on step 2 we switch to the next IO channel and then complete port removal on RDMA transport layer where we destroy cmid of the listener A 4. Qpair A1 still has IO submitted to bdev, destruction is postponed 5. Listener B is removed, in nvmf_stop_listen_disconnect_qpairs we iterate all qpairs (A1 and B1) and try to check A1's listen trid. But listener A is already destroyed, so RDMA qpair->listen_id points to freed memory chunk To fix this issue, nvmf_stop_listen_disconnect_qpairs was modified to ensure that no qpairs with listen_trid == removed_trid exist before destroying the listener. Fixes issue #2948 Signed-off-by: Alexey Marchuk <alexeymar@nvidia.com> Change-Id: Iba263981ff02726f0c850bea90264118289e500c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/17287 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2023-05-09 17:58:11 +08:00
Mike Gerdts	57b47f209f	lvol: esnap clones must end on cluster boundary When regular lvols are created, their size is rounded up to the next cluster boundary. This is not acceptable for esnap clones as this means that the clone may be silently grown larger than external snapshot. This can cause a variety of problems for the consumer of an esnap clone lvol. While the better long-term solution is to allow lvol sizes to fall on any block boundary, the implementation of that needs to be suprisingly complex to support creation and deletion of snapshots and clones of esnap clones, inflation, and backward compatibility. For now, it is best to put in a restriction on the esnap clone size during creation so as to not hit problems long after creation. Since lvols are generally expected to be large relative to the cluster size, it is somewhat unlikely that this restriction will be a significant limitation. Signed-off-by: Mike Gerdts <mgerdts@nvidia.com> Change-Id: Id7a628f852a40c8ec2b7146504183943d723deba Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/17607 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com>	2023-05-09 17:58:11 +08:00
Mateusz Kozlowski	35298940a8	lib/ftl: Give correct type for seq_id variables/return types Change-Id: I7d2fd31620481cf66f5f4400e6de4fc736ee3dad Signed-off-by: Mateusz Kozlowski <mateusz.kozlowski@solidigm.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/17608 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2023-05-09 17:58:11 +08:00
Anil Veerabhadrappa	bf84d7d814	nvmf/fc: delegate memory object free to LLD 'args' object in nvmf_fc_adm_evnt_i_t_delete() is actually allocated in the FC LLD driver and passed to nvmf/fc in nvmf_fc_main_enqueue_event() call. So this object should be freed in the LLD's callback function. Change-Id: I04eb0510ad7dd4bef53fc4e0f299f7226b303748 Signed-off-by: Anil Veerabhadrappa <anil.veerabhadrappa@broadcom.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/17836 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2023-05-09 17:58:11 +08:00
Richael Zhuang	46113de13b	nvme_tcp: fix memory leak when resetting controllor In failover test, it reports memory leak about tqpair->stats when detaching a tcp controller and it failover to the other controller. Because during resetting the controller, we disconnect the controller at first and then reconnect. when disconnecting, the adminq is not freed which means the corresponding tqpair and tqpair->stats are not freed. But when reconnecting, nvme_tcp_ctrlr_connect_qpair will allocate memory for tqpair->stats again which causes memory leak. So this patch fix the bug by not reallocating memory for tqpair->stats if it's not NULL. We keep the old stats because from user perspective, the spdk_nvme_qpair is the same one. Besides, when destroying a qpair, the qpair->poll_group is set as NULL which means if qpair->poll_group is not NULL, it should be a new qpair. So there's no need to check if stats is NULL or not if qpair->poll_group is not NULL. So adjusting the if...else... in _nvme_pcie_ctrlr_create_io_qpair. Change-Id: I4108a980aeffe4797e5bca5b1a8ea89f7457162b Signed-off-by: Richael Zhuang <richael.zhuang@arm.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/17718 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Krzysztof Karas <krzysztof.karas@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2023-05-09 17:58:11 +08:00
Ziye Yang	7be13fbd61	env_dpdk: optimizing spdk_call_unaffinitized Purpose: Reduce unnecessary affinity setting. For some usage cases, the app will not use spdk framework and already call spdk_unaffinitize_thread after calling spdk_env_init(). Change-Id: I5fa8349913c4567ab63c5a01271e7b2755e53257 Signed-off-by: Ziye Yang <ziye.yang@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/17720 Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com>	2023-05-09 17:58:11 +08:00
Krzysztof Karas	9046dfb058	vhost_blk: make sure to_blk_dev() return value is not NULL Assert that return pointer of to_blk_dev() is not NULL, before dereferencing it. Change-Id: I15adeac0926f23f84fdb3af88fc15ac07c580d91 Signed-off-by: Krzysztof Karas <krzysztof.karas@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/17536 Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com>	2023-05-09 17:58:11 +08:00
Krzysztof Karas	b890bd21cd	nvme_transport: return NULL if transport does not exist spdk_nvme_ctrlr_get_registers() calls nvme_get_transport() to get a reference for a transport, whose registers should be returned, but nvme_get_transport() explicitly returns NULL, if the transport does not exist. This would result in dereferencing a NULL pointer on line 862. To remedy that, if no transport was found, return NULL. Additionally change "THis" to "This" on line 46. Change-Id: I3944925659991e9424e2177b5c940b2e2626d1f4 Signed-off-by: Krzysztof Karas <krzysztof.karas@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/17532 Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2023-05-09 17:58:11 +08:00
Jim Harris	d333b553f3	nvmf: initialize trid param in get_**_trid paths When removing a listener, for example with nvmf_subsystem_remove_listener RPC, we use the concept of a "listen trid" to determine which existing connections should be disconnected. This listen trid has the trtype, adrfam, traddr and trsvcid defined, but not* the subnqn. We use the subsystem pointer itself to match the subsystem. nvmf_stop_listen_disconnect_qpairs gets the listen trid for each qpair, compares it to the trid passed by the RPC, and if it matches, then it compares the subsystem pointers and will disconnect the qpair if it matches. The problem is that the spdk_nvmf_qpair_get_listen_trid path does not initialize the subnqn to an empty string, and in this case the caller does not initialize it either. So sometimes the subnqn on the stack used to get the qpair's listen trid ends up with some garbage as the subnqn string, which causes the transport_id_compare to fail, and then the qpair won't get disconnected even if the other trid fields and subsystem pointers match. For the failover.sh test, this means that the qpair doesn't get disconnected, so we never go down the reset path on the initiator side and don't see the "Resetting" strings expected in the log. This similarly impacts the host/timeout.sh test, which is also fixed by this patch. There were multiple failing signatures, all related to remove_listener not working correctly due to this bug. While the get_listen_trid path is the one that caused these bugs, the get_local_trid and get_peer_trid paths have similar problems, so they are similarly fixed in this patch. Fixes issue #2862. Fixes issue #2595. Fixes issue #2865. Fixes issue #2864. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I36eb519cd1f434d50eebf724ecd6dbc2528288c3 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/17788 Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Mike Gerdts <mgerdts@nvidia.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: <sebastian.brzezinka@intel.com>	2023-05-09 17:58:11 +08:00
Mike Gerdts	a0790ea1af	vbdev_lvol: load esnaps via examine_config This introduces an examine_config callback that triggers hotplug of missing esnap devices. Signed-off-by: Mike Gerdts <mgerdts@nvidia.com> Change-Id: I5ced2ff26bfd393d2df4fd4718700be30eb48063 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/16626 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2023-05-09 17:58:11 +08:00
Mike Gerdts	7c42a93a35	lvol: add spdk_lvol_get_by_* API spdk_lvol_get_by_uuid() allows lookup of lvols by the lvol's uuid. spdk_lvol_get_by_names() allows lookup of lvols by the lvol's lvstore name and lvol name. Signed-off-by: Mike Gerdts <mgerdts@nvidia.com> Change-Id: Id165a3d17b76e5dde0616091dee5dff8327f44d0 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/17546 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2023-05-09 17:58:11 +08:00
Mike Gerdts	b920476b3a	lvol: add spdk_lvol_iter_immediate_clones() Add an interator that calls a callback for each clone of a snapshot volume. This follows the typical pattern of stopping iteration when the callback returns non-zero. Signed-off-by: Mike Gerdts <mgerdts@nvidia.com> Change-Id: If88ad769b72a19ba0993303e89da107db8a6adfc Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/17545 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2023-05-09 17:58:11 +08:00
Mike Gerdts	250acc1792	lvol: hotplug of missing esnaps This introduces spdk_lvs_notify_hotplug() to trigger the lvstore to call the appropriate lvstore's esnap_bs_dev_create() callback for each esnap clone lvol that is missing the device identified by esnap_id. Change-Id: I0e2eb26375c62043b0f895197b24d6e056905aa2 Signed-off-by: Mike Gerdts <mgerdts@nvidia.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/16428 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2023-05-09 17:58:11 +08:00
Mike Gerdts	6d71e476ec	lvol: keep track of missing external snapshots If an lvol is opened in degraded mode, keep track of the missing esnap IDs and which lvols need them. A future commit will make use of this information to bring lvols out of degraded mode when their external snapshot device appears. Change-Id: I55c16ad042a73e46e225369bfff2631958a2ed46 Signed-off-by: Mike Gerdts <mgerdts@nvidia.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/16427 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2023-05-09 17:58:11 +08:00
Mike Gerdts	2e1b12b45f	blob: esnap clones are not clones spdk_blob_is_clone() should return true only for normal clones. To detect esnap clones, use spdk_blob_is_esnap_clone(). This also clarifies documentation of spdk_blob_is_esnap_clone() to match the implementation. Signed-off-by: Mike Gerdts <mgerdts@nvidia.com> Change-Id: I9993ab60c1a097531a46fb6760124a632f6857cd Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/17544 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2023-05-09 17:58:11 +08:00
Mike Gerdts	7d9bc09008	blob: add is_degraded() to spdk_blob_bs_dev The health of clones of esnap clones depends on the health of the esnap clone. This allows recursion through a chain of clones so that degraded state propagates up from any back_bs_dev that is degraded. Signed-off-by: Mike Gerdts <mgerdts@nvidia.com> Change-Id: Iadd879d589f6ce4d0b654945db065d304b0c8357 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/17517 Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2023-05-09 17:58:11 +08:00
Mike Gerdts	42e50f66d0	blob: add spdk_blob_is_degraded() In preparation for supporting degraded lvols, spdk_blob_is_degraded() is added. To support this, bs_dev gains an optional is_degraded() callback. spdk_blob_is_degraded() returns false so long as no bs_dev that the blob depends on is degraded. Depended upon bs_devs include the blobstore's device and the blob's back_bs_dev. Signed-off-by: Mike Gerdts <mgerdts@nvidia.com> Change-Id: Ib02227f5735b00038ed30923813e1d5b57deb1ab Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/17516 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2023-05-09 17:58:11 +08:00
Mike Gerdts	510b723ba9	blob: add spdk_blob_get_esnap_bs_dev() While getting memory domains, vbdev_lvol will need to be able to access the bdev that acts as the lvol's external snapshot. The introduction of spdk_blob_get_esnap_bs_dev() facilitates this access. Signed-off-by: Mike Gerdts <mgerdts@nvidia.com> Change-Id: I604c957a468392d40b824c3d2afb00cbfe89cd21 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/16429 Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2023-05-09 17:58:11 +08:00
Konrad Sztyber	3a99974701	accel: add method for getting per-channel opcode stats Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Ic3cc0ddc5907e113b6d9d752c9bff0f526458a11 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/17625 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2023-05-09 17:58:11 +08:00
Konrad Sztyber	3570d392ac	accel: collect stats on the number of processed bytes For operations that have differently sized input/output buffers (e.g. compress, decompress), the size of the src buffer is recorded. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I1ee47a2e678ac1b5172ad3d8da6ab548e1aa3631 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/17624 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2023-05-09 17:58:11 +08:00
Konrad Sztyber	20d6833849	accel: specify number of events when updating stats Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I5b611c8978b581ac504b033e1f335a2e10a9315b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/17623 Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2023-05-09 17:58:11 +08:00

1 2 3 4 5 ...

10298 Commits