ivampiresp/Spdk - Spdk - Leaflow Developers

Author	SHA1	Message	Date
Ben Walker	f2efa8f293	idxd: Keep ops on a singly linked list This does not actually need a doubly linked list. Single is enough. That frees up 4 more bytes in the op for other uses. Signed-off-by: Ben Walker <benjamin.walker@intel.com> Change-Id: I8c2a30de175b42815afd0a3ba3c694aef2f35882 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12258 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com>	2022-04-28 07:29:51 +00:00
Ben Walker	863200c7d4	idxd: Rearrange logic in spdk_idxd_process_events If we don't find a completion, break out of the loop at the top. This removes a level of indentation but most importantly makes it very clear that we only remove elements from ops_outstanding from the front of the list. Signed-off-by: Ben Walker <benjamin.walker@intel.com> Change-Id: I5e8784a5af5449c14ff7015bc8e6062e6aee6b4e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12257 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com>	2022-04-28 07:29:51 +00:00
Jonas Pfefferle	da744fc9e0	bdev/part: add compare and cmp&write io types The partition bdev checks if the underlying device supports the io type and sends the bdev_io directly down to the bdev. This patch adds missing compare and compare&write io types to the partition bdev. Signed-off-by: Jonas Pfefferle <pepperjo@japf.ch> Change-Id: Ice7e5c0332ce7e564bad2bb8d7f4bb1d535388c1 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12390 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-04-28 07:29:43 +00:00
Jim Harris	0064713871	bdev: more ZERO_BUFFER_SIZE to bdev_internal.h The bdevio test app has some test cases verifying that write zeroes commands are handled correctly, but using knowledge of the ZERO_BUFFER_SIZE that the bdev library uses for splitting larger write zeroes commands. Instead of hardcoding that 1MB value in bdevio.c, have bdevio.c use ZERO_BUFFER_SIZE directly instead. But this requires moving ZERO_BUFFER_SIZE into bdev_internal.h and having bdevio.c include that file. We do this instead of putting ZERO_BUFFER_SIZE in the public API because we don't want users to make any kind of dependencies on this value. While here, also rename the tests that are using this value, so that the test names don't include any reference to the specific size of this bdev-internal zero buffer size. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ia29d92a706cb1f86b4c29374dc2a9beccf679208 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12383 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-04-28 07:29:43 +00:00
Richael Zhuang	9bff828f99	sock: introduce dynamic zerocopy according to data size MSG_ZEROCOPY is not always effective as mentioned in https://www.kernel.org/doc/html/v4.15/networking/msg_zerocopy.html. Currently in spdk, once we enable sendmsg zerocopy, then all data transferred through _sock_flush are sent with zerocopy, and vice versa. Here dynamic zerocopy is introduced to allow data sent with MSG_ZEROCOPY or not according to its size, which can be enabled by setting "enable_dynamic_zerocopy" as true. Test with 16 P4610 NVMe SSD, 2 initiators, target's and initiators' configurations are the same as spdk report: https://ci.spdk.io/download/performance-reports/SPDK_tcp_perf_report_2104.pdf For posix socket, rw_percent=0(randwrite), it has 1.9%~8.3% performance boost tested with target 1~40 cpu cores and qdepth=128,256,512. And it has no obvious influence when read percentage is greater than 50%. For uring socket, rw_percent=0(randwrite), it has 1.8%~7.9% performance boost tested with target 1~40 cpu cores and qdepth=128,256,512. And it still has 1%~7% improvement when read percentage is greater than 50%. The following is part of the detailed data. posix: qdepth=128 rw_percent 0 \| 30 cpu origin thisPatch opt \| origin thisPatch opt 1 286.5 298.5 4.19% 307 304.15 -0.93% 4 1042.5 1107 6.19% 1135.5 1136 0.04% 8 1952.5 2058 5.40% 2170.5 2170.5 0.00% 12 2658.5 2879 8.29% 3042 3046 0.13% 16 3247.5 3460.5 6.56% 3793.5 3775 -0.49% 24 4232.5 4459.5 5.36% 4614.5 4756.5 3.08% 32 4810 5095 5.93% 4488 4845 7.95% 40 5306.5 5435 2.42% 4427.5 4902 10.72% qdepth=512 rw_percent 0 \| 30 cpu origin thisPatch opt \| origin thisPatch opt 1 275 287 4.36% 294.4 295.45 0.36% 4 979 1041 6.33% 1073 1083.5 0.98% 8 1822.5 1914.5 5.05% 2030.5 2018.5 -0.59% 12 2441 2598.5 6.45% 2808.5 2779.5 -1.03% 16 2920.5 3109.5 6.47% 3455 3411.5 -1.26% 24 3709 3972.5 7.10% 4483.5 4502.5 0.42% 32 4225.5 4532.5 7.27% 4463.5 4733 6.04% 40 4790.5 4884.5 1.96% 4427 4904.5 10.79% uring: qdepth=128 rw_percent 0 \| 30 cpu origin thisPatch opt \| origin thisPatch opt 1 270.5 287.5 6.28% 295.75 304.75 3.04% 4 1018.5 1089.5 6.97% 1119.5 1156.5 3.31% 8 1907 2055 7.76% 2127 2211.5 3.97% 12 2614 2801 7.15% 2982.5 3061.5 2.65% 16 3169.5 3420 7.90% 3654.5 3781.5 3.48% 24 4109.5 4414 7.41% 4691.5 4750.5 1.26% 32 4752.5 4908 3.27% 4494 4825.5 7.38% 40 5233.5 5327 1.79% 4374.5 4891 11.81% qdepth=512 rw_percent 0 \| 30 cpu origin thisPatch opt \| origin thisPatch opt 1 259.95 276 6.17% 286.65 294.8 2.84% 4 955 1021 6.91% 1070.5 1100 2.76% 8 1772 1903.5 7.42% 1992.5 2077.5 4.27% 12 2380.5 2543.5 6.85% 2752.5 2860 3.91% 16 2920.5 3099 6.11% 3391.5 3540 4.38% 24 3697 3912 5.82% 4401 4637 5.36% 32 4256.5 4454.5 4.65% 4516 4777 5.78% 40 4707 4968.5 5.56% 4400.5 4933 12.10% Signed-off-by: Richael Zhuang <richael.zhuang@arm.com> Change-Id: I730dcf89ed2bf3efe91586421a89045fc11c81f0 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12210 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-04-28 07:29:28 +00:00
Tomasz Zawadzki	eef6af95d1	vhost_blk: separate out generic virtio_blk request processing This patch adds process_virtio_blk_request() that will be called by virtio_blk transports to process incoming requests. Meanwhile vhost_user_blk_request_finish() will be replaced with a callback to the virtio_blk transport to notify of the result. blk_request_finish() should only be called as direct result of process_virtio_blk_request(), usually from it. Some error paths now call vhost_user_blk_request_finish() directly, if vhost_user_process_blk_request() was not called yet. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I0cce22f15b922fe45f30fb659c384b6e836def4c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9537 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2022-04-27 08:51:55 +00:00
Tomasz Zawadzki	8c609f29bc	lib/vhost_blk: split up vhost_blk task struct spdk_vhost_blk_task shall contain only fields that are relevant to the generic virtio blk layer. Moved to vhost_internal.h header to allow for broader use. In contrast to spdk_vhost_user_blk_task that is specific to vhost_user transport. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I8438d3e6dbca816855f55ee998a632f50acde045 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12282 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2022-04-27 08:51:55 +00:00
Tomasz Zawadzki	57e6a387c2	lib/vhost: remove vhost_user fields from I/O processing logs Later in the series, the spdk_vhost_blk_task will not contain vhost_user specific fields. As such the generic part of processing I/O will not be able to refer to those. References to req_idx are replaced with pointer to the task, meanwhile bvsession reference when queueing I/O is removed. Note that vhost_user fields are accessible in the final callback for the I/O, and will be printed. See blk_request_finish(). Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I100d60968146da778bd6bf4fbcf2a2694d3be6e6 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12335 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-04-27 08:51:55 +00:00
Tomasz Zawadzki	a8757f6081	lib/vhost: cache vdev and io channel for bdev io wait Previously the blk_request_queue_io() and blk_request_resubmit() relied on vdev and channel contained in task or bvsession structures. In an effor to make the I/O processing for virtio blk not reliant on vhost_user, this patch caches the vbdev and io channel submited in process_blk_request(). Later in the series, vhost_user structures will be separated out from the spdk_vhost_blk_task. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: If1ea38a77af8fcfee12054f5857a6db2db2093c6 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12334 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-04-27 08:51:55 +00:00
Jim Harris	b2ee0bc180	nvmf: make acwu 0-based based ACWU is a 0's based value, and our intent is to report that our target's ACWU is 1 block. This means we should report ACWU as 0, not 1. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I6ad0606be07fd38bc6c2e3a8e4bb78225b3dfadc Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12385 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-04-27 07:36:44 +00:00
John Levon	370df805b2	nvmf/vfio-user: remove unnecessary read barrier The spdk_rmb() in nvmf_vfio_user_poll_group_poll() is unnecessary: we already have a read barrier for SQ tail updates at the per-SQ level, so this doesn't add anything. Signed-off-by: John Levon <john.levon@nutanix.com> Change-Id: I88cddd968f4a949640754526e19cb869d9fb31af Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12381 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-04-27 07:36:30 +00:00
John Levon	acba80827a	nvmf/vfio-user: reduce read barrier costs There's no need to spdk_rmb() in nvmf_vfio_user_sq_poll() unless we actually found the tail has advanced. Signed-off-by: John Levon <john.levon@nutanix.com> Change-Id: I778835c527409764c3db78459b2aa76420cc0105 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12378 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Thanos Makatos <thanos.makatos@nutanix.com>	2022-04-27 07:36:30 +00:00
Alex Michon	f89cf818c0	nvme/pcie: Fix doorbell delay with fuse operations When sending the first part of a fuse command, we set the first_fused_submitted flag so that we don't ring the doorbell immediately. When the second part is sent, we ring the doorbell for both commands. However, this doesn't work well when we use the option to delay ringing the doorbell. We send both parts, then later when we try to ring the doorbell, we don't because of the first_fused_submitted flag from the first command. Replace this mechanism by keeping track of the last submitted fuse. Change-Id: Ia4ac9b3ce9c319ee4c7e42f86eadda93dac85fca Signed-off-by: Alex Michon <amichon@kalrayinc.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12182 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com>	2022-04-27 07:36:20 +00:00
John Levon	fa3986b10e	support live migration of shadow doorbells We need to keep track of the shadow doorbell buffer locations, and make sure to re-initialize on resume. Co-authored-by: Thanos Makatos <thanos.makatos@nutanix.com> Signed-off-by: John Levon <john.levon@nutanix.com> Change-Id: If3ba456fb35f6f6199e4ff14cec1aad96775f71a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12237 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Thanos Makatos <thanos.makatos@nutanix.com>	2022-04-26 07:47:04 +00:00
Tomasz Zawadzki	c29112b1d7	lib/vhost: accept generic vhost arugments in process_blk_request() process_blk_request() accepted spdk_vhost_blk_session argument which is specific for vhost_user. In preparation to making this function usable outside of vhost_user, replace this field with struct spdk_io_channel and spdk_vhost_dev. For this purpose vhost_user_process_blk_request() was created to translate from spdk_vhost_blk_task to the generic arguments. to_blk_dev() was moved further up so it can be more commonly used throughout the file. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: Ie61a1ae2a615c4f1a95601e533b9eec51998cd07 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12333 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-04-25 07:36:30 +00:00
suhua	eb15d158d4	vhost: Add VIRTIO_F_ANY_LAYOUT feature for vhost In the qemu virtualization environment, when the virtio driver of the windows image is upgraded to virtio-win 0.1.185, and virtio uses legacy mode. In the negotiation stage of vhost, qemu's host_features will enable VIRTIO_F_ANY_LAYOUT, and guest_features will also enable this feature, because qemu's feature is a superset, so the feature will be passed to the spdk side through the set feature process, which leads to "VHOST_CONFIG: Processing VHOST_USER_SET_FEATURES failed. VHOST_CONFIG: vhost message handling failed.", so we enable this bit for spdk vhost. Signed-off-by: suhua <suhua1@kingsoft.com> Signed-off-by: lizhaoxin <lizhaoxin1@kingsoft.com> Change-Id: I27323bf5a03dce774c8a74cfb070ddd43be05534 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12300 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-04-25 07:36:20 +00:00
John Levon	7a31179f4f	nvmf/vfio-user: fix a couple of debug log messages There were a couple of places not using the standard formatting for qid still. Signed-off-by: John Levon <john.levon@nutanix.com> Change-Id: If96c3f6d762128b0f274e2c4e9eebf4e80e35139 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12346 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-04-25 07:36:11 +00:00
Konrad Sztyber	3056c8ac02	nvmf/tcp: delay qpair destruction This patch adds an extra spdk_thread_send_msg() call to destroy a qpair to make sure that it isn't freed from the context of a socket write callback. Otherwise, spdk_sock_close() won't abort pending requests, causing their completions to be exected after the qpair is freed. Fixes #2471 Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Ia510d5d754baccca1e444afdb10696ab9b58e28b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12332 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-04-25 07:36:05 +00:00
Shuhei Matsumoto	494eb6e58b	bdev: Fix race among bdev_reset(), bdev_close(), and bdev_unregister() There is a race condition when a bdev is unregistered while reset is submitted from the upper layer very frequently. spdk_io_device_unregister() may fail because it is called while spdk_for_each_channel() is processed. spdk_io_device_unregister io_device bdev_Nvme0n1 (0x7f4be8053aa1) has 1 for_each calls outstanding To avoid this failure, defer calling spdk_io_device_unregister() until reset completes if reset is in progress when unregistration is ready to do, and then reset completion calls spdk_io_device_unregister() later. A bdev cannot be opened if it is already deleting. So we do not need to hold mutex. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Ida1681ba9f3096670ff62274b35bb3e4fd69398a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12222 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Michael Haeuptle <michaelhaeuptle@gmail.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com>	2022-04-22 09:45:14 +00:00
Alexey Marchuk	b0f4249c59	nvme/rdma: Add async set/get registers Now controller initialization with RDMA transport is fully async Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: I26e857740d3137d0b0e987facc81fc5f6ef81f2b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10756 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-04-22 09:44:57 +00:00
Shuhei Matsumoto	dbe7e74cee	nvme: Change nvme_qpair_abort_queued_reqs() to set SC_ABORTED_SQ_DELETION Transport specific qpair_abort_reqs() set SC to SC_ABORTED_SQ_DELETION. However, nvme_qpair_abort_queued_reqs() set SC to SC_ABORTED_BY_REQUEST even if its call is not requested by the upper layer. Change nvme_qpair_abort_queued_reqs() to set SC to SC_ABORTED_SQ_DELETION for consistency. nvme_qpair_abort_queued_reqs() is used to abort queued requests that were sent while adminq was connecting. SC_ABORTED_SQ_DELETION will not be so bad even for the case. This change is required for the NVMe bdev module to be resilient for I/O error. The NVMe bdev module does not retry I/O if SC is SC_ABORTED_BY_REQUEST. SC is set to SC_INTERNAL_DEVICE_ERROR if a request is failed to submit to qpair by a generic qpair layer. We can change it to SC_ABORTED_SQ_DELETION as well but we keep this for now. SC_INTERNAL_DEVICE_ERROR is also retriable for the NVMe bdev module. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I7d8d5e97b222fe9275afc4fed024c1654c9579a2 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12121 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2022-04-22 09:44:57 +00:00
Andreas Economides	3b047a6162	nvmf/vfio-user: support shadow doorbells As per the NVMe specification, a host can identify two areas of guest memory: one of which is used for the host-written doorbells, and one of which contains event indexes. The host writes to the shadow doorbell area, but also writes to the controller's BAR0 doorbell area if the corresponding event index is crossed by the update. This avoids many mmio exits in interrupt mode, where BAR0 doorbells are not directly mapped into the guest VM, with greatly improved performance. This isn't a useful feature in BAR0 doorbells are mapped into the VM, so we explicitly disable support in that case. NB: the Windows NVMe driver doesn't yet support this feature. Although the specification says that the admin queues should also engage in this behaviour, in practice, no VM does, so have to include some hacks to account for this. Co-authored-by: John Levon <john.levon@nutanix.com> Signed-off-by: John Levon <john.levon@nutanix.com> Change-Id: I0646b234d31fbbf9a6b85572042c6cdaf8366659 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11492 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2022-04-21 08:12:29 +00:00
Ben Walker	7f75e1081a	idxd: Do not allow calls to spdk_idxd_set_config after devices have been probed This can cause a mismatch of kernel vs user driver and isn't allowed. Signed-off-by: Ben Walker <benjamin.walker@intel.com> Change-Id: I9c572ea1fa1da89d7b41e31ab4719eec719fb50a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10588 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: John Kariuki <John.K.Kariuki@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-04-21 08:11:33 +00:00
Ben Walker	85580d47e1	idxd: Remove _idxd_batch_is_valid The only place a batch can be created is by assigning it to the channel now, so this isn't a mistake that can be made and the checks can all be removed. Signed-off-by: Ben Walker <benjamin.walker@intel.com> Change-Id: I915edb4f212c0751396554655ffe95ae3bb20cd6 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11538 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-04-21 08:11:33 +00:00
Ben Walker	b2bdbbac56	idxd: Always store the current batch in chan->batch This effectively means there is only ever a single batch being build at a time, which simplifies a lot of the APIs. Change-Id: Ifd66cd1ce6f6f0abe2011528dd862c5324213658 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11223 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com>	2022-04-21 08:11:33 +00:00
Ben Walker	de732691a3	idxd: Simplify the kernel mode to only create 1 WQ per device Change-Id: I32e4fe2592c63752f08c326fb9845aa44ef7775b Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11537 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com>	2022-04-21 08:11:33 +00:00
John Levon	c20e41cd38	nvmf/vfio-user: move map_one() This lets us use it more widely. Signed-off-by: John Levon <john.levon@nutanix.com> Change-Id: I9c67be19020677fab3eafe05c1e0f91c3d04611d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12307 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-04-20 08:22:21 +00:00
zhangduan	31db7b139b	nvme_tcp: set transport_ack_timeout to ack_timeout The value of ack_timeout is calculated according to the formula 2^(transport_ack_timeout) msec. Signed-off-by: zhangduan <zhangd28@chinatelecom.cn> Change-Id: I5a938635d70693ddd405fa5907555bb745b4df0f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12215 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-04-20 08:21:42 +00:00
John Levon	48408177b5	lib/nvmf: add a comment on max admin queue size Signed-off-by: John Levon <john.levon@nutanix.com> Change-Id: I247e95843bd15a341a66f7ab07d9639bea403bd4 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12301 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-04-20 08:21:02 +00:00
Ben Walker	e22c933edb	idxd: Make many internal idxd_user functions take an idxd_user object This reduces a lot of casting. Signed-off-by: Ben Walker <benjamin.walker@intel.com> Change-Id: Ibc04f422858642d0e20c9b020bb6c5d1b70256fe Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11534 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com>	2022-04-20 08:20:45 +00:00
Konrad Sztyber	72925e3db8	nvmf/tcp: delay completion for zcopy reqs w/ in-progress writes When a qpair is disconnected, any outstanding zero-copy requests are freed to release their buffers before the qpair gets destroyed. However, if there is a PDU being sent to the host as part of this request (e.g. C2HData/R2T), we need to wait until that write is done before freeing the request to avoid freeing it twice. Fixes #2445 Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I2a6e82f26a4f011dfd18c55c821e9039de7e584a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12255 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-04-19 11:15:45 +00:00
Konrad Sztyber	75169d0dec	nvmf/tcp: update pdu_in_use flag in write functions This makes the flag indicate whether there's an outstanding PDU write for a given request. Additionally, it reduces the number of places we need to update this flag. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Id7e587f84955b096c46bfbf88d4dd222214d4a6a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12254 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2022-04-19 11:15:45 +00:00
Konrad Sztyber	c676c0815d	nvmf/tcp: use different callbacks for sending mgmt/req PDUs This will make it possible to have some common handling in request's PDU write completion. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Icaff38da0e47dd93327e3d8f09edd9fdba8f532e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12253 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2022-04-19 11:15:45 +00:00
Konrad Sztyber	37dc93b9ef	nvmf/tcp: adjust assert for zcopy req complete When an request using zcopy is completed, it might have an unreleased zcopy_bdev_io attached in three cases: 1) the request was a read, 2) the request was a failed write, 3) the qpair is being disconnected. The last case was missing from the assertion. Fixes #2425 Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I5cbeaa198a1fd878c98caf148a0bc47060e35bca Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12263 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-04-19 11:14:56 +00:00
Konrad Sztyber	aa21240574	nvme/pcie: increase min admin queue size to 256 Now that IO qpairs can be created asynchronously, we need to make sure that all the create IO CQ/SQ commands can be executed simultaneously. It is pretty common to create multiple IO qpairs at the same time, e.g. adding an NVMe bdev to an nvmf subsystem will create an IO qpair on each poll group. In that case, if the number of cores exceed the size of the admin queue (actually it can be even lower due to outstanding AERs), we might run out nvme_requests on the admin queue. The chosen minimum value for the admin queue size, 256, should be enough to cover most cases. Fixes #2465 Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I55c59aef64f3fdb33f7b4824d3e9beb403602633 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12270 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-04-19 08:18:34 +00:00
Shuhei Matsumoto	2c13441ba8	nvme_rdma: Destroy qpair after qpair is actually disconnected The RDMA transport can disconnect qpair asynchronously now. Previously, we tried to release the resource of the qpair after disconnected. However it did not work because it was done when deleting the qpair. The admin qpair was not deleted in a ctrlr reset sequence. This patch tries to satisfy the same aim again but by a different way. Previously, we released the resource of the qpair before starting actual disconnection process. This patch release the resource of the qpair after the qpair is actually disconnected. The related patches are: `b9518a5540` `eb09178a59` Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Id6a814895a35b1589b781a91744ef872b42aaa69 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11783 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-04-18 18:35:29 +00:00
Shuhei Matsumoto	4b73223542	nvme_rdma: Wait until lingering qpair becomes quiet before completing disconnection The code to handle the lingering qpair when deleting it was really complicated. The RDMA transport can connect or disconnect qpair asynchronously. Then we can include the code to handle the lingering qpair into the code to disconnect qpair now. If the disconnected qpair is still busy, defer completion of the disconnection until qpair becomes idle. If poll group is not used, we can complete disconnection immediately because cq is already destroyed. The related data and unit test cases are not necessary anymore. So delete them in this patch. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Ic8f81143fcad0714ac9b7db862313aa8094eeefb Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11778 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-04-18 18:35:29 +00:00
Shuhei Matsumoto	20cf90801e	nvme_rdma: Handle stale connection asynchronously Include delayed disconnect/connect retries with finite times into the state machine of asynchronous qpair connnection. We do not need to call back to the common transport layer but we need to do the following, clear rqpair->cq before starting disconnection if qpair uses poll group, and clear qpair->transport_failure_reason after disconnected. Additionally locate the new state STALE_CONN before INITIALIZING because cq is not ready to use for admin qpair when the state is STALE_CONN. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Ibc779a2b772be9506ffd8226d5f64d6d12102ff2 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11690 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2022-04-18 18:35:29 +00:00
Shuhei Matsumoto	77c4657140	nvme_rdma: Factor out destroying rdma qpair operation Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I18e166a726cca69f13e7c5818eba57f478726286 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11689 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: John Kariuki <John.K.Kariuki@intel.com>	2022-04-18 18:35:29 +00:00
Shuhei Matsumoto	aa36c18196	nvme_rdma: Pass callback to ctrlr_disconnect_qpair() via a parameter Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I06cbb9739286d1928ad9fc07de3715a449914d75 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11688 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-04-18 18:35:29 +00:00
Shuhei Matsumoto	75d38a301d	nvme: poll_group_process_completions() returns -ENXIO if any qpair failed TCP transport already does it but was not documented clearly. RDMA and PCIe transports follow it and document it clearly. Then we can check each qpair's state if spdk_nvme_poll_group_process_completions() returns -ENXIO before disconnected_qpair_cb() is called. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I2afe920cfd06c374251fccc1c205948fb498dd33 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11328 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-04-18 18:35:29 +00:00
Shuhei Matsumoto	9717b0c3df	nvme_rdma: Connect and disconnect qpair asynchronously Add three states, INITIALIZING, EXITING, and EXITED to the rqpair state. Add async parameter to nvme_rdma_ctrlr_create_qpair() and set it to opts->async_mode for I/O qpair and true for admin qpair. Replace all nvme_rdma_process_event() calls by nvme_rdma_process_event_start() calls. nvme_rdma_ctrlr_connect_qpair() sets rqpair->state to INITIALIZING when starting to process CM events. nvme_rdma_ctrlr_connect_qpair_poll() calls nvme_rdma_process_event_poll() with ctrlr->ctrlr_lock if qpair is not admin qpair. nvme_rdma_ctrlr_disconnect_qpair() returns if qpair->async is true or qpair->poll_group is not NULL before polling CM events, or polls CM events until completion otherwise. Add comments to clarify why we do like this. nvme_rdma_poll_group_process_completions() does not process submission for any qpair which is still connecting. Change-Id: Ie04c3408785124f2919eaaba7b2bd68f8da452c9 Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11442 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-04-18 18:35:29 +00:00
Jim Harris	ac0c53ae58	env_dpdk: do not set RTE_MEMPOOL_F_NO_IOVA_CONTIG This was added in patch `07526d85`, back in March 2018. This was before DPDK supported dynamic hugepage allocations. Presumably this flag was added to reduce the amount of memory lost due to mempool buffers that would otherwise span an IOVA boundary (mostly typical with IOMMU off and we are relying on physical addresses). Removing it simplifies any code in SPDK that uses mempool buffers for DMA operations, since it doesn't have to worry about splitting buffers that span an IOVA boundary - DPDK has already done it for us. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I49f6c1407fad02acae7e07c9dd00cb0449bd3554 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12277 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2022-04-15 08:25:54 +00:00
Tomasz Zawadzki	0368340581	lib/vhost: consolidate successful and invalid request path Both blk_request_finish() and invalid_blk_request() acomplished the same thing, with variation on handled statuses and debug logs. Consolidating those two into single function will help later on when replacing completion of request processing to single callback. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: Iae7b93db01bfd98819b2bb8fad9e11afcdb3a459 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12196 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2022-04-15 07:49:32 +00:00
Tomasz Zawadzki	4f95fd7be6	lib/vhost_blk: get bdev io_channels via vhost_blk functions This patch adds vhost_blk_[get/put]_io_channel() to be used by virtio_blk transports. Functions related to vhost_user sessions were modified to use it. dummy_io_channel reference is managed at the vhost_blk layer and as such continues to use the spdk_[get/put]_io_channel() APIs. The description is updated to reflect its not specific to vhost_user transport. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I6644198da83bfa0210c167e203d3875e96f1e7ea Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11101 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2022-04-15 07:49:32 +00:00
Tomasz Zawadzki	223f1f1446	lib/vhost: separate out vhost_user specific json config The vhost_user_config_json() will be replaced with callback to virtio_blk transport. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I6ea0ea38f505f0d354cd34ee5ab9cd3a858bd82e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9538 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-04-15 07:49:32 +00:00
Tomasz Zawadzki	6f89388ed3	lib/vhost: move vhost_user related fields from spdk_vhost_dev spdk_vhost_dev structure should only contain generic fields that are to be used by either vhost, vhost_blk or vhost_scsi layer. The vhost_user backend can hold its properties in spdk_vhost_user_dev, which is maintained within rte_vhost. Both structures contain references back to each other. The reference in spdk_vhost_dev is a void pointer to allow future transports to keep the reference to their own structures. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I68640c524426d885c20242146365ba242fa9df8e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11813 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2022-04-15 07:49:32 +00:00
Or Gerlitz	bfcfdb7903	nvmf/rdma: Use spdk allocation scheme for RDMA requests and receives In a similar manner for what we do for other per IO data-structures of cmds, cpls and bufs, use the conventional huge-pages based spdk allocation scheme for RDMA requests and receives. Change-Id: I4c2e86e928106e78c053f24915e2a9ce1a200c78 Signed-off-by: Or Gerlitz <ogerlitz@nvidia.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12273 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2022-04-15 07:48:23 +00:00
Or Gerlitz	5edb8edca7	nvmf/rdma: use LIFO practice for incoming queue To maximize cache locality, use lifo and not fifo when managing objects which are used per IO such as the RDMA receive elements queue. Change-Id: Id8917558acc1bec29943fcbae6afe6b072bde6ac Reported-by: Jim Harris <james.r.harris@intel.com> Signed-off-by: Or Gerlitz <ogerlitz@nvidia.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12272 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-04-15 07:48:23 +00:00
zhangduan	87cfed8442	sock: Add ack_timeout to spdk_sock_opts Due to the same reason as transport_ack_timeout for RDMA transport, TCP transport also needs ack timeout. This timeout in msec will make TCP socket to wait for ack util closes connection. Signed-off-by: zhangduan <zhangd28@chinatelecom.cn> Change-Id: I81c0089ac0d4afe4afdd2f2c7e5bff1790f59199 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12214 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-04-14 08:34:29 +00:00
paul luse	37b68d7287	accel: cleanup by getting rid of capabilties enum In support of upcoming patches and to greatly simplify things, the capabilites enum which held bit positions for each opcode has been removed. Only the opcodes enum remains and thus only opcodes are used throughout. For the capabiltiies bitmap a helper function is added to convert from opcode to bit position. Right now it is used in the IO path but in upcoming patches that goes away and the conversion is only done at init time. Signed-off-by: paul luse <paul.e.luse@intel.com> Change-Id: Ic4ad15b9f24ad3675a7bba4831f4e81de9b7bc70 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11949 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-04-14 08:32:50 +00:00
Shuhei Matsumoto	0d29a988be	bdev: Use spdk_for_each_bdev() for bdev_get_io_stat and get_bdevs RPCs spdk_for_each_bdev() can be applied simply to rpc_bdev_get_bdevs() because the callback function rpc_dump_bdev_info() is synchronous. spdk_for_each_bdev() cannot be applied simply to rpc_bdev_get_iostat(). The factored-out callback function _bdev_get_device_stat() is asynchronous. Add desc pointer to struct spdk_bdev_io_stat and open before and close after executing spdk_bdev_get_device_stat(). Replace spdk_bdev_get_by_name() by spdk_bdev_open_ext() when a bdev name is specified. spdk_bdev_register() checks if the name of each bdev is set and each bdev is opened while collecting its stats now. Hence it is not possible that spdk_bdev_get_name() returns NULL. Simplify rpc_bdev_get_iostat_cb() based on this fact. Furthermore, we want to fail the RPC for all failures. The callback function to spdk_bdev_get_device_stat() is executed after stack unwinding if successful. Defer starting RPC response until it is ensured that all spdk_bdev_get_device_stat() calls will succeed or there is no bdev. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I7b036d6d707c49d19c8922a159b12b5b5ce7ca41 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12089 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-04-14 08:32:12 +00:00
Ziv Hirsch	e749fa9c27	nvmf: fix buffer overflow on admin commands When req->iovcnt is bigger than 1, `memset(req->data, 0, req->length)` is wrong. Signed-off-by: Ziv Hirsch <zivhirsch13@gmail.com> Change-Id: Ie53eba686b4c5889bbde3b3644d51acbef303b42 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12216 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-04-14 08:31:35 +00:00
Jim Harris	92f0be87a0	iscsi: use EVP APIs for md5 calculations OpenSSL 3.0 deprecated the MD5_xxx APIs, so switch the md5 code in the iscsi library to use the EVP APIs recommended by OpenSSL instead. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ic5e3cd6e30ebc8b027f0715434cc3be045f1b770 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12240 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2022-04-13 08:33:51 +00:00
Jim Harris	7778bc3a33	vhost: copy virtio_blk_outhdr to local struct Some SeaBIOS versions are not aligning virtio_blk_outhdr on 8-byte boundary, causing ubsan failures. To be safe, let's just make sure on our end that we only access a properly aligned structure by copying this small (16-byte) data structure to a local structure variable. Fixes issue #2452. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Iacad72c3a1759fb8dc5ba411272a34d93ef2a6fc Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12238 Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-04-13 08:33:51 +00:00
Changpeng Liu	1bac8afd02	nvmf/vfio-user: unlink created files Only FDs are used for passing them to another process, we can unlink them after creation. Here we only unlink the files created in vfio-user, and there is still one file created via libvfio-user, it will be fixed via https://github.com/nutanix/libvfio-user/issues/660. Partly fix issue #2449. Change-Id: Ie27640e0cb85f44596e9d0ad5a2b67adf0419f5c Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12195 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: John Levon <levon@movementarian.org> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-04-12 13:59:57 +00:00
Changpeng Liu	c47b7b0276	nvme/vfio-user: use API to setup BAR0 doorbells We can use lib/vfio-user API to setup BAR0 doorbells, existing implementation is redundant. Change-Id: Ib880d167c84c6b8482bf1a35559a34c939f6a02d Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12211 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: John Levon <levon@movementarian.org> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-04-12 07:24:22 +00:00
Konrad Sztyber	91aee82d74	vmd: use config_bus_number when resetting root ports The config_bus_number is an offset within the config space reserved for the devices behind the VMD, while bus_number refers to the actual bus number assigned by VMD that depend on the VMCAP and VMCONFIG registers. So, to access the mapped config space we have to use config_bus_number. We didn't do that when resetting root ports', which could lead to segfaults if these values were different, as we'd access unmapped memory. Fixes #2451 Change-Id: I4e7bbb81400462284014565099bec98f6171c8c9 Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12208 Reviewed-by: Tom Nabarro <tom.nabarro@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-04-11 07:44:23 +00:00
Tomasz Zawadzki	f9fccbae63	lib/vhost: separate out vhost_user backend callbacks Previously spdk_vhost_dev_backend held callbacks for vhost_blk and vhost_scsi functionality, along with ones that are called by the vhost_user backend. This patch separates out those callbacks into two structures: - spdk_vhost_dev_backend - to be implemented by vhost_blk and vhost_scsi - spdk_vhost_user_dev_backend - is only implemented by vhost_user backend, callbacks for session managment specific to that transport Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I348090df5dddeb2b1945b082b85aec53d03c781b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11812 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2022-04-11 07:44:09 +00:00
Tomasz Zawadzki	c010a71f27	lib/vhost: make packed_ring_recovery per controller Previously g_packed_ring_recovery was set globally. Setting that during controller creation, would affect all previously created controllers. This is now set on per-controller basis and only enabled if packed ring feature is used. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: Idcc7231471446c805154648ab835a6af78f6543c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12040 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2022-04-11 07:44:09 +00:00
sberbz	fd53562a87	vhost: parse JSON vhost_blk devices specific params Separate parsing generic rpc vhost params form device specific, this solution allow to create various device which share common parameters. Change-Id: I50b1a89a8260fb1394880a750591e95539995288 Signed-off-by: Sebastian Brzezinka <sebastian.brzezinka@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12026 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2022-04-11 07:44:09 +00:00
Jim Harris	5d4b553cd9	vmd: change DEBUGLOGs to INFOLOG None of the current DEBUGLOGs are in any kind of performance critical code path. Making them INFOLOG means that we can enable them even in release builds to get additional information if needed. (Note: planning to do this with other libraries and modules as well.) Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I61765fab843a06c36ac1979151589e8f57fea76e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12209 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-04-11 07:43:30 +00:00
John Levon	32e54c6b16	nvmf/vfio-user: refactor suppressed IRQ handling No functional change; this just makes the poll code a little easier to read. Signed-off-by: John Levon <john.levon@nutanix.com> Change-Id: If6d1dcd940ed5b461856b535b1bf01c4efa8612a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12076 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-04-05 11:48:45 +00:00
Shuhei Matsumoto	f41248ffde	bdev: Use spdk_bdev_open_ext() for some simple RPCs Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Ie13f3153ea711f64f2099b2e4d37855b79977f82 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12148 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2022-04-05 11:48:31 +00:00
Jim Harris	183c348557	nvmf/rdma: issue fused cmds consecutively to tgt layer RDMA READs can cause cmds to be submitted to the target layer in a different order than they were received from the host. Normally this is fine, but not for fused commands. So track fused commands as they reach nvmf_rdma_request_process(). If we find a pair of sequential commands that don't have valid FUSED settings (i.e. NONE/SECOND, FIRST/NONE, FIRST/FIRST), we mark the requests as "fused_failed" and will later fail them just before they would be normally sent to the target layer. When we do find a pair of valid fused commands (FIRST followed by SECOND), we will wait until both are READY_TO_EXECUTE, and then submit them to the target layer consecutively. This fixes issue #2428 for RDMA transport. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I01ebf90e17761499fb6601456811f442dc2a2950 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12018 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-04-05 08:32:06 +00:00
Shuhei Matsumoto	428b17a0a8	bdev: Add spdk_for_each_bdev/bdev_leaf for clean up and further improvements To execute a callback function for each registered bdev or unclaimed bdev, add new public APIs, spdk_for_each_bdev() and spdk_for_each_bdev_leaf(). These functions are safe for race conditions by opening before and closing after executing the provided callback function. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I59b702ffec7b4fc5e9779de5a3a75d44922b829b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12088 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-04-05 07:30:47 +00:00
Shuhei Matsumoto	941ca7e09e	bdev: Factor out bdev close operation from spdk_bdev_close() Bdev open/close will be done for each bdev when traversing the bdev list. This patch is a preparation. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I2486bd823953fe020ed6106844877e1cf49d8a0d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12126 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-04-05 07:30:47 +00:00
Shuhei Matsumoto	b4bcf7721d	bdev: bdev_close() unlock g_bdev_mgr.mutex after spdk_io_device_unregister() spdk_io_device_unregister() send message to call its callback. So to make the following patches easier, consolidate g_bdev_mgr.mutex unlocks to the end of spdk_bdev_close(). Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Ib3b5c72be06e764918da30d7aa9fbc2ccd33956e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12125 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-04-05 07:30:47 +00:00
Shuhei Matsumoto	ced08048ee	bdev: Factor out descriptor allocation from spdk_bdev_open_ext() Bdev open/close will be done for each bdev when traversing the bdev list. This patch is a preparation. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I4e4fe6f1248176631a74c09585c931b21eb49d2b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12124 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-04-05 07:30:47 +00:00
Alexey Marchuk	3185d3c92f	bdev: Report memory domains in bdev_get_bdevs RPC This change will simplify development/debugging. Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: Ibde374089057a0684391f6519fa4e878d007408d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11049 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-04-04 09:57:56 +00:00
Alexey Marchuk	d7ac3d92e4	bdev/part: Use ext bdev API in IO path That will allow to pass ext opts to bdev layer Since in ext API metadata is passed as part of ext IO opts structure and ext opts can be NULL (e.g. upper layed used regular API), in that case we use *blocks_with_md API Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: I1bfb3fcb11bf42e100ecc7e4058087f12086db3a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11048 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-04-04 09:57:56 +00:00
Alexey Marchuk	1299439f3d	bdev: pull/push data if bdev doesn't support memory domains If bdev doesn't support any memory domain then allocate internal bounce buffer, pull data for write operation before IO submission, push data to memory domain once IO completes for read operation. Update test tool, add simple pull/push functions implementation. Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: Ie9b94463e6a818bcd606fbb898fb0d6e0b5d5027 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10069 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-04-04 09:57:56 +00:00
Shuhei Matsumoto	96c007d301	bdev: Add spdk_bdev_unregister_by_name() to handle race condtions To unregister a bdev more correctly, we had to call spdk_bdev_open_ext(), spdk_bdev_desc_get_bdev(), spdk_bdev_unregister(), and then spdk_bdev_close(). This was correct but complicated. Hence add a new public API spdk_bdev_unregister_by_name() which does the whole correct sequence of bdev unregistration. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I9068d4ac49dca944436e0ba587308fd356dfef75 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12065 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-04-04 09:57:43 +00:00
Tomasz Zawadzki	6301f8915d	lib/sock: provide a hint to picking optimal poll group The process of matching qpair to poll group is split into two distinct parts that occur on different threads. See spdk_nvmf_tgt_new_qpair(). This results in a race condition for TCP between spdk_sock_map_lookup() and spdk_sock_map_insert(), which are called in spdk_nvmf_get_optimal_poll_group() and spdk_nvmf_poll_group_add() respectively. Fixes #2113 This patch picks a hint from nvmf_tcp for next poll group, which is then passed down to spdk_sock_map_lookup(). When matching placement_id exists, but does not have a poll group assigned - the hint will be used. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I4abde2bc9c39225c9f5dd7c3654fa2639bb0a27f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10271 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2022-04-01 12:41:26 +00:00
Chunsong Feng	0db0c443df	nvmf/rdma: Improve read performance in DIF strip mode The rdma buffer for stripping DIF metadata is added. CPU strips the DIF metadata and copies it to the rdma buffer, improving the rdma write bandwith. The network bandwidth during 4KB random read test is increased from 79 Gbps to 99 Gbps, the IOPS is increased from 2075K to 2637K. Fixes issue #2418 Signed-off-by: Chunsong Feng <fengchunsong@huawei.com> Change-Id: If1c31256f0390f31d396812fa33cd650bf52b336 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11861 Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-04-01 11:19:18 +00:00
paul luse	75209b1d53	lib/idxd: fail init if IOMMU is not enabled Currently the idxd driver requires VFIO so avoid unexpected errors if someone tries without it (with UIO). Temp workaround for issue #2316 Signed-off-by: paul luse <paul.e.luse@intel.com> Change-Id: I430cd2193bc8dbd6939af7d0ca799832e7a73213 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11816 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-04-01 08:30:46 +00:00
Kozlowski Mateusz	adc36d5be3	lib/vhost: Fix ENOMEM resubmission for vhost_blk In the current behavior the iovcnt is lowered before sending it to the next BDEV in the stack - however if the returned value is ENOMEM (due to eg. not enough bdev requests in the pool), the request needs to be returned to its original state, as it would be resubmitted with skipped iov entries. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Change-Id: I7240510a2ec04594b248f7347e86ac11ecfd26a0 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11976 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-04-01 08:30:28 +00:00
Chunsong Feng	05dd3c0bb2	dif: enhance copy API to support block-aligned bounce_iov When iovs are copied from bounce or to bounce, the bounce is usually alloced from data_buf_pool for better performance, and is multi iovs instead of a single buffer. Therefore, block-aligned bounce are supported. Signed-off-by: Chunsong Feng <fengchunsong@huawei.com> Change-Id: If56b21d9e46c73d4c956c227bec33ddd0ab9745b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11860 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2022-04-01 08:29:12 +00:00
Shuhei Matsumoto	e48475b776	nvmf/rdma: Move get length with DIF from parse_sgl() to fill_iovs() This is another small code cleanup. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I49ed19d025c96c87be3b7782536fd98570bd2569 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11966 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: fengchunsong <fengchunsong@huawei.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2022-04-01 08:29:12 +00:00
Shuhei Matsumoto	9db2571d32	nvmf/rdma: Split fill_wr_sgl() between DIF is enabled or not Extract the code for DIF from nvmf_rdma_fill_wr_sgl() into nvmf_rdma_fill_wr_sgl_with_dif(). Then clean up nvmf_rdma_request_fill_iovs() and nvmf_rdma_request_fill_iovs_multi_sgl(). Additionally, this patch has a bug fix. nvmf_rdma_fill_wr_sgl_with_dif() returned false if spdk_rdma_get_translation() failed. However, the type of return value of nvmf_rdma_fill_wr_sgl_with_dif() is not bool but int. The boolean false is 0 in integer. Hence in this case, nvmf_rdma_fill_wr_sgl_with_dif() returned 0 even if it failed. Change nvmf_rdma_fill_wr_sgl_with_dif() to return rc as is if spdk_rdma_get_translation() returns non-zero rc. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I71cc186458bfe8863964ab68e2d014c495312cd3 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11965 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: fengchunsong <fengchunsong@huawei.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2022-04-01 08:29:12 +00:00
John Levon	45b55f6012	nvmf/vfio-user: regularize debug messages To help grep, use a standard sqid:%d style format for identifying queue IDs. Signed-off-by: John Levon <john.levon@nutanix.com> Change-Id: Ib82c81939f85f9beb333a4db10d006524522a1d9 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11822 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Thanos Makatos <thanos.makatos@nutanix.com>	2022-04-01 08:28:55 +00:00
John Levon	f49b1724ba	nvmf/vfio-user: move io_q_exists() As a general utility function, move it up with the others. Signed-off-by: John Levon <john.levon@nutanix.com> Change-Id: I32881c01afd9819c889730d7c09163c95fbb827e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11790 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Thanos Makatos <thanos.makatos@nutanix.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-04-01 08:28:55 +00:00
John Levon	172ea8381a	nvmf/vfio-user: track doorbell pointers per queue For each queue, track its doorbell location individually, rather than needlessly recalculating it every time we look up the doorbell value. This will also greatly simplify shadow doorbell support. Co-authored-by: Andreas Economides <andreas.economides@nutanix.com> Signed-off-by: John Levon <john.levon@nutanix.com> Change-Id: I6882d2f92ee2f2b2b90c54ee14e5f6b41ecca85d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11789 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-04-01 08:28:55 +00:00
Shuhei Matsumoto	0a61427ecc	nvme_rdma: Start qpair after resolving address and route when poll group is used Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I0b0f314c98368247582f2dfcaf69f78e24d715f9 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11366 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2022-04-01 08:28:45 +00:00
Shuhei Matsumoto	531c1b0f04	nvme_rdma: Make nvme_rdma_process_event() asynchronous Separate nvme_rdma_process_event() into nvme_rdma_process_event_start() and nvme_rdma_process_event_poll(). Use nvme_rdma_process_event_start() and nvme_rdma_process_event_poll() in nvme_rdma_process_event() to ensure compatibility. Change-Id: Idc960fab2540efec612dcf22f156acabd2e2874e Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10594 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2022-04-01 08:28:45 +00:00
Shuhei Matsumoto	791ee7deb4	nvme_rdma: nvme_rdma_process_events() returns negated errno It will be convenient for the following patches to return negated errno directly. Change-Id: Ic80181b2ee449946dd60ad0c97a325fd48b92231 Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10990 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-04-01 08:28:45 +00:00
Shuhei Matsumoto	cf7f253302	nvme_rdma: Add callback to nvme_rdma_process_event() Change-Id: I66aa89dc54d5aaedbe2f06239cbf04aeeb2c739e Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11359 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2022-04-01 08:28:45 +00:00
Shuhei Matsumoto	bcf0845727	nvme_rdma: Make CM event operations callback functions Change-Id: I9f2551a07187400dd9ef624348cd465e64557e1b Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11138 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-04-01 08:28:45 +00:00
Shuhei Matsumoto	e5927c02e9	nvme_rdma: Remove cm_channel param from process_event() nvme_rdma_poll_events() gets the cm_channel pointer itself. Before calling nvme_rdma_process_event(), we checks the rctrlr is valid. Hence we do not have to pass the cm_channel pointer to nvme_rdma_process_event() via a parameter. This simplifies the code and makes the following patches a little easier. Change-Id: I03f095833469c5b64592264d63a592106d49e13b Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11167 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2022-04-01 08:28:45 +00:00
Shuhei Matsumoto	29974dc882	nvme_rdma: Make fabric_qpair_connect() asynchronous Replace nvme_fabric_qpair_connect() by nvme_fabric_qpair_connect_async() and nvme_fabric_qpair_connect_poll(). The following is a detail. Define state of the nvme_rdma_qpair and each rqpair holds it. Initialize rqpair->state by INVALID at nvme_rdma_ctrlr_create_qpair(). _nvme_rdma_ctrlr_connect_qpair() sets rqpair->state to FABRIC_CONNECT_SEND instead of calling nvme_fabric_qpair_connect(). Then the new function nvme_rdma_ctrlr_connect_qpair_poll() calls nvme_fabric_qpair_connect_async() at FABRIC_CONNECT_SEND and nvme_fabric_qpair_connect_poll() until it returns 0 at FABRIC_CONNECT_POLL. nvme_rdma_qpair_process_completions() or nvme_rdma_poll_group_process_completions() calls nvme_rdma_ctrlr_connect_qpair_poll() if qpair->state is CONECTING. This patter follows the TCP transport. Change-Id: I411f4fa8071cb5ea27581f3820eba9b02c731e4c Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11334 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-04-01 08:28:45 +00:00
Tomasz Zawadzki	1ca04a1d7a	lib/sock: refactor allocation of sock_map entry At this time only spdk_sock_map_insert() allocates entries for the sock_map. Next patch will introduce it to lookup too. So now this part is refactored out to separate function. This patch should not introduce any functional change. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I46ac88aedebffe0cbc1f4616dc1fcfaf7f950b05 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10726 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-04-01 08:28:25 +00:00
Tomasz Zawadzki	91f2725291	lib/sock: fix lookup on placement_id with NULL sock_group spdk_sock_map_insert() allows for allocating a sock_map entry, without assigning any sock_group. This is useful for cases where placement_id determined by the component using spdk_sock_map_*. See PLACEMENT_MARK mode. Placement_id's are allocated first, then an empty one is found using spdk_sock_map_find_free(). Since the above is a valid use case, then entry in sock_map can exist without a group assigned. spdk_sock_map_lookup() has to handle such cases, rather than trigger an assert. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: Ia717c38fef5e71fe44471ea12f61a5548463f0cf Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10725 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: wanghailiang <hailiangx.e.wang@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2022-04-01 08:28:25 +00:00
paul luse	b9d44da07d	lib/idxd: Further simplify WQ configuration code As we now only support a single WQ, there's no need for a teble of them and no need to assert that the stride from WQ to WQ is the same as the WQ struct size. Signed-off-by: paul luse <paul.e.luse@intel.com> Change-Id: I205f36aae22070f532653726dd75249bbafbe3ef Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12081 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot	2022-03-31 17:59:21 +00:00
Evgeniy Kochetov	a2d4ddb3b1	nvme: Prioritize user provided trstring for transport lookup This patch fixes the issue with custom nvme transport. It is possible to register custom nvme transport with arbitrary name but it is not usable because 'spdk_nvme_trid_populate_transport' call in probe function will always set trstring to 'CUSTOM' and transport lookup will fail. Signed-off-by: Evgeniy Kochetov <evgeniik@nvidia.com> Change-Id: I83fd24dd8732ac0a21e22435e0acff20ab0e7521 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9557 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-03-31 10:31:20 +00:00
paul luse	e68aebd50b	lib/accel: remove public API for getting capabilities First in a series of patches that will enable multiple engines to exist at once and choose the best one based on their priorities and capabilites, the public API will no longer be needed. Signed-off-by: paul luse <paul.e.luse@intel.com> Change-Id: Ia87b83aa2263745a94a822a160b6e97bb2e0dc19 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11948 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-03-31 09:36:25 +00:00
GangCao	fb1e12491c	lib/vhost: update the error message Fix issue #2441 Change-Id: Iabd773c4bfd4769cd21c3ed7f8a53e8690b0a35f Signed-off-by: GangCao <gang.cao@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12087 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-03-31 09:35:40 +00:00
Alexey Marchuk	42f59f5006	lib/reduce: Copy user's buffers if SGL is not supported In the compression operation we may have SGL input if user's buffer is fragmented or less than chunk_size. If the backing device doesn't support SGL input then we should copy user's buffers into decomp_buffer (including paddings if any). In the decompression operation, if the backing device doesn't support SGL output, we use a single output buffer which is pointing to decomp_buffer. Once the operation completes, we should copy the result into user's buffers. Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: Ic7fddd38374bb6898256633eacd192dbaf36541a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11970 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-03-31 09:34:52 +00:00
Jim Harris	c81c10c529	nvmf/tcp: issue fused cmds consecutively to target layer R2Ts can cause cmds to be submitted to the target layer in a different order than they were received from the host. Normally this is fine, but not for fused commands. So track fused commands as they reach nvmf_tcp_req_process(). If we find a pair of sequential commands that don't have valid FUSED settings (i.e. NONE/SECOND, FIRST/NONE, FIRST/FIRST), we mark the requests as "fused_failed" and will later fail them just before they would be normally sent to the target layer. When we do find a pair of valid fused commands (FIRST followed by SECOND), we will wait until both are READY_TO_EXECUTE, and then submit them to the target layer consecutively. This fixes issue #2428 for TCP transport. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I8a9e13690ecb16429df68ae41b16b439a0913e4e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12017 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-03-30 08:02:20 +00:00
Konrad Sztyber	fa649869b9	bdev: add timeout option to bdev_get_bdevs RPC This opption allows the bdev_get_bdevs RPC to block until a bdev with specified name appears. It can be useful, when a bdev is created asynchronously and the exact moment at which it appears is not known. For instance, with a discovery service, a bdev is created when a namespace on a remote NVMeoF target is added, but it's not possible to specify when that happens exactly. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I6c1f974fba445376ca9d45aac2639202547410cc Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11960 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-03-30 08:02:08 +00:00
Jim Harris	7039639063	nvme: read full discovery page after reading header Some targets report they support log page offset, but then fail GET_LOG_PAGE commands that specify a non-zero offset, or report the wrong number of discovery entries when reading more than the discovery log page header but not the entire log page. So just revert to reading the entire discovery log page, after we've read the header to know how big the log page will be. This means that when we read the log page initially (without the individual entries), we need to save off the genctr, since it will get overwritten when we read the log page again. We can just store this in the discovery context, and compare it to the genctr that we read with the whole log page. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I34929253312fed9924db58904a051f3979283730 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11478 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-03-28 17:10:04 +00:00

1 2 3 4 5 ...

9336 Commits