ivampiresp/Spdk - Spdk - Leaflow Developers

Author	SHA1	Message	Date
John Levon	dda78a882f	nvmf/vfio-user: fix _free_ctrlr() In _free_ctrlr(), ->endpoint can never be NULL, and the code was self-contradictory; assume it's not NULL. Signed-off-by: John Levon <john.levon@nutanix.com> Change-Id: I81a449123ca05f64460380dc3a8ad8af2143d166 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15831 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-12-12 09:26:34 +00:00
John Levon	05edb4d69b	nvmf/vfio-user: correct log message Use standard "sqid" naming for a log message. Signed-off-by: John Levon <john.levon@nutanix.com> Change-Id: Icca8415cd17272ca7bd82667721c4131dd1df7f1 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15828 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-12-12 09:26:34 +00:00
Konrad Sztyber	0db7a0dc7f	vhost: add (set\|get)_coalescing to virtio_blk transport This fixes the behavior of spdk_vhost_(set\|get)_coalescing() on non-vhost-user devices. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Ia17cd4c0ed4bad262090e05f83727c1516c21f92 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15772 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-12-12 09:26:22 +00:00
Konrad Sztyber	25d55f48c1	vhost: add (set\|get)_coalescing to backend interface The current code for setting/getting coalescing setting only works with vhost-user devices, while users can create virtio-blk devices with non-vhost-user transport. Calling spdk_vhost_(set\|get)coalescing() on such device results in a segfault. So, spdk_vhost_dev_backend interface is extended with methods to set / get coalescing parameters. In the following patch, the virtio_blk interface will be also extended with similar callbacks allowing us to pipe coalescing settings to the appropriate transport. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Ide5d5f633b17dcdbedb4b7804d5e45bf41373eca Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15771 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-12-12 09:26:22 +00:00
Konrad Sztyber	a64acd100c	nvmf: return error on invalid req length for copy commands Both the length of a request and the number of ranges to copy are controlled by the user, so we should check them and return an error instead of asserting that they're correct. This fixes the `test/nvmf/target/fabrics_fuzz.sh` test. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I3481c4bb1f2c7676df81f41dfc95ef063924222e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15805 Reviewed-by: Pawel Piatek <pawelx.piatek@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot	2022-12-09 08:16:50 +00:00
Michal Berger	3f912cf0e9	misc: Fix spelling mistakes Found with misspell-fixer. Signed-off-by: Michal Berger <michal.berger@intel.com> Change-Id: If062df0189d92e4fb2da3f055fb981909780dc04 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15207 Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-12-09 08:16:18 +00:00
Mike Gerdts	9d06166f5b	nvme: annotate and log existing deprecation Use the deprecation API to annotate and log the deprecation of spdk_nvme_ctrlr_prepare_for_reset() using the tag "nvme_ctrlr_prepare_for_reset". Signed-off-by: Mike Gerdts <mgerdts@nvidia.com> Change-Id: I98fd840aa9acc028a49bb47daf4ab7e88f1eb818 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15756 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-12-08 12:59:32 +00:00
Jim Harris	af8d147328	iscsi: only define srandomdev when arc4random not available srandomdev is only used to emulate arc4random, so only bother defining it on Linux when it's needed. This avoids unused errors on newer distros packaging glibc versions that now defined arc4random. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I6e64a697d9633709cedd0198f75cf094d514562d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15814 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-12-08 12:56:30 +00:00
John Kariuki	1d4628efc9	lib/idxd: change max idxd completions processed This patch fixes issue # 2809, by changing the max completions processed per poll. A new parameter called IDXD_MAX_COMPLETIONS is used to set maximum completions processed per poll to 128 because we observed performance degradation on a system with 16 NVMe SSDs at a queue depth of 64 per SSD. When using DSA to compute the data digest, the target application can issue upto 1024(16x64) request to compute data digest concurrently to DSA. Limiting the maximum completions processed per poll to 32 using DESC_PER_BATCH cause up to 43% IOPS degradation. Use IDXD_MAX_COMPLETIONS to control the number of completions proccessed per poll in spdk_idxd_process_event based on your workload. For example, if your application is issuing 1000s of concurrent request to DSA you might want to set IDXD_MAX_COMPLETIONS to a value higher than 128. Change-Id: I2a1db993283a83a20266f40dac851728d63e6127 Signed-off-by: John Kariuki <John.K.Kariuki@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15801 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-12-08 12:55:58 +00:00
paul luse	19e2dc3853	configure: rename --with-reduce --with-vbdev-compress This is in prep for adding a new compressDev accel_fw module that will contain all of the DPDK compressDev specifics on it, the vbdev will make calls to the accel_fw instead. As the accel_fw has SW based compression, we want the configure option to apply to building the vbdev module but not the accel_sw software implementation or the upcoming compressdev module. Renamed to "compress" as reduce is a term specific to the vbdev implementation of the compression to be provided by the accel_fw and thus the same reason why we leave the test flag called REDUCE because it's controlling tests for the reduce library as well as the vbdev module that is using reduce. The flag does not apply to the SW implementation of compression. This does not affect upcoming accel_fw compressdev module, that will have its own configure option. Signed-off-by: paul luse <paul.e.luse@intel.com> Change-Id: If8ed3e48e1e3dabcaad1cd161289e78122cd9d58 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15179 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-12-08 12:55:27 +00:00
paul luse	0b7138e97f	lib/idxd: use physical address for IAA aecs table Per specification. Signed-off-by: paul luse <paul.e.luse@intel.com> Change-Id: Ic93349c7d3ed50fa6e502e39db0347141804d4c4 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15673 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-12-08 12:55:27 +00:00
Mike Gerdts	6580f654fc	lvol: remove unused lvs->destruct While lvs->destruct is set in a few places, it is never read. Since it is not used, it is removed. Signed-off-by: Mike Gerdts <mgerdts@nvidia.com> Change-Id: Iee21e92c9049d143fca13930b4b5f328f9ec38f0 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15716 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-12-08 12:55:07 +00:00
Evgeniy Kochetov	b7bfa50468	blob: Use bdev copy command in CoW flow if supported Copy-on-write happens when cluster is written for the first time for thin provisioned volume. Currently it is implemented as two separate requests to underlying bdev: read of the whole cluster to bounce buffer and then write of this buffer to the new location on the same underlying bdev. This patch improves copy-on-write flow by utilizing copy command of underlying bdev if it is supported. In this case we have just one request to bdev and don't need the bounce buffer. Signed-off-by: Evgeniy Kochetov <evgeniik@nvidia.com> Change-Id: I92552e0f18f7a41820d589e7bb1e86160c69183f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14351 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-12-08 12:54:54 +00:00
Evgeniy Kochetov	9e843fdbd1	blob: Add translate_lba operation New `translate_lba` operation allows to translate blob lba to lba on the underlying bdev. It recurses down the whole chain of bs_dev's. The operation may fail to do the translation when blob lba is not backed by the real bdev. For example, when we eventually hit zeroes device in the chain. This operation is used in the next commit to get source LBA for copy operation. Signed-off-by: Evgeniy Kochetov <evgeniik@nvidia.com> Change-Id: I89c2d03d1982d66b9137a3a3653a98c361984fab Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14528 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-12-08 12:54:54 +00:00
Shuhei Matsumoto	1c57fa1a95	nvme_rdma: Rename poll_group_set_cq() by qpair_set_poller() In the following patches, nvme_rdma_poll_group_set_cq() will touch not only CQ but also SRQ and receive WR objects. All these resources are of a poller. Hence for clarification, rename nvme_rdma_poll_group_set_cq() by nvme_rdma_qpair_set_poller(). Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Ic59ba5a45833e39b1b2647c000c8b953f1031d6b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14910 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-12-08 12:54:40 +00:00
Shuhei Matsumoto	e22dcc075a	nvme_rdma: Factor out reset failed sends/recvs operation Factor out reset failed recvs operation into a helper function nvme_rdma_reset_failed_recvs(). This will make the following patches simpler. For send operation, this change is not required yet, but in future we may support something like shared SQ. Hence, we do this change for send operation too. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Signed-off-by: Denis Nagorny <denisn@nvidia.com> Signed-off-by: Evgeniy Kochetov <evgeniik@nvidia.com> Change-Id: Ib44acebe63e97e5a60ea6fa701b49278c7f44b45 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14171 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-12-08 12:54:40 +00:00
Shuhei Matsumoto	4cef00cbbf	nvme_rdma: Merge alloc_ and register_reqs/rsps into create_reqs/rsps functions In the following patches, poll group will have rsps objects and to share the code between poll group and qpair, option for creation will be used. As a preparation, merge nvme_rdma_alloc_rsps() and nvme_rdma_register_rsps() into nvme_rdma_create_rsps(). For consistency, merge nvme_rdma_alloc_reqs() and nvme_rdma_register_reqs() into nvme_rdma_create_reqs(). Update unit tests accordingly. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Signed-off-by: Denis Nagorny <denisn@nvidia.com> Signed-off-by: Evgeniy Kochetov <evgeniik@nvidia.com> Change-Id: I92ec9e642043da601b38b890089eaa96c3ad870a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14170 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-12-08 12:54:40 +00:00
Shuhei Matsumoto	8e48517f96	nvme_rdma: Defer send/recv objects allocation until connection is established When SRQ is supported, recv objects will be allocated by poll group and qpair will associated and use them. In this case, we do not want qpair to allocate and free recv objects. When connection is established, it will be decided if SRQ is used or not. Hence, defer recv objects allocation until connection is established. Send objects are not affected directly by SRQ, but nvme_rdma_register_reqs() no longer does any registration and deferring send objects allocation makes the code more consistent. Hence, defer send objects allocation until connection is established too. Even after this patch, we rely on nvme_rdma_ctrlr_delete_io_qpair() to free resources completely. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Signed-off-by: Denis Nagorny <denisn@nvidia.com> Signed-off-by: Evgeniy Kochetov <evgeniik@nvidia.com> Change-Id: Ic151fad01009d92a7fc809a730e6e9dff1a365f3 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14169 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-12-08 12:54:40 +00:00
Shuhei Matsumoto	6602291766	nvme_rdma: Move submit_recvs() from register_rsps() to connect_established() Response objects will be in poll group when SRQ is enabled. But we want to share the code to allocate and register response objects between SRQ is enabled or disabled. To do it cleanly, move nvme_rdma_qpair_submit_recvs() from nvme_rdma_register_rsps() to nvme_rdma_connect_established(). A few clean up of error handling are done in this patch. Unregistration will be done when qpair is disconnected. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Signed-off-by: Denis Nagorny <denisn@nvidia.com> Signed-off-by: Evgeniy Kochetov <evgeniik@nvidia.com> Change-Id: I38dc5a6cb84a6bf56c01d5fb7f2cf3d3b63918e0 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14168 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-12-08 12:54:40 +00:00
Shuhei Matsumoto	cd640f6275	nvme_rdma: Inline qpair_queue_send/recv_wr() This will make the following patches simpler. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Id3d7c025525b35c1c2b96027430789a8d8f2697b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14422 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-12-08 12:54:40 +00:00
Shuhei Matsumoto	6275f8445f	nvme_rdma: Inline post_recv() Inline nvme_rdma_post_recv() into the callers. We do not have any similar helper function for posting send WR. This will make the following patches simpler and will be reasonable. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Ia95a4b350942d20bdb65e84f7575c2dcf67c149b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14421 Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot	2022-12-08 12:54:40 +00:00
Shuhei Matsumoto	ecd9234d4d	nvme_rdma: Extract conditional submit_sends/recvs from queue_send/recv_wr Extract and inline the conditional nvme_rdma_qpair_submit_sends() and nvme_rdma_qpair_submit_recvs() calls. This will cralify the logic and make the following patches simpler. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Ibe217c6f4fb2880af1add8c0429f92b4de107da8 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14420 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-12-08 12:54:40 +00:00
Shuhei Matsumoto	851a8dfe42	nvme_rdma: rdma_req caches rdma_rsp and rdma_rsp caches recv_wr When SRQ is supported, rsp array will be in either qpair or poller. To make this difference transparent, rdma_req caches rdma_rsp and rdma_rsp caches recv_wr directly instead of caching indecies. Additionally, do a very small clean up together. spdk_rdma_get_translation() gets a translation for a single entry of a rsps array. It is more intuitive to use rsp. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Signed-off-by: Denis Nagorny <denisn@nvidia.com> Signed-off-by: Evgeniy Kochetov <evgeniik@nvidia.com> Change-Id: I61c9d6981227dc69d3e306cf51e08ea1318fac4b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13602 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-12-08 12:54:40 +00:00
Shuhei Matsumoto	cce990607b	nvme_rdma: Factor out send/recv completion from cq_process_completions() Factor out processing recv completion and send completion into helper functions to make the following patches simpler. Additionally, invert if condition to check if both send and recv are completed to make the following patches simpler. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Idcd951adc7b42594e33e195e82122f6fe55bc4aa Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14419 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-12-08 12:54:40 +00:00
Shuhei Matsumoto	d7ad7bca3c	bdev: Add mode to bdev_reset_iostat RPC to reset only max/min fields Both max and min should be reset periodically. We can use the queue depth sampling poller to reset these but the queue depth sampling poller is optional. We extend the bdev_reset_iostat RPC to support mode to reset all or only max/min fields. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I9ce54892f6e808f6a82754b6930092f3a16d51ff Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15444 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot	2022-12-08 12:54:23 +00:00
Shuhei Matsumoto	15040628ec	bdev: Add min/max_latency_read/write/unmap_ticks into I/O statistics Add max/min_read/write/unmap_latency_ticks into the struct spdk_bdev_io_stat. When initializing or resetting the instance of the struct spdk_bdev_io_stat, initialize max to 0 and min to UINT64_MAX. Then update max if a new value is larger than the current max, and update min if a new value is smaller than the current min. For the bdev_get_iostat RPC, it prints max and prints min if min is not UINT64_MAX or 0 if min is UINT64_MAX. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I1b30b3825c15e37e9f0cf20104b866186de788a2 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14825 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot	2022-12-08 12:54:23 +00:00
Shuhei Matsumoto	cf4e8664bb	bdev: Add bdev_reset_iostat RPC Add a helper function bdev_reset_device_stat() to reset I/O statistics. This funciton is used for the bdev_reset_iostat RPC. We do not have any plan to use bdev_reset_device_stat() outside lib/bdev. Hence, we do not add this as a public API. Then, add a new RPC bdev_reset_iostat to reset I/O statistics of a single bdev or all bdevs. Resetting I/O statistics affects all consumers. Add a note to CHANGELOG and doc/jsonrpc.md. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I97af09107b5c3ad1f9c19bf3cbf027457c4fbae7 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15350 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-12-08 12:54:23 +00:00
Shuhei Matsumoto	319d1cbb4e	bdev: Store bdev_io data into local variables to update I/O statistics Hold not only io_stat pointer but also num_blocks and blocklen in local variables. This will shorten and simplify bdev_io_update_io_stat(), and improve readability and changeability. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I527b72538a169a1faafd32863ff539306a8763a9 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15732 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-12-08 12:54:23 +00:00
Shuhei Matsumoto	8985382b96	bdev: Factor out I/O trace update at completion into a helper function The following patches will add max/min latencies and more optional counters. This factorization will improve the readability. In addition to factorization, add spdk_likely to check if completed successfully or not. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I57581ece2b73d486aa138f8d26a5afaf6953a322 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15480 Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Krzysztof Karas <krzysztof.karas@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-12-08 12:54:23 +00:00
Shuhei Matsumoto	fab3558f2e	bdev: Change name and parameter order of function to dump I/O statistics For consistency, rename a JSON dump function by bdev_io_stat_dump_json() and change the parameter order. Other public APIs and function pointers in the generic bdev layer, spdk_bdev_dump_info_json(), spdk_bdev_fn_table::dump_info_json, and spdk_bdev_fn_table::write_config_json have a json_write_ctx pointer as the last parameter. For consistency, swap a statistics pointer and a json_write_ctx pointer. This is another preparation to extend I/O statistics to include error counters and module specific counters to output these via the bdev_get_iostat RPC. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I6f3bb6f2752f7da856d4fe66c0f1f8a2eedc176b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15731 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-12-08 12:54:23 +00:00
Shuhei Matsumoto	5d269efe96	bdev: Move helper function to dump I/O statistics into bdev.c Move a JSON dump functionbdev_get_iostat_dump() for I/O statistics into lib/bdev/bdev.c. The next patch will rename the function and change the parameter order. This is another preparation to extend I/O statistics to include error counters and module specific counters to output these via the bdev_get_iostat RPC. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I6a90d15fcbaa2e2a250167754135623bc9e7f362 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14837 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-12-08 12:54:23 +00:00
Shuhei Matsumoto	24eab32532	bdev: Add helper functions to allocate/free/get spdk_bdev_io_stat Add helper functions, bdev_io_stat_alloc(), bdev_io_stat_free(), and bdev_io_stat_get() for struct spdk_bdev_io_stat. Then replace a bdev_io_stat_add() call by bdev_io_stat_get() at spdk_bdev_get_device_stat() because the saved data is queried first. This is another preparation to extend I/O statistics to include error counters and module specific counters to output these via the bdev_get_iostat RPC. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I9547757421a1de1b8cb44e0f8ade4b5c2bcad4e6 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15443 Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-12-08 12:54:23 +00:00
Shuhei Matsumoto	571638b9b9	bdev: Alloc spdk_bdev_io_stat dynamically for spdk_bdev The following patches will extend I/O statistics to include error counters and module specific counters to output these via the bdev_get_iostat RPC. In this case, the size of the struct spdk_bdev_iostat will be variable. As a preparation, allocate spdk_bdev_io_stat dynamically. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I1979a9d867859d5cb5d05717bfcc677f07fa03f8 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15479 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Krzysztof Karas <krzysztof.karas@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Community-CI: Mellanox Build Bot	2022-12-08 12:54:23 +00:00
Shuhei Matsumoto	e84bc517c3	bdev: Alloc spdk_bdev_io_stat dynamically for spdk_bdev_channel The following patches will extend I/O statistics to include error counters and module specific counters to output these via the bdev_get_iostat RPC. In this case, the size of the struct spdk_bdev_iostat will be variable. As a preparation, allocate spdk_bdev_io_stat dynamically. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I50b57f792b451cf748ea8eb0611fe65d693d5a14 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15478 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Krzysztof Karas <krzysztof.karas@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Community-CI: Mellanox Build Bot	2022-12-08 12:54:23 +00:00
Shuhei Matsumoto	04786a73c3	bdev: Alloc spdk_bdev_io_stat dynamically for bdev_get_iostat_ctx The following patches will extend I/O statistics to include error counters and module specific counters to output these via the bdev_get_iostat RPC. In this case, the size of the struct spdk_bdev_iostat will be variable. As a preparation, allocate spdk_bdev_io_stat dynamically. For the per_channel mode, we can share the bdev_ctx->stat because spdk_bdev_get_io_stat() always overwrites stat. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I51cd550f52dc3b7d0f3f825fd48bcbeb3ecdcff2 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14836 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Krzysztof Karas <krzysztof.karas@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-12-08 12:54:23 +00:00
Mike Gerdts	5b50d3e8b7	log: add deprecated tracking API When use of deprecated featues is encountered, SPDK now calls SPDK_LOG_DEPRECATED(). This logs the use of deprecated functionality in a consistent way, making it easy to add further instrumentation to catch code paths that trigger deprecated behavior. Change-Id: Idfd33ade171307e5e8235a7aa0d969dc5d93e33d Signed-off-by: Mike Gerdts <mgerdts@nvidia.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15689 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Community-CI: Mellanox Build Bot	2022-12-07 17:45:53 +00:00
lizengwu	93967961c8	iscsi: fix the abnormal connection exit the mobj is allocating from pdu_data_out_pool, if pdu_data_out_pool is exhausted, when the pdu is polled next time, because data_buf_len is modified, iscsi_pdu_payload_read return -1, and the connection will be released. Signed-off-by: lizengwu <786436671@qq.com> Change-Id: I3ee65472f7ddaa357d7952a5b734540f0bc0b216 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15626 Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-12-07 08:48:28 +00:00
Changpeng Liu	8c6de5ebfd	lib/vhost: move `registered` flag to vhost-user device Previously we use this flag to avoid to call `vhost_dev_unregister` twice in `subsystem_fini`, but DPDK vhost library will check it, we don't need this flag actually, but there is one race condition between adding a new connection and unregistering the socket file in different threads, so here we just move it to vhost-user device as the first patch, and then use this flag in coming patch. Change-Id: I658712dd20331a2e2eb5f4758bf76f748036a131 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15482 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot	2022-12-07 00:49:35 +00:00
Changpeng Liu	558638003a	lib/vhost_scsi: remove unnecessary checks `vhost_user_dev_unregister` will check if the device is busy, so we don't need to check `user_dev->pending_async_op_num` here. For `vdev->registered`, with this check here, we can remove a device even it didn't have a valid QEMU connection, and since vhost-scsi supports hotplug feature, we don't need to check this flag either when it have a valid QEMU connection. Change-Id: I50cdeb5ca544e2ed93a1bc99ec3da8787a9e5df5 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15481 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Feng Li <lifeng1519@gmail.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-12-07 00:49:35 +00:00
Mike Gerdts	6e140e3544	bdev: enforce documented lock requirements Replace comments saying that particular locks must be held with assertions that enforce that those locks are held. Remove the comments so that there is no chance of comments and code getting out of sync in the future. This also fixes a caller of bdev_close() that did not hold a required lock. Signed-off-by: Mike Gerdts <mgerdts@nvidia.com> Change-Id: I3a540f1ad9b9826f925c523986334aa8fcd302f2 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15440 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-12-06 21:20:17 +00:00
Mike Gerdts	0dc6aac101	bdev: use SPDK spinlocks Transition from pthread spinlocks to SPDK spinlocks for improved error checking. Signed-off-by: Mike Gerdts <mgerdts@nvidia.com> Change-Id: I7877c3a4601d7d5cf03e632df493974f97782272 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15439 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-12-06 21:20:17 +00:00
Mike Gerdts	0f73e7664d	thread: test SPDK spinlocks in an application This exercises the parts of spdk_spin_*() that are difficult to test in unit tests. In particular, it tests multiple SPDK threads running on different pthreads contending for a lock and it tests pollers and messages going off CPU with a lock held. Change-Id: I5cd6ce29c92c44ba63f47332fe339e59eed81553 Signed-off-by: Mike Gerdts <mgerdts@nvidia.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15534 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-12-06 21:20:17 +00:00
Mike Gerdts	cd2bcf1061	thread: SPDK spinlocks This introduces an enhanced spinlock that adds safeguards compared to the default pthread_spinlock_t. In particular: - A pthread_spinlock_t is still used, but additional error checking is performed to ensure there is no undefined behavior on relock, unlocking when not the owner, or destoying a locked lock. - The SPDK concurrency model allows an SPDK thread to be migrated between pthreads. Releasing a pthread spinlock on a different thread from where it is taken is undefined behavior. If an SPDK spinlock is held at a time that a time when a poller or message returns control to thread_poll(), the program will abort. - SPDK spinlocks can only be obtained from an SPDK thread. Signed-off-by: Mike Gerdts <mgerdts@nvidia.com> Change-Id: I6dd6493ab5f5532ae69e20654546405a507eb594 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15277 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Community-CI: Mellanox Build Bot	2022-12-06 21:20:17 +00:00
Ben Walker	73b02ffdc3	nvme: In nvme_tcp_qpair_process_completions, do not call nvme_tcp_read_pdu in a loop nvme_tcp_read_pdu itself has a loop in it that runs until no more data is available, so the extra loop does nothing. Change-Id: I1471018e396c43187d1f06bd18ce8a6846a71c94 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15139 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-12-05 22:52:20 +00:00
Konrad Sztyber	9e647c1f46	bdev: disallow get_buf() calls from other threads This is unsafe, because we touch need_buf_* queues, which aren't thread-safe. Also, documented this requirement in spdk_bdev_io_get_buf()'s description. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Iabc141e051c543fdd51f079ae212f69e980d8148 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15668 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-12-05 09:51:26 +00:00
Xinrui Mao	cd4ac9c792	lib/trace: add trace_get_info RPC Add rpc method trace_get_info to show name of shared memory file, list of the available trace point groups and mask of the available trace points for each group. Fixes #2747 Signed-off-by: Xinrui Mao <xinrui.mao@intel.com> Change-Id: I2098283bed454dc46644fd2ca1b9568ab2aea81b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15426 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Xiaodong Liu <xiaodong.liu@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Community-CI: Mellanox Build Bot	2022-12-05 09:50:38 +00:00
wanghailiangx	c680e3a05b	lib/map file: Optimized some indentation formats Change-Id: I071ecc0422f8fd5b889927c249e8cb6484489cd3 Signed-off-by: Hailiang Wang <hailiangx.e.wang@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14053 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com>	2022-12-05 09:43:30 +00:00
Konrad Sztyber	35156582a7	nvme/tcp: add an errlog when sock_flush fails Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Ic14a1ff1120272a3afc86971b9670c10ef66523f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15643 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-12-01 12:49:04 +00:00
Konrad Sztyber	0cae873b78	sock: set errno in spdk_sock_flush() All the other spdk_sock_* functions return -1 and set errno appropriately, so we should do the same in flush(). Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I51cda2c51974c72e82531f06fa31ab89b2329c91 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15642 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-12-01 12:49:04 +00:00
Konrad Sztyber	3bc7e8f091	nvmf/tcp: print more details when sock_writev fails Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I2e9f1d0819bff43156e0847149d91cbfa79eb1cd Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15641 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-12-01 12:49:04 +00:00
Jim Harris	30c8b17f1f	nvmf/rdma: account for unassociated qpairs when picking pg If a lot of qpairs are connected all at once, the RDMA optimal_poll_group logic does not work correctly, because it only accounts for qpairs that received their CONNECT capsule. Now that we have a counter for a poll group's unassociated qpairs, use that value to supplement the current io qpair count. We can just assume for now that all of these unassociated qpairs are io qpairs. That won't always be true, but for purposes of picking the optimal poll group it is sufficient. Note that for RDMA, we could increment the counters based on the RDMA qpair ID in the private data in the rdmacm connect, but to keep the code simpler and common across all transports, we defer the accounting until after receiving the CONNECT command, so that it is the same for all transports. Fixes issue #2800. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I5897d6ebac23d3b78b100e3fef5a7f9fb5304820 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15695 Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-12-01 10:57:29 +00:00
Jim Harris	30020c2ffc	nvmf/rdma: simplify get_optimal_poll_group logic Use a local variable to hold the qpair count. While here, also use pg_current to get the min_value, this is a bit simpler to read than things like (*pg)->group. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I65771fb469f021e9e77b8a6c117841b8f4b66af5 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15694 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com>	2022-12-01 10:57:29 +00:00
Jim Harris	bb926e803d	nvmf: make poll groups count unassociated qpairs We make decisions on how to pick a poll group for a new qpair by looking at each poll group's current_io_qpairs count. But this count isn't always accurate since it doesn't get updated until after the CONNECT has been received. This means that if we accept a bunch of connections all at once, they may all get assigned the same poll group, because the target poll groups counter doesn't get immediately incremented. So add a new counter, current_unassociated_qpairs, to account for these qpairs. We protect this counter with a lock, since the accept thread will increment the counter, and the poll group thread will decrement it when the qpair receives the CONNECT allowing us to associated with a subsystem/controller.. If the qpair gets destroyed before the CONNECT is received, we can use the qpair->connect_received flag to decrement current_unassociated_qpairs. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I8bba8da2abfe225b3b9f981cd71b6f49e2b87391 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15693 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com>	2022-12-01 10:57:29 +00:00
Jim Harris	f3e197ff18	nvmf: add qpair->connect_received Currently we use qpair->ctrlr at qpair destroy time to decide if we need to decrement the qpair's poll group's qpair count. But this is not correct - these counters get incremented when the CONNECT is received, but qpair->ctrlr doesn't get set until later. So add a new connect_received bool to the spdk_nvmf_qpair. Use this instead to determine when we should decrement the poll group qpair counters. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I174a0fda36c4558171953bf58f2f5117bc074f76 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15692 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com>	2022-12-01 10:57:29 +00:00
John Levon	478c0fa852	lib/nvmf: don't report invalid identify controller CNS At least recent Linux guest VMs send SPDK_NVME_IDENTIFY_CTRLR_IOCS as a matter of course. While this isn't supported in lib/nvmf, as this doesn't represent an error, reduce the log level of the error message so we don't spam the logs. Signed-off-by: John Levon <john.levon@nutanix.com> Change-Id: I095de3e4331b3912cbc457da6d722b9883ec7884 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15646 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Community-CI: Mellanox Build Bot	2022-11-30 08:51:00 +00:00
GangCao	cebb63a7a7	lib/virtio: add the ctx NULL check before dereferencing it Issue is found in the virtio_pci_scsi_dev_create() whose error path is setting the vdev->ctx to NULL before the destruct operation. Change-Id: I4ab0fbe300f7413ad4503833088856aa3f4c0734 Signed-off-by: GangCao <gang.cao@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15676 Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-11-30 08:50:48 +00:00
Artur Paszkiewicz	fed1f52b9e	nvmef: don't set optimal I/O boundary if write_unit_size != 1 Optimal I/O boundary causes I/O to be split in the nvme driver. This is a problem for writes if write_unit_size > 1 because the split I/O may not match the write_unit_size. Fixes: #2791 Change-Id: I437e6cb6d8e2415658d5b46539feeacb5363fd46 Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15627 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-11-30 08:50:29 +00:00
Evgeniy Kochetov	8305e49b07	nvmf: Add copy command support NVMf target reports copy command support if all bdevs in the subsystem support copy IO type. Maximum copy size is reported for each namespace independently in namespace identify data. For now we support just one source range. Note, that command support in the controller is initialized once on controller create. If another namespace which doesn't support copy command is added to the subsystem later, it will not be reflected in the controller data structure and will not be communicated to the initiator. Attempt to execute copy command on such namespace will fail. This issue is not specific to copy command and applies also to write zeroes and unmap (dataset management) commands. Signed-off-by: Evgeniy Kochetov <evgeniik@nvidia.com> Change-Id: I5f06564eb43d66d2852bf7eeda8b17830c53c9bc Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14350 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-11-30 08:50:06 +00:00
Thanos Makatos	6be6e9f298	nvmf/vfio-user: drop thread from struct nvmf_vfio_user_cq The correct SPDK thread is already contained in the poll group. Signed-off-by: Thanos Makatos <thanos.makatos@nutanix.com> Change-Id: I4eefe2ba60c77c01a866a693bccbb8affc8262ed Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15546 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: John Levon <levon@movementarian.org> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-11-30 08:47:31 +00:00
Thanos Makatos	79abd0f034	nvmf/vfio-user: use define instead of hardcoded value Change-Id: Ia24ba290da3476d452974bfe08e2e93ae44f954e Signed-off-by: Thanos Makatos <thanos.makatos@nutanix.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15544 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: John Levon <levon@movementarian.org>	2022-11-30 08:47:31 +00:00
Thanos Makatos	954b145ba1	nvmf/vfio-user: add poll group stats This patch adds some basic stats for nvmf/vfio-user poll groups. Signed-off-by: Thanos Makatos <thanos.makatos@nutanix.com> Change-Id: Ifd9621a8dd4f5f89713582ee5c7b408ff49f43bb Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15390 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: John Levon <levon@movementarian.org> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-11-30 08:47:31 +00:00
melon.masou	565a44628d	iscsi: fix segfault when r2t Fixes #2781 This patch fixes two issue causing segfault on r2t: 1. pdu buffer is allocated from immediate_data_pool, but data_buf_len is set as data_out_pool 2. task->desired_data_transfer_length is rewrite by iscsi_send_r2t, which causes a wrong calculated pdu->data_buf_len Signed-off-by: melon.masou <melon.masou@outlook.com> Change-Id: I151859afff7104f29ad7f0ec57a8479d88b742bd Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15542 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-11-29 17:21:18 +00:00
GangCao	99a43e75ed	lib/sock: use_after_free of the group_impl point Change-Id: I9d19e469b4c84b09de5a3938238687f7650452ef Signed-off-by: GangCao <gang.cao@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15619 Reviewed-by: wanghailiang <hailiangx.e.wang@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Xiaodong Liu <xiaodong.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot	2022-11-29 08:30:29 +00:00
Richael Zhuang	f192c11bbf	bdev: support to get histogram per channel Added new API 'spdk_bdev_histogram_get_channel' to get histogram of a specified channel for a bdev. A callback function is passed to it to process the histogram. Change-Id: If5d56cbb5fe6c39cda7882f887dcc9c6afa769ac Signed-off-by: Richael Zhuang <richael.zhuang@arm.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15539 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-11-29 08:28:57 +00:00
wanghailiangx	0da97a15cc	lib/bdev: print num_blocks and the write_unit_size in SPDK_ERRLOG Print out the specific values in this SPDK_ERRLOG, this can help to find where the error is. Change-Id: I2a38aa2d4270e0bbf554ddb348a73d40967d1b16 Signed-off-by: wanghailiangx <hailiangx.e.wang@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15618 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: GangCao <gang.cao@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Community-CI: Mellanox Build Bot	2022-11-28 09:46:12 +00:00
GangCao	c85df53551	lib/virtio: handle double free of virtio_dev device Change-Id: I76a3f9125d05aa6ca0c31e8220036cf853a24619 Signed-off-by: GangCao <gang.cao@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15617 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Community-CI: Mellanox Build Bot	2022-11-25 08:14:25 +00:00
Ben Walker	85478eccc9	thread: Fix error handling in spdk_interrupt_register If the calloc failed, the fd was left in the fd_group. Change-Id: Ie68426a13d342756c20315656f0309440fda6e02 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15475 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Xiaodong Liu <xiaodong.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Thanos Makatos <thanos.makatos@nutanix.com> Reviewed-by: John Levon <levon@movementarian.org>	2022-11-24 10:08:31 +00:00
Mike Gerdts	8dbaca1300	bdev: use spinlock instead of mutex SPDK threads generally run on dedicated cores and locks should be rarely contended. Thus, putting a thread to sleep while waiting on a mutex does not free up CPU cycles for other pthreads or processes. Even when running in interrupt mode, lock contention should be low enough that spinlocks are a net win by avoiding context switches. Signed-off-by: Mike Gerdts <mgerdts@nvidia.com> Change-Id: I6e2e78b2835bbadb56bbec34918d998d75280dfd Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15438 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-11-24 10:08:17 +00:00
Jim Harris	2be196c609	nvme/pcie: validate that mptr is iova contiguous Also add unit tests that explicitly test this condition. They fail without the nvme driver changes in this patch. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Iaa369be341eb4eba394f248990e56dce001d3940 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15579 Reviewed-by: Mariusz Barczak <mariusz.barczak@intel.com> Reviewed-by: Wojciech Malikowski <wojciech.malikowski@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-11-23 08:23:15 +00:00
Jim Harris	1d2700d4c1	event: check that all non-app threads have exited at shutdown For now, just print a loud warning when this case is violated. We will add a hard assertion and cause the app to exit with error status in a later release. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ic9226f76a4729820f13a2728bea977b6a54f48ee Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15513 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-11-23 08:22:04 +00:00
Jim Harris	8203e68e24	thread: add spdk_thread_is_running() This function can be useful to query if a thread had spdk_thread_exit() called on it yet. Internally we have both EXITING and EXITED state - so !spdk_thread_is_running() can be used to detect a thread that is in either of those states. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I2f6fb024a6b1bc895fdc5132c722abc10f5d30f9 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15512 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-11-23 08:22:04 +00:00
Jim Harris	98ceddb47c	rocksdb: remove spdk_thread This was an accidental remnant from the original check-in, when we did not have a clear differentiation between the event and thread libraries. The rocksdb plugin code will send events to an lcore - not an SPDK thread. But originally the two were combined though an API called spdk_allocate_thread. Once the differentiation was clearly made, we moved to using spdk_event_allocate() to send events to a specific lcore, but never removed the spdk_thread. So now let's just remove the spdk_thread_create since it is not needed. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I5c6a3c304b7b4183eee90038367fdea7ebd7280f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15504 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-11-23 08:22:04 +00:00
Jim Harris	0d3b54825e	subsystem: assert all subsystems initialized on app thread This requires creating and setting SPDK threads in the subsystem unit tests as well. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I31acfb1d7e418f011acc9b48933032d8bf8a1c53 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15511 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-11-23 08:22:04 +00:00
Jim Harris	327d1c988d	vhost: defer vhost_dev_unregister until scsi tgts removed Currently when a vhost-scsi controller is removed, it calls spdk_vhost_scsi_dev_remove_tgt on all remaining targets, and then immediately calls vhost_dev_unregister. But this path goes into vhost_user_dev_unregister which immediately returns with error if there are any pending async operations - and there are since scsi_dev_remove_tgt is asynchronous. So instead add the vhost_dev_unregister call to remove_scsi_tgt, so that the unregister only happens after the last ref goes away. This requires changing vhost_fini() to no longer assume that spdk_vhost_dev_remove() will immediately unregister the device, since it now happens asynchronously. Previously vhost_fini() was making this assumption erroneously - it would call g_fini_cb without actually checking that the devices had been unregistered. Because of that incorrect assumption, we need to do both the vhost and vhost-scsi changes in the same patch. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I9577901266975447f9acfe53475221113f02fea3 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15510 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2022-11-23 08:22:04 +00:00
Jim Harris	85d70c03c5	thread: don't move to EXITED if there are pending messages At end of spdk_thread_poll(), if thread is in EXITING state, we call thread_exit() to see if the thread can move to EXITED state. If there are any pollers, io_channels or pending device unregistrations in progress, thread_exit() will keep the thread in EXITING mode for this iteration. But a thread may post messages to itself during this cleanup process, so thread_exit() should also check if there are any messages on its queue. Found during testing of spdk_thread lifetime patch set. rbd bdev module will send messages to itself like this during cleanup. Without this change, rbd module testing with bdevperf would cause an spdk_thread to move to EXITED state prematurely. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ie611026a67b7fa48640ae83be03e29a9c64883a2 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15533 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-11-23 08:22:04 +00:00
Jim Harris	b35aceb8cf	iscsi: unregister login_timer when destroying connection If a connection is established and we receive a bad PDU before successful login, the login_timer would not get unregistered. So ensure the login_timer is always unregistered in _iscsi_conn_destruct(). Found with Calsoft tests during new spdk_thread_exit() assertion testing. Lack of unregistration would result in its associated spdk_thread being unable to exit cleanly due to the unexpired timer. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I79d427512f7829ad76bf89155e0e14c7bce3a7d7 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15499 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-11-23 08:22:04 +00:00
Jim Harris	090b8af12b	thread: add spdk_thread_get_app_thread The "app thread" will always be the first thread created using spdk_thread_create(). There are many operations throughout SPDK that implicitly expect to happen in the context of this app thread, so by formalizing it we can start to make assertions on this to help clarify and simplify locking and synchronization through the code base. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I7133b58c311710f1d132ee5f09500ffeb4168b15 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15497 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-11-23 08:22:04 +00:00
Jim Harris	db18916f29	thread: move _free_thread() earlier in file Next patch will add a new caller to this function. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I54374c0af3a4a0fdcc5ac9ca25e2c7ef03e99829 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15576 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-11-23 08:22:04 +00:00
Changpeng Liu	b45556e2b2	include/bdev_module.h: add `SPDK_` prefix to macros `BDEV_IO_NUM_CHILD_IOV` and `BDEV_RESET_IO_DRAIN_RECOMMENDED_VALUE` are public macro definitions without `SPDK_` prefix, so we add the `SPDK_` prefix to them. Change-Id: I4be86459f0b6ba3a4636a2c8130b2f12757ea2da Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15425 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-11-22 10:03:57 +00:00
yupeng	c0c333e2ed	bdev: provide all available bdevs when loop bdevs The bdev hot remove might be an async process. The bdev_open will return an error during the hot remove process. If someone invoke the bdev_get_bdevs API when a bdev is in the middle of a hot remove process, the spdk_for_each_bdev function will stop its loop when a bdev_open return an error. Thus the bdev_get_bdevs will only return partual bdevs or even return an empty list if the hot remove bdev is the first bdev in the loop. When spdk_for_each_bdev and spdk_for_each_bdev_leaf loop for each bdevs, if a bdev returns an error, we skip that bdev instead of stop the whole loop. Signed-off-by: Peng Yu <yupeng0921@gmail.com> Change-Id: Ib35b817e23e47569fc5762a883b4ff8e322ae173 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15322 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Mike Gerdts <mgerdts@nvidia.com> Reviewed-by: Xiaodong Liu <xiaodong.liu@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-11-22 10:03:48 +00:00
Thanos Makatos	70f185ea51	json: add spdk_json_write_named_double Signed-off-by: Thanos Makatos <thanos.makatos@nutanix.com> Suggested-by: Jim Harris <james.r.harris@intel.com> Change-Id: I2439cd739240fb2d95c5cdaccc557ba9a8f6501b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15490 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: John Levon <levon@movementarian.org> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-11-22 10:01:43 +00:00
Thanos Makatos	4475295e15	nvmf/vfio-user: add some unlikely on the hot path Signed-off-by: Thanos Makatos <thanos.makatos@nutanix.com> Change-Id: Ib7977f34fc2fc312f0a502405dcd1b5266a22d3f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15430 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: John Levon <levon@movementarian.org> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-11-22 10:01:43 +00:00
Thanos Makatos	6b71006dfe	nvmf/vfio-user: refactor nvmf_vfio_user_prop_req_rsp Signed-off-by: Thanos Makatos <thanos.makatos@nutanix.com> Change-Id: Id6b0a4bc12aa8799fdb1ce1b286c308c9a79083b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15389 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: John Levon <levon@movementarian.org>	2022-11-22 10:01:43 +00:00
Thanos Makatos	82b2c1923f	nvmf/vfio-user: refactor duplicate code Change-Id: If501002e9ed110f77a4ece9f026ecfc4e53dee27 Signed-off-by: Thanos Makatos <thanos.makatos@nutanix.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15388 Reviewed-by: John Levon <levon@movementarian.org> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-11-22 10:01:43 +00:00
Thanos Makatos	a7885283b3	nvmf/vfio-user: delete CQ on vfio-user client disconnect If the guest performs a hard shutdown we're not deleting the CQs: nvmf_vfio_user_close_qpair calls delete_sq_done, which won't delete the CQ because vu_ctrlr->reset_shn is false. Signed-off-by: Thanos Makatos <thanos.makatos@nutanix.com> Change-Id: I383fb985340a0d9d0eb7fea7403372cbdc55a089 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15387 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: John Levon <levon@movementarian.org> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-11-22 10:01:43 +00:00
Thanos Makatos	e398dcdadb	nvmf/vfio-user: don't use uninitialized refcount for admin CQ Signed-off-by: Thanos Makatos <thanos.makatos@nutanix.com> Change-Id: I16d511ac10b8ba4dfb2f7a7e5c144e2f2fe1bad5 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15386 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: John Levon <levon@movementarian.org>	2022-11-22 10:01:43 +00:00
Thanos Makatos	25440c3bdb	nvmf/vfio-user: don't blindly drain poll group eventfd This eventfd may be passed by libvfio-user to the remote process which might remove the EFD_NONBLOCK flag, in which case we would block indefinitely. Signed-off-by: Thanos Makatos <thanos.makatos@nutanix.com> Change-Id: If9826cd700b4a7b3458a0a8278a96322d99ac08e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15385 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: John Levon <levon@movementarian.org> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2022-11-22 10:01:43 +00:00
Thanos Makatos	7f23638550	util: add function spdk_fd_group_get_epoll_event This patch introduces function spdk_fd_group_get_epoll_event, which returns the epoll(7) event that caused the file descriptor group callback function to execute. Rather than changing the signature of spdk_fd_fn in order to pass the struct epoll_event, which would result in a gigantic patch where there vast majority of users would simply have to ignore the new argument, we introduce this new API that allows to return the epoll_event only when really needed. Signed-off-by: Thanos Makatos <thanos.makatos@nutanix.com> Suggested-by: John Levon <john.levon@nutanix.com> Change-Id: I3debe1382d1c2bfec6ae4fea274ee38ed0b135fe Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14935 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: John Levon <levon@movementarian.org>	2022-11-22 10:01:43 +00:00
Kozlowski Mateusz	304f0802d1	lib/ftl: Fix segfault in recovery path of unmap The ftl_md_get_buffer_size returns the buffer size in bytes, so we should divide by the block size, instead of this smaller value. Risks touching bad memory during dirty shutdown recovery, especially in >16TiB drives. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Mariusz Barczak <mariusz.barczak@intel.com> Change-Id: I4095b00a79a1bdbce5046dc46349a9670e41b18e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15259 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-11-22 10:01:30 +00:00
Kozlowski Mateusz	6a26cb6053	lib/ftl: Fix findings of static code analysis A metadata region without mirror should have the INVALID enum set, otherwise it risks touching invalid parts of the array. The sb_shm_md not being set to NULL could cause the code to touch this freed pointer in the error path in ftl_md_create -> ftl_md_create_shm -> ftl_md_invalidate_shm calls. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Mariusz Barczak <mariusz.barczak@intel.com> Change-Id: I7fe9694dad535de5f6b2a4af27400fa125480605 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15258 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-11-22 10:01:30 +00:00
Kozlowski Mateusz	646b851e75	lib/ftl: Update FTL IO activity statistics Bumping the IO activity statistics during relocation, compaction, L2P cache processing and user IO handling. This makes sure poller busy counter is more accurate. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Change-Id: Iabf8ec7ca41c01d7a00d3a70825b8d5283ab2bf1 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15257 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Community-CI: Mellanox Build Bot	2022-11-22 10:01:30 +00:00
Kozlowski Mateusz	a7f4a2db7f	lib/ftl: Validate l2p_dram_limit parameter Disallow 0 value as parameter - avoids a segmentation fault. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Mateusz Brenk <mateusz.brenk@intel.com> Change-Id: I492256ff621da3be11239d2fd705d8cc54bfe7b7 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15256 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot	2022-11-22 10:01:30 +00:00
Konrad Sztyber	72a6cd5381	nvme: execute hotplug monitor even if hotplug_fd < 0 NVMe controllers can be marked as removed even if we cannot receive uevents (e.g. by the VMD driver), so we should process them regardless of hotplug_fd. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Iaaf13a136929200e824f7a6dd3b5584998801630 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15547 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tom Nabarro <tom.nabarro@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-11-21 16:15:44 +00:00
Konrad Sztyber	0a672ea974	rpc: print device type in framework_get_pci_devices Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I2d3825ffcce098909745ba949cdde3eb7f71c703 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15545 Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tom Nabarro <tom.nabarro@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-11-21 16:15:44 +00:00
Konrad Sztyber	806c100595	rpc: extend bdf buffer in framework_get_pci_devices The previous 14B buffer was too small for VMD devices. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Ib3984f7104fadbb2fbf7ec56932675d73eda1456 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15532 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tom Nabarro <tom.nabarro@intel.com>	2022-11-21 16:15:44 +00:00
Konrad Sztyber	86ba16c39c	build: compile API functions with missing deps We should always build all function that are part of the API, even if some of the libraries they depend on are missing. In that case, they can return an error instead. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I72b450b3a1d62e222bd843e45be547d926414775 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15414 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-11-18 08:40:05 +00:00
Krzysztof Karas	a1c7ae2d3f	bdev: remove generation of UUIDs for bdevs that do not provide one Remove automatic generation of UUIDs for bdevs that do not provide this value themselves. This is to clarify whether this field can be depended upon. Modified match files to reflect change in UUID generation. Disabled nullglob shell option, as it deletes empty arrays during word splitting. Bdevs with no aliases would instead of "[]", have nullpointer printed, which makes resulting JSON invalid. Part of enhancement proposed in #2516. Change-Id: Ic1d5f8f8d001ae1a219e876aef2a19b1ff0b2f2c Signed-off-by: Krzysztof Karas <krzysztof.karas@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15150 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Community-CI: Mellanox Build Bot	2022-11-18 08:38:13 +00:00
wanghailiangx	f6a256c013	lib/accel: set RPC accel_get_opc_assignments as SPDK_RPC_RUNTIME Add the processing of returning 0 for spdk_accel_get_opc_module_name(), and remove SPDK_RPC_STARTUP, because this will cause core dumped when run nvmf_tgt with --wait-for-rpc and no RPC framework_start_init. Fixes issue: 2770 Change-Id: I1c53ccb8caa52f2eaa0b8b560a021bded49d8fed Signed-off-by: wanghailiangx <hailiangx.e.wang@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15377 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-11-17 08:57:53 +00:00
Shuhei Matsumoto	2356d1d6f3	bdev: Add helper functions to allocate/free bdev_get_iostat_ctx Add helper functions, bdev_iostat_ctx_alloc() and bdev_iostat_ctx_free() for the bdev_get_iostat RPC. The following patches will allocate spdk_bdev_io_stat dynamically for bdev_get_iostat_ctx. This is a preparation for that. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Ib71d6fb92d8134d2282507e62874f19045b630b7 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15442 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-11-17 08:56:49 +00:00
Shuhei Matsumoto	7c687dfcbd	bdev: Clarify bdev_ctx and rpc_ctx for bdev_get_iostat RPC The bdev_get_iostat RPC uses two types of contexts, one to manage the progress of the bdev_get_iostat RPC and another to call spdk_bdev_get_device_stat(). However, this was hard to find from the source code. To make us easier to find this, rename the former by rpc_ctx and the latter by bdev_ctx. Then rename related functions and variables accordingly. Furthermore, relocate request and decoder declaration to improve readability. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I3472c87fe4ec1f5981a49ef79148534fbb1d46c4 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15349 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-11-17 08:56:49 +00:00
Shuhei Matsumoto	038fb90350	bdev: Move down RPC parameters and decoders for bdev_get_iostat RPC RPC parameters and decoders for the bdev_get_iostat RPC are used only by rpc_bdev_get_iostat(). Locating RPC parameters and decoders close to rpc_bdev_get_iostat() clarifies it. Furthermore, this will simplify code review for the next patch. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I1b1b428e3eb3bb4422e490c5f4324f0e40f9710f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15416 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-11-17 08:56:49 +00:00
Shuhei Matsumoto	0ac95a684b	bdev: Consolidate two TRACE_BDEV_IO_START calls into a single call For I/Os controlled by QoS, TRACE_BDEV_IO_DONE is collected after redirecting to the original thread. Hence, TRACE_BDEV_IO_START should be collected on the original thread too. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I15411be823450ee5ddaa7582509a7aa068476fc5 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14824 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-11-17 08:56:49 +00:00
Jim Harris	8dc878483d	env_dpdk: allow 2211.c file to build against older DPDK The 2211 implementation only gets used when runtime detects the DPDK version is DPDK 22.11. But we still compile this file even if it gets built against an older DPDK. This is typically fine, except there are some interrupt APIs that changed in DPDK 21.11, so older DPDKs don't have some of the functions used in this file. We need to use ifdefs to allow this to compile. We will need some more work to handle this case properly, but this patch at least fixes the 2211.c case for now. We will probably need a 2108.c file that exactly matches the 2207.c file except for this interrupt API changes. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I6055694ccbb79845798e750ebb7127ec6c160e2e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15236 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Michal Berger <michal.berger@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com>	2022-11-15 08:31:28 +00:00
Michael Piszczek	1473d3b8c2	env_dpdk: fix check for AMD iommu Update code for read the virtual address width to use glob to locate the Intel and AMD iommu capability registers. This code should work for all AMD numa configurations. Fixes issue 2730 Signed-off-by: Michael Piszczek <mpiszczek@ddn.com> Change-Id: Ibf5789087b7e372d892b53101e4c0231809053f0 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14961 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Community-CI: Mellanox Build Bot	2022-11-15 08:31:13 +00:00
John Levon	0d0de8e7d9	lib/rpc: add RPC allow list Add an optional allowlist for RPC methods: if the method is not listed, it is not allowed to be called or visible. This can be used to restrict accidental mis-configurations, and generally helps locking down the configuration surface. Signed-off-by: John Levon <john.levon@nutanix.com> Change-Id: Ied78fc4b14b60cb94ed0852b92deb6df545cbec4 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15275 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-11-15 08:31:02 +00:00
John Levon	1139cb1415	lib/util: add strarray utility functions Add some basic utilities for handling arrays of strings. Signed-off-by: John Levon <john.levon@nutanix.com> Change-Id: I2333f3e4605175b1717a7f289847ff2d48745e8d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15274 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Thanos Makatos <thanos.makatos@nutanix.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-11-15 08:31:02 +00:00
paul luse	a6dbe3721e	update Intel copyright notices per Intel policy to include file commit date using git cmd below. The policy does not apply to non-Intel (C) notices. git log --follow -C90% --format=%ad --date default <file> \| tail -1 and then pull just the 4 digit year from the result. Intel copyrights were not added to files where Intel either had no contribution ot the contribution lacked substance (ie license header updates, formatting changes, etc). Contribution date used "--follow -C95%" to get the most accurate date. Note that several files in this patch didn't end the license/(c) block with a blank comment line so these were added as the vast majority of files do have this last blank line. Simply there for consistency. Signed-off-by: paul luse <paul.e.luse@intel.com> Change-Id: Id5b7ce4f658fe87132f14139ead58d6e285c04d4 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15192 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Community-CI: Mellanox Build Bot	2022-11-10 08:28:53 +00:00
Krzysztof Karas	344249069d	event: add runtime cpu lock configuration Allow CPU core locks to be enabled and disabled during runtime. This feature will be useful in cases like SPDK hot upgrade, where locking should be disabled temporarily. Change-Id: I9bc7292fd964abffc7214d074d191f38b13583c3 Signed-off-by: Krzysztof Karas <krzysztof.karas@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15031 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-11-09 08:18:32 +00:00
Krzysztof Karas	0af934b38c	event: add CPU lock files When running SPDK application on a given set of CPU cores, create lock files for each of them. This wil prevent user misconfiguration and assigning a core to more than one SPDK instance. The introduced mechanism is based on device locks implemented in spdk_pci_device_claim() function. Add a command line option to disable lock files. This feature will be useful in cases where differing CPU cores is impossible (eg. setup with only one core available). The patch also fixes all existing cases of overlapping core masks. Change-Id: Ie9aacb7523a3597b9aa20f2c3fa9efe4db92c44c Signed-off-by: Krzysztof Karas <krzysztof.karas@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14919 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-11-09 08:18:32 +00:00
Konrad Sztyber	cff39ee7d5	nvme: add missing \n in ctrlr init fail log Additionally, print the string representation of the ctrlr state, as it makes debugging init failures much easier. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I572ef3d6f7d5bbd52039a8872733578c92be4c4a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15305 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-11-08 08:20:26 +00:00
Richael Zhuang	cabbb25d5d	bdev: add API to get submit tsc of a bdev I/O Add API spdk_bdev_io_get_submit_tsc to get submit tsc of a bdev I/O, which can be used in bdev modules to avoid calling expensive spdk_get_ticks(). Change-Id: Ifbcecb1bc663344997c5e73b72a1dfb5d0422946 Signed-off-by: Richael Zhuang <richael.zhuang@arm.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14989 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-11-04 10:15:46 +00:00
Denis Nagorny	c273513401	nvme/rdma: Allows to use PCI Express Relaxed Ordering This fix allows to use relaxed ordering feature where it is supported. libibversb checks with the driver if relaxed ordering access flag is supported and ignores it if not. Experiments show that set by default it doesn't spoil performance but allows to reach desired one on AMD EPYC systems. For example fio read test (ConnectX-6, AMD EPYC 7763, two jobs, queue depth 32, block size 32K) can starve down to 6-7 GiB/s without it. Enabling this option allows to get bandwidth more than 21 GiB/s. Change-Id: I5983aed5d1f38ee7bec9c310597731c9a6a329da Signed-off-by: Denis Nagorny <denisn@nvidia.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14885 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-11-04 10:15:31 +00:00
Thanos Makatos	b8fc75c36e	nvmf/vfio-user: ensure BAR5 isn't 0 Signed-off-by: Thanos Makatos <thanos.makatos@nutanix.com> Change-Id: I60a39c8a311879b7d6c7c82df0abd7a69f9a2778 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14933 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-11-04 10:10:33 +00:00
Thanos Makatos	bad452d25e	nvmf/vfio-user: calculate doorbells based on number of queue pairs It doesn't make sense to have the size of the doorbells fixed and then calculate the maximum number of queue pairs based on it, do it the other way round. Also, add some sanity checks based on the spec. Signed-off-by: Thanos Makatos <thanos.makatos@nutanix.com> Change-Id: I17e3509fb0a011128ca089ce78b7a296262e6f8e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14932 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-11-04 10:10:33 +00:00
Alexey Marchuk	0fec09fc50	bdev/part: Call bdev_with_md even if md is NULL The bdev_with_md APIs now allow to pass NULL md pointer, so calling this function without checking for metadata simplifies code Signed-off-by: Alexey Marchuk <alexeymar@nvidia.com> Change-Id: I364a646630bd36120231ea87a41fea05df51befb Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15090 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com>	2022-11-03 14:54:41 +00:00
Shuhei Matsumoto	d683d7b792	bdev/part: Modify spdk_bdev_part_submit_request() to use custom completion callback In the following patches, we will add a feature to inject data corruption to the error bdev module. For read I/O, we will have to inject data corruption at completion. However, if we use spdk_bdev_part_submit_request(), it will not be possible because we cannot add any custom operation into the completion callback. To fix the issue, modify spdk_+bdev_part_submit_request() and rename it to spdk_bdev_part_submit_request_ext(). Fortunately, we can use stored_user_cb in struct spdk_bdev_io. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I46d3c40ea88a3fedd8a8fef6b68ee417c814a7a1 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15002 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-11-03 14:54:28 +00:00
Changpeng Liu	fabf6a83cc	lib/vhost: remove session `initialized` flag Session in vhost means an active socket connection from client(e.g: QEMU or SPDK vhost initiator), but the device state could be `started` or `stopped` because users may remove the driver of the device in VM, so in `foreach_session` we can always call the callback function without checking the session state, and the callback function may check the device state if necessary. Change-Id: Id0fc8c7f6f0915a55a738f0c87ebe6539f7fb2db Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15038 Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-11-03 14:53:55 +00:00
Changpeng Liu	9da4e15c5c	lib/vhost: start device asynchronously Now we will start the device(virtio-blk and virtio-scsi) when there is a valid I/O queue(VRING_KICK message), the backend device `start_session` callback will ensure this check, so when processing VRING_KICK messages for each vring, we can just call `new_device` if `started` is false, and if `started` is true, it means the device is already started, it's safe for us to add one more vring even the device is started. With this change, we don't need to wait for the return value of `start_session` in synchronous mode, just return is OK. Fix #2518. Change-Id: I92ba3d4e5c38422d7697c1d13180a4a48f0dd4cd Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14981 Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-11-03 14:53:55 +00:00
Changpeng Liu	23baa6761d	lib/vhost: don't restart device multiple times We will stop/start the device multiple times when a new vring is added, and also stop/start the device when set vring's callfd, actually we only need to start the device after a I/O queue is enabled, DPDK rte_vhost will not help us to start the device in some scenarios, so this is controlled in SPDK. Now we improve the workaround to make it consistent with vhost-user specification. For each SET_VRING_KICK message, we will setup the new added vring, and then we try to start the device. For each SET_VRING_CALL message, we will add one more interrupt count, previously this is done when enable the vring, which is not accurate. For each GET_VRING_BASE message, we will stop the device before the first message. With above changes, we will start/stop the device once, any new added vrings after starting the device will be polled in next `vdev_worker` poller. Change-Id: I5a87c73d34ce7c5f96db7502a68c5fa2cb2e4f74 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14928 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-11-03 14:53:55 +00:00
Changpeng Liu	b7facb30f8	lib/vhost_scsi: don't start device before a valid I/O queue is enabled Change-Id: I407c62df2117069ad1d8f6aba18cf316a3cf47bf Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14980 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-11-03 14:53:55 +00:00
Changpeng Liu	9cdd1a8a2c	lib/vhost: remove `vhost_session_used_signal` function `vdev_worker` in vhost-scsi is used to process request queues, and `vdev_mgmt_worker` is used to process the event and control queue, so we don't need to call `vhost_session_used_signal` in `vdev_worker`, just remove it. Change-Id: I86f3e90890e6defba69b01fec131afe1adad3a49 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14927 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-11-03 14:53:55 +00:00
Changpeng Liu	7fcbd0220e	lib/vhost: alloc VQ tasks in VQ setting function Currently we will allocate all VQ's tasks when starting the device, it will not allow us to add new VQ after starting the device, so here, we move it to VQ setting function. Change-Id: I59cfc393d66779ab8a0eb704bc73bcede3f0a2a0 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14926 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-11-03 14:53:55 +00:00
Changpeng Liu	d55bf60a89	lib/vhost: move vq settings into a function With this change, then we can call vq settings after the VRING_KICK message, currently we will stop/start device multiple times when a new vq is added. Change-Id: Icba3132f269b5b073eaafaa276ceb405f6f17f2a Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14925 Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-11-03 14:53:55 +00:00
Changpeng Liu	a1cd28c6f3	lib/vhost: get negotiated features after SET_FEATURES message Feature negotiation is done after SET_FEATURES message, here we move it in this message context, so that we can use the negotiated features before starting the device. Change-Id: Ic6388dbcebd72bc5ef182e65798d34c07f6fc35c Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14924 Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-11-03 14:53:55 +00:00
Changpeng Liu	835490b1d5	lib/vhost: check memory table earlier Before starting a device, the memory table is already there, so we can check it earlier. Change-Id: I4996705501577cfa78c89621f7081eb0c3d4dd78 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14923 Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-11-03 14:53:55 +00:00
Changpeng Liu	d941d138ad	lib/vhost: merge vq settings into a single loop Change-Id: I5a9ef59adcd383e2fae746a434dda10893a3b84a Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14922 Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-11-03 14:53:55 +00:00
GangCao	7f7b468b48	lib/bdev: new __io_ch_to_bdev_ch and __io_ch_to_bdev_mgmt_ch utilities Change-Id: Ie7d818a9a648e28cd191588164420173149af38b Signed-off-by: GangCao <gang.cao@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15167 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-11-02 15:25:21 +00:00
GangCao	cb55e8493f	Lib/Bdev: update calling to spdk_bdev_for_each_channel Change-Id: I541ccffc90e7dc54b416da385e862e952d9db71d Signed-off-by: GangCao <gang.cao@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14638 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-11-02 15:25:21 +00:00
Jim Harris	5497616e8f	env_dpdk: add support for DPDK 22.11 DPDK has merged changes which hide remove some DPDK object such as rte_device and rte_driver from the public API. So we add copies of the necessary header files into our tree, along with a 22.11-specific pci_dpdk implementation. These files are copied over exactly, except for one #include which needs to change from <> to "" so that it picks up the header in our tree instead of looking for it in system headers. Longer-term we may want to look at ways to automated checking and updating of these header files. DPDK 22.11 isn't officially released yet, so the header files could change, but we want to get this in now since without it SPDK cannot build against DPDK tip at all. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I89ffd0abab52c404cfff911c1c9b0cd9e889241d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14570 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-11-02 10:50:23 +00:00
Evgeniy Kochetov	8c3590a983	bdev: Add copy IO statistics Signed-off-by: Evgeniy Kochetov <evgeniik@nvidia.com> Change-Id: Id51ac80bce33a27a8ccea273c076f39019b98339 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14348 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Community-CI: Mellanox Build Bot	2022-11-02 10:33:00 +00:00
Evgeniy Kochetov	a383a15fb1	bdev/part: Add copy IO type support Signed-off-by: Evgeniy Kochetov <evgeniik@nvidia.com> Change-Id: I9e2dcf29794fdb9535a4f0282b3046602f09188e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14385 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Community-CI: Mellanox Build Bot	2022-11-02 10:33:00 +00:00
Evgeniy Kochetov	d14afd5000	bdev: Add copy IO type Copy operation is defined by source and destination LBAs and LBA count to copy. For destiantion LBA and LBA count we reuse exiting fields `offset_blocks` and `num_blocks` in `struct spdk_bdev_io`. For source LBA new field `src_offset_blocks` was added. `spdk_bdev_get_max_copy()` function can be used to retrieve maximum possible unsplit copy size. Zero values means unlimited. It is allowed to submit larger copy size but it will be split into several bdev IOs. Signed-off-by: Evgeniy Kochetov <evgeniik@nvidia.com> Change-Id: I2ad56294b6c062595c026ffcf9b435f0100d3d7e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14344 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Community-CI: Mellanox Build Bot	2022-11-02 10:33:00 +00:00
GangCao	e28e247954	RPC/Bdev: display the per channel IO statistics for required Bdev Add a new parameter "-c" to display the per channel IO statistics for required Bdev ./scripts/rpc.py bdev_get_iostat -b Malloc0 -h usage: rpc.py [options] bdev_get_iostat [-h] [-b NAME] [-c] optional arguments: -h, --help show this help message and exit -b NAME, --name NAME Name of the Blockdev. Example: Nvme0n1 -c, --per-channel Display per channel IO stats for specified device This could give more intuitive information on each channel's processing of the IOs with the associated thread on the same Bdev. Please also be aware that the IO statistics are collected from SPDK thread's related channel's information. So that it is more relating to the SPDK thread. And in the dynamic scheduling case, different SPDK thread could be running on the same Core. In this case, any seperate channel's IO statistics are returned to the RPC call and if needed, further parse of the data is needed to get the per Core information although usually there is one thread per Core. On the other hand, user could run the framework_get_reactors RPC method to get the relationship of the thread and CPU Cores so as to get the precise information of IO runnings on each thread and each Core for the same Bdev. Change-Id: I39d6a2c9faa868e3c1d7fd0fb6e7c020df982585 Signed-off-by: GangCao <gang.cao@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13011 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-10-28 06:51:19 +00:00
GangCao	f0494649e3	Lib/Bdev: add the new API spdk_bdev_for_each_channel And also related function pointers and APIs: spdk_bdev_for_each_channel_msg; spdk_bdev_for_each_channel_done; spdk_bdev_for_each_channel_continue; Change-Id: I52f0f6f27717d53c238faf2f998810c9c5ee45d4 Signed-off-by: GangCao <gang.cao@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14614 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Community-CI: Mellanox Build Bot	2022-10-28 06:51:19 +00:00
Shuhei Matsumoto	6a5ecb3276	bdev/part: Consolidate all I/O types into bdev_part_complete_io() The following patches will allow the caller to specify a custom completion callback to spdk_bdev_part_submit_request(). To do it easily, consolidate completions of all I/O types into bdev_part_complete_io(). Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I083695189daa7e5271787c50947e428d01a83677 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15001 Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Krzysztof Karas <krzysztof.karas@intel.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-10-28 06:49:40 +00:00
Shuhei Matsumoto	ab839831f1	nvme_rdma: Remove workaround for Soft RoCE's bug from cq_process_completions() We do not support Soft RoCE anymore. Remove a workaround for Soft RoCE's bug that we amy receive a completion without error status after qpair is disconnected/destroyed. Then add a assert to check if rdma_req->req is not NULL. This will simplify the code and the following patches. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I80c349053adc0f79679eaf8a5d7265d555d3c2b0 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14909 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-10-28 06:27:19 +00:00
Shuhei Matsumoto	1439f9c773	nvme_rdma: Pass poller instead of poll_group to cq_process_completions() The following patches will support SRQ and SRQ will be per poller. We will need SRQ in nvme_rdma_cq_process_completions(). It is not possible to identify poller if poll_group is passed to nvme_rdma_cq_process_completions(). Based on these thoughts, add poll_group pointer to poller and pass poller to nvme_rdma_cq_process_completions() instead of poll_group. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Signed-off-by: Denis Nagorny <denisn@nvidia.com> Signed-off-by: Evgeniy Kochetov <evgeniik@nvidia.com> Change-Id: I322a7a0cc08bdcc8e87e720ad65dd8f0b6ae9112 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14282 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-10-28 06:27:19 +00:00
Shuhei Matsumoto	194047249b	nvme_rdma: Get qpair from poll group using WC NVMe-RDMA target has a helper function get_rdma_qpair_from_wc() and uses it to identify a qpair from a WC. NVMe-RDMA initiator has a similar function nvme_rdma_poll_group_get_qpair_by_id(). NVMe-RDMA initiator will support SRQ in the following patches, and it will want to identify a qpair from a WC. get_rdma_qpair_from_wc() of NVMe-RDMA target uses wc->qp_num internally anyway. However, the upcoming custom transport for RDMA will have to use other variables of WC. Hence, it will be convenient to pass WC instead of qp_num if we consider future enhancements. Based on these thoughts, for NVMe-RDMA initiator rename nvme_rdma_poll_group_get_qpair_by_id() by get_rdma_qpair_from_wc(). remove unnecessary declaration, and pass WC instead of qp_num. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Signed-off-by: Denis Nagorny <denisn@nvidia.com> Signed-off-by: Evgeniy Kochetov <evgeniik@nvidia.com> Change-Id: I01ead4730207e2c6ac53b83f151bd5f977a11465 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14279 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-10-28 06:27:19 +00:00
Shuhei Matsumoto	6ea9de5fc8	nvme_rdma: Factor out poller destroy operation Poller will have more shared resources when SRQ is supported. This is a preparation. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Signed-off-by: Denis Nagorny <denisn@nvidia.com> Signed-off-by: Evgeniy Kochetov <evgeniik@nvidia.com> Change-Id: Ic3d1cb93dde3f53653a9536a103e5518cebd58e1 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14173 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-10-28 06:27:19 +00:00
Shuhei Matsumoto	6a59daad2b	nvme_rdma: Poll disconnect until completion if async mode is disabled nvme_rdma_ctrlr_disconnect_qpair() does not poll the qpair until it is actually disconnected if it is in a poll group even if its async mode is disabled. Hence, spdk_nvme_ctrlr_free_io_qpair() removes the qpair from a poll group when it is being disconnected. On the other hand, I/O qpair is destroyed after it is actually disconnected. When SRQ is enabled and used, a SRQ is destroyed if the corresponding poller does not have any I/O qpair after an I/O qpair is removed from the poller. In particular, if we use spdk_nvme_ctrlr_free_io_qpair(), a SRQ is destroyed before the corresponding I/O qpairs are destroyed. Destroying a SRQ failed because it is still referenced by I/O qpairs. This bug was found when running the SPDK NVMe perf tool with SRQ. The reason was we had nvme_rdma_poll_group_process_completions() to call disconnected_qpair_cb after the qpair is actually disconnected. However, it is ensured that nvme_rdma_poll_group_process_completions() calls disconnected_qpair_cb for any disconnected qpair. Hence, remove a check if qpair->poll_group is not NULL from nvme_rdma_ctrlr_disconnect_qpair() and update the comment. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I0fde0d827eec3280e1cc5a0fce34d163a6069bc4 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14908 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-10-28 06:27:19 +00:00
Vasuki Manikarnike	3fcee8ddcc	lib/nvme: Do not submit queued aborts if adminq is in failed state. With RDMA, the admin poller can experience a remote disconnect when processing completions. The admin qpair will be disconnected to handle this. The disconnect code path will manually complete queued aborts. However, the completion callback for the abort will attempt to resubmit other queued aborts from the queue, which will result in a very large stack and can eventually cause a segfault. The fix is to not resubmit queued aborts if the admin qpair is in any kind of failed state. Change-Id: I4a6f959232c8a1bd30c87ca50459014e556cbaa0 Signed-off-by: Vasuki Manikarnike <vasuki.manikarnike@hpe.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15114 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Michael Haeuptle <michaelhaeuptle@gmail.com>	2022-10-28 06:26:20 +00:00
Szulik, Maciej	51ae6d4002	nvme/tcp: add max_completion exit condition to loop inside read_pdu A loop inside 'nvme_tcp_qpair_process_completions' makes 'max_completions' actually behaving like a minimum: do { rc = nvme_tcp_read_pdu(tqpair, &reaped); [...] } while (reaped < max_completions); Before this change 'max_completion' constraint, in its true sense, was actually not respected and a loop inside 'nvme_tcp_read_pdu' could be executed indefinitely as long as a recv state changed. To prevent this behavior, max_completion must be passed to 'nvme_tcp_read_pdu' and used as an additional exit condition. Signed-off-by: Szulik, Maciej <maciej.szulik@intel.com> Change-Id: I28da962f4a62f08ddb51915b5d0dae9611a82dee Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15136 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-10-26 07:35:21 +00:00
John Levon	36dfcca2b4	nvmf/vfio-user: switch from shadow doorbells when freeing Some reset/disable paths are freeing the shadow doorbells without switching the SQs back to BAR0. Fix this up, and add a small cleanup when initializing the shadow doorbells. Signed-off-by: John Levon <john.levon@nutanix.com> Change-Id: Ia5e5b91b7dc696a558eb0ad59cc554abced47cca Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14901 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-10-26 07:32:54 +00:00
John Levon	64db53f1aa	nvmf/vfio-user: support multiple poll groups in interrupt mode To support SQs allocated to a poll group other than the controller's main poll group, we need to make sure to poll those SQs when we wake up and handle the controller interrupt. As they will be running in a separate SPDK thread, we will arrange for all poll groups to wake up when we receive an interrupt corresponding to a vfio-user message arriving. This can mean needless wakeups: we don't (yet) have a mechanism to only wake up the poll groups that correspond to a particular SQ write. Additionally, as we don't have any notion of a poll group per controller, this ends up polling all SQs in the entire poll group, not just the ones corresponding to the controller we were handling. As this has potential performance issues in many cases, it defaults to disabled. Signed-off-by: John Levon <john.levon@nutanix.com> Change-Id: I3d9f32625529455f8d55578ae9cd7b84265f67ab Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14120 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-10-26 07:32:54 +00:00
liu.darong	7e17de3d81	bdev/trace: add support to trace with bdev name Fixes #2585 Signed-off-by: liu.darong <liu.darong@xsky.com> Change-Id: I3f9b6d4719b5eed004f383e86db8a17b8b0287f5 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13823 Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-10-25 07:12:52 +00:00
Anton	7ba33f49f0	lib/idxd: fix use after free due to stale crc_dst in chained ops When crc32c is invoked with a multiple entry input iov, only the last op has crc_dst set in order to write the final crc value into the user supplied location. spdk_idxd_process_events() for every successfully completed CRC op writes the value into *op->crc_dst UNLESS it is NULL. The problem is that _idxd_prep_batch_cmd() that allocates new ops left op->crc_dst uninitialized. This results in a memory corruption (use after free) in the following scenario: 1) op A is allocated an crc_dst is set to point to user memory X. 2) Op A is compeleted 3) User memory X is freed. 4) Ops B and C are allocated (chained), C has crc_dst set. => B reused op A memory and crc_dst still points to the now stale user location (1) 5) B is complered, spdk_idxd_process_events() writes into X as B->crc_dst = X. Fix: _idxd_prep_batch_cmd() should initialize crc_dst to NULL. Signed-off-by: Anton Eidelman <anton@lightbitslabs.com> Change-Id: I9e7d57ec43a8fbcb3750906015a5cb7291278c35 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15115 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-10-25 07:10:55 +00:00
paul luse	13597fd4f1	accel_sw: add extra check on compression We were missing a check when ISAL uses the complete output buffer on compression to determine whether it was s perfect fit or if simply not enough buffer was provided. Signed-off-by: paul luse <paul.e.luse@intel.com> Change-Id: I73532666f50cb9fbef3c42f6bfb25fc5c7de01c6 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14874 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-10-25 07:09:37 +00:00
Krzysztof Karas	a74c8c2e8c	scheduler: prevent user from switching back to static Prevent user from switching back to static scheduler after different scheduler has been selected. Currently we do not have a way to save initial thread distribution configuration, so each time user switches from dynamic scheduler back to static, the SPDK threads may end up on different reactors. This would cause discrepancy in performance statistics of SPDK managed by static scheduler. Change-Id: Ic17a6be55eaea0e1a748f92e01f7075540403637 Signed-off-by: Krzysztof Karas <krzysztof.karas@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15055 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-10-21 07:33:06 +00:00
Jim Harris	a9be4f2c2f	trace: add likely/unlikely hints to _spdk_trace_record This helps generate slightly better code in this function, which can have a noticeable impact for high trace event workloads. Tested with bdevperf, single malloc or null bdev, qd=32, 512B randreads on a single Xeon core. Specify "-e bdev" to enable bdev trace events. Null: Before: 8.09M/s (123ns per IO) After: 8.68M/s (115ns per IO) Malloc: Before: 4.21M/s (237ns per IO) After: 4.34M/s (230ns per IO) Note that each bdev I/O generates two trace events (START and END) - meaning this change removes 7-8ns of overhead for every 2 trace events, at least on my system. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I7021b7f9e28b4a7cb16f8a97b4d4004ae165efd2 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15096 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com>	2022-10-21 07:18:37 +00:00
Alexey Marchuk	c77b537786	accel: Save overridden options in json config file Signed-off-by: Alexey Marchuk <alexeymar@nvidia.com> Change-Id: Ida2c6f1c460c2b66d2d4159d225036377e488e62 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14856 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-10-19 07:47:58 +00:00
Anton Eidelman	c2c8b4ebc7	lib/idxd: fix bug in crc32c with chained ops When spdk_idxd_submit_crc32c() handles input with multiple iovs (or multiple ops are generated due to physically discontinuous buffers), the first op has the original seed, while the subsequent ops instruct the hardware to to fetch the seed from the output of the previous op (op->hw.crc32c_val): void *prev_crc; ... desc->flags \|= IDXD_FLAG_FENCE \| IDXD_FLAG_CRC_READ_CRC_SEED; desc->crc32c.addr = (uint64_t)prev_crc; <<< virtual addr The problem is the prev_crc is a virtual address, so the hardware (at least with no IOMMU configured) reports: DSA_COMP_HW_ERR1 spdk_idxd_process_events: Completion status 0x20 Solution: Set crc32c.addr to the physical address of the crc32c_val field in the previous desc. Since desc->completion_addr already holds the physical address of the dsa_hw_comp_record, we use this with the crc32c_val offset. Signed-off-by: Anton Eidelman <anton@lightbitslabs.com> Change-Id: I330e98c2f3fd6da5cb4fc03d0745df09a9ff0e0c Signed-off-by: Anton Eidelman <anton@lightbitslabs.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14954 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: John Kariuki <John.K.Kariuki@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-10-18 07:24:55 +00:00
Konrad Sztyber	1f3a6b0398	rpc: use rw access when creating RPC lock file It allows the users to specify the path to the RPC socket on a NFS mounted filesystem. This is necessary, because flock(2) on NFS requires write access to place an exclusive lock. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: If197498ed5bdcb4e02c5f2f2b2c1ef388872c457 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14993 Reviewed-by: Jacek Kalwas <jacek.kalwas@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot	2022-10-18 07:23:28 +00:00
GangCao	f20b99bbb3	lib/nvme/vfio: destruct ctrlr in failed cases Change-Id: Ie7d7ab25055c26ea1c2ae4997bf7197a170de989 Signed-off-by: GangCao <gang.cao@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15005 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-10-17 12:52:55 +00:00
Szulik, Maciej	dcf30711ef	build: add explicit vars init to silence LTO related warning When Link Time Optimization is enabled, compiler can sometimes produce additional warnings saying that some variables may be uninitialized. To supress the warning it is enough to add explicit initialization of the variable causing the issue, in this case 'module_name = NULL' and "writer = NULL". Signed-off-by: Szulik, Maciej <maciej.szulik@intel.com> Change-Id: I30492115b28a18554b08a6f575cbcc9538f3b848 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14849 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jacek Kalwas <jacek.kalwas@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-10-05 10:24:53 +00:00
GangCao	8afb3d0037	lib/bdev: return error when failing to get resource To fix issue: 2719 Change-Id: I983ef607fad154608fff9bb9355645968caf0c5a Signed-off-by: GangCao <gang.cao@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14746 Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot	2022-10-04 07:07:04 +00:00
Tomasz Zawadzki	f98ac63ea7	reactor: do not switch mode for threads in non interrupt tgt Fixes #2693 spdk threads should not be placed in interrupt mode if the application does not have interrupt mode enabled. This resulted in race condition, while reactor was placed in interrupt mode, thread was scheduled on it. Such operation is a valid one, but never should be attempt to change the threads mode in this case. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I10b0bbacac1df812badb91b37064528f66743e51 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14815 Reviewed-by: Michal Berger <michal.berger@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-09-30 16:14:10 +00:00
Tomasz Zawadzki	c34f15e09c	env_dpdk: keep DPDK 20.11 compatiblity Patch below added copies of pci realted headers to keep compatiblity with <= DPDK 22.07. (`1eb35ac`) env_dpdk: add copies of 22.07 pci-related header files Unfortunetly the rte_bus/bus_pci/dev headers from DPDK 22.07 are not compatibile going back to DPDK 20.11. The issues are: - lack of RTE_TAILQ_ENTRY defined in rte_os.h - rte_intr_handle being part of rte_pci_device rather than pointer pci_dpdk_2207.c even before this patch is not binary compatible with DPDK 20.11 - see pci_device_*_interrupt_2207() functions. There would need to be another copy of headers matching that version of DPDK to resolve this issue. SPDK supports up to two latest LTS releases. Which right now includes DPDK 20.11, but soon will be dropped due to DPDK 22.11 release. Having compile time defines here, keeps the older DPDK working. Meanwhile backwards compatiblity in SPDK is no worse than before. The recent changes to env_dpdk, are aiming to improve support with newer versions of DPDK. Change-Id: If4dc601cb03e18c2cad61f3a93080e8265ca5fcc Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14795 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-09-30 15:56:33 +00:00
Artur Paszkiewicz	a51649faf6	bdev: use write_unit_size for acwu and write_zeroes Change-Id: Idbcfc110c153a62082f84f3304f1e245f2fc3daf Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14716 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-29 22:52:45 +00:00
Artur Paszkiewicz	69c448a30e	lib/util: add ISA-L accelerated xor generation Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: I3ef9dadb4c68e92760c8426f0fffb7b249829e2b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12080 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-09-29 22:52:45 +00:00
Artur Paszkiewicz	d6e9827e9f	bdev: split writes based on write_unit_size Add new bdev property split_on_write_unit which, if set to true, causes writes to be split to match write_unit_size and fail if not aligned to or not multiple of write_unit_size. Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: Id49f58a3288ddf5cfe4921ce4020ae4bcdd67298 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11390 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Krzysztof Karas <krzysztof.karas@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-09-29 22:52:45 +00:00
Changpeng Liu	295e54d144	virtio/vfio_user: add virtio_blk device support Add vfio-user transport support based on existing virtio client library. Test steps using bdevperf: Start `spdk_tgt` with created virtio_blk device: 1. build/bin/spdk_tgt 2. scripts/rpc.py bdev_malloc_create -b malloc0 $((512)) 512 3. scripts/rpc.py vfu_virtio_create_blk_endpoint vfu.0 --bdev-name malloc0 \ --cpumask=0x1 --num-queues=2 \ --qsize=256 --packed-ring Start `bdevperf`: 1. test/bdev/bdevperf/bdevperf -r /var/tmp/spdk.sock.1 -g -s 2048 -q 128 -o 4096 \ -w randread -t 30 -m 0x2 2. scripts/rpc.py -s /var/tmp/spdk.sock.1 bdev_virtio_attach_controller --dev-type blk \ --trtype vfio-user --traddr vfu.0 VirtioBlk0 3. test/bdev/bdevperf/bdevperf.py -s /var/tmp/spdk.sock.1 perform_tests Change-Id: I368c4becebbca57328a25fc750e41c353420e481 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13896 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-29 19:42:56 +00:00
Changpeng Liu	e50ade3153	vfio_user: remove CONFIG_VFIO_USER flag for client library The client vfio_user library doesn't require this flag as it is totally owned in SPDK, so remove it. Change-Id: I8f7b1df18017ceac24dbb8a0417871f25f6bee0d Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13895 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-29 19:42:56 +00:00
Changpeng Liu	da231290b2	lib/vfu_tgt: add library for PCI device emulation Previously SPDK use libvfio-user library to provide emulated NVMe devices to VM, but it's limited to NVMe device type only. Here we add SPDK vfu_target library abstraction based on libvfio-user which supports more PCI device types. We will add virtio-blk and virtio-scsi devices emulation based on vfu_tgt library in following patches, actually this library can support NVMe emulation too, due to the fact that the NVMe emulation is already exist, so we will keep the NVMe emulation which based on libvfio-user directly as it is. Change-Id: Ib0ead6c6118fa62308355fe432003dd928a2fae9 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12597 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-29 19:42:56 +00:00
Jim Harris	c7f5010984	env_dpdk: add dpdk_pci_device_get_mem_resource This allows eliminating dpdk_pci_device_vtophys and dpdk_pci_device_map_bar, reducing the amount of code we need to maintain in the per-DPDK version implementations. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I73d15eb75bf7fe8340d85494425e15651fec5425 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14722 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Krzysztof Karas <krzysztof.karas@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com>	2022-09-29 15:32:24 +00:00
Jim Harris	5be703ef35	env_dpdk: break up dpdk_pci_device_copy_identifiers Break this function up into three APIs instead: * dpdk_pci_device_get_addr * dpdk_pci_device_get_id * dpdk_pci_device_get_numa_node This more clearly delineates the requirements we have from the DPDK PCI device/driver APIs. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ie585c8252d63c15c6e6884d60f8a064c3f0ab94f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14684 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Krzysztof Karas <krzysztof.karas@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com>	2022-09-29 15:32:24 +00:00
Jim Harris	1eb35ac7e3	env_dpdk: add copies of 22.07 pci-related header files Moving forward, we want to still be able to run against <= 22.07 versions of DPDK, which exposed the necessary data structures in public header files. But since we will be building against newer versions of DPDK which don't expose them publicly, we need a copy of the 22.07 header files in our tree. Exclude these header files from astyle and POSIX include file checks in check_format.sh Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Icd8a067af41a2ba031ce8f875a8a2b63f722ab69 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14683 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Krzysztof Karas <krzysztof.karas@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com>	2022-09-29 15:32:24 +00:00
Jim Harris	25f89bd584	check_format.sh: remove "rte_vhost" exclusions This was a remnant from ages ago when we had rte_vhost DPDK code copied into our repo. We actually have a file named rte_vhost_user.c which is not DPDK code that was getting excluded from astyle checking. So this also includes the astyle violations that had crept into this file. In a couple of places, change the enum return type to int, this reduces astyle confusion on function and if brace style. Same applies to POSIX include checking - we don't need to exclude rte_vhost_user.c from this either. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: If3a25011ad54c694c15a91f7be66d862c765c5db Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14688 Community-CI: Mellanox Build Bot Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Krzysztof Karas <krzysztof.karas@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-09-29 15:32:24 +00:00
GangCao	13c7a98d33	thread: add assert for io_channel_iter allocation failure For example, in the calling from spdk_bdev_get_current_qd(), if spdk_for_each_channel() failed to allocate struct spdk_io_channel_iter, it will just return and the ctx allocated in spdk_bdev_get_current_qd() is not released. Instead to change the public API of spdk_for_each_channel() to return the failed status to let the caller properly handle the NOMEM case and release the allocation, it just adds the assert here. Change-Id: I6a95207dd390586bdae4e86e5d550cdac709e10a Signed-off-by: GangCao <gang.cao@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14657 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-09-29 07:27:27 +00:00
MengjinWu	f1bec928d1	nvmf/tcp: add admin queue depth check before init max_aq_depth should be not smaller than 2 or greater than 4096 Signed-off-by: MengjinWu <mengjin.wu@intel.com> Change-Id: I205fbb4345cfdc41ebaf30c953da263fe9f0e9a8 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14691 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Community-CI: Mellanox Build Bot	2022-09-28 06:39:14 +00:00
MengjinWu	bf887576cb	nvmf/tcp: add IO queue depth check before init max_queue_depth should be not smaller than 2 or greater than 65536 Signed-off-by: MengjinWu <mengjin.wu@intel.com> Change-Id: I0f2a4b8df6eb1b140a11936fc6929f1285a7d717 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14619 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Community-CI: Mellanox Build Bot	2022-09-28 06:39:14 +00:00
MengjinWu	5eb3239cdf	nvmf/tcp: Refine the macro definition of queue depth Refine the macro definition name about queue depth and prepare for next patch. Signed-off-by: MengjinWu <mengjin.wu@intel.com> Change-Id: I85bee2528ae4ab70292fc11aa62d05bae0c28a77 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14664 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Krzysztof Karas <krzysztof.karas@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot	2022-09-28 06:39:14 +00:00
Krzysztof Karas	19c1d632f1	trace: update trace help inside SPDK target Delete bit masks from trace help (found inside build/bin/spdk_tgt -h help text), as they do not provide useful information, are much harder to remember and use, and migh leave user confused. Since we provide trace group names anyway, bit masks are excessive. Change --tpoint-group-mask parameter name to --tpoint-group, because we do not provide bit masks anymore. Drop "default" tpoint group mask from help text, since it does not enable any tracepoints and may confuse the user. Change-Id: I2ca780883dfa7822e76523e9ba1fc65a7bfe5a99 Signed-off-by: Krzysztof Karas <krzysztof.karas@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14656 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-27 19:41:17 +00:00
Szulik, Maciej	1b575d831d	lib/nvmf: add explicit iovcnt init to silence LTO related warning When Link Time Optimization is enabled, compiler can sometimes produce additional warnings saying that some variables may be uninitialized. To supress the warning it is enough to add explicit initialization of the variable causing the issue, in this case 'iovcnt = 0'. Signed-off-by: Szulik, Maciej <maciej.szulik@intel.com> Change-Id: I080b20a6008643ae78c8e3a6c2d183193ef6c1bf Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14674 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Jacek Kalwas <jacek.kalwas@intel.com> Community-CI: Mellanox Build Bot	2022-09-26 15:36:23 +00:00
Liu Xiaodong	b6bb252e23	lib/nvmf: fix async_events index When data_local.num_async_events > SPDK_NVMF_MIGR_MAX_PENDING_AERS, data_local.async_events was already indexed by 256, and it was out of bounds. Signed-off-by: Liu Xiaodong <xiaodong.liu@intel.com> Change-Id: I15cfdeb9bc165de0c73fbc9171b0ce6d8689c0aa Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14666 Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot	2022-09-26 11:49:41 +00:00
Ben Walker	2371a070c8	idxd: For kernel mode, handle IOMMU+SM mode If the kernel is booted with the IOMMU enabled and Shared Memory mode enabled (which are the expected boot parameters for production servers), then the kernel idxd driver will automatically register a dedicated work queue with the PASID for the process that opens it. This means that the descriptors written into the portal for that work queue should be virtual addresses. If the IOMMU is enabled but Shared Memory mode is disabled, then the kernel has registered the device with the IOMMU and assigned it I/O virtual addresses. We have no way to get those addresses from user space, so we cannot use the kernel driver in this mode. Add a check to catch that. If the IOMMU is disabled, then physical addresses are used everywherre. Change-Id: I0bf079835ad4df1128ef9db54f5564050327e9f7 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14019 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com>	2022-09-26 11:48:39 +00:00
Ben Walker	1c098401d8	idxd: Correctly memory barrier prior to submitting descriptors The DSA specification calls out that software must use a memory barrier such as sfence prior to writing a descriptor or incorrect data may be transferred during the operation. Change-Id: I12f20e5a748e41616c7a542ccdb158c6b548eea4 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14018 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-09-26 11:48:39 +00:00
Ben Walker	a36bc251df	env_dpdk: Automatically map PCI BARs into VFIO By doing the registration immediately upon mapping the BAR instead of when the memory is inserted into the spdk_mem_map, we're able to register BARs that are not 2MB multiples in size and alignment. The SPDK API for registering a BAR already returns the physical/io address in the map call, and it can be used directly without a call to spdk_mem_register(). If the user does elect to later register the BAR using spdk_mem_register(), we attempt to insert the 2MB aligned segments we can into the spdk_mem_map. Users may still need to register memory for a few reasons, such as making spdk_vtophys() work, or for setting up the BAR as a target for RDMA. These cases still require 2MB aligned and sized segments. Change-Id: I395ae8803ec4bf22703f6f76db54200949e82532 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14017 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot	2022-09-26 11:48:39 +00:00
Jim Harris	3d5971ecc6	env_dpdk: do not use rte_version_xxx() variants These variants did not exist in DPDK 20.11 which is still supported by SPDK. So we will instead need to scan the rte_version() string to get these values. Fixes issue #2715. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I79657002a7a605a38a0d98b944ac53c02fa6d78c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14661 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Pawel Piatek <pawelx.piatek@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-09-26 11:16:42 +00:00
MengjinWu	8d1c4f74d4	nvmf/tcp: Check if In-capsule Data length and sgl data length are equal In-capsule data length should be the same with the SGL data length. Signed-off-by: MengjinWu <mengjin.wu@intel.com> Change-Id: I7eefecb8baebb76850a48689907aff27a8946f98 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14602 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-23 18:26:36 +00:00
MengjinWu	8ed53eee32	nvmf/tcp: Fixed error handle in 'nvmf_tcp_req_parse_sgl' Fixed error handles which are violated with spec: 1. 'data length > MAXH2CDATA' is a fatal error. 2. 'ICDOFF != 0' should abort the IO. Other errors which are not defined in spec: 1. invalid sgl type 2. In-capsule Data length > In-capsule Data size Because this function runs before data part receiving, it is hard to skip the following data segment if we want to handle some error as non-fatal. Currently, we have to handle all undefined errors as fatal errors. I think after this release, we can change receving process. This will be helpful for error handling. But this work is not small. Signed-off-by: MengjinWu <mengjin.wu@intel.com> Change-Id: I8fc0d2d743505e49a93be19fd217e7ad6ca06622 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14580 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-23 18:26:36 +00:00
Sebastian Brzezinka	5fb57441ec	lib/vfio-user: add spdk_vfio_user_dev_send_request as public function Fuzzing vfio-user require access to send request api Signed-off-by: Sebastian Brzezinka <sebastian.brzezinka@intel.com> Change-Id: I6c58b8ab4fd3394150bbb3e64b4f95bff93dae6e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13881 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-09-23 15:16:01 +00:00
Sebastian Brzezinka	ef73f559e6	lib/nvmf: test if client and server are runing in same process During fuzzing vfio-user client and server are started from same process causing deadlock. SO_PEERCRED return pid of process connected to vfio endpoint. Signed-off-by: Sebastian Brzezinka <sebastian.brzezinka@intel.com> Change-Id: I6fc2db5d58a459a30fec116a9de3c69d48acf75e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14559 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-09-23 15:16:01 +00:00
Jim Harris	936726f847	env_dpdk: add dpdk_pci_init() This checks the current version to make sure we have a dpdk_fn_table that supports it. This is easy for now, since the DPDK PCI API is public. Moving forward, DPDK 22.11 will likely make these APIs private, requiring us to carry header file copies for different DPDK versions so that we can not only build against DPDK but also use the correct data strucures and APIs to interact with those private DPDK interfaces. We will also need to consider minor (i.e. stable or point) releases since they could technically change PCI ABI as well - the current year + month checks won't be sufficient. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ic9f41d9d13778f3d078b20b08da48d8d16362b11 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14637 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-09-23 08:01:01 +00:00
Jim Harris	52c674d23a	env_dpdk: make pci_env_init() return int This allows it to return error codes. Have the init code check the return value and fail the init process when pci_env_init() returns error. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I7c8a4f9a6da6b3438ed09a881153b7a4ceef3a83 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14635 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-09-23 08:01:01 +00:00
Jim Harris	a25a834ae1	env_dpdk: move <=22.07 specific code to pci_dpdk_2207.c Get ready to have multiple implementations of the dpdk_fn_table. We could do some fancy self-registering constructor functions, but let's just keep it simple for now and extern declare each implementation in the pci_dpdk.h header file. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I8f5621412d1c8bd22c95ab74ef66c5bcc41d1380 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14636 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-09-23 08:01:01 +00:00
Jim Harris	53cd692b95	env_dpdk: add struct dpdk_fn_table This is the next step in supporting multiple DPDK PCI device/driver ABIs once those APIs are no longer public and subject to ABI versioning rules. This patch does the following: 1) introduce dpdk_fn_table 2) rename the existing dpdk_xx functions to xx_2207, to denote these functions are valid for DPDK versions up to and including 22.07 3) create a dpdk_fn_table pointing to the xx_2207 functions 4) create a global dpdk_fn_table pointer that points directly to the 2207 fn_table 5) create new dpdk_xx functions that just redirect to the associated dpdk_fn_table function pointer Future patches will add the machinery to register multiple function tables and pick the one to use at run time based on rte_version() calls. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I1171fbdb4f72ff117416ac1fb282ff6f9fa5cadf Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14634 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-09-23 08:01:01 +00:00
paul luse	850cd90082	accel/idxd/iaa: Convert to use iovecs In prep for upcoming iovec based compression/decompression patches. Signed-off-by: paul luse <paul.e.luse@intel.com> Change-Id: I413493f764bead9e56266e488b74f8bca979e225 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14633 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-09-23 00:10:08 +00:00
paul luse	28886ac352	lib/accel: rename iovec elements with src prefix In prep for adding both src and dst iovec support for compression. Signed-off-by: paul luse <paul.e.luse@intel.com> Change-Id: I704b8d2bd459de03deb7f8ee45d76261910a3727 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13746 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-09-23 00:10:08 +00:00
MengjinWu	100c53718d	nvmf/tcp: add in_capsule_data_size check before init in_capsule_data_size should not be larger than max_io_size. Signed-off-by: MengjinWu <mengjin.wu@intel.com> Change-Id: I636724c888b9e5abc4cffac96bff24021e172498 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14618 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-22 22:13:19 +00:00
Krzysztof Karas	dfc9894396	bdev: send bdev reset based on outstanding IO and a new timeout parameter A new parameter io_drain_timeout has been added to spdk_bdev structure. If this value is unset, the bdev reset behavior does not change. The io_drain_timeout controls how long a bdev reset must wait for IO to complete prior to issuing a reset to the underlying device. If there is no outstanding IO at the end of that period, the reset is skipped. Change-Id: I585af427064ce234a4f60afc3d69bc9fc3252432 Signed-off-by: Krzysztof Karas <krzysztof.karas@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14501 Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot	2022-09-22 19:18:30 +00:00
Jim Harris	11313c2090	env_dpdk: move dpdk pci code to pci_dpdk.c/h Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I54489903f48a8a2e500f64c2e7f8530eed1e6882 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14548 Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-09-22 12:38:25 +00:00
Jim Harris	7a7fd57715	env_dpdk: add dpdk_device_* functions Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I89dbf50821a3843b861629c195f2f9e8dfdc59a6 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14569 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com>	2022-09-22 12:38:25 +00:00
Jim Harris	89e56a49d3	env_dpdk: create dpdk_bus_probe and dpdk_bus_scan Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I514b99e0cc887ca9243ccf212d0b7a0304bed45a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14568 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com>	2022-09-22 12:38:25 +00:00
Jim Harris	34ff0cb6aa	env_dpdk: add dpdk_pci_device interrupt functions Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ia707870591b1e82e25bb3294b176f47d7e46483f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14547 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com>	2022-09-22 12:38:25 +00:00
Jim Harris	44caf7fdfb	env_dpdk: rename register_rte_driver Rename it to dpdk_pci_driver_register. This way we follow the dpdk_pci_xxx naming convention for all DPDK PCI structure/API dependent functions. Also move it to the end of the file, to prepare for moving it into the separate file. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ifca4110f737095a94f9db3d27525f5b9af0546c9 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14546 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com>	2022-09-22 12:38:25 +00:00
Jim Harris	84c34e64a3	env_dpdk: add dpdk_pci_device functions for bars and cfg Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I2f65adaead06d2443f634d8d905c780ad38ec454 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14545 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com>	2022-09-22 12:38:25 +00:00
Jim Harris	0c6a7b9153	env_dpdk: add dpdk_pci_device_copy_identifiers Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I2821cbfc58829e2b7f71d2700e102e8fd6c6c322 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14544 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com>	2022-09-22 12:38:25 +00:00
Jim Harris	dabd899365	env_dpdk: add dpdk_pci_device_get_devargs Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I54bdd891f99b53fbc3111f1a51c2f73f7a73b92a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14543 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com>	2022-09-22 12:38:25 +00:00
Jim Harris	db531332cf	env_dpdk: add dpdk_pci_device_get_name This touches the rte_pci_device structure, so let's make a separate accessor function just for that. We will start putting the definitions for these new dpdk_pci_device_xxx functions at the end of pci.c. At the end of this series, we will then just lop off the end of pci.c containing all of the dpdk_pci_device functions and move them to a DPDK-dependent pci_22_07.c file. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I0323fc19b51d21d1bac899df21d6ebf4354ab339 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14542 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com>	2022-09-22 12:38:25 +00:00
Jim Harris	ce63b017b8	env_dpdk: don't embed rte_pci_driver directly struct rte_pci_driver will become private, and its size may change between DPDK releases. But we want to keep the spdk_pci_driver structure generic. So allocate 256 bytes of space for the rte_pci_driver structure, which is far more than the 104 bytes it currently occupies. We will keep a struct rte_pci_driver pointer to this memory in spdk_pci_driver which can be set up in the generic code. This will make it easier in future patches to make sure that anything actually touching the rte_pci_driver structure will be in the separate DPDK dependent files. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I29aa7e71137da25a5480b34c71f2e0d5c9c02eae Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14541 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com>	2022-09-22 12:38:25 +00:00
Xinrui Mao	4a9209bf1d	lib/nbd: return nbd_poll idle or busy accordingly The previous version missed the case of return value of _nbd_poll equals to 0,and thus,when using nbd with no io,spdk_top shows high cpu utilization.Return idle when _nbd_poll return 0. Fixes #2697 Signed-off-by: Xinrui Mao <xinrui.mao@intel.com> Change-Id: Ifa2ca3010e10250b5320a8282dfed3d97bea5105 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14615 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Community-CI: Mellanox Build Bot	2022-09-22 07:51:46 +00:00
MengjinWu	4c33c7ae20	nvmf/tcp: inline function 'nvmf_tcp_req_set_state' Signed-off-by: MengjinWu <mengjin.wu@intel.com> Change-Id: Ie3af436411da9e3f3ad1ec159f0fbf59c4901983 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14598 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Community-CI: Mellanox Build Bot	2022-09-22 07:45:56 +00:00
MengjinWu	1d7230285b	nvmf/tcp: add hpda value check in 'nvmf_tcp_icreq_handle' hpda value should be in range of 0 to 31. Signed-off-by: MengjinWu <mengjin.wu@intel.com> Change-Id: Ie1329c831af06ccc8943a562c3f6396b635be518 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14575 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-09-22 07:45:03 +00:00
MengjinWu	f8dd380b33	nvmf/tcp: eliminate function nvmf_tcp_set_in_capsule_data This function is small and called only once. Signed-off-by: MengjinWu <mengjin.wu@intel.com> Change-Id: Ie4b11668e42a8920b3a9a11aa8cb83512f32942c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14576 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Community-CI: Mellanox Build Bot	2022-09-22 07:38:25 +00:00
MengjinWu	b5aeff1dba	nvmf/tcp: 'nvmf_tcp_send_c2h_term_req' should set fes Set the fes in nvmf_tcp_send_c2h_term_req. Signed-off-by: MengjinWu <mengjin.wu@intel.com> Change-Id: I457e102d9329e5624c738c5cf2e7fe411106f30b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14583 Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-09-22 07:37:56 +00:00
Kozlowski Mateusz	be61c92a6d	FTL: close ftl bdev in original thread spdk_bdev_close should be called on the caller thread. Saving the thread now for both unmap and get stats, and executing the close in the appropriate context. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Change-Id: I82192817d6012b0d41bbe2078fbd3f7dc01a7282 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14597 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-09-22 07:10:03 +00:00
Kozlowski Mateusz	691504a314	FTL: Fix error path for initializing mempools If both allocation paths would fail, then the same mngt path would execute rollback twice, leading to use after free error. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Change-Id: I55c9ea5131faabc930fd8ff92ddd9f8d0fd9a0b0 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14596 Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-09-22 07:10:03 +00:00
MengjinWu	03843f73cb	lib/nvme: disable multi c2hs crc32 offload at host An example: There are 3 c2h data PDUs for one read request. Data digest is enabled, accel_poller is enabled. The first PDU will be offload to accel_poller. Then the others will use CPU to calc the crc32c. If the last PDU is calc done and the first PDU is not calc down, SPDK will direct success the read request, and free some objects. When accel_poller calc down, it will find the request is freed, and abort the SPDK. Disable multi c2hs async process to prevent this situation. Signed-off-by: MengjinWu <mengjin.wu@intel.com> Change-Id: I03c9e5b30622bbe84523c0836aa93cfed672896 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14079 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: GangCao <gang.cao@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-21 17:01:46 +00:00
Jim Harris	9633d482a7	nvmf: emit add_listeners RPCs after add_ns RPCs When emitting the JSON-RPC text for saving the current configuration, add the listeners last. This is usually the preferred order when configuring a new subsystem - it is better to have all of the namespaces and hosts added to the subsystem before adding the listener to allow hosts to connect to it. We support namespace hotplug but there's no need to unnecessarily generate hotplug events if we can avoid it. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I79e8a0a496eeb128efbb7e314ac835b6110d3cc8 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14586 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-09-21 08:50:08 +00:00
MengjinWu	00005ed8d5	nvmf/tcp: eliminate function 'nvmf_tcp_pdu_payload_insert_dif' This function is called only once and can be eliminated. Signed-off-by: MengjinWu <mengjin.wu@intel.com> Change-Id: I0b3e80c025b60a816e2113f859907f95e96dd183 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14578 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-09-21 08:18:56 +00:00
MengjinWu	252c053e6f	nvmf/tcp: insert dif after all payload received 'nvmf_tcp_pdu_payload_insert_dif' can be done after receiving whole payload data as an optimization. Signed-off-by: MengjinWu <mengjin.wu@intel.com> Change-Id: I3054079427c25d102477ef8ec1b288631741d7a3 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14577 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-09-21 08:18:56 +00:00
Ben Walker	712e8cb7ef	accel: Refer to plugins as 'modules' instead of 'engines' This is consistent with the use of terms in other parts of SPDK and fits with the code living under module/ Change-Id: If182f7cf2d160d57443a1b5f24e0065f191b59b2 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13919 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-09-21 08:17:48 +00:00
MengjinWu	e4569bd421	test/nvme_tcp: Correct the psh_len in nvme_tcp unittest psh len is not the same with header len. Add an assert in nvme_tcp.c to prevent this happen again. Signed-off-by: MengjinWu <mengjin.wu@intel.com> Change-Id: Ibc250752bedf3da8994f79c51fb01577a222d364 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14521 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-20 20:29:40 +00:00
MengjinWu	0b7f5a57ac	nvme/tcp: remove unnecessary if check in nvme_tcp_read_pdu This "if" is of no use here. The state machine has the "NVME_TCP_PDU_RECV_STATE_AWAIT_PDU_CH" state means the pdu does not receive enough length of header. Signed-off-by: MengjinWu <mengjin.wu@intel.com> Change-Id: Id50943f77b570fd337e2bb4e3b45281018d159e4 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14504 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-20 20:29:40 +00:00
Aleksey Marchuk	bf41b46c4e	nvmf: Don't reg additional MRs RDMA transport registers MRs for in-capsule data buffers, commands and completions. Since these structures are allocated using huge pages, MR for these buffers are already registered, we only need to translate addresses. Signed-off-by: Aleksey Marchuk <alexeymar@nvidia.com> Change-Id: I90c53d8276d72077f7983e9faf9160e9ede52a7d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14430 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-20 20:27:52 +00:00
Aleksey Marchuk	c66b68e94e	nvme/rdma: Inline nvme_rdma_calloc/free These functions used to allocate resources using calloc/spdk_zmalloc depending on the g_nvme_hooks pointer. Later these functions were refactored to always use spdk_zmalloc, so they became simple wrappers of spdk_zmalloc and spdk_free. There is no sense to use them, call spdk memory API directly. Signed-off-by: Aleksey Marchuk <alexeymar@nvidia.com> Change-Id: I3b514b20e2128beb5d2397881d3de00111a8a3bc Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14429 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-20 20:27:52 +00:00
Aleksey Marchuk	77aef307fd	nvme/rdma: Don't reg MRs for cmds and rsps Since now cmds and rsps buffers are allocated from huge pages, there are already registered MR for this memory. In that way we can avoid registering 2 additional MRs per qpair, just perform memory translation to get lkey. Signed-off-by: Aleksey Marchuk <alexeymar@nvidia.com> Change-Id: I2cb39a15e5d224698c293ac18af00a909840eaa8 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14428 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-20 20:27:52 +00:00
Kozlowski Mateusz	920c1cca18	ftl: Change metadata to use structure packing Don't rely on compiler for metadata packing to 4KiB size and add reserved fields manually. For compatibility reasons against metadata relying on automatic padding the reserved fields are also added in-between existing fields as needed. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Change-Id: I5e342d5bf5948c213d455590d09597ae120b3c62 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14307 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-09-20 19:24:26 +00:00
Kozlowski Mateusz	c332181331	FTL: Move base device sb to LBA 0 Moving the superblock of the base device to sector 0, in order to prevent other bdevs (e.g. GPT or blobstore) from potentially hijacking the base device during startup (if their metadata by 'luck' manages to find itself at sector 0 of band 0, which depending on the order of operations could be very likely). Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Change-Id: I8a6eb3c89a229f443ef23d975a8ff0880ba65b08 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14143 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-09-20 19:24:26 +00:00
Kozlowski Mateusz	759e176927	lib/ftl: Don't retry on write failure Retrying on write errors is generally not needed, by default FTL will fail now in such cases. If retry is preferable, an additional build flag must be supplied. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Change-Id: I8ed1fe140564f08905bdf7fc6d6aa86a7585693a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14114 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-09-20 19:24:26 +00:00
Artur Paszkiewicz	d1dd6ca814	ftl: check structure sizes for future ABI compatibility Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: Ic32f6fe085d94b00d025b6cab7e5073341169a73 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13677 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-09-20 19:24:26 +00:00
Kozlowski Mateusz	4759b0b6a6	ftl: Add explicit values to the ftl_layout_region_type This should prevent accidental reordering/removal of regions from causing problems after loading against such changed metadata. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Change-Id: I75c62810157db4bb0de4dfc84f5656fd187befde Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13614 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-09-20 19:24:26 +00:00
Artur Paszkiewicz	63b2fecb3f	ftl: nv cache write throttling Adds user write throttling - since writing to cache must be balanced against the ability to compact the data to the base device, this throttling mechanism allows for a smoother, more stable performance levels - tying the user write speed to the compaction drain speed. Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Change-Id: Ia85efeb387f17c6c080b23ae4e658a6d7e47a2fb Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13392 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-09-20 19:24:26 +00:00
Artur Paszkiewicz	8a76d5500d	ftl: I/O tracing Adds tracepoints in FTL. Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Change-Id: I661703e42b8b531822a2ba74a09cdc716daa1c46 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13391 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-09-20 19:24:26 +00:00
Artur Paszkiewicz	1790ee8a8d	ftl: I/O statistics Add gathering of some performance counters and RPC for printing them. Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Change-Id: I2e77d37fb66459240ff2e241f2b1f77c60f4eef4 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13390 Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-09-20 19:24:26 +00:00
Kozlowski Mateusz	d748bc41e2	ftl: Add layout upgrade to management path Execute the upgrade management path during startup. Will attempt to update metadata and verify layout validity. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Change-Id: I2cff15cbe87836ca8b7700d0e3f4eee0f331ac56 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14450 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-09-20 19:24:26 +00:00
Kozlowski Mateusz	8c41c40331	ftl: Add md upgrade templates for P2L/Band/Chunk from version 0 to version 1 Since P2L, Band, Chunks start at version 1, adding some code blocking the loading of version 0 for them. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: I4f5d3a8bb3ed1e39bea18803ffb8ba319a815ae8 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13387 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-09-20 19:24:26 +00:00
Kozlowski Mateusz	c8ab874d7c	ftl: Add upgrade of superblock from version 2 to version 3 Layout of metadata will be part of the superblock at the end of the upgrade. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: If888866806e948ee07f0777612da73ab8b7548b1 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13385 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-20 19:24:26 +00:00
Kozlowski Mateusz	7ff285193f	ftl: Add metadata upgrade framework Added the ability for minor metadata upgrade - updating the internal fields of metadata structures, without changing the overall layout. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: Iec98c62b45b099d6d476d486ba7e4ff6b648bb95 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13384 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-20 19:24:26 +00:00
Artur Paszkiewicz	44b6d585ca	FTL: Add helper functions for superblock upgrade Adds extra functions which will be used during upgrade (changing versions) of superblock metadata. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: I08642deaf509f613cc8b22043dcdded6c329daa9 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13383 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-20 19:24:26 +00:00
Kozlowski Mateusz	1bc356bb21	ftl: Fix abort in compaction retry path Don't try to abort when return code is actually 0. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Change-Id: Id93a43173ae54324dc61ba419d929fdec4d90264 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14449 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-20 19:24:26 +00:00
Jim Harris	b313652b30	env_dpdk/pci: Refactor PCI bus scan Preparing for potential 22.11 changes, refactor this code using DPDK api: - a bus device list can be walked through via RTE_DEV_FOREACH, - a reference to the bus object is directly available under the device, Signed-off-by: David Marchand <david.marchand@redhat.com> Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Id3a21a6e62dfa1619a92465fac5a82afb9b43cb0 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14532 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-09-20 10:19:22 +00:00
Jim Harris	36644ef32f	env_dpdk: move spdk_pci_driver definition to pci.c Also remove all pci-related DPDK includes from env_internal.h, and add rte_bus_pci.h to pci.c only. Now pci.c has all references to DPDK pci-related header files and data structures. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I5f1727d465eaa73cf71d2f3589cecd3ebb83eb85 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14531 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-20 10:19:22 +00:00
Jim Harris	2bb7185f1b	env_dpdk: add dpdk_pci_device_vtophys() This moves the only references to the rte_pci_device data structure from memory.c to pci.c. This helps prepare SPDK for possible changes to DPDK around visibility of these DPDK data structures, making it easier for SPDK to manage if only one file is affected. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I26b1907fabd7a6c23701523811abd1ce12606683 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14530 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-20 10:19:22 +00:00
Jim Harris	92e63a9cc6	env_dpdk: remove unused SPDK_PCI_DRIVER_MAX_NAME_LEN Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I7b6f8d165b56b079fbab0f9dd4a354bf82533d59 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14529 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-20 10:19:22 +00:00
paul luse	dd2c08d2d1	configure/misc: make ISA-L a hard dependency Following discussion in a recent SPDK community meeting, it was determined that we no longer need to carry ISA-L as a user configuration option. It will be enabled by default. If running on an architecture that ISA-L isn't fully supported on, the configure script will disable associated features and display a warning and will also not build ISA-L. Same case if there are issues with dependencies. Note that --without-isal is no longer supported as a configure option. Signed-off-by: paul luse <paul.e.luse@intel.com> Change-Id: Ibd1e5e9454d1b090462c3e757b2f51c52e6cb774 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14393 Reviewed-by: Krzysztof Karas <krzysztof.karas@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-09-20 10:18:54 +00:00
Jim Harris	18c8b52afa	trace: allocate shm filesize based on number of cores used Previously we would always allocate the shm file based on max (128) cores which is unnecessary. So use spdk_env APIs to only allocate shm file size based on the cores we might possible use. With default settings, an shm file was 135MB before this change, now an app using cores 0-7 will just use about 9MB. A lot of the trace-related code depended on there always being a history for every core, even unused ones, so a few additional changes were needed, mainly the trace_parser library. Tested by starting an app using a 0x4 core mask and enabling a trace mask, generating some events, then checking both the size of the shm file and that spdk_trace works properly with the resulting file. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ie868b3e3658d6f82b2fea37cb87453e8a9e0abc4 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14044 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-09-20 10:17:45 +00:00
Changpeng Liu	982c25feef	nvmf: add spdk_nvmf_ctrlr_[save\|restore]_migr_data() APIs When doing live migration, there are some spdk_nvmf_ctrlr internal data structures which need to be saved/restored, these data structures are designed only for vfio-user transport, for the purpose to extend them to support other vendor specific transports, here we move them as public APIs, users can use SAVE\|RESTORE to restore a new nvmf controller based on original one. And remove the register from vfio-user transport, these registers are stored in the common nvmf library. Change-Id: I9f5847ef427f7064f8e16adcc963dc6b4a35f235 Signed-off-by: Jacek Kalwas <jacek.kalwas@intel.com> Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11059 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Thanos Makatos <thanos.makatos@nutanix.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-20 10:17:24 +00:00
Liu Xiaodong	762db2a4f4	vhost: register memtable once if unchanged Move memtable register out of start_device, into post_handler for vhost-msg SET_MEMTABLE; And unregister memtable in destroy_connection instead of destroy_device If memtable info not changed in the msg, then we don't need to register it multi times. Signed-off-by: Liu Xiaodong <xiaodong.liu@intel.com> Change-Id: I0f8c76c1ee43b6f981d703beeba92da5dac4dbd6 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14263 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-09-19 13:12:24 +00:00
Xinrui Mao	c3f628f141	lib/nbd:export bdev flush and trim ability Fix mkfs fail when using lvol as backend of nbd.Predefined NBD_FLAG_SEND_FLUSH and NBD_FLAG_SEND_TRIM are defined by default, so the operations of trim and flush are supported,but in fact lvol doesn't support trim and flush operations.Therefore add judgement for NBD_FLAG_SEND_FLUSH and NBD_FLAG_SEND_TRIM to check. Signed-off-by: Xinrui Mao <xinrui.mao@intel.com> Change-Id: I3d21034d12a038c8fc694d3383028103239ea6bd Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14099 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Xiaodong Liu <xiaodong.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2022-09-16 13:32:13 +00:00
MengjinWu	48312019c8	nvme/tcp: Remove duplicate code in nvme_tcp_read_pdu Signed-off-by: MengjinWu <mengjin.wu@intel.com> Change-Id: I63f51ecba2b4d40579d2592d2c85a7aefdacf7e7 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14503 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Xiaodong Liu <xiaodong.liu@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-09-15 19:25:02 +00:00
MengjinWu	31fc5f196f	nvme/tcp: simplify state change function state change function do not need to use swtich to do some work. Do memset in state machine. Signed-off-by: MengjinWu <mengjin.wu@intel.com> Change-Id: Ie66454d8f31860f403171f20858a6b4a24e3c76f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14502 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Xiaodong Liu <xiaodong.liu@intel.com>	2022-09-15 19:25:02 +00:00
Aleksey Marchuk	7a7f21b6fe	init: Avoid calling RPC methods twice Some methods are allowed to be run in both STARTUP and RUNTIME states and current implementation calls such methods twice. That can be a problem in some cases, so use the new spdk_rpc_get_method_state_mask function to skip such methods in RUNTIME state. Signed-off-by: Aleksey Marchuk <alexeymar@nvidia.com> Change-Id: I0a109805db428f60072a8c82161805dcde763da7 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14407 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-09-15 08:25:18 +00:00
Aleksey Marchuk	515419ac66	rpc: Add API to get method state mask The new API will be used in the next patch to prevent calling metods for the seconds time when subsystem is initialized with config file Signed-off-by: Aleksey Marchuk <alexeymar@nvidia.com> Change-Id: I60ac8196e46ccb3b22b3af0607e1ba35a11a66a6 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14406 Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-09-15 08:25:18 +00:00
Damiano	6defafc913	bdev: Add functions to [hole,data] seek These functions start from a given offset and seek for next data or for next hole. For bdevs that do not support seeking, it is assumed that only data and no holes are present Signed-off-by: Damiano Cipriani <damiano.cipriani@suse.com> Change-Id: I6bc831970223333b25683f60ce3fcbbfebb5bb81 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14361 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Community-CI: Mellanox Build Bot	2022-09-15 08:23:56 +00:00
Damiano	d8a3dee1c1	blob: Add functions to find [un]allocated io_unit These functions start from a given offset and seek for first io_unit belonging to an allocated cluster or first io_unit belonging to an unallocated cluster Signed-off-by: Damiano Cipriani <damiano.cipriani@suse.com> Change-Id: I0c632e2b3dfd2e96aa22e21796e25a36f2f55f9f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14360 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Community-CI: Mellanox Build Bot	2022-09-15 08:23:56 +00:00
Damiano Cipriani	ddf5a8da90	blobstore: Add function to get io_unit per cluster This function returns the number of io_units per cluster Signed-off-by: Damiano Cipriani <damiano.cipriani@suse.com> Change-Id: I8f33d24a63876a0a918830b9eeaa69a91ff21193 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14431 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Community-CI: Mellanox Build Bot	2022-09-15 08:23:56 +00:00
Boris Glimcher	35f7f0ce1e	nvme/tcp: Allow to choose SSL socket implementation Adding `psk` field to `spdk_nvme_ctrlr_opts` Adding `psk` parameter to `bdev_nvme_attach_controller` RPC Change-Id: Ie6f0d8b04ce472e6153934e985c026acded6cdfc Signed-off-by: Boris Glimcher <Boris.Glimcher@emc.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14046 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-09-14 07:44:53 +00:00
Kefu Chai	39ecb61ade	event: pass "const struct option" to spdk_app_parse_args() before this change, we cannot pass a `const struct option` to spdk_app_parse_args() even the callee does not mutate the value pointed by the pointer. in other words, we are not able to write something like: static const option g_options[] = {...}; // ... spdk_app_parse_args(argc, argv, &opts, "", g_options, app_parse_arg, app_usage); after this change, the requirement of the type of the `option` argument is relaxed, so we can pass a `const struct option*` to this function now. Signed-off-by: Kefu Chai <tchaikov@gmail.com> Change-Id: I8794fcf92090f538743850a28ef4a2a8c357f121 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14082 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-09-13 10:48:58 +00:00
MengjinWu	12807c5bc6	lib/nvmf: Do one memset per new PDU recv While waiting for a new PDU, target will not do too many useless memcpy. Signed-off-by: MengjinWu <mengjin.wu@intel.com> Change-Id: Ie0825c2b1e44444b210040c4a1761010e0e4cfe5 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14444 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com>	2022-09-13 07:29:38 +00:00
Kozlowski Mateusz	630922e825	ftl: Add lazy unmap process Since only L2P pages as a whole are marked as invalid during trim, the specific L2P entries won't be updated until someone touches that page. The unmap process will slowly invalidate pages during runtime, by paging them in. This will allow compaction and relocation to benefit from the trim as the user data gets invalidated. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: I239b9adf0aaaeac58f440145f4ab78b0d78d98b0 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13381 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-09 19:44:29 +00:00
Kozlowski Mateusz	b3e5d8a723	ftl: Add recovery and restart path for trim Restores necessary metadata and sets L2P during clean/dirty shutdown recovery process. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: Iaa44025250b44f424ac9de5859d1db82900ecaa9 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13380 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-09 19:44:29 +00:00
Kozlowski Mateusz	2c7c8b6ceb	ftl: Add rpc functionality for unmap Trim is now also available as a management operation via RPC. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: I05b778a611e9809a14bfed50b01986bb4649a35c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13379 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-09 19:44:29 +00:00
Kozlowski Mateusz	66fe5f75bb	ftl: Unmap functionality Adds ability to send trim commands to FTL - only 4MiB aligned requests (both for offset and length of request) will be processed. During a trim operation an L2P page (containing 1024 4B entries, 1 per user LBA; which is where the 4MiB alignment comes from) will be marked as unmapped. After this point any L2P access to that page will actually set the entries themselves as FTL_ADDR_INVALID. This is done to make the trim as fast as possible, since for large requests it's probable that most of the L2P pages aren't actually in DRAM. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: I4a04ee9498a2a6939af31b06f2e45d2b7cccbf19 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13378 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-09 19:44:29 +00:00
Artur Paszkiewicz	78c3cbf4c9	ftl: metadata for unmap support Setup trim metadata layout. Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Change-Id: I9395119cb8d5f7a5de4fde7b3f9506eb06452d7b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13377 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-09 19:44:29 +00:00
Kozlowski Mateusz	c7c9211ee0	Ftl: Open chunk recovery At the end of the recovery step, all chunks will be transferred to closed state. Missing write pointer data filled with LBA_INVALID Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: Id496e465e46fa24b04b30f2558bdacfdd668e8a4 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13375 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Community-CI: Mellanox Build Bot	2022-09-09 19:44:29 +00:00
Kozlowski Mateusz	5c5587d805	FTL: L2P chunk recovery Recover L2P from chunks' P2L. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: I039cfc54374fad0ba584d6029b752ca2f31925cf Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13374 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-09-09 19:44:29 +00:00
Kozlowski Mateusz	d1462266ce	FTL: Recover chunk state Recovers the free/open/close chunk state, initializing them to any specific lists. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: Idf689f4fbcd6fc6bd986104dc89f5079c758845a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13373 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-09-09 19:44:29 +00:00
Kozlowski Mateusz	ca53f5a6df	FTL: Band L2P recovery Recovers L2P based on all non-free bands' P2L. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: Ice9e77b00161b031c795570baf3ed8c92dfecef0 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13372 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-09-09 19:44:29 +00:00
Changpeng Liu	40f556ca38	vhost: don't kick VM when there are outstanding vhost-user messages For all the vhost-user messages processed in SPDK except VHOST_USER_GET_VRING_BASE, DPDK rte_vhost "vhost-events" thread already holds all VQ's access lock, before return response to "vhost-events" thread, SPDK should not call `rte_vhost_vring_call`, here we set a flag to TRUE for these vhost-user messages, and avoid to kick VM. The deferred IRQs will be posted in next round poll or after restarting the device. Fix issue #2518. Change-Id: I82f14b97d0b0ce602a93fd66d5fdeef64f07d179 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14402 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Dong Yi <yidong0635@126.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-09-09 15:31:06 +00:00
Changpeng Liu	097691fc18	vhost: do `rte_vhost_vring_call` from spdk context Currently we will call `rte_vhost_vring_call` in the DPDK "vhost-events" thread context when starting the device, and DPDK vhost library already holds all VQ's access lock when starting device, with new DPDK/dpdk@c573699 commit, it will cause deadlock to call `rte_vhost_vring_call` in "vhost-events" context, so here we increase 1 to `used_req_cnt` to make sure one more `rte_vhost_vring_call` will be executed later in SPDK thread context. Signed-off-by: Jim Harris <james.r.harris@intel.com> Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Change-Id: Iab53941942335744bf25ab6e9b8747bd08b0c698 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14328 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Dong Yi <yidong0635@126.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-09-09 15:31:06 +00:00
Changpeng Liu	9b74b4a3de	lib/vhost: don't clear interrupt counter for error case `rte_vhost_vring_call` may return error, then we can try to call it in next poll. Change-Id: I8f6a591837225079e004c6f57f2d7b01063f87a1 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14342 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: GangCao <gang.cao@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Dong Yi <yidong0635@126.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-09-09 15:31:06 +00:00
Jim Harris	75cc6fd62f	vhost: move the session_start_done calls to common layer Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I355790f87ef148af85d5c13002260f1120749ae5 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14340 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Dong Yi <yidong0635@126.com> Reviewed-by: GangCao <gang.cao@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-09-09 15:31:06 +00:00
Jim Harris	f869197b76	virtio: assert and ERRLOG for virtio-user dynamic mem allocations We do not support dynamic memory allocation with the virtio-user library - it results in SET_MEM_TABLE vhost messages for every change which is not supported by the vhost target. Add '-s 256' to vhost fuzz script, to ensure it does not violate the new restriction. This is a follow-on patch for issue #2596. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: If851f53d7d670ac8443f0d9c8f4e3cbe82e0df7c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14249 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Dong Yi <yidong0635@126.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-09-09 13:06:15 +00:00
Michael Piszczek	9ffb0497c1	iommu: Read AMD iommu address width Add code needed to read the virtual address width for AMD processors Fixes issue 2686 Signed-off-by: Michael Piszczek <mpiszczek@ddn.com> Change-Id: I44f988e60d7bbfb1cb137b3cbc4ac44dbb693d35 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14416 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-09-09 13:06:05 +00:00
Michal Berger	59c10a2fa2	lib/ftl: Fix -Wunused-function under clang utils/ftl_mempool.c:131:1: error: unused function 'ftl_mempool_is_initialized' [-Werror,-Wunused-function] ftl_mempool_is_initialized(struct ftl_mempool *mpool) ^ 1 error generated. Signed-off-by: Michal Berger <michal.berger@intel.com> Change-Id: I81076fb9c931fe63c79241f80584502a1ce56be9 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14291 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Dong Yi <yidong0635@126.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-09-09 13:02:07 +00:00
Kefu Chai	5a6f3a6f91	event: accept negative --shm-id as a valid option Before this change, a negative `--shm-id` value is rejected by `spdk_app_parse_args()` and this function simply errors out after detecting it. However, `build_eal_cmdline()` has a dedicated branch checking for a negative `opts->shm_id` and passes `--no-shconf` down to DPDK as a parameter, so we cannot disable the shared config support in DPDK. After this change, a negative value `--shm-id` is accepted, but if it cannot be parsed as an integer, `spdk_app_parse_args()` errors out as before. In result we can disable shared config support in DPDK by passing `--shm-id=-1` to SPDK application. Signed-off-by: Kefu Chai <tchaikov@gmail.com> Change-Id: Ibe089f13638eefa9ac28c5c99e303bcc3102f307 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14097 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Krzysztof Karas <krzysztof.karas@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-09-09 12:57:01 +00:00
Shuhei Matsumoto	cad6f55e33	bdev: Add spdk_bdev_get_current_qd to measure and return current value The generic bdev layer has a public API spdk_bdev_get_qd() but its value is the most recently measured value and it requires qd sampling to be enabled. We will have bdev modules to want to wait until all bdev_ios are aborted by a reset. Unfortunately, spdk_bdev_get_qd() is not suitable for the custom bdev module. Furthermore, spdk_bdev_channel::io_outstanding is not accessible from bdev modules. Hence, add a new public API spdk_bdev_get_current_qd(). This function should be used only from the bdev module and it should be ensured that the bdev is not unregistered during execution. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Ica30a8d8fe3264e28f0772a39bdf5f9ba72933e1 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12791 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-09-09 12:55:39 +00:00
Shuhei Matsumoto	1212b53fb8	bdev: Add spdk_bdev_for_each_bdev_io() to execute function for each bdev_io Some use cases want to abort every bdev_io submitted to the bdev by traversing the bdev channels. However, struct spdk_bdev_channel is private in lib/bdev/bdev.c. Hence, add a helper function spdk_bdev_for_each_bdev_io() to execute the function on the appropriate thread for every bdev_io submitted to the bdev. This function should be used only from the bdev module and it should be ensured that the bdev is not unregistered during execution. We keep this function as generic as possible because we may have other use cases in future. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Ic0209361bd1228ea8d4cb3241d0df07106be58d9 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12751 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-09-09 12:55:39 +00:00
GangCao	3851a64f9f	Lib/Bdev: add the new utility function For the iostat change, add a new utility function: rpc_bdev_get_iostat_dump() Change-Id: I5883fc3eb8c73a0dc2bf41c7889100e0e492359a Signed-off-by: GangCao <gang.cao@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14418 Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-09-08 07:23:07 +00:00
yidong0635	9e81535efe	reactor: Encapsulate a function _event_call. Former code, there're many repeated defines. And some add asserts checking valid event and some don't add. To get the right reports from debugging mode and catch the errors, so encapsulate a common function to do these. And add assert in this function. This will help get the right failure point. Signed-off-by: yidong0635 <dongx.yi@intel.com> Change-Id: I23d71eac6652c4104ceff80419f39634ac5ce395 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14335 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-09-08 07:17:34 +00:00
John Levon	654738ff45	lib/nvmf: small cleanup in vfio_user_qpair_delete_cb() We already define a convenient variable for the admin CQ: use it. Suggested-by: Alexis Lescouet <alexis.lescouet@nutanix.com> Signed-off-by: John Levon <john.levon@nutanix.com> Change-Id: If6570f30844a52113633bdb5f3543eec700f05d7 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14391 Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-09-07 07:04:44 +00:00
Kozlowski Mateusz	bcdedd1a2b	FTL: Add recovery iterations In order to fit inside the maximum memory usage limit, recovery needs to be split into multiple parts. During each iteration, part of L2P needs to be read, modified as necessary and saved back to the cache. This patch introduces the load/save steps, initialization of seq_id array and valid map recovery. The actual L2P recovery is done in the followup patch. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: I8ceadc5ef280542a173d83b932a983d5d86604a1 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13371 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-07 00:08:34 +00:00
Kozlowski Mateusz	8786f3b465	FTL: Open band recovery Adds recovery of open bands from P2L metadata region. Recovers the commited P2Ls and write pointers for them. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: I943c53f55e653dd075035cef7ddba448c990be87 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13370 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-07 00:08:34 +00:00
Kozlowski Mateusz	0e0f3d9af2	FTL: Shared memory recovery Adds valid map and L2P restroration for shared memory (crash) recovery. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: Ia4e0cc6cd552ea61dca8985a26aa55c84a1233db Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13369 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-07 00:08:34 +00:00
Kozlowski Mateusz	764a3675a9	Ftl: Add band state recovery after dirty shutdown Recovers the open/close/free state of bands after shutdown, initializing necessary lists. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: I4a6bd4ed1013ce8d04f44d1772dcd1f0e4e365bd Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13368 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-07 00:08:34 +00:00
Artur Paszkiewicz	1738488e41	ftl: p2l checkpointing Since base device doesn't require VSS, FTL introduces a mechanism that will allow for recovering both the P2L and write pointer of open bands after a dirty shutdown. After writing 1MiB of data to a band, a 4KiB block describing the P2L will be persisted to cache device, effectively emulating VSS for the base device. Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Change-Id: Ic6be52dc09b237297a5cda3e752d6c038e98b70e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13367 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-07 00:08:34 +00:00
Artur Paszkiewicz	36049672a3	ftl: sequence id tracking Track the relative sequence of opening and closing bands and chunks. Necessary for detecting the most recent user data during dirty shutdown recovery. Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Change-Id: I682030e58284d7b090667e4e5a9f4bbc7615708a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13366 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-09-07 00:08:34 +00:00
GangCao	b50af42b62	lib/virtio: return error if CMSG_FIRSTHDR returns NULL Fix issue: potential NULL pointer dereference Change-Id: I623096c49e7a75e66404666a2f502ba3209e3530 Signed-off-by: GangCao <gang.cao@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14330 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-09-06 07:17:26 +00:00
Blachut, Bartosz	503835ee63	util: made hexlify and unhexlify functions public hexlify and unhexlify utils from vbdev_crypto.h have been moved so that they could be included and reused outside of vbdev_crypto module. Signed-off-by: Blachut, Bartosz <bartosz.blachut@intel.com> Change-Id: Ia074250176907f4803b84024239ecd4e9d8a5fc1 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14191 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Jacek Kalwas <jacek.kalwas@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-09-06 07:17:13 +00:00
Ben Walker	34c48f1b3b	accel: Do not refer to the "framework" as "engine" The word engine was both used (interchangeably with module) to refer to the things that plug into the framework and to the framework itself. This patch eliminates all use of the word engine that meant the framework. It leaves uses of the word that meant "module". Change-Id: I6b9b50e2f045ac39f2a74d0152ee8d6269be4bd1 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13918 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-09-06 07:16:17 +00:00
Ben Walker	dd7140e627	accel: Rename spdk_accel_engine_module_finish to spdk_accel_module_finish Also move it into the internal header that defines the interface used by modules. Change-Id: I3aeb41e643f27a69556099cb8d166f64c9e5d67f Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13917 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Krzysztof Karas <krzysztof.karas@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com>	2022-09-06 07:16:17 +00:00
GangCao	0b9ba6a330	lib/vmd: return -1 if NVMe driver is not found Fix issue: potential NULL pointer dereference Change-Id: I23f90616661fdebaacb041bc9f47284231601136 Signed-off-by: GangCao <gang.cao@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14329 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Community-CI: Mellanox Build Bot	2022-09-05 12:50:06 +00:00
Shuhei Matsumoto	cdf61c2f22	nvme: Polls only the qpair if ctrlr is not fabrics when connecting synchronously For non-fabric controllers, the corresponding I/O qpairs are simply re-enabled at controller reset. This had a issue when I/O qpairs span multiple threads and poll group is used. spdk_nvme_ctrlr_reconnect_poll_async() calls nvme_transport_ctrlr_connect_qpair() with qpair->async being false. Then nvme_transport_ctrlr_connect_qpair() calls spdk_nvme_poll_group_process_completions() until the qpair is connected. spdk_nvme_poll_group_process_completions() may poll other qpairs. This may cause I/O to complete on a wrong thread. For PCIe controller, spdk_nvme_poll_group_process_completions() calls spdk_nvme_qpair_process_completions() simply for each qpair. Hence change nvme_transport_ctrlr_connect_qpair() to call spdk_nvme_qpair_process_completions() if the controller is non-fabrics. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Ieb270c2fb154124021ef6d25577b817d05e5ca9e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14295 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Michael Haeuptle <michaelhaeuptle@gmail.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-09-05 12:50:00 +00:00
Evgeniy Kochetov	2e7a7fe530	blob: Optimize copy-on-write flow for clusters backed by zeroes device Writing to unallocated cluster triggers copy-on-write sequence. If this cluster is backed by zeroes device we can skip the copy part. For a simple thin provisioned volume copy this shortcut is already implemented because `blob->parent_id == SPDK_BLOBID_INVALID`. But this will not work for thin provisioned volumes created from snapshot. In this case we need to traverse the whole stack of underlying `spdk_bs_dev` devices for specific cluster to check if it is zeroes backed. This patch adds `is_zeroes` operation to `spdk_bs_dev`. For zeroes device it always returns 'true', for real bdev (`blob_bs_dev`) always returns false, for another layer of `blob_bs_dev` does lba conversion and forwards to backing device. In blobstore's cluster copy flow we check if cluster is backed by zeroes device and skip copy part if it is. Signed-off-by: Evgeniy Kochetov <evgeniik@nvidia.com> Change-Id: I640773ac78f8f466b96e96a34c3a6c3c91f87dab Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13446 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-09-05 12:49:46 +00:00
Konrad Sztyber	ab58ddf107	sock: make impl_name const char * in all functions There's no reason for this parameter to be non-const and it makes this functions pain to use when you want to hardcode a specific sock implementation. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Ifed4426a02ab54cbd51c8a2051b1eac010f86db9 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14303 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-09-05 12:49:28 +00:00
Shuhei Matsumoto	b3e1db32a3	nvmf/rdma: Ignore async_event if its qp_context is NULL If initiator and target run on the same application, and initiator uses SRQ, target may get async events for initiator, e.g., IBV_EVENT_QP_LAST_WQE_REACHED unexpectedly. The reason is initiator and target may use the same device simultaneously and only target polls async events. Target sets attr.qp_context to rqpair when creating QP, but initiator sets attr.qp_context to NULL when creating QP. Hence one simple fix is to ignore async events whose qp_context is NULL. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Id9ead1934f0b2ad1e18b174d2df2f1bf9853f7e1 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14297 Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot	2022-09-05 12:49:11 +00:00
Shuhei Matsumoto	0e4b13dc53	nvme_rdma: Destroy qpair after it is disconnected and drained By the previous patches, a qpair is destroyed after it is actually disconnected. But after the qpair is destroyed, it is checked if drained by using rqpair->current_num_sends and rqpair->current_num_recvs. However, if the qpair is the last of a poller of a poll group, CQ is destroyed before checking if the qpair is drained. If CQ is destroyed, at least rqpair->current_num_recvs is not updated, and we may get one second timeout. This should be avoided. Hence, destroy the qpair after it is disconnected and drained. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Ibd6c83e8a3e7b6e11e9b45cee42669da6d42a621 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14278 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-05 12:49:11 +00:00
Shuhei Matsumoto	1d58eb038b	nvme_rdma: Release poller from poll group when qpair is actually disconnected If the being disconnected qpair is the last of a poller of a poll group, CQ is destroyed and the poller is released before the qpair is actually disconnected. This patch destroy CQ and release the poller after the qpair is actually disconnected. One exception is when spdk_nvme_ctrlr_free_io_qpair() is called to a connected qpair. In this case, the qpair is removed from a poll group before the qpair is actually disconnected. In this case, destroy CQ and release the poller when the qpair is removed from the poll group. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Idf266bbb6dbb40f04ae6313db724fabf80865763 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14253 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-05 12:49:11 +00:00
Shuhei Matsumoto	80d75fda06	nvme_rdma: Clean up releasing poller from poll group We have two cases to call nvme_rdma_poll_group_put_poller(). For consistency, make the two cases the same sequence. This will make the next patch easier. The next patch will release poller from poll group when qpair is actually disconnected as possible as we can. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I4178113d5277240e287e83a57e97cf32fd0f7457 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14252 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-05 12:49:11 +00:00
Kozlowski Mateusz	86619848ec	Ftl: Add clean restore management path Adds ability for FTL to startup after clean shutdown. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: I2f1b83bb3eb1487b6665c95e76c48881e8899b16 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13364 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-02 17:40:09 +00:00
Kozlowski Mateusz	d4b9f2c68b	FTL: Add metadata self test Adds additional debugging functionality - ability to check the validity of all L2P entries and valid map to check for inconsistencies after FTL startup. Since this is a very time consuming process, it's controlled by an environment variable and not executed during normal operations. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: I4766a1576c058f69fa047f45d2d8be6d0ad0b3cd Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13363 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-02 17:40:09 +00:00
Kozlowski Mateusz	cbd7ae6df7	FTL: Add metadata restore functionality Adds necessary functions for setting up the state of FTL components based on loaded in metadata. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: I3a4c05230c877850e61d4f31d495d38121d27b3f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13362 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-02 17:40:09 +00:00
Kozlowski Mateusz	55147295d7	FTL: Add L2P restore path Adds initialization code for L2P done after shutdown (both clean and dirty). Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: I7a938b298467c96d68f40cb14c3171d1533e1a08 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13361 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-02 17:40:09 +00:00
Kozlowski Mateusz	b5e2c59ad6	FTL: Add fast shutdown path Adds the ability to persist only the most important metadata. The rest is stored in shared memory. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: I4084c04ba09115a7a08ff66fd33552a2ec60d801 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13360 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-02 17:40:09 +00:00
Kozlowski Mateusz	ef93cc38ee	FTL: Persist metadata on clean shutdown Add an extra step during FTL shutdown to save all metadata. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: Idc2f77e15bbd02028548cc88355cd450175830e8 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13359 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-02 17:40:09 +00:00
Kozlowski Mateusz	b4b70e8303	FTL: Make L2P caching default mode Flat L2P (all L2P in memory) needs to be specifically built against, due to large memory consumption for big devices. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: Ib8906e10868455f88725b69b2b033b70a9f7256c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13358 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-09-02 17:40:09 +00:00
Kozlowski Mateusz	94b7f8d82d	FTL: Add L2P cache eviction logic Adds eviction of least recently used pages from the L2P cache - dirty pages will be persisted. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: Ic646f7e9da777d077b5cb9b409c3f03ef05b1273 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13357 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-02 17:40:09 +00:00
Kozlowski Mateusz	73f9b4f5fe	FTL: L2P cache page in logic Adds paging in from the cache device to memory. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: I250009d12e9ed5ad52ee861ec5157cf983cf8cfc Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13356 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-02 17:40:09 +00:00
Kozlowski Mateusz	905fbf946c	ftl: Add L2P cache pin/unpin logic There is a set amount of pinned pages available. If exceeded they will be deferred and processed in the future, using eviction logic. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: Ic642a5870db009ccf57152dd8a4178a6b2098ee1 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13355 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-02 17:40:09 +00:00
Kozlowski Mateusz	db65602a39	FTL: Add l2p cache get/set logic This commit also introduces ranking pages, based on usage for determining the least used page to be evicted. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: Iaf3812177b61376bb38aa209e4ba8576d784ffb5 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13354 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-02 17:40:09 +00:00
Kozlowski Mateusz	e7e5bc07b2	FTL: Add initial L2P cache logic L2P cache allows for partial storing of L2P in memory, paging in and out as necessary, lowering the total memory consumption. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: I727fec9d2f0ade4ca73e872d62a2ec10cfdb0a88 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13353 Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-09-02 17:40:09 +00:00
Jim Harris	01cec2499f	vhost: add start_session vhost_blk_start and vhost_scsi_start are now just a single vhost_user_session_send_event() call, so make this more generic by adding a top-level start_session function. Now this function will do the vhost_user_session_send_event(), using the user_dev_backend's start_session function pointer. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ia89ba15011e231f0474405fb7225e713dcc920bf Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14327 Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-09-02 07:32:54 +00:00
Jim Harris	f8df19a49f	vhost: assign svdev from spdk thread context Currently scsi sets it's svdev from the vhost thread context, while blk does it from the spdk thread context. Make scsi match what blk does, to make the code more consistent. This also will allow for an upcoming simplification. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I609513bc8e05b49dd9455f2f61ba0cedc35236e6 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14326 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-09-02 07:32:54 +00:00
tongkunkun	bb432b4eea	json: fix parsing json problems when json config is invalid. Add parsing json as invalid cases: 1.json content that not enclosed in {}, it should be parsed as invalid, e.g. "abc":"not encloesed in {}" 2.json content that 'subsystems' not associate with array, it will report error and return failure, e.g. {"subsystems":"123"} 3.handle other invalid json formats, report and return failure, e.g. duplicate keys. Added `spdk_json_find` API return errcode: EPROTOTYPE - json not enclosed in {}. json config with content: 1."not enclosed in {}" 2."'subsystems' not be an array" 3."duplicate key in json" and some other invaild cases will be regarded as invalid json config, and will fail to start app. Fixes #2599 Signed-off-by: tongkunkun <tongkunkun_yewu@cmss.chinamobile.com> Change-Id: I02574c9acd7671e336d4c589ebbff8ed21eb3681 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13754 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: GangCao <gang.cao@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-09-02 07:32:21 +00:00
Konrad Sztyber	4cbd23e28b	vmd: method for forcing a rescan Added a new RPC, vmd_rescan, that forces the VMD driver to do a rescan of all devices behind the VMD. A device that was previously removed via spdk_vmd_remove_device() will be found again during vmd_rescan. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Ide87eb44c1d6d524234820dc07c78ba5b8bcd3ad Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13958 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tom Nabarro <tom.nabarro@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-09-01 08:48:32 +00:00
Konrad Sztyber	052ea0baac	vmd: method for removing devices behind VMD Added new RPC, vmd_remove_device, that allows users to remove a PCI device managed by the VMD library simulating a hot-remove. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Ifb84818ce8d147d1d586b52590527e85fe9c10de Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13957 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tom Nabarro <tom.nabarro@intel.com>	2022-09-01 08:48:32 +00:00
Konrad Sztyber	9a9aed4e7b	env/pci: use TAILQ_FOREACH_SAFE in pci_foreach_device() It'll make it possible to remove a PCI device from within the callback. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I4cea2207a29bb145aee968715e873076a8c0993c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13956 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Tom Nabarro <tom.nabarro@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com>	2022-09-01 08:48:32 +00:00
Konrad Sztyber	4c482a623b	vmd: don't create new buses in hotplug This doesn't work anyway and can cause creating duplicate bus objects if vmd_scan_single_bus() is called on a parent bus with previously allocated child buses. Also, while here, removed a few unused functions and flags in struct vmd_adapter. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Ic757070188157d9851f648acd074ca4943a14c39 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13955 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tom Nabarro <tom.nabarro@intel.com>	2022-09-01 08:48:32 +00:00
Konrad Sztyber	ee1ab6f6be	vmd: increment dev_cnt once device is initialized This is done in order to avoid having to decrement this counter in case of a failure. Also, it makes the result valid for the few error cases when we didn't decrement it. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Ia944fb8b810ce69caa8db5bc7c941e0905c9d3bd Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13954 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tom Nabarro <tom.nabarro@intel.com>	2022-09-01 08:48:32 +00:00
Konrad Sztyber	55bdd88506	env/pci: add detach() callback to pci_device_provider This makes it possible to notify other PCI device providers (VMD) that a PCI device is no longer used. The VMD will driver will unhook that device and free any resources tied to it. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I42752afbb371a1d33972dac50fd679f68d05b597 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13887 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tom Nabarro <tom.nabarro@intel.com>	2022-09-01 08:48:32 +00:00
Konrad Sztyber	690eebb447	vmd: extract removing devices to separate function Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Idc9c7d0e5d0ebce8278e089bcfe5b7f76b86c270 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13953 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Tom Nabarro <tom.nabarro@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-09-01 08:48:32 +00:00
Konrad Sztyber	ffa9953a14	vmd: add attach_device() This patch implements the callback for attaching devices behind the VMD with a given PCI address. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I07cf92c94cc7e6d3c8e31af7a8615e9a4ca641bf Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13886 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tom Nabarro <tom.nabarro@intel.com>	2022-09-01 08:48:32 +00:00
Konrad Sztyber	3b2097f313	vmd: use vmd_container.count when iterating over domains It makes it possible to call this function even if the VMD library wasn't initialized. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I3d0f4677c4a1189f9d8acf07baee50a4e2050459 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14260 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tom Nabarro <tom.nabarro@intel.com>	2022-09-01 08:48:32 +00:00
Konrad Sztyber	4b08c07a62	env/pci: call driver callback in pci_hook_device Now that we have a attach_device() callback, the devices can be hooked during spdk_pci_device_attach(). With DPDK, driver->cb_fn() is called in pci_device_init(), so we need to do the same in spdk_pci_hook_device(). Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Iada8b83ce7592aa62561530192072a50ec3a904b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13884 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tom Nabarro <tom.nabarro@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-09-01 08:48:32 +00:00
Konrad Sztyber	ac8b65bdd2	vmd: extract freeing device resources to vmd_dev_free This allows to free resources tied to a vmd_pci_device that isn't on the dev_list or wasn't hooked to the PCI driver. Also, use that function whenever a vmd_pci_device is freed instead of regular free(). Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Ifca177a7eb6d8180d6f2ee2a9d9e36d58810e8ad Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14259 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tom Nabarro <tom.nabarro@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-01 08:48:32 +00:00
Konrad Sztyber	3f4e968dab	vmd: add device to dev_list after initialization is complete That way, we don't have to do TAILQ_REMOVE if vmd_assign_base_addrs() fails. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Id7a5df2093e4f9dfc95ee1fe415eb644c61bc971 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14258 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tom Nabarro <tom.nabarro@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-01 08:48:32 +00:00
Konrad Sztyber	35f8bd2a13	vmd: move pci_hook_device to vmd_dev_init_end_device Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I79c35600fc9a758bbd9d58393b7eb98c8ac82acc Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14257 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tom Nabarro <tom.nabarro@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-01 08:48:32 +00:00
Konrad Sztyber	2dfd36772f	vmd: extract end device initialization It'll make it easier to reuse this part of the code. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Id26f3f00abeeea6205df4f44689ffab1d367d777 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13885 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tom Nabarro <tom.nabarro@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com>	2022-09-01 08:48:32 +00:00
Konrad Sztyber	b20f3678dd	env/pci: method for registering PCI device providers The primary motivation for this patch is to allow the VMD driver to be notified of when users wants to attach a device under a given BDF and to make it more similar to the regular PCI path. Currently, the way the VMD driver scans for the devices is a little bit different. The initial scan is done during initialization and there's a separate poller for checking hotplugs. Also, there's no device_attach() interface, so with hotplug poller disabled, it isn't possible to attach to a device not present in the initial scan, even if the BDF is known. This causes a few issues. First of all, the VMD library isn't notified when a device is stopped being used (i.e. user calls spdk_pci_device_detach()), so when such a device is hotremoved, it never gets unhooked. But we cannot simply add a spdk_pci_device.detach() callback, as this would break cases when user detaches a device (without hotremove) and then tries to reattach it again (via spdk_pci_device_attach()), as the VMD doesn't get notified about the device_attach() call. So, in order to resolve this, a device_attach() callback is added, which will notify the VMD library that the user wants to attach a device under a specific PCI address. Then, in subsequent patches, a spdk_pci_device_provider.detach_cb() callback is added to make sure that devices are unhooked once they're no longer used. Once that is done, it'll be also possible to get rid of the VMD hotplug poller by adding something like scan_cb() to spdk_pci_device_provider and call it from spdk_pci_enumerate(). Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I084a27dcd12455f0f841440b7692375e80d07e84 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13883 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tom Nabarro <tom.nabarro@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-01 08:48:32 +00:00
Jim Harris	b90d7b5b43	nvme: add admin queue size quirk for Hyper-V Hyper-V NVMe SSD controllers require admin queue size to be even multiples of a page. Add quirk to adjust the admin queue size if user overrides the default value to something other than an even multiple. As part of this change, set the quirks earlier when constructing a pcie controller, so that the quirks value can be used in the generic nvme_ctrlr_construct() function. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I417cd3cdc7e3ba512ec412f4876b0e0b7432341c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14220 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-09-01 08:31:46 +00:00
yidong0635	0447dca450	include: Remove the last line break. The last line doesn't need the line break, otherwise it will wrongly include the next line. Signed-off-by: yidong0635 <dongx.yi@intel.com> Change-Id: I06257b18d25c060b7c6bb00853fa44963fe5b439 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14241 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com>	2022-09-01 08:30:24 +00:00
yidong0635	b813f998ea	nvme_pcie_common: Move group right before using. Better not to cache a value especially for there's an error return. Signed-off-by: yidong0635 <dongx.yi@intel.com> Change-Id: I3b243a66f4db9af34bc2ea01bafdac33004be128 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13650 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-09-01 08:26:34 +00:00
Jim Harris	3d59045a2a	nvme: remove incorrect comment about spdk_nvme_ctrlr structs This was correct back when we only supported PCIe, but doesn't in the newfangled world of fabrics and vfio-user. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I565edd2dab1eff862844585df8c25da508e4816d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14136 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Jacek Kalwas <jacek.kalwas@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-08-30 16:20:23 +00:00
Artur Paszkiewicz	8fad5718e1	ftl: validate band metadata in debug mode Adds a debug function, that scans the whole P2L of band, when it's getting closed. The P2L is compared against both L2P and valid map to check for any discrepancies. Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Change-Id: Ia4d7be65415e6af3752d676de69b6fdcb73effb4 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13352 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-08-30 14:48:50 +00:00
Artur Paszkiewicz	57cfab6808	ftl: use valid map to optimize compaction and reloc Utilize the valid map when picking physical blocks to compact/relocate, speeding up the process. Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Change-Id: I860e3cf25a5907591e4f3043def67156fec8b0df Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13351 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-08-30 14:48:50 +00:00
Artur Paszkiewicz	cea8dadecf	ftl: valid map Adds P2L validity map tracking - a bitmap marking all physical LBAs as containing valid (current) user data or not. A clear bit denotes the location has no valid data and may be skipped during relocation or compaction. A set bit means it may have valid data (it's still necessary to do the necessary comparision against L2P). Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Change-Id: I6a831a97b3080eb7c880d9c4feab41b523467885 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13350 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-08-30 14:48:50 +00:00
Artur Paszkiewicz	1e904e2b75	ftl: fast startup Adding API for the bringup part of fast shutdown/startup. Adds shared memory utilization for necessary functions during initialization. Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Change-Id: Iab2da102fd0ccaa56fbdb9b3c765be5eeefff145 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13349 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-08-30 14:48:50 +00:00
Kozlowski Mateusz	0e33da4974	ftl: fast shutdown Adds API for fast shutdown - the ability for FTL to skip most of the metadata persists made during clean shutdown, and relying on their representation in shared memory instead. This allows for faster update of SPDK (or just FTL, assuming no metadata changes), with downtime reduction from 2-5 seconds to 500-1000 ms (for 14TiB+800GiB base and cache drives). Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Change-Id: I5999d31698a81512db8d5893eabee7b505c80d06 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13348 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-08-30 14:48:50 +00:00
Kozlowski Mateusz	811a027e43	ftl: Add helper functions for creating md regions Helper functions which determine which md regions will be stored in shm. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Change-Id: I94cbfca66dfb56457a350874dbd1de63a2e07661 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14159 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-08-30 14:48:50 +00:00
Kozlowski Mateusz	101a039923	ftl: p2l map on shm Stores P2L map of open bands in shared memory, allowing for faster recovery times from application crash. Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Change-Id: I519441af05e4d0f57768835bf01c800556873c58 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13347 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-08-30 14:48:50 +00:00
Artur Paszkiewicz	71a1762821	ftl: mempool support for durable format objects Allows for using shared memory in memory pools. Adds API for accessing such pools after dirty shutdown (claiming them, ie. marking an entry as actively used; calling the ftl_mempool_initialize_ext will reclaim all unused entries back to the pool). Also introduces API for accessing objects, since using direct pointers is not possible (as addresses may change inbetween application startups). Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Change-Id: I5325b39d68aef7e231945cee9d92c925cab2fb2a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13346 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-08-30 14:48:50 +00:00
Artur Paszkiewicz	f1b079b49f	ftl: bitmap on external memory Main use case is to allow for keeping it in shared memory, to speed up the recovery time after application crash. Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Change-Id: I36b6b8331cd6483c5bd202e5f9103c351d705da8 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13345 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-08-30 14:48:50 +00:00
Kozlowski Mateusz	43a4d47a1c	FTL: Add relocation logic Relocation will 1. Read LBA map of a given band 2. Pin the LBAs 3. Issue writes of valid LBAsto the new location Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: Ie753a790e56a86bfa1e451b5eda78b88eeacd3cb Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13344 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-08-30 14:48:50 +00:00
Jim Harris	ffa823557a	blob: add assert that cluster_sz > 0 Avoids divide-by-zero scanbuild warning on Fedora36. Fixes issue #2667. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ib2793c793725e8bb8ba25fb779ffc14334929da0 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14238 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-08-29 11:41:50 +00:00
Konrad Sztyber	475b86aa8d	print better errors when creating mempools from secondary process Multiprocess is only supported by a few libraries (e.g. NVMe driver). Other libraries that don't support it will often fail on mempool initialization when running as a secondary process, as the mempools are already created by the primary process. But the error messages are vague and don't indicate why this happened. So, this patch adds a check to see if a mempool exists after spdk_mempool_create() fails and prints an error message informing users that multiprocess is unsupported. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I6f915a94266e64dda380e3b269424cc579372a10 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14234 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com>	2022-08-29 11:41:32 +00:00
Shuhei Matsumoto	4a6f858872	nvme_rdma: Set REUSEADDR to reuse source address among multiple CM IDs When we specify source address for admin and I/O qpairs, rdma_resolve_addr() succeeded only for admin qpair and failed for following all I/O qpairs because rdma_resolve_addr() returned -EADDRINUSE. To reuse source address among multiple qpairs, set the REUSEADDR option for each CM ID before executing rdma_resolve_addr() if source address is specified. We may miss something. Even if rdma_set_option() fails, execute rdma_resolve_addr(). Fixes issue #2604 Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: If03f82d4499cf83c0e428a62e91c9d9e6aad28e0 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14229 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: GangCao <gang.cao@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com>	2022-08-29 11:41:17 +00:00
Jonas Pfefferle	29977e8506	bdev: add additional io types in dump bdev info Add indication of support for compare, compare & write and abort in json bdev info dump. Signed-off-by: Jonas Pfefferle <pepperjo@japf.ch> Change-Id: Ifc8dc1a1b180f08fcd9e9d58684eab1fd50356ff Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14137 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-08-29 10:51:31 +00:00
Jim Harris	4300c62167	nvme: add spdk_nvme_ctrlr_disable_read_changed_ns_list_log_page() Commit `a119799b` ("test/nvme/aer: remove duplicated changed NS list log") changed the nvme driver to read the CHANGED_NS_LIST log page before calling the application's AER callback (previously it would read it after). Commit `b801af090` ("nvme: add disable_read_changed_ns_list_log_page") added a new ctrlr_opts member to allow the application to tell the driver to not read this log page, and will read the log page itself instead to clear the AEN. But we cannot add this option to the 22.01 LTS branch since it breaks the ABI. So adding this API here, which can then be backported manually to the 22.01 branch for LTS users that require it. Restoring the old behavior is not correct for applications that want to consume the CHANGED_NS_LIST log page contents itself to know which namespaces have changed. Even if the driver reads the log page after the application, that read could happen during a small window between when a namespace change event has occurred and the AEN has been sent to the host. The only safe way for the application to consume ChANGED_NS_LIST log page contents itself is to make sure the driver never issues such a log page request itself. Fixes issue #2647. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Iaeffe23dc7817c0c94441a36ed4d6f64a1f15a4e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14134 Reviewed-by: Michael Haeuptle <michaelhaeuptle@gmail.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-08-25 07:31:44 +00:00
liuqinfei	cd1b7ab0e7	nvmf: balance the get optimal poll group Fixes #issue 2636. The existing allocation method (nvmf_rdma_get_optimal_poll_group()) is traversal and unperceived link disconnection. A more fair method considering the number of real-time connections to allocate a poll group is implemented. Signed-off-by: liuqinfei <18138800392@163.com> Signed-off-by: luo rixin <luorixin@huawei.com> Change-Id: Ic1e6283e386dbb0dd6655bedebe26aeedb16c333 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14002 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-08-23 07:46:03 +00:00
Jonas Pfefferle	9e50d53b1a	bdev: add compare fall-back separate md support If the bdev does not natively support compare we use the fall-back which performs a read instead of a compare operation. We then compare the results of the read with the buffer provided by the user. In case the bdev has metadata, there are two options: 1) md is interleaved -> the md will be part of the data buffer allocated for the read and compared accordingly 2) md is separate -> currently we do not compare the metadata but just ignore it. This patch fixes 2) by comparing the md buffer after the read is done. Signed-off-by: Jonas Pfefferle <pepperjo@japf.ch> Change-Id: I1018b8c02540bffcba69408eb283bdc8f06bb747 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14132 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-08-23 07:18:56 +00:00
Jonas Pfefferle	7ba89d1e48	bdev: set ext_opts=NULL if not used bdev_io is allocated from a memory pool and is not zeroed on reuse. So set bdev_io->u.bdev.ext_opts = NULL for io ops where it is not supported (yet) so we can test against it. Signed-off-by: Jonas Pfefferle <pepperjo@japf.ch> Change-Id: Ia579ea6b0787cf62572ea3a6bf2251867602e952 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14056 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Community-CI: Mellanox Build Bot	2022-08-23 07:18:56 +00:00
Kozlowski Mateusz	711759a029	FTL: Add reloc helper functions Adds functions for reading end metadata and initializing band reloc state. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: I3d12c4a7edd36f0437bf10316114c83efe449f0f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13343 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-08-22 20:21:15 +00:00
Artur Paszkiewicz	f45c007512	ftl: superblock in shared memory Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Change-Id: I86e2cbf364ae3075aad2e09429754027df33eadf Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13342 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-08-22 20:21:15 +00:00
Artur Paszkiewicz	818b9c053b	ftl: support for metadata on shared memory Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Change-Id: Ibc259f61f0ef2aeadb0e5ac7230969e29d77f184 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13340 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-08-22 20:21:15 +00:00
Kozlowski Mateusz	19613862ae	FTL: Add free chunk logic After chunk is compacted it can be moved to the free state, able to be used for new user IO again. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: I7f9c341169b171ee246c5aa161d74903b91bdc2f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13338 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-08-19 17:37:14 +00:00
Kozlowski Mateusz	71f20c9a74	FTL: Add compaction logic During compaction FTL moves valid user data from the nv cache drive to the bottom device. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: Ia200af39cec80014fac3a10f20d2859b10a81088 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13337 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-08-19 17:37:14 +00:00
Artur Paszkiewicz	1dadcd8786	ftl: ftl_rq helpers for compaction Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Change-Id: I614b29e7bc7f6db20b10395bc780ff633c497b59 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13336 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-08-19 17:37:14 +00:00
Kozlowski Mateusz	31cf633679	FTL: Add writer logic Add writer - tracks and manages band state transitions and write pointer as IO is issued to it. Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: I5f878dc15bc1c1ac84835f75fe440672fad541d5 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13335 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-08-19 17:37:14 +00:00
Artur Paszkiewicz	0291b2845a	FTL: Add read path Signed-off-by: Kozlowski Mateusz <mateusz.kozlowski@intel.com> Signed-off-by: Artur Paszkiewicz <artur.paszkiewicz@intel.com> Change-Id: Ib5bac109b59d5a21a7dad1f8e79b5da7633ffa9d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13334 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-08-19 17:37:14 +00:00

... 5 6 7 8 9 ...

10193 Commits