ivampiresp/Spdk - Spdk - Leaflow Developers

Author	SHA1	Message	Date
Thanos Makatos	25440c3bdb	nvmf/vfio-user: don't blindly drain poll group eventfd This eventfd may be passed by libvfio-user to the remote process which might remove the EFD_NONBLOCK flag, in which case we would block indefinitely. Signed-off-by: Thanos Makatos <thanos.makatos@nutanix.com> Change-Id: If9826cd700b4a7b3458a0a8278a96322d99ac08e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15385 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: John Levon <levon@movementarian.org> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2022-11-22 10:01:43 +00:00
paul luse	a6dbe3721e	update Intel copyright notices per Intel policy to include file commit date using git cmd below. The policy does not apply to non-Intel (C) notices. git log --follow -C90% --format=%ad --date default <file> \| tail -1 and then pull just the 4 digit year from the result. Intel copyrights were not added to files where Intel either had no contribution ot the contribution lacked substance (ie license header updates, formatting changes, etc). Contribution date used "--follow -C95%" to get the most accurate date. Note that several files in this patch didn't end the license/(c) block with a blank comment line so these were added as the vast majority of files do have this last blank line. Simply there for consistency. Signed-off-by: paul luse <paul.e.luse@intel.com> Change-Id: Id5b7ce4f658fe87132f14139ead58d6e285c04d4 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/15192 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Community-CI: Mellanox Build Bot	2022-11-10 08:28:53 +00:00
Thanos Makatos	b8fc75c36e	nvmf/vfio-user: ensure BAR5 isn't 0 Signed-off-by: Thanos Makatos <thanos.makatos@nutanix.com> Change-Id: I60a39c8a311879b7d6c7c82df0abd7a69f9a2778 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14933 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-11-04 10:10:33 +00:00
Thanos Makatos	bad452d25e	nvmf/vfio-user: calculate doorbells based on number of queue pairs It doesn't make sense to have the size of the doorbells fixed and then calculate the maximum number of queue pairs based on it, do it the other way round. Also, add some sanity checks based on the spec. Signed-off-by: Thanos Makatos <thanos.makatos@nutanix.com> Change-Id: I17e3509fb0a011128ca089ce78b7a296262e6f8e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14932 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-11-04 10:10:33 +00:00
John Levon	36dfcca2b4	nvmf/vfio-user: switch from shadow doorbells when freeing Some reset/disable paths are freeing the shadow doorbells without switching the SQs back to BAR0. Fix this up, and add a small cleanup when initializing the shadow doorbells. Signed-off-by: John Levon <john.levon@nutanix.com> Change-Id: Ia5e5b91b7dc696a558eb0ad59cc554abced47cca Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14901 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-10-26 07:32:54 +00:00
John Levon	64db53f1aa	nvmf/vfio-user: support multiple poll groups in interrupt mode To support SQs allocated to a poll group other than the controller's main poll group, we need to make sure to poll those SQs when we wake up and handle the controller interrupt. As they will be running in a separate SPDK thread, we will arrange for all poll groups to wake up when we receive an interrupt corresponding to a vfio-user message arriving. This can mean needless wakeups: we don't (yet) have a mechanism to only wake up the poll groups that correspond to a particular SQ write. Additionally, as we don't have any notion of a poll group per controller, this ends up polling all SQs in the entire poll group, not just the ones corresponding to the controller we were handling. As this has potential performance issues in many cases, it defaults to disabled. Signed-off-by: John Levon <john.levon@nutanix.com> Change-Id: I3d9f32625529455f8d55578ae9cd7b84265f67ab Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14120 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-10-26 07:32:54 +00:00
MengjinWu	f1bec928d1	nvmf/tcp: add admin queue depth check before init max_aq_depth should be not smaller than 2 or greater than 4096 Signed-off-by: MengjinWu <mengjin.wu@intel.com> Change-Id: I205fbb4345cfdc41ebaf30c953da263fe9f0e9a8 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14691 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Community-CI: Mellanox Build Bot	2022-09-28 06:39:14 +00:00
MengjinWu	bf887576cb	nvmf/tcp: add IO queue depth check before init max_queue_depth should be not smaller than 2 or greater than 65536 Signed-off-by: MengjinWu <mengjin.wu@intel.com> Change-Id: I0f2a4b8df6eb1b140a11936fc6929f1285a7d717 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14619 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Community-CI: Mellanox Build Bot	2022-09-28 06:39:14 +00:00
MengjinWu	5eb3239cdf	nvmf/tcp: Refine the macro definition of queue depth Refine the macro definition name about queue depth and prepare for next patch. Signed-off-by: MengjinWu <mengjin.wu@intel.com> Change-Id: I85bee2528ae4ab70292fc11aa62d05bae0c28a77 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14664 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Krzysztof Karas <krzysztof.karas@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot	2022-09-28 06:39:14 +00:00
Szulik, Maciej	1b575d831d	lib/nvmf: add explicit iovcnt init to silence LTO related warning When Link Time Optimization is enabled, compiler can sometimes produce additional warnings saying that some variables may be uninitialized. To supress the warning it is enough to add explicit initialization of the variable causing the issue, in this case 'iovcnt = 0'. Signed-off-by: Szulik, Maciej <maciej.szulik@intel.com> Change-Id: I080b20a6008643ae78c8e3a6c2d183193ef6c1bf Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14674 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Jacek Kalwas <jacek.kalwas@intel.com> Community-CI: Mellanox Build Bot	2022-09-26 15:36:23 +00:00
Liu Xiaodong	b6bb252e23	lib/nvmf: fix async_events index When data_local.num_async_events > SPDK_NVMF_MIGR_MAX_PENDING_AERS, data_local.async_events was already indexed by 256, and it was out of bounds. Signed-off-by: Liu Xiaodong <xiaodong.liu@intel.com> Change-Id: I15cfdeb9bc165de0c73fbc9171b0ce6d8689c0aa Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14666 Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot	2022-09-26 11:49:41 +00:00
MengjinWu	8d1c4f74d4	nvmf/tcp: Check if In-capsule Data length and sgl data length are equal In-capsule data length should be the same with the SGL data length. Signed-off-by: MengjinWu <mengjin.wu@intel.com> Change-Id: I7eefecb8baebb76850a48689907aff27a8946f98 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14602 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-23 18:26:36 +00:00
MengjinWu	8ed53eee32	nvmf/tcp: Fixed error handle in 'nvmf_tcp_req_parse_sgl' Fixed error handles which are violated with spec: 1. 'data length > MAXH2CDATA' is a fatal error. 2. 'ICDOFF != 0' should abort the IO. Other errors which are not defined in spec: 1. invalid sgl type 2. In-capsule Data length > In-capsule Data size Because this function runs before data part receiving, it is hard to skip the following data segment if we want to handle some error as non-fatal. Currently, we have to handle all undefined errors as fatal errors. I think after this release, we can change receving process. This will be helpful for error handling. But this work is not small. Signed-off-by: MengjinWu <mengjin.wu@intel.com> Change-Id: I8fc0d2d743505e49a93be19fd217e7ad6ca06622 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14580 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-23 18:26:36 +00:00
Sebastian Brzezinka	ef73f559e6	lib/nvmf: test if client and server are runing in same process During fuzzing vfio-user client and server are started from same process causing deadlock. SO_PEERCRED return pid of process connected to vfio endpoint. Signed-off-by: Sebastian Brzezinka <sebastian.brzezinka@intel.com> Change-Id: I6fc2db5d58a459a30fec116a9de3c69d48acf75e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14559 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-09-23 15:16:01 +00:00
MengjinWu	100c53718d	nvmf/tcp: add in_capsule_data_size check before init in_capsule_data_size should not be larger than max_io_size. Signed-off-by: MengjinWu <mengjin.wu@intel.com> Change-Id: I636724c888b9e5abc4cffac96bff24021e172498 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14618 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-22 22:13:19 +00:00
MengjinWu	4c33c7ae20	nvmf/tcp: inline function 'nvmf_tcp_req_set_state' Signed-off-by: MengjinWu <mengjin.wu@intel.com> Change-Id: Ie3af436411da9e3f3ad1ec159f0fbf59c4901983 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14598 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Community-CI: Mellanox Build Bot	2022-09-22 07:45:56 +00:00
MengjinWu	1d7230285b	nvmf/tcp: add hpda value check in 'nvmf_tcp_icreq_handle' hpda value should be in range of 0 to 31. Signed-off-by: MengjinWu <mengjin.wu@intel.com> Change-Id: Ie1329c831af06ccc8943a562c3f6396b635be518 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14575 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-09-22 07:45:03 +00:00
MengjinWu	f8dd380b33	nvmf/tcp: eliminate function nvmf_tcp_set_in_capsule_data This function is small and called only once. Signed-off-by: MengjinWu <mengjin.wu@intel.com> Change-Id: Ie4b11668e42a8920b3a9a11aa8cb83512f32942c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14576 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Community-CI: Mellanox Build Bot	2022-09-22 07:38:25 +00:00
MengjinWu	b5aeff1dba	nvmf/tcp: 'nvmf_tcp_send_c2h_term_req' should set fes Set the fes in nvmf_tcp_send_c2h_term_req. Signed-off-by: MengjinWu <mengjin.wu@intel.com> Change-Id: I457e102d9329e5624c738c5cf2e7fe411106f30b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14583 Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-09-22 07:37:56 +00:00
Jim Harris	9633d482a7	nvmf: emit add_listeners RPCs after add_ns RPCs When emitting the JSON-RPC text for saving the current configuration, add the listeners last. This is usually the preferred order when configuring a new subsystem - it is better to have all of the namespaces and hosts added to the subsystem before adding the listener to allow hosts to connect to it. We support namespace hotplug but there's no need to unnecessarily generate hotplug events if we can avoid it. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I79e8a0a496eeb128efbb7e314ac835b6110d3cc8 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14586 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-09-21 08:50:08 +00:00
MengjinWu	00005ed8d5	nvmf/tcp: eliminate function 'nvmf_tcp_pdu_payload_insert_dif' This function is called only once and can be eliminated. Signed-off-by: MengjinWu <mengjin.wu@intel.com> Change-Id: I0b3e80c025b60a816e2113f859907f95e96dd183 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14578 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-09-21 08:18:56 +00:00
MengjinWu	252c053e6f	nvmf/tcp: insert dif after all payload received 'nvmf_tcp_pdu_payload_insert_dif' can be done after receiving whole payload data as an optimization. Signed-off-by: MengjinWu <mengjin.wu@intel.com> Change-Id: I3054079427c25d102477ef8ec1b288631741d7a3 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14577 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-09-21 08:18:56 +00:00
Aleksey Marchuk	bf41b46c4e	nvmf: Don't reg additional MRs RDMA transport registers MRs for in-capsule data buffers, commands and completions. Since these structures are allocated using huge pages, MR for these buffers are already registered, we only need to translate addresses. Signed-off-by: Aleksey Marchuk <alexeymar@nvidia.com> Change-Id: I90c53d8276d72077f7983e9faf9160e9ede52a7d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14430 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-20 20:27:52 +00:00
Changpeng Liu	982c25feef	nvmf: add spdk_nvmf_ctrlr_[save\|restore]_migr_data() APIs When doing live migration, there are some spdk_nvmf_ctrlr internal data structures which need to be saved/restored, these data structures are designed only for vfio-user transport, for the purpose to extend them to support other vendor specific transports, here we move them as public APIs, users can use SAVE\|RESTORE to restore a new nvmf controller based on original one. And remove the register from vfio-user transport, these registers are stored in the common nvmf library. Change-Id: I9f5847ef427f7064f8e16adcc963dc6b4a35f235 Signed-off-by: Jacek Kalwas <jacek.kalwas@intel.com> Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11059 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Thanos Makatos <thanos.makatos@nutanix.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-09-20 10:17:24 +00:00
Boris Glimcher	35f7f0ce1e	nvme/tcp: Allow to choose SSL socket implementation Adding `psk` field to `spdk_nvme_ctrlr_opts` Adding `psk` parameter to `bdev_nvme_attach_controller` RPC Change-Id: Ie6f0d8b04ce472e6153934e985c026acded6cdfc Signed-off-by: Boris Glimcher <Boris.Glimcher@emc.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14046 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-09-14 07:44:53 +00:00
MengjinWu	12807c5bc6	lib/nvmf: Do one memset per new PDU recv While waiting for a new PDU, target will not do too many useless memcpy. Signed-off-by: MengjinWu <mengjin.wu@intel.com> Change-Id: Ie0825c2b1e44444b210040c4a1761010e0e4cfe5 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14444 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com>	2022-09-13 07:29:38 +00:00
John Levon	654738ff45	lib/nvmf: small cleanup in vfio_user_qpair_delete_cb() We already define a convenient variable for the admin CQ: use it. Suggested-by: Alexis Lescouet <alexis.lescouet@nutanix.com> Signed-off-by: John Levon <john.levon@nutanix.com> Change-Id: If6570f30844a52113633bdb5f3543eec700f05d7 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14391 Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-09-07 07:04:44 +00:00
Ben Walker	34c48f1b3b	accel: Do not refer to the "framework" as "engine" The word engine was both used (interchangeably with module) to refer to the things that plug into the framework and to the framework itself. This patch eliminates all use of the word engine that meant the framework. It leaves uses of the word that meant "module". Change-Id: I6b9b50e2f045ac39f2a74d0152ee8d6269be4bd1 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13918 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-09-06 07:16:17 +00:00
Shuhei Matsumoto	b3e1db32a3	nvmf/rdma: Ignore async_event if its qp_context is NULL If initiator and target run on the same application, and initiator uses SRQ, target may get async events for initiator, e.g., IBV_EVENT_QP_LAST_WQE_REACHED unexpectedly. The reason is initiator and target may use the same device simultaneously and only target polls async events. Target sets attr.qp_context to rqpair when creating QP, but initiator sets attr.qp_context to NULL when creating QP. Hence one simple fix is to ignore async events whose qp_context is NULL. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: Id9ead1934f0b2ad1e18b174d2df2f1bf9853f7e1 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14297 Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot	2022-09-05 12:49:11 +00:00
Konrad Sztyber	475b86aa8d	print better errors when creating mempools from secondary process Multiprocess is only supported by a few libraries (e.g. NVMe driver). Other libraries that don't support it will often fail on mempool initialization when running as a secondary process, as the mempools are already created by the primary process. But the error messages are vague and don't indicate why this happened. So, this patch adds a check to see if a mempool exists after spdk_mempool_create() fails and prints an error message informing users that multiprocess is unsupported. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I6f915a94266e64dda380e3b269424cc579372a10 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14234 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com>	2022-08-29 11:41:32 +00:00
liuqinfei	cd1b7ab0e7	nvmf: balance the get optimal poll group Fixes #issue 2636. The existing allocation method (nvmf_rdma_get_optimal_poll_group()) is traversal and unperceived link disconnection. A more fair method considering the number of real-time connections to allocate a poll group is implemented. Signed-off-by: liuqinfei <18138800392@163.com> Signed-off-by: luo rixin <luorixin@huawei.com> Change-Id: Ic1e6283e386dbb0dd6655bedebe26aeedb16c333 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14002 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-08-23 07:46:03 +00:00
John Levon	2eaae37ded	nvmf/vfio-user: complete queue deletion on correct thread If the queue was on another poll group, we need to send a message back to the admin CQ's thread to post the completion from the correct context. Signed-off-by: John Levon <john.levon@nutanix.com> Change-Id: I997987d5d6b822a1a5124f54fc29ce5d7f03190d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/14057 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Swapnil Ingle <swapnil.ingle@nutanix.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Community-CI: Mellanox Build Bot	2022-08-17 07:19:03 +00:00
Ben Walker	081f080a49	accel: Rename public header to accel.h The public interface of lib/accel is now include/spdk/accel.h Change-Id: Id94f623a494eb1b524b060f4413f633073ea7466 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13916 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-08-16 10:22:55 +00:00
Changpeng Liu	91eb10b4be	nvmf/vfio-user: only kick controller when in interrupt mode There is a race condition if we call this function in the polling mode when running with multi-cores, same as other places where the function is called, we only kick controller in interrupt mode, also in `vfio_user_ctrlr_intr`, `ctrlr->sqs[0]` may be set to NULL after the controller poll call, so return earlier for this case. Change-Id: I03a7b74a39c966a2b8be610bca0e492d902f6b08 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13696 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com>	2022-08-15 19:10:54 +00:00
Changpeng Liu	d0cf194bc4	nvmf/vfio-user: only relisten accept poller when connection is disconnected For the case `nvmf_subsystem_remove_listener` RPC call when VM is connected, we should not relisten the accept poller, because the endpoint will be destroyed for this case. Change-Id: Icf8299f26a3bbf7bbe44fd01edb4ede344692d25 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13548 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: John Levon <levon@movementarian.org>	2022-08-12 09:00:50 +00:00
Changpeng Liu	c60cb1a8be	lib/nvmf: don't raise assertion in `nvmf_tgt_destroy_cb` While running into this function, even the subsystem can't be destroyed due to error subsystem state, it's better to continue the execution. Continue to fix #2590, QEMU is stuck for the failure case, and nvmf target should process such error because it may support other normal subsystems at the same time. Change-Id: Ib05e24996378b52070d2b760519f476f9b2d7e76 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13839 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-08-04 07:29:27 +00:00
Changpeng Liu	78ca4b27c5	nvmf: don't raise assertion when destroying an non-inactive subsystem Sometimes VM may get a kernel panic when starting, and SPDK CI will kill `nvmf_tgt` after 60 seconds, and for this exception, SPDK will raise an assertion when destroying the subsystem, while here, we remove this assertion and print the error information. CI will still mark this case as a failed case, then we can use this error information to understand error subsystem state in vfio-user. Fix issue #2590. Change-Id: I20b16f9e96a566730eca2dd9ea165645bd9160bd Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13773 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-08-02 01:26:10 +00:00
Jacek Kalwas	8c35e1bd79	nvmf/rdma: remove lock on few transport ops it simplifies the code and improves readability sync is done on generic layer Signed-off-by: Jacek Kalwas <jacek.kalwas@intel.com> Change-Id: If324039ef2b26fa8ba026b80ec49788a7b2dcaa3 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13667 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-07-29 16:34:41 +00:00
Jacek Kalwas	c7ac84d1f2	nvmf/tcp: remove lock on few transport ops it simplifies the code and improves readability sync is done on generic layer Signed-off-by: Jacek Kalwas <jacek.kalwas@intel.com> Change-Id: I75753511842dff237bb27561e406c43ea68269fe Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13666 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-07-29 16:34:41 +00:00
Jacek Kalwas	b17919d8bc	lib/nvmf: add lock around few transport ops this is a prework for further changes - with lock on generic layer lock on specific transport (e.g. tcp, rdma) layer becomes optional possibly it won't be required if some contract introduced on public interfaces (to be considered) - spdk_nvmf_poll_group_[create\|destroy] - spdk_nvmf_tgt_listen_ext, spdk_nvmf_tgt_stop_listen - spdk_nvmf_get_optimal_poll_group Signed-off-by: Jacek Kalwas <jacek.kalwas@intel.com> Change-Id: Ib132babf9e7022342129fe795991cdad834e7f53 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13665 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-07-29 16:34:41 +00:00
Alexey Marchuk	7fbda6d916	nvmf/rdma: Fix data_wr_pool corruption When there are not enought transport buffers for multi SGL request in state NEED_BUFFER, WRs received from the data_wr_pool are returned back to the pool. However rdma_req->data.wr.next pointer still points to the first WR from the pool. Usually it doesn't cause any problems since rdma_req will try to fill buffers again, but when qpair is being destroyed, all requests are completed forcefully. When the request is completed and data.wr.next pointer is not NULL, we'll try to put already released WRs into the pool one more time. That corrupts the pool and leads to undefined behavior. Fixes #2541 Signed-off-by: Alexey Marchuk <alexeymar@nvidia.com> Change-Id: I238b92eec132d8d845330362af6f335421177454 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13760 Reviewed-by: Michael Haeuptle <michaelhaeuptle@gmail.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-07-28 07:12:43 +00:00
MengjinWu	7fc2c0856e	lib/nvmf: use DSA to offload recv data digest crc32 in nvmf-TCP allow DSA device to async offload crc32 calculation in nvmf-TCP This patch can use DSA to accelerate crc32 computation, making the io performance of TCP paths using crc32 approach the io performance of TCP paths that do not use crc32. Using SLIST to minimize the performance drop. SLIST has less operation compared to TAILQ. Thinking about memory thrashing, we should use the same memory as possible to receive new PDUs. So, insert newly freed PDU in to head is better. The performance drop is within 1% compared to the TCP path without crc32. Signed-off-by: MengjinWu <mengjin.wu@intel.com> Change-Id: I480eb8db25f0e730cb198ca5ec19dbe3b4d38440 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11708 Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: GangCao <gang.cao@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-07-27 08:39:35 +00:00
Changpeng Liu	6abb4764ad	nvmf: check interleaved metadata size when adding NS When doing DIF insert and strip, we will reserve extra buffer in block device layer to save DIF information, so when attaching one device to Namespace, we will check the value first so that the reserved buffer size isn't smaller than metadata size. Change-Id: Id9272886ce8a7c01271279686730af4e5b24f35a Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12188 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com>	2022-07-19 12:31:59 +00:00
Changpeng Liu	a438718fc2	nvmf: don't report E2E Protection Capabilities to client When `dif_insert_or_strip` is enabled, NVMf library will do DIF insert and strip automatically, client isn't aware of it, when `dif_insert_or_strip` is disabled, we will report Namespace E2E Protection Capabilities to client, but we don't process PRACT and PRCHK flags in NVMf library, so here we don't report the capabilities to client and leave the use of extended LBA buffer to users. Change-Id: Ic610dc65fef210a7799c6ab693d89138b99e1193 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12165 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-07-19 12:31:59 +00:00
Jacek Kalwas	0adabc9eb1	lib/nvmf: rm nvmf_poll_group_add_transport from internal header it is impl and used only in nvmf.c source file Signed-off-by: Jacek Kalwas <jacek.kalwas@intel.com> Change-Id: I1236f9ede28c5da313d118ce73e1da64381379c5 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13664 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com>	2022-07-18 10:18:19 +00:00
Jim Harris	9cb5f885df	nvmf: decrement mgmt_io_outstanding for all AER cases We cannot count AERs as outstanding IO for purposes of subsystem pause, because we cannot expect them to be completed. Previously we would account for this in nvmf_ctrlr_async_event_request() by decrementing the counter, but this did not consider cases in the calling function (nvmf_ctrlr_process_admin_cmd) where an AER might complete with error before this function, resulting in the counter getting stuck indefinitely with a >0 value. Rather than adding a decrement in all of those error cases, do a single check at the beginning of nvmf_ctrlr_process_admin_cmd, and remove the one from nvmf_ctrlr_async_event_request. Fixes issue #2215. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ica969f116d80dfba0168369ff2fba9a4a42fc076 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13678 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-07-15 12:51:31 +00:00
BinYang0	20cd4841f1	lib/nvmf: set low water mark in NVMe/TCP target to 1 byte In NVMe/TCP target, the socket low water mark is set to sizeof(struct spdk_nvme_tcp_common_pdu_hdr), which is 8 bytes. In corner test, there might be 4 bytes data packet sent to NVMe/TCP target, after that, if there is no more data sent to the same socket, the 4 bytes won't be read by NVMe/TCP target qpair thread. Because of this, there is a IO request didn't complete in initiator. Then, if manual call the readv function to read the 4 bytes for the pdu in target, the io request complete normally in initiator. It seems like the pdu might be split, and in the situation, the IO request will not complete until new IO request reach. After set low water mark in NVMe/TCP target to 1 byte, just like iscsi target done, the issue disappear immediately. Signed-off-by: BinYang0 <bin.yang@jaguarmicro.com> Change-Id: I59d3d900f0b25632d786ef25ab096eabe43476bd Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/13633 Reviewed-by: <chuanwei.ji@jaguarmicro.com> Reviewed-by: Qingmin Liu <qingmin.liu@jaguarmicro.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-07-13 07:28:52 +00:00
MengjinWu	427cbb46a3	lib/nvmf: optimize the performance for h2c handle It will not find the h2c related reqs in the tailq now. We can get it from tqpair->reqs directly. Signed-off-by: MengjinWu <mengjin.wu@intel.com> Change-Id: I25f0900e875b054d7617450477e9719e7a59aa18 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/12861 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com>	2022-07-11 07:40:53 +00:00
Thanos Makatos	caadae6c10	nvmf/vfio-user: briefly explain live migration Signed-off-by: Thanos Makatos <thanos.makatos@nutanix.com> Change-Id: I08d3aa90ec4f3e29bece820919bd39d20c74c6cf Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11745 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: John Levon <levon@movementarian.org>	2022-07-11 07:38:04 +00:00
Thanos Makatos	50a4875255	nvmf/vfio-user: ensure migration data are generated in stop-and-copy state Currently we initialize pending_bytes only in pre-copy state. This is pointless since we don't generate any migration data at this state, so if the vfio-user client reads migration data it will be garbage. Even worse, we don't re-initialize pending_bytes in stop-and-copy state, so if the vfio-user client reads the entire migration data in pre-copy state then there will be nothing left to read in the stop-and-copy state, which is where we actually produce the migration data. This results in corruption of the controller's state (e.g. queues). This patch ensures that migration data are available in the stop-and-copy state, by setting pending_bytes accordingly only in that state. Signed-off-by: Thanos Makatos <thanos.makatos@nutanix.com> Change-Id: I0b215e64cd1f58f254e1079f06402d196f984099 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11718 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-07-11 07:38:04 +00:00

1 2 3 4 5 ...

2051 Commits