ivampiresp/Spdk - Spdk - Leaflow Developers

Author	SHA1	Message	Date
Mike Gerdts	b66f8df748	blob_ut: bs_opts initialized with wrong size An spdk_bs_opts structure is sometimes partially initialized due to using sizeof(opts) (struct spdk_blob_opts, 64 bytes) rather than sizeof(bs_opts) (struct spdk_bs_opts, 72 bytes). Signed-off-by: Mike Gerdts <mgerdts@nvidia.com> Change-Id: Iaaa89bb419f66969d0888f49f8991c35b3dc5ea4 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11268 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2022-02-02 08:25:02 +00:00
Mike Gerdts	9f9c7161c9	bdev_ut: test read-only bdev claim While not documented as such, spdk_bdev_module_claim_bdev() has always allowed a bdev that is opened read-only to remain read-only when claimed. This occurs when NULL is passed in place of an spdk_bdev_desc. This change updates the function's documentation to match the implementation and adds a unit test to ensure the current behavior remains. Signed-off-by: Mike Gerdts <mgerdts@nvidia.com> Change-Id: Ief26de60e4408bfe1aa60b7a4e1d8adf273470b6 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11267 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com>	2022-02-02 08:25:02 +00:00
Shuhei Matsumoto	cc797456f4	ut: Use unit/lib/json_mock.c for stubs Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I2cd488c17dbc92c381cd956ae0d6f5ca709a24dc Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11263 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-01-31 09:44:28 +00:00
Shuhei Matsumoto	def45b4c07	ut/json_mock: Add stubs for json_write_uint8 and _uint16 Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I260b958e0640f737ab77654fedc8007f92eec325 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11262 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-01-31 09:44:28 +00:00
Mike Gerdts	96212d45cc	lvol: lvol_get_xattr_value failure undetectable When an unexpected xattr name is passed to lvol_get_xattr_value(), no error is returned to the caller. The one caller, blob_set_xattrs() via the xattrs->get_value callback, makes the reasonable assumption that a lookup that fails to find a value returns a NULL value. This updates lvol_get_xattr_value() to match that expectation. Signed-off-by: Mike Gerdts <mgerdts@nvidia.com> Change-Id: I5c7a740f2757e6d8265ba2637afecb729acfcdd4 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11326 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2022-01-31 09:44:16 +00:00
Changpeng Liu	d1c2027d01	nvmf/vfio-user: add NVMe live migration support finally VFIO in QEMU uses region 9 as the PCI passthrough devices' migration channel. The format of the region 9 migration region is as follows: ------------------------------------------------------------------ \|vfio_device_migration_info\| data section \| ------------------------------------------------------------------ QEMU will access vfio_device_migration_info to controll the migration process. For SPDK vfio-user target, we also implement the BAR9 via libvfio-user, and we also define the NVMe device specific migration data stored in data section of BAR9. QEMU doesn't care about the format in data section, it will help us to gather the NVMe specific migration data in source VM and then restore the migration date to data section of BAR9 in destination VM. The core idea to implement live migration will following the device state change which is controlled by QEMU. First QEMU will try to STOP the device in the source VM, and set the destination VM to RESUME state, SPDK will save NVMe devic state data structure to BAR9 in the source VM once the subsystem is paused, then QEMU will read BAR9 in source VM and restore the content of BAR9 in destination VM, finally in the destination VM, we will restore the NVMe device state include BARs/PCI CFG/queue pairs in the destination VM. Change-Id: I42e38f28c3ff59831be63290038b50d199d06658 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/7617 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Thanos Makatos <thanos.makatos@nutanix.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-01-27 20:55:16 +00:00
Evgeniy Kochetov	08f9b40113	bdev/nvme: Fix namespace comparison This patch aligns namespace comparison with Linux kernel implementation: - UUID is optional and may be NULL - command set (CSI) should be the same Signed-off-by: Evgeniy Kochetov <evgeniik@nvidia.com> Change-Id: I8f889989f24cd51b104057217f87eb303b30fa68 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11312 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2022-01-27 18:53:41 +00:00
Nick Connolly	968371131e	ut/nvme_ctrlr: initialize mutex for portability For correct behaviour, pthread_mutex must be initialized before use and destroyed afterwards. An already initialized mutex should not be re-initialized. Add calls to nvme_ctrlr_construct where nvme_ctrlr_destruct is called without a matching construct. Add missing calls to mutex_init and mutex_destroy as required. Signed-off-by: Nick Connolly <nick.connolly@mayadata.io> Change-Id: I9753fa7fbd77402f23a08a66f4b489a5c229487a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11298 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Community-CI: Mellanox Build Bot	2022-01-27 08:15:26 +00:00
Shuhei Matsumoto	c8f986c7ee	Revert "nvme/rdma: Correct qpair disconnect process" This reverts commit `eb09178a59`. Reason for revert: This caused a degradation for adminq. For adminq, ctrlr_delete_io_qpair() is not called until ctrlr is destructed. So necessary delete operations are not done for adminq. Reverting the patch is practical for now. Change-Id: Ib55ff81dfe97ee1e2c83876912e851c61f20e354 Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10878 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-01-26 08:09:15 +00:00
paul luse	c501d2b37c	idxd: fix issue w/multiple WQ config Found via inspection during spec review of latest HW. We were using the wrong stride for the WQCFG regsiter when configuring but it just so happened to be the right value for the current DSA version. We were mixing up the size of the WQCFG register with the stride value used to configure the next WQCFG regsiter as they are not contiguous in HW, we need to read another capabilities bit to determine the address of the next wqcfg to configure.. Signed-off-by: paul luse <paul.e.luse@intel.com> Change-Id: I14d1ff95e0131fd30121aa955bfbc8c8fb3fc512 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10968 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-01-20 21:39:56 +00:00
paul luse	026f003154	idxd: update structures based on latest public DSA spec Compliant with both current and next gen DSA. Note: some fields in gencap were mapped incorrectly previously, but this did not impact the SPDK driver because the only times those values (max_xfer_shift and max_batch_shift) were used were in asserts. Signed-off-by: paul luse <paul.e.luse@intel.com> Change-Id: I9648184670f661166136e7898d0d8c7e07d8c746 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10966 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-01-20 21:39:56 +00:00
Tomasz Zawadzki	1e080e5e67	lib/vhost: move dev_dirname to rte_vhost_user Creation of sockets is specific to rte_vhost, so it functionality responsible for setting path for them. dev_dirname is renamed to g_vhost_user_dev_dirname and its definition is moved to rte_vhost_user. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I9bae67667b0f6624f2daf3244a048d10e94e553c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10631 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com>	2022-01-20 19:09:20 +00:00
Tomasz Zawadzki	ef873d21e3	ut/vhost: add rte_vhost_user.c to UT vhost.c contains a lot of functionality that is rte_vhost specific. This series is moving rte_vhost specific functionality to rte_vhost_user.c. UT for vhost didn't make a distinction for either. So starting with this patch the rte_vhost_user.c is now included in the UT, only stubing out rte_vhost functions. Signed-off-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Change-Id: I0d5f62ad47d1261bbb44c0aa23400d94ece4564e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10743 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com>	2022-01-20 19:09:20 +00:00
GangCao	6b7e9d0af2	Lib/iSCSI: add the LUN Resize support From SAM-4, section 5.13 (Sense Data); “When a command terminates with a CHECK CONDITION status, sense data shall be returned in the same I_T_L_Q nexus transaction (see 3.1.50) as the CHECK CONDITION status. After the sense data is returned, it shall be cleared except when it is associated with a unit attention condition and the UA_INTLCK_CTRL field in the Control mode page (see SPC-4) contains 10b or 11b.” SPDK does not set UA_INTLCK_CTRL to 10b or 11b, so we set the unit attention condition immediately against a single IO or Admin IO after reporting it via a CHECK CONDITION. Once the failed IO received at iSCSI initiator side, it will be retried. In the case of resize operation, if there is no IO from iSCSI initiator side, the unit attention condition will be delayed to report until the first IO is received at the iSCSI target side. Meanwhile, we clear the resizing (newly added) flag on our SCSI LUN structure after first time we report the resize unit attention condition. The kernel initiator won’t actually resize the corresponding block device automatically. It will report a uevent, and then you can set up udev rules to trigger a rescan. SPDK iSCSI initiator will automatically report the LUN size change. Change-Id: Ifc85b8d4d3fbea13e76fb5d1faf1ac6c8f662e6c Signed-off-by: GangCao <gang.cao@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11086 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Xiaodong Liu <xiaodong.liu@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com>	2022-01-20 07:56:23 +00:00
Ben Walker	86bb0df191	idxd: Bump batch size to 32 Increase the batch size and with it the effective queue depth per channel to 512. Change-Id: Ide665e92d47ee753c141f34dd6a8bc4d040fe8db Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11031 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: John Kariuki <John.K.Kariuki@intel.com>	2022-01-20 07:54:55 +00:00
Changpeng Liu	b3cd421ffd	nvmf/vfio-user: implement device quiesce APIs libvfio-user will call quiesce callback when there are memory region add/remove and device state change requests from client, and in the quiesce callback, we will pause the subsystem so that it's safe to do everything after it, then after quiesce callback, we will resume the subsystem. The quiesce callback is also used in live migration, each device state change will quiesce the device first. Change-Id: I3a6a0320ad76c6b2d1d65c754b9f79cce5c9c683 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10620 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-01-20 00:13:42 +00:00
Konrad Sztyber	a7d61bef5a	nvme: guard admin qpair error injection queue Admin commands can be sent and polled from any thread, which also means that the error injection queue on the admin qpair can be accessed from multiple threads. Therefore, any modifications to that queue should be done under the ctrlr lock. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Ib1ed194405cb5b93f65a007b9749fd4433dc367d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11099 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Michael Haeuptle <michaelhaeuptle@gmail.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot	2022-01-19 09:05:36 +00:00
Changpeng Liu	f63c0899a2	nvmf: add reset/shutdown timeout process There is an error case that the block device didn't complete outstanding IOs during the controller reset or shutdown, so the NVMf library will wait until all the IOs returned from the backend, however, so here we added a timeout timer, when the time expired, we will try to reset the block device which hold the outstanding IOs. Fix #2194. Change-Id: I8d0746335e1f20a09e6a9ea87730551808a898d1 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9909 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Dong Yi <dongx.yi@intel.com>	2022-01-19 09:04:50 +00:00
GangCao	7b67a696da	UT/NVMe: Fix compilation warnings Fix warning: missing braces around initializer This issue is seen with gcc (GCC) 4.8.5 20150623. Warning like below: nvme_tcp_ut.c:243:9: warning: (near initialization for ‘ctrlr.ns’) [-Wmissing-braces] nvme_tcp_ut.c: In function ‘test_nvme_tcp_req_init’: nvme_tcp_ut.c:525:9: warning: missing braces around initializer [-Wmissing-braces] struct spdk_nvme_ctrlr ctrlr = {0}; ^ nvme_tcp_ut.c:525:9: warning: (near initialization for ‘ctrlr.ns’) [-Wmissing-braces] And more information from below link: https://gcc.gnu.org/bugzilla/show_bug.cgi?id=53119 Change-Id: I88b5b3908d5d0daa9383e47a1ed53288f342ca3b Signed-off-by: GangCao <gang.cao@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11137 Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot	2022-01-19 09:04:27 +00:00
Shuhei Matsumoto	3185df9057	ut/bdev_nvme: Manage adminq's state and return -ENXIO if adminq is disconnected Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I81d4a8ce5c487449ab634bcd4f984d6867febf35 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10949 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2022-01-19 08:44:09 +00:00
Shuhei Matsumoto	49b8d1f33a	ut/bdev_nvme: Delete qpair after unwiding context from process_completions() This is the same effort as the last patch. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I94ef08abdbb2bd2e07d0cd1e552c5d05c805233e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10817 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2022-01-19 08:44:09 +00:00
Shuhei Matsumoto	5485f55dc1	ut/bdev_nvme: Separate disconnected and connected qpair in poll_group More precise stubs for spdk_nvme_poll_group are critically important to verify upcoming changes. Add a flag is_failed to struct spdk_nvme_qpair separately from is_connected. This is used to inject error to a connection. Replace a single list qpairs by two lists, connected_qpairs and disconnected_qpairs for struct spdk_nvme_poll_group. Then utilize these to manage qpair in poll group. spdk_nvme_ctrlr_reconnect_io_qpair() is not used in the NVMe bdev module now. Remove the corresponding stub. Adjust polling count accordingly. Change-Id: I4d867c56ae518276813f6f96d23a5f6933364fd4 Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10816 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-01-19 08:44:09 +00:00
Shuhei Matsumoto	728e3721a4	nvme_rdma: Remove a guard for recursive calls from poll_group_disconnect_qpair() nvme_poll_group_disconnect_qpair() is called only by a single place now. We do not need the flag poll_group_disconnect_in_progress any more. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I8f9c0f14baa8fcb9b0637635a5bb3d34a8b11af5 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10673 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-01-19 08:44:09 +00:00
Shuhei Matsumoto	7ae79a38a5	nvme: Limit spdk_nvme_poll_group_remove() to use only for disconnected qpairs Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I3c06c41664ee757423641474141439f9c32fc0b6 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10671 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Monica Kenguva <monica.kenguva@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-01-19 08:44:09 +00:00
Shuhei Matsumoto	e021cc0147	nvme: Swap ctrlr_disconnect_qpair() and poll_group_remove() in nvme_ctrlr_free_io_qpair() nvme_ctrlr_disconnect_qpair() calls nvme_poll_group_disconnect_qpair() if the qpair uses a poll group, and nvme_poll_group_disconnect_qpair() calls nvme_ctrlr_disconnect_qpair() if the state of the qpair is not DISCONNECTING. This relationship made the code very complex. A few patches starting from this patch simplifies disconnect and free qpair operations. This patch swaps the ordering of nvme_ctrlr_disconnect_qpair() and spdk_nvme_poll_group_remove() in spdk_nvme_ctrlr_free_io_qpair(). This ensures the qpair is disconnected when spdk_nvme_ctrlr_free_io_qpair() calls spdk_nvme_poll_group_remove(). This enables us to limit spdk_nvme_poll_group_remove() to be available only for disconnected qpairs. Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I0601a74f953a2efc4f177a51a4450baea33533d4 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10670 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-01-19 08:44:09 +00:00
Shuhei Matsumoto	80e81273e2	bdev/nvme: Do not use ctrlr for I/O submission if reconnect failed repeatedly If ctrlr_loss_timeout_sec is set to -1, reconnect is tried repeatedly indefinitely, and I/Os continue to be queued. This patch adds another option fast_io_fail_timeout_sec, a flag fast_io_fail_timedout to nvme_ctrlr. If the time fast_io_fail_timeout_sec passed after starting reset, set fast_io_fail_timedout to true not to use the path for I/O submission. fast_io_fail_timeout_sec is initialized to zero as same as ctrlr_loss_timeout_sec and reconnect_delay_sec. The name of the parameter follows the famous DM-multipath, its fast_io_fail_tmo. Change-Id: Ib870cf8e2fd29300c47f1df69617776f4e67bd8c Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10301 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-01-17 14:25:15 +00:00
Shuhei Matsumoto	ae4e54fdc3	bdev/nvme: Retry reconnecting ctrlr after seconds if reset failed Previously reconnect retry was not controlled and was repeated indefinitely. This patch adds two options, ctrlr_loss_timeout_sec and reconnect_delay_sec, to nvme_ctrlr and add reset_start_tsc, reconnect_is_delayed, and reconnect_delay_timer to nvme_ctrlr to control reconnect retry. Both of ctrlr_loss_timeout_sec and reconnect_delay_sec are initialized to zero. This means reconnect is not throttled as we did before this patch. A few more changes are added. Change nvme_io_path_is_failed() to return false if reset is throttled even if nvme_ctrlr is reseting or is to be reconnected. spdk_nvme_ctrlr_reconnect_poll_async() may continue returning -EAGAIN infinitely. To check out such exceptional case, use ctrlr_loss_timeout_sec. Not only ctrlr reset but also non-multipath ctrlr failover is controlled. So we need to include path failover into ctrlr reconnect. When the active path is removed and switched to one of the alternative paths, if ctrlr reconnect is scheduled, connecting to the alternative path is left to the scheduled reconnect. If reset or reconnect ctrlr is failed and the retry is scheduled, switch the active path to one of alternative paths. Restore unit test cases removed in the previous patches. Change-Id: Idec636c4eced39eb47ff4ef6fde72d6fd9fe4f85 Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10128 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Monica Kenguva <monica.kenguva@intel.com>	2022-01-17 14:25:15 +00:00
Shuhei Matsumoto	962c4c3800	bdev/nvme: Fix a degradation that I/O gets queued infinitely We noticed the difference between the SPDK 21.10 and the latest master in a test. The simplified scenario is as follows: 1. Start SPDK NVMe-oF target 2. Run bdevperf for the target with -f parameter to suppress exit on failure. 3. Kill the target after I/O started. With the SPDK 21.10, bdevperf retries failed I/Os and exits after the test time is over. With the latest SPDK master, bdevperf hungs and does not exit even after the test time is over. The cause was as follows: reset ctrlr is repeated very quickly (once per 10ms by default) and hence I/Os were queued infinitely because nvme_io_path_is_failed() returned false if nvme_ctrlr is resetting. We should queue I/O when nvme_ctrlr is resetting only if reset is throttoled and fail-fast for the repeated failures is supported. Hence in this patch, fix the degradation and remove the related unit test cases. Reported-by: Evgeniy Kochetov <evgeniik@nvidia.com> Signed-off-by: Shuhei Matsumoto <smatsumoto@nvidia.com> Change-Id: I4047d42dc44488a05264c6a841d101a7c371358b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/11062 Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-01-17 14:25:15 +00:00
Ahriben Gonzalez	0345729e00	nvme: Add metadata support to io commands Adding metadata support for io commands. Currently metadata is ignored even if present in the cmd struct. Making metadata adress readable/writable depending on data transfer bits. Adding extra unit test to make sure metadata fields are populated. Signed-off-by: Ahriben Gonzalez <ahribeng@gmail.com> Change-Id: I1d01974a6b2831c82b43e94073065d235eea429a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10854 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2022-01-14 11:10:13 +00:00
Ben Walker	517b557226	nvme: Do not track a separate active namespace list We only populate active namespaces into the main namespace tree, so we don't need a separate list of active namespaces too. Change-Id: Iaf194f806cc1d9672f5567cff3dffafff3165069 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10034 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2022-01-14 08:35:10 +00:00
Ben Walker	e7602c158f	nvme: Hold namespaces in an RB_TREE Since this is now sparsely populated, a tree is a better choice. Change-Id: Ie66d913fa1d298de56a7d22ef55f0adf7f8803b8 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10031 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-01-14 08:35:10 +00:00
Ben Walker	b4dace738e	nvme: Do not allocate inactive namespace objects Some subsystems report a very large maximum value for the number of namespaces, but in essentially every case the subsystem is sparsely populated with active namespaces. To save memory, don't allocate objects for the inactive ones. Change-Id: I4cbeb5a7a898d3c685f4a3a9ec4c2ce45efffb92 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9898 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2022-01-14 08:35:10 +00:00
Ben Walker	1cfae16563	accel: Use vectored crc32 operations instead of chaining Chaining may be faster, but this is really an implementation detail of the idxd driver. Push the decision on how to implement a vectored crc down into the individual drivers and eliminate it from the generic framework. Signed-off-by: Ben Walker <benjamin.walker@intel.com> Change-Id: Iedbdc5a6dbd3f7d1674d0a83f6827588f4b6b2fb Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10291 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com>	2022-01-12 08:20:39 +00:00
Konrad Sztyber	6631c2a8aa	nvmf/tcp: initialize zcopy phase in nvmf_tcp_req_get Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Ia74148fb36733deaf7b2f833ac0247859311a805 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10794 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-01-12 08:20:11 +00:00
Konrad Sztyber	a50a70ecdf	nvmf: abort outstanding zcopy reqs in qpair disconnect Zero-copy requests are kept on the outstanding queue for the whole duration of the request - from the initial zcopy_start submission to the completion of zcopy_end. This means, that there's a period in which a request doesn't wait for a completion from the bdev layer, but is still on the oustanding queue (after zcopy_start callback, before zcopy_end submit). If a qpair gets disconnected while a request is in this state, we need to manually force its completion, as otherwise it might hang indefinitely (e.g. waiting for host data). Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I53731b8e363b725efa564ca3c7d89b46f5fb2a24 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10793 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com>	2022-01-12 08:20:11 +00:00
Konrad Sztyber	974a32b72e	nvmf: resume queued zcopy requests The zero-copy requests can also be queued when a subsystem is paused, so we need to properly resume and submit them by using zcopy_start. Since only requests that haven't received the zero-copy buffer (i.e. before zcopy_start was called) can be queued, we don't need to bother with checking zcopy_phase. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Ie629688f6961eb2ae05741df496720b91be4d80d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10792 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-01-12 08:20:11 +00:00
Shuhei Matsumoto	521a9bb22c	bdev/nvme: Fix race between failover and add secondary trid We sort secondary trids to avoid using disconnected trids for failover. However the sort had a bug. This bug was found by running test/nvmf/host/multipath.sh in a loop. Verify the fix by adding unit test. Fixes #2300 Signed-off-by: Shuhei Matsumoto <shuheimatsumoto@gmail.com> Change-Id: I22b0ede4d2ef98b786c3e0d1f5337a2d568ba56d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10921 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com>	2022-01-10 22:18:46 +00:00
Jim Harris	b68f2eeb0b	bdev_nvme: add bdev_nvme_start_discovery RPC This patch adds the framework for a discovery service in the bdev/nvme module. Users can specify an IP/port of a discovery service. The bdev/nvme module will connect to a discovery controller, get the discovery log page, and then register for AERs. It will connect to each subsystem specified in the initial log page. AER completions will trigger fetching the log page again, at which point new subsystems will be connected to, or removed subsystems will be detached. This patch does the following: * Adds the new start_discovery RPC * Connects to the discovery controller * Gets the discovery log page * Registers for AERs * Detach from discovery controllers at shutdown Subsequent patches in this series will: * Connect to subsystems listed in discovery log page * Detach from subsystems that were listed in earlier discovery log pages but subsequently removed * Add a stop_discovery RPC Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I54bfa896a48c5619676f156b5ea9f2d1f886c72f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10694 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com>	2022-01-10 15:23:39 +00:00
Konrad Sztyber	7a374fbc0b	nvmf: make zcopy_end void Since spdk_bdev_zcopy_end() cannot really fail (it only fails if we pass a bad bdev_io), we can simplify the nvmf zcopy_end functions by making them void and always expect asynchronous completion. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I6e88ac28aba13acadea88489ac0dd20d1f52f999 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10790 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Anil Veerabhadrappa <anil.veerabhadrappa@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-01-06 18:53:42 +00:00
Konrad Sztyber	92d7df1f47	nvmf: use spdk_nvmf_request_exec to submit zcopy_start Since this path now supports sending zero-copy, use it for zcopy_start. Additionally, it makes it possible make zcopy_start void, as it reports all errors asynchronously via request_complete(), and remove some of the duplicated error checks. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I41f43ce1651432d9a7d74e3680d4a3f780128a1d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10789 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-01-06 18:53:42 +00:00
Konrad Sztyber	686b9984b3	nvmf: return async/complete status in bdev zcopy operations Additionally, the NVMe completion status is now updated and the IOs are queued if the bdev layer doesn't have enough IO descriptors. It makes the zcopy operations behave similarly to the other IO operations. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I455ae781e32aa6e60d144d2c91f109bd8be46664 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10787 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-01-06 18:53:42 +00:00
Konrad Sztyber	0e09df57dd	nvmf: rename zcopy operations to zcopy_(start\|end) It makes their names consistent with the bdev API. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I314051f0980b46959d6560aa25885f13b4c28f2a Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10786 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Anil Veerabhadrappa <anil.veerabhadrappa@broadcom.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-01-06 18:53:42 +00:00
Konrad Sztyber	f65099d378	nvmf: remove zcopy check in spdk_nvmf_request_exec It will make it possible to submit zero-copy requests through spdk_nvmf_request_exec(). Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: Ibc14fe77cd477b11ed55d1350a7486caaad81add Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10783 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2022-01-06 18:53:42 +00:00
Konrad Sztyber	7d23ac8657	nvmf: remove zcopy phase checks from IO functions The code should never reach these functions for requests using zero-copy. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: If9f30e05a43b340a982604d5b985242d63ce252b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10782 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-01-06 18:53:42 +00:00
Konrad Sztyber	aa1d039836	nvmf: zero-copy enable flag in transport opts It makes it possible for the user to specify whether a transport should try to use zero-copy to execute requests when possible. Signed-off-by: Konrad Sztyber <konrad.sztyber@intel.com> Change-Id: I40a92b0d7a6707f4c9292795f380846acb227200 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10780 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2022-01-06 18:53:42 +00:00
Changpeng Liu	2a6c2c289c	nvmf: support static CNTLID SPDK NVMf subsystem supports dynamic controller model, for transports other fabrics, users should use static controller model. Change-Id: I364ea61a71b04d51932fd9e0e16f401a383ff67c Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10149 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2022-01-06 01:20:32 +00:00
Alexey Marchuk	3c4a68cafc	nvme: Do not create IO qpair during ctrlr initialization If nvme ctrlr is resetting or initializing, free_io_qids bitmap is already freed or not created yet. In that case an attempt to create IO qpair leads to segmentation fault. Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: I6a97bf81d5a568db20d23b3f88cf01e994ba42e3 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10827 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuheimatsumoto@gmail.com>	2021-12-27 08:43:03 +00:00
Alexey Marchuk	eb09178a59	nvme/rdma: Correct qpair disconnect process In current implementation RDMA qpair is destroyed right after disconnect. That is not graceful qpair shutdown process since there can be requests submitted to HW and we may receive completions for already destroyed/freed qpair. To avoid this, only disconnect qpair in ctrlr_disconnect_qpair transport callback, all other resources will be released in ctrlr_delete_io_qpair cb. This patch is useful when nvme poll groups are used since in that case we use shared CQ, if the disconnected qpair has WRs submitted to HW then qpair's destruction will be deferred to poll group. When nvme poll groups are not used, this patch doesn't change anything, in that case destruction flow is still ungraceful. However since CQ is destroyed immediately after qpair, we shouldn't receive any requests which point to released resources. A correct solution for non-poll group case requires async diconnect API which may lead to significant rework. There is a bug when Soft Roce is used - we may receive a completion with "normal" status when qpair is already disconnected and all nvme requests are aborted. Added a workaround for it. Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: I0680d9ef9aaa8737d7a6d1454cd70a384bb8efac Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10327 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Shuhei Matsumoto <shuheimatsumoto@gmail.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-12-23 08:44:40 +00:00
GangCao	10f32b9f19	lib/blob: do not assume realloc(NULL, 0) returns a not-NULL value There is situation that num_extent_pages is zero and original pointer is also NULL, the realloc() could return a Not NULL pointer. Related UT has been added and updated. 1) In the default allocation (num_clusters == 0), the extent_pages is not allocated as expected. 2) In the thin provisioning allocation (num_clusters != 0), the extent_pages will be allocated if extent_table is used. More related information as below: The crux of the problem is that according to POSIX: realloc: "If ptr is NULL, then the call is equivalent to malloc(size)" malloc: "If size is 0, then malloc returns either NULL or a unique pointer value that can later be successfully passed to free" blobstore was relying on realloc(NULL, 0) always return a unique pointer value, and not NULL. This is not portable behavior. Change-Id: Ibc28d9696f15a3c0e2aa6bb2371dc23576c28954 Signed-off-by: GangCao <gang.cao@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10470 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-12-20 18:14:06 +00:00
Ben Walker	fca4262987	nvme: Remove nvme_ns_update In the one place this was called, we can call nvme_ns_construct instead. There's no harm in re-fetching the identify pages. Change-Id: I91292ff9650bdc7edd5588a05837b671dcac1922 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10102 Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2021-12-20 08:49:41 +00:00
Peng Lian	4c1757ffb9	nvmf: update discovery log when removing hostnqn In NVMF Revision spec 1.1a, discovery log should be updated when removing hostnqn of subsystem. Update unit test to check the discovery log when removing hostnqn and destroying subsystem. Signed-off-by: Peng Lian <peng.lian@smartx.com> Change-Id: I51c597a2493295a677a7aa68e4f13a887f7e1140 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10668 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-12-16 08:52:20 +00:00
Anil Veerabhadrappa	68f0c6160a	ut/fc : fix fc_ls_ut compilation failure This regression was introduced when 'accept' was removed from spdk_nvmf_transport_ops structure. Signed-off-by: Anil Veerabhadrappa <anil.veerabhadrappa@broadcom.com> Change-Id: I5d880791db258a97a1861dbd841e97a7c068ce12 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10676 Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2021-12-16 08:43:39 +00:00
Changpeng Liu	723adbaf32	UT/vfio-user: fix clang-12 compilation error Add missed STUBs. Change-Id: I20989bf4ea66720d62f8ecc9668bb8f74e459666 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10638 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2021-12-15 04:32:05 +00:00
Jacek Kalwas	43022da379	nvmf: remove accept poller from generic layer Not every transport requires accept poller - transport specific layer can have its own policy and way of handling new connection. APIs to notify generic layer are already in place - spdk_nvmf_poll_group_add - spdk_nvmf_tgt_new_qpair Having accept poller removed should simplify interrupt mode impl in transport specific layer. Fixes issue #1876 Change-Id: Ia6cac0c2da67a298e88956734c50fb6e6b7521f1 Signed-off-by: Jacek Kalwas <jacek.kalwas@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/7268 Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-12-14 13:18:33 +00:00
Jim Harris	59f3cdacb1	nvmf: don't always update discovery log when adding hosts If a subsystem has no listeners, then there is no need to update the discovery log when adding a host, or setting a subsystem to allow all hosts. This eliminates some unnecessary discovery log update notifications, especially when setting 'allow any hosts' on a subsystem immediately after it is created (and before it has any listeners). Update unit test to check the adding a host to a subsystem without listeners does not rev the genctr. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I63dab5df564269e574bb925890088f52063aa378 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10546 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Jacek Kalwas <jacek.kalwas@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-12-10 17:32:18 +00:00
Jim Harris	3867f83dea	test/nvmf: add local var for hostnqn string Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ia967512bfcc5d7b1df15b6f6b5c132f21d601dce Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10563 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-12-10 17:32:18 +00:00
Jim Harris	9ac2cf7ff0	nvmf: don't update discovery log on subsystem create/delete The discovery log isn't updated when a subsystem is created or deleted, it's only updated when a listener for a subsystem is added or removed. So remove the nvmf_update_discovery_log() in the subsystem create and delete paths. They just generate extra AER completions that potentially cause the host to do unneeded work. Note that if a subsystem is deleted with active listeners, the subsystem delete path will remove each of the listeners before deleting the subsystem itself. So the discovery log will still get updated when those listeners are removed. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Id01bbfa3b24d3e1279a614a2fd60be41387a03b1 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10545 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Dong Yi <dongx.yi@intel.com> Reviewed-by: Jacek Kalwas <jacek.kalwas@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-12-10 17:32:18 +00:00
paul luse	fbb24d0ebe	lib/accel: remove batching from the framework and plug-in modules Batching will be made available for DSA specifically through the new idxd_perf tool. Signed-off-by: paul luse <paul.e.luse@intel.com> Change-Id: Ic51d9ad3692074805b1ffa705cea8be35737c778 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9846 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Monica Kenguva <monica.kenguva@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-12-08 16:35:40 +00:00
Shuhei Matsumoto	215518069a	bdev/nvme: nvme_ctrlr_create() gets prchk_flags from nvme_async_probe_ctx Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Id3deca8e0aba23299347a6aee6f0f44ee683556e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10555 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2021-12-08 08:31:24 +00:00
Shuhei Matsumoto	696ad465d7	bdev/nvme: Remove the failover_in_progress flag from struct nvme_ctrlr The failover_in_progress flag is used to decide the return value of bdev_nvme_failover(). bdev_nvme_delete() calls bdev_nvme_failover() with remove=true to remove nvme_ctrlr->active_path_id. However bdev_nvme_failover() returns zero if nvme_ctrlr->failover_in_progress is true. bdev_nvme_failover() may return zero even if it does not remove nvme_ctrlr->active_path_id. The following will be better. bdev_nvme_failover() returns -EBUSY if nvme_ctrlr->resetting is true, and the caller repeats calling bdev_nvme_failover() until the target trid becomes alternative path or bdev_nvme_failover() returns zero. To do that, the failover_in_progress flag is not necessary any more. Removing the failover_in_progress will also simplify the following patches to unify ctrlr reset and failover. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I57ab944beb1d06ea4def144c81c69705860de35f Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10441 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-12-08 08:31:24 +00:00
Shuhei Matsumoto	7cc66c0ab1	bdev/nvme: Check if ns can be shared when configuring multipath We had not checked the bit 0 of the Namespace Multipath I/O and Namespace Sharing Capabilities (NMIC) field in the Identify Namespace data structure. If the bit 0 of the NMIC is zero, it is likely that namespaces are not identical. We should check if the value of the NMIC first, and do it in this patch. Additionally, it is not usual if the bit 0 of the CMIC and the bit 0 of the NMIC do not match. So in unit tests rename the parameter multi_ctrlr by multipath for ut_attach_ctrlr() and use it for the value of the NMIC. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I6aa7cbcc99be2507dbf18930f7b585a9ea7d0f90 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10380 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-12-08 08:31:24 +00:00
Shuhei Matsumoto	8afa746b4d	bdev/nvme: Use new APIs in a reset ctrlr sequence Replace the spdk_nvme_ctrlr_reset_async() and spdk_nvme_reset_poll_async() calls by the spdk_nvme_ctrlr_disconnect(), spdk_nvme_ctrlr_reconnect_async(), and spdk_nvme_ctrlr_reconnect_poll_async() calls in a reset ctrlr sequence. spdk_nvme_ctrlr_disconnect() can fail if ctrlr is already resetting or removed. But both cases are not possible. reset is controlled and the callback to the hot remove is called when the ctrlr is hot removed. So we assume spdk_nvme_ctrlr_disconnect() always succeed. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I1299e198597b2a2110f80b9a868e2dae015682ee Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10092 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-12-08 08:31:24 +00:00
Changpeng Liu	632c8d5613	nvme: make get INTEL log pages can be executed asynchronously Also we don't treat exceptions when getting INTEL log pages as a fatal error, the initialization will still contine. Change-Id: Ic2fd2be510fde2679c1546482934d0a180266936 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10341 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-12-06 23:17:07 +00:00
Evgeniy Kochetov	1fd2af0150	nvmf/ctrlr_bdev: Set DNR bit in status for failed NVMe passthru When NVMe passthru command (IO or admin) fails on submission (e.g. it is not supported), set DNR bit in completion status field. There is no sense in retrying the command in this case. Signed-off-by: Evgeniy Kochetov <evgeniik@nvidia.com> Change-Id: I55960c128bd9fc31f6defef0b9832259a71684b1 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8578 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2021-12-03 08:13:52 +00:00
Evgeniy Kochetov	d03b31c61f	nvmf/ctrlr_bdev: Fix status code for failed admin passthru command If NVMe admin passthru command is not supported by underlying bdev, set status code in NVMe completion to INVALID_OPCODE. Signed-off-by: Evgeniy Kochetov <evgeniik@nvidia.com> Change-Id: I29c4e1f8263b76b27c199cfd2d9b2474432ec70b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10517 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com>	2021-12-03 08:13:52 +00:00
Evgeniy Kochetov	a9593c7981	bdev: Fail nvme passthru command if not supported by bdev The originally detected problem is that SPDK NVMf target fails command with invalid opcode with status code INTERNAL_DEVICE_ERROR instead of INVALID_OPCODE. All unknown commands on IO queue are passed to underlying block device layer as NVME_IO type. It is not checked if this type of commands is supported and, when command fails, INTERNAL_DEVICE_ERROR is set as status code. If command fails on submission, status code is set to INVALID_OPCODE which is more relevant. This patch adds check if command type is supported to bdev_nvme_*_passthru functions. If not supported, it is failed with ENOTSUP. Signed-off-by: Evgeniy Kochetov <evgeniik@nvidia.com> Change-Id: I4d7f7639da17dd3b1dc3eee7eb1b4a4f876117a2 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/8567 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Paul Luse <paul.e.luse@intel.com>	2021-12-03 08:13:52 +00:00
Josh Soref	c9c7c281f8	spelling: test Part of #2256 * achieve * additionally * against * aliases * already * another * arguments * between * capabilities * comparison * compatibility * configuration * continuing * controlq * cpumask * default * depends * dereferenced * discussed * dissect * driver * environment * everything * excluded * existing * expectation * failed * fails * following * functions * hugepages * identifiers * implicitly * in_capsule * increment * initialization * initiator * integrity * iteration * latencies * libraries * management * namespace * negotiated * negotiation * nonexistent * number * occur * occurred * occurring * offsetting * operations * outstanding * overwhelmed * parameter * parameters * partition * preempts * provisioned * responded * segment * skipped * struct * subsystem * success * successfully * sufficiently * this * threshold * transfer * transferred * unchanged * unexpected * unregistered * useless * utility * value * variable * workload Change-Id: I21ca7dab4ef575b5767e50aaeabc34314ab13396 Signed-off-by: Josh Soref <jsoref@gmail.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10409 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2021-12-03 08:13:22 +00:00
Jim Harris	7e68d0baca	nvme: configure AER for discovery controllers Move the CONFIGURE_AER state before SET_KEEP_ALIVE to make sure that we run the CONFIGURE_AER state for discovery controllers. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: Ia4e24f6507c43e3fece06b9161ff8e0b8fa0e97d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10332 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-12-02 04:02:29 +00:00
Shuhei Matsumoto	f9fba507fe	bdev/nvme: Redirect the reset ctrlr operation into nvme_ctrlr->thread In the following patches, we want to retry reconnect if reconnect failed in a reset ctrlr sequence but we want to delay the retry. While we wait the delayed retry, we want to quiesce ctrlr completely. As part of quiesce ctrlr operations, we want to pause adminq poller but we need to do it on the nvme_ctrlr->thread. If a reset ctrlr sequence runs on the nvme_ctrlr->thread, we can avoid redirecting the pending destruct request at completion too. So we redirect the reset ctrlr sequence into the nvme_ctrlr->thread. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I538b962e2a7b5cf00ebbac2a1e888482ddeeee61 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10075 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-12-01 09:20:09 +00:00
Jim Harris	1c083e6200	nvme: set keep alive for discovery controllers Discovery services using the SPDK nvme driver may use long-lasting connections that detect AER completions to determine when there are changes in the discovery log. This means that we still need to send keep alives on discovery controller admin queues. So move the SET_KEEP_ALIVE_TIMEOUT state immediately after IDENTIFY, and run the SET_KEEP_ALIVE_TIMEOUT state even for discovery controllers. Note, we need the IDENTIFY's KAS value to properly set the keep alive timeout, so we have to keep the IDENTIFY state before SET_KEEP_ALIVE_TIMEOUT. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I5c6403c28fb72d42629c5f9009a89c4bfd44d162 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10329 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-11-24 08:34:58 +00:00
Shuhei Matsumoto	50b10bc20e	bdev/nvme: bdev_nvme_reset_io() redirect to the orig_thread at completion In the following patches, bdev_nvme_reset() will execute the reset ctrlr operation on the nvme_ctrlr->thread until completion as bdev_nvme_admin_passthru() does. Hence change the callback bdev_nvme_reset_io_continue() to redirect to the orig_thread by using bio. Furthermore, use bio->cpl.cdw0 to store the completion status of the reset processing. bdev_nvme_reset() does not use bio->cpl. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I361cc44494190ba83ad6e360788d78851416c46c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10074 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-11-23 08:46:36 +00:00
Shuhei Matsumoto	b4447abf70	bdev/nvme: Retry failed admin passthru up to retry_count times This patch supports admin passthrough retry when we get any error with DNR=0 but ABORTED_BY_REQUEST up to retry_count times. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I1bf29570791fdbe8651fa70c4c8685bb740fb86b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9944 Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2021-11-23 08:46:36 +00:00
Shuhei Matsumoto	a9a86a14c1	bdev/nvme: Retry admin passthru immediately if it got ctrlr path error This patch supports admin passthrough retry when we get ctrlr path error at completion. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Ice0045b84054ec66a9db9ef23e21786d2c082b1d Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9943 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-11-23 08:46:36 +00:00
Shuhei Matsumoto	35a2f4e22e	bdev/nvme: Retry admin passthru a second later if any ctrlr may become available When resetting ctrlr, adminq is disconnected first. If adminq is disconnected, admin passthrough request is rejected with -ENXIO. But resetting ctrlr may succeed. If resetting ctrlr succeeds, adminq is connected again, and admin passthrough request will be submitted successfully. On the other hand, if ctrlr is failed, admin passthrough request is rejected with -ENXIO. But when resetting ctrlr, ctrlr is set to unfailed. Hence bdev_nvme_admin_passthru() skips any ctrlr which is resetting or failed, and calls bdev_nvme_admin_passthru_complete() with -ENXIO if no available ctrlr is found. bdev_nvme_admin_passthru_complete() queues admin passthrough request and retry it one second later if ctrlr is resetting. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Ic748dc4faf29ebf717ae5c29dcf7c55fe2ea9243 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9942 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-11-23 08:46:36 +00:00
Changpeng Liu	0af4a7cd84	nvme: abort outstanding requests case by case For DSM command, the NVMe drive may take a long time to finish it, if we set a small timeout value for DSM command, the bdev/nvme module will try to reset the IO queue pair when timeout happens, in `spdk_nvme_ctrlr_free_io_qpair`, we will abort the outstanding IO requests first, then in the `nvme_pcie_ctrlr_delete_io_qpair`, we will poll the CQ for any requests that have been completed by the NVMe controller, if there are NVMe completions in the CQ, we will finish them again, thus double completions happened. Here we rename `nvme_qpair_abort_reqs` to `nvme_qpair_abort_all_queued_reqs`, so the common layer will just abort queued request, and let each transport to abort outstanding requests case by case. Fix #2233. Change-Id: Icae6214239160c615418cb514fc51cfe77b59211 Signed-off-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10233 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-11-22 08:35:35 +00:00
Jim Harris	d810a7458d	idxd: change NOTICELOGs to DEBUGLOGs The NOTICELOGs really clutter the output during application start - it's better to make these DEBUGLOGs instead. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I3ae37d5d057d7b972017befbc0834de414b9710b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10191 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2021-11-17 10:58:17 +00:00
Shuhei Matsumoto	7b8e7212a6	bdev/nvme: Abort the queued I/O for retry The NVMe bdev module queues retried I/Os itself now. bdev_nvme_abort() needs to check and abort the target I/O if it is queued for retry. This change will cover admin passthrough requests too because they will be queued on the same thread as their callers and the public API spdk_bdev_reset() requires to be submitted on the same thread as the target I/O or admin passthrough requests. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: If37e8188bd3875805cef436437439220698124b9 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9913 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-11-17 10:58:12 +00:00
Shuhei Matsumoto	72e4a4d46a	bdev/nvme: Each nvme_bdev_channel caches its current io_path Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I3ec3a588ff741cf04383e89f5a701e33bf1987a6 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9894 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-11-17 10:58:12 +00:00
Shuhei Matsumoto	ae7019417e	iscsi: Merge immediate data into the following R2T data The recent changes merged multiple Data-OUT PDUs within the same sequence into a single subtask up to 64KB. However, they were not enough. For a large write operation, the hardware iSCSI HBA host sent an immediate data whose size was not block size multiples and then more solicit data through R2T exchanges. One example for a 64KB write operation was as follows: host sent SCSI Write with 5792 bytes and F = 1 target replied a R2T host sent Data-OUT with 15880 bytes host sent Data-OUT with 11536 bytes host sent Data-OUT with 2848 bytes host sent Data-OUT with 11536 bytes host sent Data-OUT with 5744 bytes host sent Data-OUT with 12200 bytes and F = 1 The hardware iSCSI HBA host can decide the size of the unsolicited data but the SPDK iSCSI target can require the host to send the solicited data whose size is block size multiples. Hence we merge immediate data to the following R2T data if the immediate data is not more than 64KB and more R2T data come. Add another test case to check if the fix works for the above example. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I4906b4e1a8b61e08862f4ccc27a6caf165126530 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9708 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-11-16 09:08:27 +00:00
Alexey Marchuk	f72cab94dd	lib/vhost: Fix compilation with dpdk 21.11 Structure vhost_device_ops was renamed to rte_vhost_device_ops Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: Ie9601099d47465536500aa37fc113aeae03a8254 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10223 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: John Kariuki <John.K.Kariuki@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Tomasz Zawadzki <tomasz.zawadzki@intel.com>	2021-11-16 09:06:54 +00:00
Ben Walker	84688fdb1c	nvme: Rename max_active_ns_idx to active_ns_count This was sometimes used as the maximum array index and sometimes as the maximum count. Make it consistent everywhere and give it a better name. Signed-off-by: Ben Walker <benjamin.walker@intel.com> Change-Id: I518efd99a7d36584624490b0b3497bb6e81ce9ac Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10101 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-11-15 11:59:59 +00:00
Kai Li	8f633fa1c3	bdev/nvme: display all ctrlrs for this bdev when dump bdev nvme controller After multipath feature is supported, one bdev will have more than one nvme ctrlr. Fore ease of view, display each ctrlr's trid info. Moreover, rename nvme_bdev_ctrlr_get as nvme_bdev_ctrlr_get_by_name here to keep consistent with nvme_ctrlr_get_by_name. Signed-off-by: Kai Li <lik271@chinatelecom.cn> Change-Id: I417506699bbea6ed13dac0fee942749757d2ae47 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10129 Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2021-11-11 23:24:26 +00:00
Niklas Cassel	b7ad5b0b90	bdev/zone: add support for get zone id In the bdev-zone API, there are a few functions that takes a zone_id: spdk_bdev_get_zone_info(), spdk_bdev_zone_management(), and the spdk_bdev_zone_append() functions. The way a zoned application is usually written is that it starts off by getting the zone report for all zones (zone_id will be sent in as 0), and then the application will keep the whole zone report in memory. Therefore, an application usually have access to the zone_id/zslba for all zones. However, there are cases, e.g. when getting an error on write, where the completion callback will only have the lba of the write that failed. Add a helper function that can be used to get the zone_id/slba for a given lba. Having this helper in bdev-zone will avoid SPDK applications needing to provide their own implementation for this. Signed-off-by: Niklas Cassel <niklas.cassel@wdc.com> Change-Id: I978335f87f7d49bc33aed81afcaa6d9f0af8a1e4 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10180 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2021-11-11 23:23:35 +00:00
Shuhei Matsumoto	eb739d0364	iscsi: Fix the case that incoming data is split between data segment and data digest When data segment size is 64KB and data digest is enabled, if data segment and data digest are split into different two packets, - pdu->mobj[0] became full first when reading data semgment, - pdu->mobj[1] was allocated but unused and data digest was read. In this case, two SCSI write tasks were submitted by mistake and the second SCSI write task had no data. Fix the bug in this patch. When iscsi_pdu_payload_read() is called and pdu->mobj[0] is full, allocate pdu->mobj[1] only if any of data segment remains to read. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I9a0c36c05f90092c3c2122a7eb91e10976830b40 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9965 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2021-11-11 23:22:57 +00:00
Ben Walker	2dbdb9945c	test/nvme: Only test non-contiguous namespaces for NVMe 1.2 or higher This wasn't supported before NVMe 1.2 Signed-off-by: Ben Walker <benjamin.walker@intel.com> Change-Id: Ibf19cd77e522eb11c2091a9f4956f5616876986b Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10097 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Mellanox Build Bot Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-11-10 19:36:27 +00:00
Ben Walker	52e432dff2	test/nvme: Fix buffer zeroing math This meant to zero the entire active namespace list. Signed-off-by: Ben Walker <benjamin.walker@intel.com> Change-Id: I2da2293b53acd57d3480cf93b052eb1520de35d4 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10028 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2021-11-10 19:36:27 +00:00
Jim Harris	ec2ad00c92	test/unit/raid: fix set-but-not-used error verify_io() keeps track of a buf pointer, but the buf pointer never actually gets used. So remove this buf pointer. Found by clang-13. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I79dfeac7f004b56f7d4404f41b2ff18b96968a20 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/10056 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-11-03 18:30:55 +00:00
Shuhei Matsumoto	84ac18e545	bdev/nvme: Update ANA state if I/O failed by ANA error If I/O got ANA error, ANA state may be out of date. So in this case read ANA log page and update ANA states. Mark nvme_ns to be updating to avoid using while updating ANA state. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Ia43d38b3a589c84d6d0479dedcced033e76fb194 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9458 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-10-27 11:53:31 +00:00
Shuhei Matsumoto	f3fec96c20	bdev/nvme: Protect ANA log page from concurrent reads by using an new flag If an I/O failed by ANA error, the corresponding ANA state might be out of date. In the following patches, for this case, read the latest ANA log page and update the ANA state. Such reading ANA log page may be done on multiple threads concurrently including AER ANA change. Hence protect ANA log page by adding an new flag ana_log_page_updating to struct nvme_ctrlr and using it. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I8bb84091d50a5fdc0d9893b585be972dfd31c0f1 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9526 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-10-27 11:53:31 +00:00
Shuhei Matsumoto	43adb646b8	bdev/nvme: Retry failed I/O up to retry_count times Add bdev_retry_count to spdk_bdev_nvme_opts and retry_count to nvme_bdev_io, respectively. Set type of both to int because we want use -1 for infinite retry. Set the default value of bdev_retry_count to zero for the backward compatibility. bdev_retry_count is configurable by the RPC bdev_nvme_set_options. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I9bc746fcea54aa8722c76f79c70c2ae2b375aa53 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9864 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-10-27 11:53:31 +00:00
Alexey Marchuk	3d8904c66b	nvmf: Add discovery filtering rules SPDK nvmf target reports all listeners on all subsystems in discovery pages, kernel target reports only subsystems listening on a port where discovery command is received. NVMEoF specification allows to specify any addresses/ transport types. Ch 5: The set of Discovery Log entries should include all applicable addresses on the same fabric as the Discovery Service and may include addresses on other fabrics. To align SPDK and kernel targets behaviour, add filtering rules to allow flexible configuration of what should be listed in discovery log page entries. Fixes #2082 Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: Ie981edebb29206793d3310940034dcbb22c52441 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9185 Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot	2021-10-25 22:57:48 +00:00
Jim Harris	e40bd53175	nvme/pcie: only set qpair state from qpair's thread The qpair's state member is only 3 bits of a uint8_t, and the in_completion_context bit is another bit in that same uint8_t. We know that the qpair's state is only ever updated by one thread, but it is possible that the state could be modified by one thread, while another thread is modifying in_completion_context. in_completion_context is only modified by the thread that is polling the qpair (or the qpair's poll group). But with async mode, another thread that has a qpair on the same PCIe controller could poll its adminq and reap the SQ completion for the qpair that's owned by the other thread. So do not set the generic qpair state to CONNECTED from the SQ completion callback. Instead just set the pcie_state to READY, and let the thread that owns the qpair detect the qpair is READY and set the state to CONNECTED itself. Fixes issue #2157. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I9efc0c954504f1841e1c3890ae78211ad0d1990e Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9975 Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot	2021-10-25 19:53:14 +00:00
GangCao	9072c4ad0d	accel: create SW Engine Channel if HW Engine not supports Currently either HW Engine Channel or SW Engine Channel will be used. In the case that HW Engine Channel is used while does not support related operations like IOAT for CRC, it will shift back to the SW Engine's handle. So that this is an issue that it still refers to the HW Engine Channel while needs SW Eninge Channel to handle. This patch introduces the SW Eninge Channel and always initializes there in case that HW Engine does not support some operations. Related UT also added to simulate the case the IOAT does not support CRC and then SW Eninge needs to properly handle it. Change-Id: I4ecdcd09ab669a616b37c567b45b1e6499800ec9 Signed-off-by: GangCao <gang.cao@intel.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9874 Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot	2021-10-20 23:04:38 +00:00
Alexey Marchuk	2696886c75	dma: Update translation result to hold iovec pointer In some cases a single virtually contriguos memory buffer can be translated to several chunks of memory. To make such translation possible, update structure spdk_memory_domain_translation_result to use a pointer to iovec. Add a single iov structure or cases where translation is always 1:1, it will make easier translation callback implementation. For RDMA transport translation of address is always 1:1, so treat iovcnt other than 1 as an error. Change-Id: I65605575d43a490490eba72c1eb19f3a09d55ec6 Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Signed-off-by: Max Gurtovoy <mgurtovoy@nvidia.com> Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9779 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-10-20 22:55:52 +00:00
Alexey Marchuk	549bcdc0a4	dma: Update memory domain context structure Instead of a union with domain type specific parameters, store an opaque pointer to user context. Depending on the memory domain type, this context can be cast to a specific struct, e.g. to spdk_memory_domain_rdma_ctx for RDMA memory domains. This change provides more flexibility to applications to create and manage custom memory domains Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Signed-off-by: Max Gurtovoy <mgurtovoy@nvidia.com> Change-Id: Ib0a8297de80773d86edc9849beb4cbc693ef5414 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9778 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com>	2021-10-20 22:55:52 +00:00
Alexey Marchuk	0ecbe09bc1	dma: Add infrstructure for push operation Push operation complements existing pull operation and allows to implement read data flow using memory domains. Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Signed-off-by: Max Gurtovoy <mgurtovoy@nvidia.com> Change-Id: I0a3ddcb88c433dff7a9c761a99838658c72c43fd Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9701 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-10-20 22:55:52 +00:00
Shuhei Matsumoto	a59b3f9236	bdev/nvme: Retry I/O immediately if it got I/O path error The previous patch supported I/O retry when no available io_path was found at submission. This patch supports I/O retry when we get I/O path error at completion. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: I93a1664944b15ab0a826a321e2ea7a2574263afe Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9850 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-10-20 07:23:40 +00:00
Shuhei Matsumoto	ef409194a1	bdev/nvme: Retry I/O a second later if any I/O path may become available If ANA state is inaccessible or qpair is disconnected, I/O cannot be submitted. But if qpair is connected, ANA state may become accessible, or if qpair is disconnected, it may become connected via resetting. Hence even if find_io_path() returned NULL, queue I/O and retry it one second later if qpair is connected or ctrlr is resetting. Sort retried I/Os by expiration values in ticks, and activate a timed poller per nvme_bdev_channel only if there is any retried I/O. So the poller function bdev_nvme_retry_ios() always returns BUSY because if the poller runs earlier than the closest retried I/O or runs when there is no retried I/O, it is more like a bug of the framework. Signed-off-by: Shuhei Matsumoto <shuhei.matsumoto.xt@hitachi.com> Change-Id: Id28110a0d63ebc1c5772814e2ff8a47934df1644 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9830 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@mellanox.com>	2021-10-20 07:23:40 +00:00
Alexey Marchuk	d47893607b	test/scsi: Fix uninitialized variable dev_ut.c:667:30: error: ‘prev_lun’ may be used uninitialized in this function [-Werror=maybe-uninitialized] 667 \| struct spdk_scsi_lun lun, prev_lun; \| ^~~~~~~~ gcc (Ubuntu 9.3.0-17ubuntu1~20.04) 9.3.0 aarch64 Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: Id6608620ef6f18002ff7b7cc6de3e1361be762d0 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9860 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Community-CI: Mellanox Build Bot Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Jim Harris <james.r.harris@intel.com>	2021-10-18 21:49:20 +00:00
Alexey Marchuk	9efad7468f	dma: Rename fetch operation to pull The new name suits better to the following "data push" operation Signed-off-by: Alexey Marchuk <alexeymar@mellanox.com> Change-Id: Ic3249f65de203f375477f8e87b0749b9502d165c Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/9878 Community-CI: Broadcom CI <spdk-ci.pdl@broadcom.com> Reviewed-by: Changpeng Liu <changpeng.liu@intel.com> Reviewed-by: Ben Walker <benjamin.walker@intel.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com>	2021-10-18 07:56:57 +00:00

1 2 3 4 5 ...

2533 Commits