If we want to test something like 512 qpairs, with qd = 8 for each, you need to specify -q 4096 -P 512. Then those 4096 I/O are spread across the 512 qpairs, to get qd = 8 for each qpair.. But currently it ends up also allocating 4096 num_io_requests for each qpair which is a huge waste. We need to instead base the num_io_requests on the effective queue depth for each of the qpairs. Signed-off-by: Jim Harris <james.r.harris@intel.com> Change-Id: I3ec0f4d9ab94388bf980c0b0439790847161ec12 Reviewed-on: https://review.spdk.io/gerrit/c/spdk/spdk/+/17620 Reviewed-by: Konrad Sztyber <konrad.sztyber@intel.com> Reviewed-by: Aleksey Marchuk <alexeymar@nvidia.com> Tested-by: SPDK CI Jenkins <sys_sgci@intel.com> Reviewed-by: Shuhei Matsumoto <smatsumoto@nvidia.com> |
||
---|---|---|
.. | ||
.gitignore | ||
Makefile | ||
perf.c | ||
README.md |
Compiling perf on FreeBSD
To use perf test on FreeBSD over NVMe-oF, explicitly link userspace library of HBA. For example, on a setup with Mellanox HBA,
LIBS += -lmlx5