text-generation-inference/backends/trtllm/include
2024-07-17 13:55:29 +00:00
..
backend.h compute the number of maximum new tokens for each request independently 2024-07-17 13:55:29 +00:00
ffi.h impl RwLock scenario for TensorRtLllmBackend 2024-07-16 20:08:10 +00:00