text-generation-inference/backends/trtllm
2024-07-17 13:55:29 +00:00
..
cmake impl RwLock scenario for TensorRtLllmBackend 2024-07-16 20:08:10 +00:00
include compute the number of maximum new tokens for each request independently 2024-07-17 13:55:29 +00:00
lib compute the number of maximum new tokens for each request independently 2024-07-17 13:55:29 +00:00
scripts Overall build TRTLLM and deps through CMake build system 2024-07-02 17:16:27 +02:00
src compute the number of maximum new tokens for each request independently 2024-07-17 13:55:29 +00:00
tests First version loading engines and making it ready for inference 2024-07-03 21:12:24 +00:00
build.rs impl RwLock scenario for TensorRtLllmBackend 2024-07-16 20:08:10 +00:00
Cargo.toml impl RwLock scenario for TensorRtLllmBackend 2024-07-16 20:08:10 +00:00
CMakeLists.txt use external fmt lib 2024-07-12 19:24:59 +00:00