text-generation-inference/backends/trtllm
2024-07-22 11:32:31 +00:00
..
cmake simplify prebuilt trtllm libraries name definition 2024-07-22 11:32:31 +00:00
include define a shared struct to hold the result of a decoding step 2024-07-18 21:33:04 +00:00
lib forward tgi parameters rep/freq penalty 2024-07-18 20:56:58 +00:00
scripts Overall build TRTLLM and deps through CMake build system 2024-07-02 17:16:27 +02:00
src make sure executor_worker is provided 2024-07-19 11:57:10 +00:00
tests First version loading engines and making it ready for inference 2024-07-03 21:12:24 +00:00
build.rs add initial Dockerfile for TRTLLM backend 2024-07-19 22:08:12 +00:00
Cargo.toml make sure the context is not dropped in the middle of the async decoding. 2024-07-17 21:56:50 +00:00
CMakeLists.txt add some more information in CMakeLists.txt to correctly find and install nvrtc wrapper 2024-07-22 09:33:38 +00:00
Dockerfile add initial Dockerfile for TRTLLM backend 2024-07-19 22:08:12 +00:00