text-generation-inference

mirror of https://github.com/huggingface/text-generation-inference.git synced 2025-10-10 23:45:23 +00:00

History

Adrien Gallouët 8ed362d03a Clear request cache after completion Signed-off-by: Adrien Gallouët <angt@huggingface.co>		2025-02-04 13:32:59 +00:00
..
.cargo	Add llamacpp backend	2025-02-04 13:32:56 +00:00
src	Clear request cache after completion	2025-02-04 13:32:59 +00:00
build.rs	backend(llama): add CUDA Dockerfile_llamacpp for now	2025-02-04 13:32:58 +00:00
Cargo.toml	Auto-detect n_threads when not provided	2025-02-04 13:32:59 +00:00
requirements.txt	backend(llama): add CUDA Dockerfile_llamacpp for now	2025-02-04 13:32:58 +00:00