text-generation-inference/backends/llamacpp
Adrien Gallouët 8ed362d03a
Clear request cache after completion
Signed-off-by: Adrien Gallouët <angt@huggingface.co>
2025-02-04 13:32:59 +00:00
..
.cargo Add llamacpp backend 2025-02-04 13:32:56 +00:00
src Clear request cache after completion 2025-02-04 13:32:59 +00:00
build.rs backend(llama): add CUDA Dockerfile_llamacpp for now 2025-02-04 13:32:58 +00:00
Cargo.toml Auto-detect n_threads when not provided 2025-02-04 13:32:59 +00:00
requirements.txt backend(llama): add CUDA Dockerfile_llamacpp for now 2025-02-04 13:32:58 +00:00