text-generation-inference/backends/llamacpp/src
Adrien Gallouët 3eb4823f3e
Use max_batch_total_tokens
Signed-off-by: Adrien Gallouët <angt@huggingface.co>
2025-02-04 13:32:58 +00:00
..
backend.rs Use max_batch_total_tokens 2025-02-04 13:32:58 +00:00
main.rs Use max_batch_total_tokens 2025-02-04 13:32:58 +00:00
wrapper.h Add llamacpp backend 2025-02-04 13:32:56 +00:00