text-generation-inference/backends/llamacpp/src
Adrien Gallouët 30cd3cf510
Enable mmap, offload_kqv & flash_attention by default
Signed-off-by: Adrien Gallouët <angt@huggingface.co>
2025-03-05 11:08:17 +00:00
..
backend.rs [Backend] Add Llamacpp backend (#2975) 2025-02-14 13:40:57 +01:00
main.rs Enable mmap, offload_kqv & flash_attention by default 2025-03-05 11:08:17 +00:00