text-generation-inference/backends/llamacpp
Adrien Gallouët ae5bb789c2
Enable flash attention by default
Signed-off-by: Adrien Gallouët <angt@huggingface.co>
2025-02-04 13:32:58 +00:00
..
.cargo Add llamacpp backend 2025-02-04 13:32:56 +00:00
src Enable flash attention by default 2025-02-04 13:32:58 +00:00
build.rs Add llamacpp backend 2025-02-04 13:32:56 +00:00
Cargo.toml Add llamacpp backend 2025-02-04 13:32:56 +00:00