Commit Graph

8 Commits

Author SHA1 Message Date
Adrien Gallouët
c52f08351f
Set TGI_LLAMA_PKG_CUDA from CUDA_VERSION
Signed-off-by: Adrien Gallouët <angt@huggingface.co>
2025-02-05 10:57:50 +00:00
Adrien Gallouët
906c265aef
Cleanup Dockerfile
Signed-off-by: Adrien Gallouët <angt@huggingface.co>
2025-02-04 17:53:47 +00:00
Adrien Gallouët
df2a4fbb8a
Update Dockerfile_llamacpp
Signed-off-by: Adrien Gallouët <angt@huggingface.co>
2025-02-04 13:32:59 +00:00
Adrien Gallouët
207041a977
Bump llamacpp to b4623
Signed-off-by: Adrien Gallouët <angt@huggingface.co>
2025-02-04 13:32:59 +00:00
Morgan Funtowicz
e6a8d33902
backend(llama): add CUDA architectures build argument for Dockerfile 2025-02-04 13:32:59 +00:00
Morgan Funtowicz
960c12bd6e
backend(llama): add CUDA Dockerfile_llamacpp for now 2025-02-04 13:32:58 +00:00
Adrien Gallouët
8d2dfdf668
Handle ctx args & fix sampling
Signed-off-by: Adrien Gallouët <angt@huggingface.co>
2025-02-04 13:32:58 +00:00
Adrien Gallouët
95e221eece
Add llamacpp backend
Signed-off-by: Adrien Gallouët <angt@huggingface.co>
2025-02-04 13:32:56 +00:00