Adrien Gallouët
|
c52f08351f
|
Set TGI_LLAMA_PKG_CUDA from CUDA_VERSION
Signed-off-by: Adrien Gallouët <angt@huggingface.co>
|
2025-02-05 10:57:50 +00:00 |
|
Adrien Gallouët
|
906c265aef
|
Cleanup Dockerfile
Signed-off-by: Adrien Gallouët <angt@huggingface.co>
|
2025-02-04 17:53:47 +00:00 |
|
Adrien Gallouët
|
df2a4fbb8a
|
Update Dockerfile_llamacpp
Signed-off-by: Adrien Gallouët <angt@huggingface.co>
|
2025-02-04 13:32:59 +00:00 |
|
Adrien Gallouët
|
207041a977
|
Bump llamacpp to b4623
Signed-off-by: Adrien Gallouët <angt@huggingface.co>
|
2025-02-04 13:32:59 +00:00 |
|
Morgan Funtowicz
|
e6a8d33902
|
backend(llama): add CUDA architectures build argument for Dockerfile
|
2025-02-04 13:32:59 +00:00 |
|
Morgan Funtowicz
|
960c12bd6e
|
backend(llama): add CUDA Dockerfile_llamacpp for now
|
2025-02-04 13:32:58 +00:00 |
|
Adrien Gallouët
|
8d2dfdf668
|
Handle ctx args & fix sampling
Signed-off-by: Adrien Gallouët <angt@huggingface.co>
|
2025-02-04 13:32:58 +00:00 |
|
Adrien Gallouët
|
95e221eece
|
Add llamacpp backend
Signed-off-by: Adrien Gallouët <angt@huggingface.co>
|
2025-02-04 13:32:56 +00:00 |
|