.. |
custom_kernels
|
chore: add pre-commit (#1569)
|
2024-04-24 15:32:02 +03:00 |
exllama_kernels
|
chore: add pre-commit (#1569)
|
2024-04-24 15:32:02 +03:00 |
exllamav2_kernels
|
chore: add pre-commit (#1569)
|
2024-04-24 15:32:02 +03:00 |
tests
|
Updated kv cache for starcoder (#128)
|
2024-06-14 22:36:44 +02:00 |
text_generation_server
|
BS round up to BUCKET_SIZE to prevent capture graph when graph input not change (#185)
|
2024-07-16 09:42:46 +02:00 |
.gitignore
|
Impl simple mamba model (#1480)
|
2024-04-23 11:45:11 +03:00 |
dill-0.3.7-patch.sh
|
Hgraph dill patch (#131)
|
2024-04-26 11:08:15 +02:00 |
dill-0.3.8-patch.sh
|
A patch to address HPU Graphs issue with DILL
|
2024-05-06 09:15:46 +03:00 |
Makefile
|
fix: fix CohereForAI/c4ai-command-r-plus (#1707)
|
2024-04-25 17:51:35 +03:00 |
Makefile-awq
|
chore: add pre-commit (#1569)
|
2024-04-24 15:32:02 +03:00 |
Makefile-eetq
|
Upgrade EETQ (Fixes the cuda graphs). (#1729)
|
2024-04-25 17:58:27 +03:00 |
Makefile-flash-att
|
chore: add pre-commit (#1569)
|
2024-04-24 15:32:02 +03:00 |
Makefile-flash-att-v2
|
fix: fix CohereForAI/c4ai-command-r-plus (#1707)
|
2024-04-25 17:51:35 +03:00 |
Makefile-selective-scan
|
chore: add pre-commit (#1569)
|
2024-04-24 15:32:02 +03:00 |
Makefile-vllm
|
(chore): torch 2.3.0 (#1833)
|
2024-06-10 14:12:46 +03:00 |
poetry.lock
|
Update to SynapseAI 1.16.0 (#167)
|
2024-07-03 11:08:56 +02:00 |
pyproject.toml
|
Update to SynapseAI 1.16.0 (#167)
|
2024-07-03 11:08:56 +02:00 |
README.md
|
chore: add pre-commit (#1569)
|
2024-04-24 15:32:02 +03:00 |
requirements_cuda.txt
|
(chore): torch 2.3.0 (#1833)
|
2024-06-10 14:12:46 +03:00 |
requirements_rocm.txt
|
(chore): torch 2.3.0 (#1833)
|
2024-06-10 14:12:46 +03:00 |
requirements.txt
|
Update to SynapseAI 1.16.0 (#167)
|
2024-07-03 11:08:56 +02:00 |