.. |
custom_kernels
|
All integration tests back everywhere (too many failed CI). (#2428)
|
2024-09-25 06:10:59 +00:00 |
exllama_kernels
|
Update ROCM libs and improvements (#2579)
|
2024-10-25 09:01:04 +00:00 |
exllamav2_kernels
|
Update ROCM libs and improvements (#2579)
|
2024-10-25 09:01:04 +00:00 |
tests
|
Fix tokenization yi (#2507)
|
2024-09-25 06:15:35 +00:00 |
text_generation_server
|
Fix the loading issue of 90B (#283)
|
2025-02-28 11:20:55 +01:00 |
.gitignore
|
Impl simple mamba model (#1480)
|
2024-04-23 11:45:11 +03:00 |
dill-0.3.7-patch.sh
|
Make Gaudi adapt to the tgi 2.3.0
|
2024-09-26 06:04:55 +00:00 |
dill-0.3.8-patch.sh
|
Make Gaudi adapt to the tgi 2.3.0
|
2024-09-26 06:04:55 +00:00 |
Makefile
|
Add the no-deps in pip install
|
2024-12-08 12:14:38 +00:00 |
Makefile-awq
|
chore: add pre-commit (#1569)
|
2024-04-24 15:32:02 +03:00 |
Makefile-eetq
|
Upgrade EETQ (Fixes the cuda graphs). (#1729)
|
2024-04-25 17:58:27 +03:00 |
Makefile-exllamav2
|
Upgrading exl2. (#2415)
|
2024-09-25 06:07:40 +00:00 |
Makefile-fbgemm
|
Add Directory Check to Prevent Redundant Cloning in Build Process (#2486)
|
2024-09-25 06:14:07 +00:00 |
Makefile-flash-att
|
Hotfixing make install . (#2008)
|
2024-09-24 03:29:29 +00:00 |
Makefile-flash-att-v2
|
Update ROCM libs and improvements (#2579)
|
2024-10-25 09:01:04 +00:00 |
Makefile-flashinfer
|
Prefix test - Different kind of load test to trigger prefix test bugs. (#2490)
|
2024-09-25 06:14:07 +00:00 |
Makefile-lorax-punica
|
Enable multiple LoRa adapters (#2010)
|
2024-09-24 03:55:04 +00:00 |
Makefile-selective-scan
|
chore: add pre-commit (#1569)
|
2024-04-24 15:32:02 +03:00 |
Makefile-vllm
|
Update ROCM libs and improvements (#2579)
|
2024-10-25 09:01:04 +00:00 |
poetry.lock
|
Merge branch 'habana-main' into 2.3.0
|
2024-11-01 11:24:40 +08:00 |
pyproject.toml
|
Enable mllama (#272)
|
2025-02-27 16:12:15 +01:00 |
README.md
|
chore: add pre-commit (#1569)
|
2024-04-24 15:32:02 +03:00 |
requirements_cuda.txt
|
Mllama flash version (#2585)
|
2024-10-27 04:03:57 +00:00 |
requirements_intel.txt
|
Mllama flash version (#2585)
|
2024-10-27 04:03:57 +00:00 |
requirements_rocm.txt
|
Mllama flash version (#2585)
|
2024-10-27 04:03:57 +00:00 |
requirements.txt
|
Enable mllama (#272)
|
2025-02-27 16:12:15 +01:00 |