adapters
|
(fix) sliding window attention
|
2025-03-13 19:43:00 +00:00 |
layers
|
Update window size rocm flash decoding
|
2025-03-14 07:50:11 +00:00 |
models
|
(typo) collection link
|
2025-03-14 07:36:38 +00:00 |
pb
|
chore: add pre-commit (#1569)
|
2024-02-16 11:58:58 +01:00 |
utils
|
Update to kernels 0.2.1 (#3084)
|
2025-03-13 10:36:29 +01:00 |
__init__.py
|
feat(clients): Python client (#103)
|
2023-03-07 18:52:22 +01:00 |
cli.py
|
Fixing TRTLLM dockerfile. (#2922)
|
2025-01-20 11:13:46 +01:00 |
interceptor.py
|
feat: prefill chunking (#2600)
|
2024-10-16 12:49:33 +02:00 |
server.py
|
Tmp tp transformers (#2942)
|
2025-01-23 18:07:30 +01:00 |