adapters
|
Bug Fix: Sliding Window Attention (#3112)
|
2025-03-18 10:37:33 +01:00 |
layers
|
flashinfer: head_dim -> head_dim_qk
|
2025-04-11 12:38:17 +00:00 |
models
|
Update transformers to 4.51 (#3148)
|
2025-04-07 12:55:43 +02:00 |
pb
|
chore: add pre-commit (#1569)
|
2024-02-16 11:58:58 +01:00 |
utils
|
xpu 2.6 update (#3051)
|
2025-03-17 13:48:48 +01:00 |
__init__.py
|
feat(clients): Python client (#103)
|
2023-03-07 18:52:22 +01:00 |
cli.py
|
Fixing TRTLLM dockerfile. (#2922)
|
2025-01-20 11:13:46 +01:00 |
interceptor.py
|
feat: prefill chunking (#2600)
|
2024-10-16 12:49:33 +02:00 |
server.py
|
Tmp tp transformers (#2942)
|
2025-01-23 18:07:30 +01:00 |