__init__.py
|
Add support for FP8 KV cache scales (#2628)
|
2024-10-24 16:36:18 +02:00 |
common.py
|
feat: prefill chunking (#2600)
|
2024-10-16 12:49:33 +02:00 |
cuda.py
|
Bug Fix: Sliding Window Attention (#3112)
|
2025-03-18 10:37:33 +01:00 |
flash_attn_triton.py
|
feat: prefill chunking (#2600)
|
2024-10-16 12:49:33 +02:00 |
flashinfer.py
|
Merge 3d71c06aff into 84ab88d843
|
2025-04-17 18:42:51 +02:00 |
ipex.py
|
Bug Fix: Sliding Window Attention (#3112)
|
2025-03-18 10:37:33 +01:00 |
kv_cache.py
|
Use kernels from the kernel hub (#2988)
|
2025-02-10 19:19:25 +01:00 |
rocm.py
|
Bug Fix: Sliding Window Attention (#3112)
|
2025-03-18 10:37:33 +01:00 |