Default Branch

1b90c508af · Revert "Revert "feat: bump flake including transformers and huggingfa… (#3326) · Updated 2025-09-09 14:44:25 +00:00

Branches

8fad7ae5a2 · add some more basic info in README.md · Updated 2024-07-30 08:45:29 +00:00    Leaf

641
82

0b95693fb8 · fix: adjust test snapshots and small refactors (#2323) · Updated 2024-07-29 15:38:38 +00:00    Leaf

535
0
Included

12381b0b0e · delete the last no repeat processor from warpers · Updated 2024-07-26 09:22:46 +00:00    Leaf

586
5

169c8c2cf5 · token.to_str() returns result · Updated 2024-07-26 08:52:55 +00:00    Leaf

634
6

5afc98a7d7 · Snapshot update with vllm paged. · Updated 2024-07-25 10:17:40 +00:00    Leaf

560
3

344427b6ab · feat(router): drop permit after batching · Updated 2024-07-23 20:40:14 +00:00    Leaf

552
1

db7e043ded · New version. · Updated 2024-07-23 16:29:13 +00:00    Leaf

553
1

0c95f7a942 · Debug softcap flash decoding activation · Updated 2024-07-23 13:12:19 +00:00    Leaf

557
1

dee649c60c · Chore: Fix naming issues regarding head_size, there can only be one. · Updated 2024-07-23 09:26:53 +00:00    Leaf

558
1

82fc879e17 · feat: refactor lora linear and remove adapter layers · Updated 2024-07-18 19:58:55 +00:00    Leaf

580
1

a1b69a8cc5 · Completing development guide · Updated 2024-07-18 15:38:18 +00:00    Leaf

663
2

959b9dc25f · Fixup constructor arguments · Updated 2024-07-17 07:42:24 +00:00    Leaf

586
16

2967b8168c · fix post refactor · Updated 2024-07-16 13:16:27 +00:00    Leaf

580
51

f6ad3b3585 · Some MoE exploration · Updated 2024-07-15 11:47:52 +00:00    Leaf

586
1

5b27307438 · Don't error on OpenAI valid top_p values. · Updated 2024-07-12 20:22:23 +00:00    Leaf

586
1

5c69639f74 · add condition different than PR · Updated 2024-07-12 11:19:52 +00:00    Leaf

594
8

4dfdb481fb · Version 2.1.1 · Updated 2024-07-04 10:39:07 +00:00    Leaf

611
1

fe3991e857 · feat: add simple ttft load_test · Updated 2024-07-02 15:57:01 +00:00    Leaf

616
1

cb232a35a9 · feat: add test to view batch speedup amount · Updated 2024-07-02 13:33:26 +00:00    Leaf

616
1

0a5b19a3ed · updated doc · Updated 2024-07-02 13:10:26 +00:00    Leaf

660
22