Default Branch

bd1bdebb47 · doc: fix README (#3271) · Updated 2025-06-18 10:35:36 +00:00

Branches

d33fb9ed2c · extracting traceparent from header to span · Updated 2024-08-21 09:28:50 +00:00    Leaf

426
1

2652e209e7 · Updated flake lock · Updated 2024-08-21 07:15:10 +00:00    Leaf

424
15

b378fb4702 · Fixing exl2 (by disabling cuda graphs) · Updated 2024-08-14 17:44:54 +00:00    Leaf

457
2

89707adbbb · Fixing exl2 (by disabling cuda graphs) · Updated 2024-08-14 17:41:29 +00:00    Leaf

440
4

4b10c8c30b · fix: improve scales change and revert conditional · Updated 2024-08-14 16:38:15 +00:00    Leaf

441
2

b84bb19ece · fix: prefer recent gptq changes · Updated 2024-08-12 15:51:19 +00:00    Leaf

448
9

7bc16deb48 · wip: debug gemma and flash · Updated 2024-08-09 23:08:54 +00:00    Leaf

458
1

7735b385dc · Prefix caching WIP · Updated 2024-08-09 14:52:59 +00:00    Leaf

460
1

9f039ad4b3 · flake: use rust-overlay · Updated 2024-08-09 13:02:57 +00:00    Leaf

464
2

e219397ee1 · fix: adjust syntax typo again · Updated 2024-08-08 00:31:24 +00:00    Leaf

476
4

f230da8d63 · Keeping the benchmark somewhere · Updated 2024-08-06 12:36:15 +00:00    Leaf

485
17

4379f0650a · feat: add release and sha tagged images · Updated 2024-08-05 17:13:52 +00:00    Leaf

482
1

ab2ab2a0aa · pre-commit · Updated 2024-08-05 11:01:19 +00:00    Leaf

487
16

060b2db0df · add 'mamba' as model config · Updated 2024-08-01 16:16:32 +00:00    Leaf

484
1

8fad7ae5a2 · add some more basic info in README.md · Updated 2024-07-30 08:45:29 +00:00    Leaf

598
82

0b95693fb8 · fix: adjust test snapshots and small refactors (#2323) · Updated 2024-07-29 15:38:38 +00:00    Leaf

492
0
Included

12381b0b0e · delete the last no repeat processor from warpers · Updated 2024-07-26 09:22:46 +00:00    Leaf

543
5

169c8c2cf5 · token.to_str() returns result · Updated 2024-07-26 08:52:55 +00:00    Leaf

591
6

5afc98a7d7 · Snapshot update with vllm paged. · Updated 2024-07-25 10:17:40 +00:00    Leaf

517
3

344427b6ab · feat(router): drop permit after batching · Updated 2024-07-23 20:40:14 +00:00    Leaf

509
1