Default Branch

8f8819795f · Fixing CI (#3184) · Updated 2025-04-18 11:07:18 +00:00

Branches

afe3fed1a4 · Merge branch 'fix_rocm_fa' into rocm_6.2_fixes · Updated 2024-09-24 10:53:50 +00:00

329
21

38c625bfeb · Release 2.3.0 · Updated 2024-09-20 16:15:06 +00:00    Leaf

327
1

662e073668 · priv-cache. · Updated 2024-09-17 15:26:37 +00:00    Leaf

336
69

0dd6eef748 · Update runner · Updated 2024-09-17 14:01:52 +00:00    Leaf

337
3

c821a0ff76 · Tmp dump. · Updated 2024-09-17 09:19:03 +00:00    Leaf

336
2

10628e878a · Merge branch 'main' into gpt_awq_4 · Updated 2024-09-13 08:45:19 +00:00    Leaf

339
4

6b995cca30 · enable intel ipex cpu and xpu in python3.11 · Updated 2024-09-12 12:47:26 +00:00    Leaf

343
1

eabbbbda23 · Add Directory Check to Prevent Redundant Cloning in Build Process (#2486) · Updated 2024-09-07 11:19:43 +00:00    Leaf

346
0
Included

69dd51069f · unique hash for each image token · Updated 2024-09-03 12:56:02 +00:00    Leaf

360
5

a258e8f66a · fix: Fix PR comments · Updated 2024-09-02 07:36:23 +00:00    Leaf

361
12

5838f2139f · Tied embeddings in MLP speculator. · Updated 2024-08-29 10:30:26 +00:00    Leaf

367
47

e152cb022b · fix: also show total memory after full warmup · Updated 2024-08-22 17:57:51 +00:00    Leaf

372
2

d33fb9ed2c · extracting traceparent from header to span · Updated 2024-08-21 09:28:50 +00:00    Leaf

373
1

2652e209e7 · Updated flake lock · Updated 2024-08-21 07:15:10 +00:00    Leaf

373
15

b378fb4702 · Fixing exl2 (by disabling cuda graphs) · Updated 2024-08-14 17:44:54 +00:00    Leaf

406
2

89707adbbb · Fixing exl2 (by disabling cuda graphs) · Updated 2024-08-14 17:41:29 +00:00    Leaf

389
4

4b10c8c30b · fix: improve scales change and revert conditional · Updated 2024-08-14 16:38:15 +00:00    Leaf

390
2

b84bb19ece · fix: prefer recent gptq changes · Updated 2024-08-12 15:51:19 +00:00    Leaf

397
9

7bc16deb48 · wip: debug gemma and flash · Updated 2024-08-09 23:08:54 +00:00    Leaf

407
1

7735b385dc · Prefix caching WIP · Updated 2024-08-09 14:52:59 +00:00    Leaf

409
1