Default Branch

c6071749db · Fix mask passed to flashinfer (#3324) · Updated 2025-09-08 17:47:03 +00:00

Branches

c9e0f36dbc · Machete WIP · Updated 2024-10-14 13:46:00 +00:00    Leaf

377
1

99b1cf5948 · fix: rerun linter · Updated 2024-10-09 20:10:31 +00:00    Leaf

378
6

130f9d16b5 · fix: rerun black lint · Updated 2024-10-09 18:44:41 +00:00    Leaf

379
3

e618ce3ada · Fix: make moe_kernels imports conditional · Updated 2024-10-08 11:05:28 +00:00    Leaf

381
1

74489227e0 · Add Google Cloud in docs/source/references/api_reference.md · Updated 2024-10-05 14:54:17 +00:00    Leaf

385
2

11d7af730b · add cloning in Dockerfile · Updated 2024-10-04 17:41:02 +00:00    Leaf

395
6

3f07ddb469 · feat: support llama 3.1 tooling and remove grammar schema · Updated 2024-10-03 20:48:49 +00:00    Leaf

388
1

a094729386 · V2.3.1 · Updated 2024-10-03 12:49:40 +00:00    Leaf

388
1

7cb6abdf2f · Other dockerfile. · Updated 2024-10-01 15:07:36 +00:00    Leaf

402
5

a97931f3d8 · Only run 1 valid test. · Updated 2024-10-01 13:30:55 +00:00    Leaf

394
1

91656ff7a1 · Fix group. · Updated 2024-10-01 13:26:22 +00:00    Leaf

656
8

8cc2febdb6 · (fix) quantize=fp8 · Updated 2024-09-30 12:07:38 +00:00    Leaf

402
31

513ba5a0b4 · feat(tgi_common) continue more utility functions · Updated 2024-09-29 12:33:31 +00:00    Leaf

403
6

de38bf2664 · Updating Cargo lock · Updated 2024-09-25 18:51:29 +00:00    Leaf

405
9

afe3fed1a4 · Merge branch 'fix_rocm_fa' into rocm_6.2_fixes · Updated 2024-09-24 10:53:50 +00:00    Leaf

421
21

afe3fed1a4 · Merge branch 'fix_rocm_fa' into rocm_6.2_fixes · Updated 2024-09-24 10:53:50 +00:00

421
21

38c625bfeb · Release 2.3.0 · Updated 2024-09-20 16:15:06 +00:00    Leaf

419
1

662e073668 · priv-cache. · Updated 2024-09-17 15:26:37 +00:00    Leaf

428
69

0dd6eef748 · Update runner · Updated 2024-09-17 14:01:52 +00:00    Leaf

429
3

c821a0ff76 · Tmp dump. · Updated 2024-09-17 09:19:03 +00:00    Leaf

428
2