Default Branch

c6071749db · Fix mask passed to flashinfer (#3324) · Updated 2025-09-08 17:47:03 +00:00

Branches

48067e4a0d · fmt · Updated 2025-01-14 01:23:28 +00:00

239
3

c7b2e3f100 · chore: Enable blocking feature for reqwest · Updated 2025-01-09 10:07:49 +00:00

246
2

db6a9e1232 · add ats support · Updated 2025-01-08 00:23:16 +00:00

246
2

f89bdb72c8 · Fix runtime error when Qwen2-VL was prompted with multiple images · Updated 2024-12-16 21:15:43 +00:00

250
2

1fa9ca2f16 · add fix · Updated 2024-12-13 16:10:00 +00:00

254
1

182ffaf064 · misc: use return Ok(()) · Updated 2024-12-12 15:04:05 +00:00

318
92

1ca37d3353 · misc(ci): let's use the correct way to invoke sccache · Updated 2024-12-11 21:18:54 +00:00

261
15

bb9095aae3 · Updating lock. · Updated 2024-12-11 20:12:49 +00:00

259
2

b653605e54 · feat(trtllm): fix logits retrieval · Updated 2024-12-10 22:28:13 +00:00

275
30

8f326c9791 · Fixing lockfile. · Updated 2024-12-09 20:20:59 +00:00

264
2

600d7e6ece · Update server/text_generation_server/adapters/lora.py · Updated 2024-12-02 05:02:02 +00:00

282
2

d2ed52f531 · v2.4.1 · Updated 2024-11-22 17:28:39 +00:00

287
1

53b6f6e604 · Apply suggestions from code review · Updated 2024-11-18 11:28:07 +00:00

309
8

a604bfe450 · fix: run pre commit lints · Updated 2024-11-01 16:11:57 +00:00    Leaf

328
2

3bb78a8266 · misc(deps): update ompi from 4.1.6 to 4.1.7rc1 to avoid strange deadlock · Updated 2024-10-28 16:24:08 +00:00    Leaf

356
80

7bc2c97bd9 · Check if allowed tokens is None (#2694) · Updated 2024-10-28 04:10:55 +00:00    Leaf

334
3

0a655a0ab5 · v2.4.0 · Updated 2024-10-25 21:12:49 +00:00    Leaf

338
1

e3db525917 · Fix integration mt0 (transformers update). · Updated 2024-10-24 09:54:11 +00:00    Leaf

355
12

fe8d55dba9 · Clean both threads. · Updated 2024-10-21 12:49:07 +00:00    Leaf

355
2

b3917ff695 · fix: add limit to internal stream function too · Updated 2024-10-15 15:14:04 +00:00    Leaf

366
2