Default Branch

8f8819795f · Fixing CI (#3184) · Updated 2025-04-18 11:07:18 +00:00

Branches

b70f29d729 · Bypasse perm issue. · Updated 2025-01-24 11:12:47 +00:00

109
2

6d335ca7ce · Remove modifications in Lock. · Updated 2025-01-22 12:37:17 +00:00

119
2

17192c9a0e · fix: remove test debug params · Updated 2025-01-17 16:19:02 +00:00

154
54

b4187d6022 · Add tgi_batch_current_size and tgi_batch_current_size as response header · Updated 2025-01-17 14:48:02 +00:00

129
1

bde5f9ad82 · nix: update to PyTorch 2.5.1 · Updated 2025-01-17 06:44:21 +00:00

133
1

48067e4a0d · fmt · Updated 2025-01-14 01:23:28 +00:00

147
3

c7b2e3f100 · chore: Enable blocking feature for reqwest · Updated 2025-01-09 10:07:49 +00:00

154
2

db6a9e1232 · add ats support · Updated 2025-01-08 00:23:16 +00:00

154
2

f89bdb72c8 · Fix runtime error when Qwen2-VL was prompted with multiple images · Updated 2024-12-16 21:15:43 +00:00

158
2

1fa9ca2f16 · add fix · Updated 2024-12-13 16:10:00 +00:00

162
1

182ffaf064 · misc: use return Ok(()) · Updated 2024-12-12 15:04:05 +00:00

226
92

1ca37d3353 · misc(ci): let's use the correct way to invoke sccache · Updated 2024-12-11 21:18:54 +00:00

169
15

bb9095aae3 · Updating lock. · Updated 2024-12-11 20:12:49 +00:00

167
2

b653605e54 · feat(trtllm): fix logits retrieval · Updated 2024-12-10 22:28:13 +00:00

183
30

8f326c9791 · Fixing lockfile. · Updated 2024-12-09 20:20:59 +00:00

172
2

600d7e6ece · Update server/text_generation_server/adapters/lora.py · Updated 2024-12-02 05:02:02 +00:00

190
2

d2ed52f531 · v2.4.1 · Updated 2024-11-22 17:28:39 +00:00

195
1

53b6f6e604 · Apply suggestions from code review · Updated 2024-11-18 11:28:07 +00:00

217
8

a604bfe450 · fix: run pre commit lints · Updated 2024-11-01 16:11:57 +00:00    Leaf

236
2

3bb78a8266 · misc(deps): update ompi from 4.1.6 to 4.1.7rc1 to avoid strange deadlock · Updated 2024-10-28 16:24:08 +00:00    Leaf

264
80