Default Branch

8f8819795f · Fixing CI (#3184) · Updated 2025-04-18 11:07:18 +00:00

Branches

e2846f76fa · No root user TGI. · Updated 2025-03-07 10:23:02 +00:00

38
1

5a5a51217e · Stop being root in the docker. · Updated 2025-03-06 15:45:55 +00:00

60
1

c34bd9d8d9 · 3.1.1 Release. · Updated 2025-03-04 17:11:30 +00:00

67
1

ddf0b02240 · All the assertions. · Updated 2025-03-04 12:32:05 +00:00

117
2

f72547c9fb · feat(metrics): remove ngrok mandatory feature for backendv3 crate · Updated 2025-02-27 21:56:04 +00:00

77
5

efb20054aa · feat: consolidate streaming and event creation logic and add tests for streaming generations · Updated 2025-02-27 16:12:51 +00:00

55
22

16793c7f51 · ci: add missing needs for integration tests · Updated 2025-02-21 15:38:11 +00:00

65
28

7e60666711 · ?? · Updated 2025-02-21 09:18:56 +00:00

65
12

95d1172347 · fix: bump ci build yaml · Updated 2025-02-17 15:24:25 +00:00

82
5

b7250f0473 · Revert "fix: expand logic for different hardware" · Updated 2025-02-11 16:14:02 +00:00

86
4

09631bc8a2 · fix: bump prompt · Updated 2025-02-11 15:15:29 +00:00

85
2

eb0194a9c1 · fix qwen2 vl crash in continous batching · Updated 2025-02-10 09:54:45 +00:00

86
1

408663e61a · fix triton to 3.1.0 to fix ipex import issue · Updated 2025-02-06 08:54:03 +00:00

90
1

463228ebfc · Update version number. · Updated 2025-01-31 13:24:45 +00:00

95
1

50c8ebdef0 · CI must be green. · Updated 2025-01-31 12:16:29 +00:00

97
7

5452c1294c · backend(vllm): disable metrics for now · Updated 2025-01-31 09:56:54 +00:00

120
9

4e1c68e6f8 · Increase session time · Updated 2025-01-30 08:53:28 +00:00

102
8

b0b855fecd · update doc · Updated 2025-01-29 12:46:03 +00:00

125
5

c871d74b46 · More logs in the allocator. · Updated 2025-01-28 10:19:37 +00:00

106
1

bafbd06744 · Update transformers_flash_causal_lm.py · Updated 2025-01-24 14:06:50 +00:00

107
2