text-generation-inference/router/src
2023-12-14 15:59:38 +01:00
..
health.rs Rebased #617 (#868) 2023-08-28 11:43:47 +02:00
infer.rs feat: add more latency metrics in forward (#1346) 2023-12-14 15:59:38 +01:00
lib.rs fix: default max_new_tokens to 100 2023-12-13 09:19:19 +01:00
main.rs #1049 CI (#1178) 2023-10-20 10:28:45 +02:00
queue.rs Speculative (#1308) 2023-12-11 12:46:30 +01:00
server.rs feat: mixtral (#1328) 2023-12-11 14:43:40 +01:00
validation.rs feat: add more latency metrics in forward (#1346) 2023-12-14 15:59:38 +01:00