text-generation-inference/router/client/src
2023-12-18 16:07:05 +01:00
..
pb Init 2022-10-08 12:30:12 +02:00
client.rs fix: fix gpt-q with groupsize = -1 (#1358) 2023-12-18 16:07:05 +01:00
lib.rs Speculative (#1308) 2023-12-11 12:46:30 +01:00
sharded_client.rs feat: add more latency metrics in forward (#1346) 2023-12-14 15:59:38 +01:00