text-generation-inference/router/src
2023-07-13 18:59:38 +02:00
..
health.rs feat(server): only compute prefill logprobs when asked (#406) 2023-06-02 17:12:30 +02:00
infer.rs feat(server): add paged attention to flash models (#516) 2023-06-30 19:09:59 +02:00
lib.rs chore: update openapi schema 2023-06-05 18:16:08 +02:00
main.rs feat(router): explicit warning if revision is not set (#608) 2023-07-13 18:59:38 +02:00
queue.rs feat(server): add paged attention to flash models (#516) 2023-06-30 19:09:59 +02:00
server.rs feat: better errors for warmup and TP (#575) 2023-07-10 14:47:15 +02:00
validation.rs feat(launcher): add arg validation and drop subprocess (#595) 2023-07-13 14:22:37 +02:00