text-generation-inference/router/client
2024-03-27 11:59:51 +01:00
..
src Adjust warmup to all possible bucket sizes and decode batch size = 1 (#113) 2024-03-27 11:59:51 +01:00
build.rs feat: add distributed tracing (#62) 2023-02-13 13:02:45 +01:00
Cargo.toml Add warmup for all possible shapes for prefill #49 (#81) 2024-02-28 10:40:13 +01:00