text-generation-inference/router/client
OlivierDehaene 518d30dec4 feat(router): add max_batch_size (#1542)
Some hardware require a maximum batch size.
2024-04-24 09:21:57 +00:00
..
src feat(router): add max_batch_size (#1542) 2024-04-24 09:21:57 +00:00
build.rs feat: add distributed tracing (#62) 2023-02-13 13:02:45 +01:00
Cargo.toml Add warmup for all possible shapes for prefill #49 (#81) 2024-02-28 10:40:13 +01:00