text-generation-inference/router/client
Wang, Yi 3d81a80577
Fix incorrect setting of max_new_tokens in warmup (#104)
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
2024-03-13 16:19:40 +01:00
..
src Fix incorrect setting of max_new_tokens in warmup (#104) 2024-03-13 16:19:40 +01:00
build.rs feat: add distributed tracing (#62) 2023-02-13 13:02:45 +01:00
Cargo.toml Add warmup for all possible shapes for prefill #49 (#81) 2024-02-28 10:40:13 +01:00