text-generation-inference/backends/client/src/v3
yuanwu 67ee45a270 Pass the max_batch_total_tokens to causal_lm
refine the warmup

Signed-off-by: yuanwu <yuan.wu@intel.com>
2024-10-23 08:28:26 +00:00
..
client.rs Pass the max_batch_total_tokens to causal_lm 2024-10-23 08:28:26 +00:00
mod.rs Rebase TRT-llm (#2331) 2024-09-25 05:55:39 +00:00
sharded_client.rs Pass the max_batch_total_tokens to causal_lm 2024-10-23 08:28:26 +00:00