text-generation-inference/backends
Yuan Wu 1d3a4ab851
Enable mllama (#272)
Signed-off-by: Yuan Wu <yuan.wu@intel.com>
2025-02-27 16:12:15 +01:00
..
client Pass the max_batch_total_tokens to causal_lm 2024-10-23 08:28:26 +00:00
grpc-metadata Rebase TRT-llm (#2331) 2024-09-25 05:55:39 +00:00
trtllm More fixes trtllm (#2342) 2024-09-25 06:08:00 +00:00
v2 Fix the issues of tgi-gaudi for v.2.3.1 2024-10-27 20:40:36 +00:00
v3 Enable mllama (#272) 2025-02-27 16:12:15 +01:00