text-generation-inference

mirror of https://github.com/huggingface/text-generation-inference.git synced 2025-04-24 00:12:08 +00:00

History

Yuan Wu 1d3a4ab851 Enable mllama (#272 ) Signed-off-by: Yuan Wu <yuan.wu@intel.com>		2025-02-27 16:12:15 +01:00
..
client	Pass the max_batch_total_tokens to causal_lm	2024-10-23 08:28:26 +00:00
grpc-metadata	Rebase TRT-llm (#2331 )	2024-09-25 05:55:39 +00:00
trtllm	More fixes trtllm (#2342 )	2024-09-25 06:08:00 +00:00
v2	Fix the issues of tgi-gaudi for v.2.3.1	2024-10-27 20:40:36 +00:00
v3	Enable mllama (#272 )	2025-02-27 16:12:15 +01:00