text-generation-inference

mirror of https://github.com/huggingface/text-generation-inference.git synced 2025-10-10 23:45:23 +00:00

History

Morgan Funtowicz e6da212431 feat(trtllm): cache maxNumTokens to avoid calling JSON everytime		2024-10-21 14:51:58 +02:00
..
backend.cpp	feat(trtllm): cache maxNumTokens to avoid calling JSON everytime	2024-10-21 14:51:58 +02:00