This website requires JavaScript.
Explore
Help
Sign In
huggingface
/
text-generation-inference
Watch
5
Star
0
Fork
0
You've already forked text-generation-inference
mirror of
https://github.com/huggingface/text-generation-inference.git
synced
2025-04-23 16:02:10 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
67ee45a270
text-generation-inference
/
backends
History
yuanwu
67ee45a270
Pass the max_batch_total_tokens to causal_lm
...
refine the warmup Signed-off-by: yuanwu <yuan.wu@intel.com>
2024-10-23 08:28:26 +00:00
..
client
Pass the max_batch_total_tokens to causal_lm
2024-10-23 08:28:26 +00:00
grpc-metadata
Rebase TRT-llm (
#2331
)
2024-09-25 05:55:39 +00:00
trtllm
More fixes trtllm (
#2342
)
2024-09-25 06:08:00 +00:00
v3
Pass the max_batch_total_tokens to causal_lm
2024-10-23 08:28:26 +00:00