mirror of
https://github.com/huggingface/text-generation-inference.git
synced 2025-04-19 22:02:06 +00:00
* adding max_token_capacity_metric * added tgi to name of metric * Adding max capacity metric. * Add description for the metrics --------- Co-authored-by: Edwinhr716 <Edandres249@gmail.com> |
||
---|---|---|
.. | ||
client | ||
grpc-metadata | ||
trtllm | ||
v2 | ||
v3 |