mirror of
https://github.com/huggingface/text-generation-inference.git
synced 2025-11-18 23:15:59 +00:00
* Gaudi: Use exponential growth to replace BATCH_BUCKET_SIZE Signed-off-by: yuanwu <yuan.wu@intel.com> * Remove debug modifications Signed-off-by: yuanwu <yuan.wu@intel.com> --------- Signed-off-by: yuanwu <yuan.wu@intel.com> |
||
|---|---|---|
| .. | ||
| client | ||
| gaudi | ||
| grpc-metadata | ||
| llamacpp | ||
| neuron | ||
| trtllm | ||
| v2 | ||
| v3 | ||