mirror of
https://github.com/huggingface/text-generation-inference.git
synced 2025-04-19 13:52:07 +00:00
* feat(neuron): use AWS Neuron SDK 2.21.1 * feat(neuron): bump optimum-neuron version * feat(neuron): tag latest image for local tests * test(neuron): simplify sampling test |
||
---|---|---|
.. | ||
client | ||
gaudi | ||
grpc-metadata | ||
llamacpp | ||
neuron | ||
trtllm | ||
v2 | ||
v3 |