mirror of
https://github.com/huggingface/text-generation-inference.git
synced 2025-10-20 12:25:23 +00:00
* chore(neuron): update to optimum-neuron 0.3.0 Dependencies were changed accordingly, because Neuron SDK was updated to v2.24. * test: sample is not deterministic Also modify the temperature in decode test to avoid granite early stopping. * test(neuron): adjust expectations after graph changes * test(neuron): use greedy for stop sequences --------- Co-authored-by: David Corvoysier <david@huggingface.co> |
||
---|---|---|
.. | ||
helpers.py | ||
test_cached_model.py | ||
test_continuous_batching.py | ||
test_decode.py | ||
test_generator_slot.py | ||
test_info.py | ||
test_prefill.py |