text-generation-inference/backends/neuron/tests/server
Alvaro Moran 8801ba12cf
Optimum neuron 0.3.0 (#3308)
* chore(neuron): update to optimum-neuron 0.3.0

Dependencies were changed accordingly, because Neuron SDK was updated to
v2.24.

* test: sample is not deterministic

Also modify the temperature in decode test to avoid granite early
stopping.

* test(neuron): adjust expectations after graph changes

* test(neuron): use greedy for stop sequences

---------

Co-authored-by: David Corvoysier <david@huggingface.co>
2025-08-26 11:07:47 +02:00
..
helpers.py fix: run linters and fix formatting (#3057) 2025-02-25 16:11:34 -05:00
test_cached_model.py Bump neuron SDK version (#3260) 2025-06-10 17:56:25 +02:00
test_continuous_batching.py Bump neuron SDK version (#3260) 2025-06-10 17:56:25 +02:00
test_decode.py Optimum neuron 0.3.0 (#3308) 2025-08-26 11:07:47 +02:00
test_generator_slot.py fix: run linters and fix formatting (#3057) 2025-02-25 16:11:34 -05:00
test_info.py Add Neuron backend (#3033) 2025-02-24 09:10:05 +01:00
test_prefill.py Optimum neuron 0.3.0 (#3308) 2025-08-26 11:07:47 +02:00