text-generation-inference/integration-tests/fixtures/neuron
David Corvoysier 3d2e7c8fce
Optimum neuron 0.2.2 (#3281)
* chore(neuron): use optimum-neuron 0.2.1

* test(neuron): adjust expectations

Since the latest optimum-neuron uses a new modeling for granite and
qwen, the greedy outputs are slighly different.

* test(neuron): add phi3 and qwen3 tests

* chore(neuron): use optimum-neuron 0.2.2
2025-07-03 07:59:25 +02:00
..
export_models.py Optimum neuron 0.2.2 (#3281) 2025-07-03 07:59:25 +02:00
service.py fix: run linters and fix formatting (#3057) 2025-02-25 16:11:34 -05:00