mirror of
https://github.com/huggingface/text-generation-inference.git
synced 2025-10-18 19:35:23 +00:00
* chore(neuron): use optimum-neuron 0.2.1 * test(neuron): adjust expectations Since the latest optimum-neuron uses a new modeling for granite and qwen, the greedy outputs are slighly different. * test(neuron): add phi3 and qwen3 tests * chore(neuron): use optimum-neuron 0.2.2 |
||
---|---|---|
.. | ||
fixtures | ||
gaudi | ||
images | ||
models | ||
neuron | ||
conftest.py | ||
pyproject.toml | ||
pytest.ini | ||
requirements.txt | ||
uv.lock |