text-generation-inference

mirror of https://github.com/huggingface/text-generation-inference.git synced 2025-10-08 22:45:23 +00:00

History

David Corvoysier ae37890eef fix(neuron): export models from container in test fixtures The neuron tests require models to have been previously exported and cached on the hub. This is done automatically by the neuron.model fixture the first time the tests are ran for a specific version. This fixture used to export the models using optimum-neuron directly, but this package is not necessarily present on the system. Instead, it is now done through the neuron TGI itself, since it contains all the tools required to export the models. Note that since the CI runs docker in docker (dind) it does not seem possible to share a volume between the CI container and the container used to export the model. For that reason, a specific image with a modified entrypoint is built on-the-fly when a model export is required.		2025-02-23 14:17:02 +01:00
..
fixtures/neuron	fix(neuron): export models from container in test fixtures	2025-02-23 14:17:02 +01:00
images	Pali gemma modeling (#1895 )	2024-05-16 06:58:47 +02:00
models	Improve tool call message processing (#3036 )	2025-02-21 10:30:29 +01:00
neuron/integration	test(neuron): merge integration tests and fixtures	2025-02-23 14:17:02 +01:00
conftest.py	test: add --neuron option	2025-02-23 14:17:02 +01:00
pyproject.toml	Having less logs in case of failure for checking CI more easily. (#3037 )	2025-02-19 17:01:33 +01:00
pytest.ini	chore: add pre-commit (#1569 )	2024-02-16 11:58:58 +01:00
requirements.txt	Having less logs in case of failure for checking CI more easily. (#3037 )	2025-02-19 17:01:33 +01:00
uv.lock	Having less logs in case of failure for checking CI more easily. (#3037 )	2025-02-19 17:01:33 +01:00