mirror of
https://github.com/huggingface/text-generation-inference.git
synced 2025-06-03 13:12:10 +00:00
* Add more representative Llama GPTQ test The Llama GPTQ test is updated to use a model with the commonly-used quantizer config format and activation sorting. The old test is kept around (but renamed) since it tests the format produced by `text-generation-server quantize`. * Add support for manually triggering a release build |
||
---|---|---|
.. | ||
autodocs.yaml | ||
build_documentation.yaml | ||
build_pr_documentation.yaml | ||
build.yaml | ||
ci_build.yaml | ||
client-tests.yaml | ||
integration_tests.yaml | ||
load_test.yaml | ||
stale.yaml | ||
tests.yaml | ||
trufflehog.yaml | ||
upload_pr_documentation.yaml |