text-generation-inference/.github/workflows
Daniël de Kok 67ef0649cf
GPTQ CI improvements (#2151)
* Add more representative Llama GPTQ test

The Llama GPTQ test is updated to use a model with the commonly-used
quantizer config format and activation sorting. The old test is
kept around (but renamed) since it tests the format produced by
`text-generation-server quantize`.

* Add support for manually triggering a release build
2024-07-05 14:12:16 +02:00
..
autodocs.yaml feat: improve update_docs for openapi schema (#2169) 2024-07-03 09:53:35 +02:00
build_documentation.yaml New runner. Manual squash. (#2110) 2024-06-24 18:08:34 +02:00
build_pr_documentation.yaml New runner. Manual squash. (#2110) 2024-06-24 18:08:34 +02:00
build.yaml GPTQ CI improvements (#2151) 2024-07-05 14:12:16 +02:00
ci_build.yaml GPTQ CI improvements (#2151) 2024-07-05 14:12:16 +02:00
client-tests.yaml Removing IPEX_AVAIL. (#2115) 2024-06-25 13:20:57 +02:00
integration_tests.yaml Removing IPEX_AVAIL. (#2115) 2024-06-25 13:20:57 +02:00
load_test.yaml Removing IPEX_AVAIL. (#2115) 2024-06-25 13:20:57 +02:00
stale.yaml New runner. Manual squash. (#2110) 2024-06-24 18:08:34 +02:00
tests.yaml Removing IPEX_AVAIL. (#2115) 2024-06-25 13:20:57 +02:00
trufflehog.yaml New runner. Manual squash. (#2110) 2024-06-24 18:08:34 +02:00
upload_pr_documentation.yaml New runner. Manual squash. (#2110) 2024-06-24 18:08:34 +02:00