text-generation-inference/integration-tests/models/__snapshots__/test_flash_llama_gptq
Daniël de Kok 67ef0649cf
GPTQ CI improvements (#2151)
* Add more representative Llama GPTQ test

The Llama GPTQ test is updated to use a model with the commonly-used
quantizer config format and activation sorting. The old test is
kept around (but renamed) since it tests the format produced by
`text-generation-server quantize`.

* Add support for manually triggering a release build
2024-07-05 14:12:16 +02:00
..
test_flash_llama_gptq_all_params.json GPTQ CI improvements (#2151) 2024-07-05 14:12:16 +02:00
test_flash_llama_gptq_load.json GPTQ CI improvements (#2151) 2024-07-05 14:12:16 +02:00
test_flash_llama_gptq.json GPTQ CI improvements (#2151) 2024-07-05 14:12:16 +02:00