text-generation-inference

mirror of https://github.com/huggingface/text-generation-inference.git synced 2025-10-10 15:35:24 +00:00

History

Daniël de Kok 67ef0649cf GPTQ CI improvements (#2151 ) * Add more representative Llama GPTQ test The Llama GPTQ test is updated to use a model with the commonly-used quantizer config format and activation sorting. The old test is kept around (but renamed) since it tests the format produced by `text-generation-server quantize`. * Add support for manually triggering a release build		2024-07-05 14:12:16 +02:00
..
test_server_gptq_quantized_all_params.json	GPTQ CI improvements (#2151 )	2024-07-05 14:12:16 +02:00
test_server_gptq_quantized_load.json	GPTQ CI improvements (#2151 )	2024-07-05 14:12:16 +02:00
test_server_gptq_quantized.json	GPTQ CI improvements (#2151 )	2024-07-05 14:12:16 +02:00