text-generation-inference/server/text_generation_server/utils/gptq
Nicolas Patry 16d0fb04ae Santacoder GPTQ support (quantized model seems awful, not sure if it's
prompting or the quantization itself).
2023-06-15 16:59:31 +02:00
..
custom_autotune.py Functionning quantization script. 2023-06-14 09:42:55 +02:00
quant_linear.py Remove lots of dead code, move triton to hard requirement 2023-06-14 14:55:45 +02:00
quantize.py Santacoder GPTQ support (quantized model seems awful, not sure if it's 2023-06-15 16:59:31 +02:00