text-generation-inference/server/text_generation_server/utils/gptq
Nicolas Patry 732da6942b Remove lots of dead code, move triton to hard requirement
- Added option to upload to hub directly after quantizing.
2023-06-14 14:55:45 +02:00
..
custom_autotune.py Functionning quantization script. 2023-06-14 09:42:55 +02:00
quant_linear.py Remove lots of dead code, move triton to hard requirement 2023-06-14 14:55:45 +02:00
quantize.py Remove lots of dead code, move triton to hard requirement 2023-06-14 14:55:45 +02:00