mirror of
https://github.com/huggingface/text-generation-inference.git
synced 2025-04-25 01:42:12 +00:00
- Create `quantization_config` option in the model config. - Don't store the quantizer config in tensors anymore. |
||
---|---|---|
.. | ||
__init__.py | ||
custom_autotune.py | ||
exllama.py | ||
exllamav2.py | ||
quant_linear.py | ||
quantize.py | ||
utils.py |