mirror of
https://github.com/huggingface/text-generation-inference.git
synced 2025-04-24 08:22:07 +00:00
* (feat) fp8 fnuz support for rocm * (review comments) Fix compression_config load, type hints * (bug) update all has_tensor * (review_comments) fix typo and added comments * (nit) improved comment |
||
---|---|---|
.. | ||
__init__.py | ||
custom_autotune.py | ||
exllama.py | ||
exllamav2.py | ||
quant_linear.py | ||
quantize.py | ||
utils.py |