mirror of
https://github.com/huggingface/text-generation-inference.git
synced 2025-06-07 18:02:07 +00:00
IDK what else to add in this guide, I looked for relevant code in TGI codebase and saw that it's used in quantization as well (maybe I could add that?) |
||
---|---|---|
.. | ||
flash_attention.md | ||
safetensors.md | ||
streaming.md |