mirror of
https://github.com/huggingface/text-generation-inference.git
synced 2025-04-22 23:42:06 +00:00
IDK what else to add in this guide, I looked for relevant code in TGI codebase and saw that it's used in quantization as well (maybe I could add that?) |
||
---|---|---|
.. | ||
flash_attention.md | ||
safetensors.md | ||
streaming.md |