mirror of
https://github.com/huggingface/text-generation-inference.git
synced 2025-04-21 23:12:07 +00:00
* Update Quantization docs and minor doc fix. * update readme with latest quants info * Apply suggestions from code review Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * up --------- Co-authored-by: Pedro Cuenca <pedro@huggingface.co> |
||
---|---|---|
.. | ||
consuming_tgi.md | ||
gated_model_access.md | ||
launcher.md | ||
monitoring.md | ||
non_core_models.md | ||
preparing_model.md | ||
safety.md | ||
train_medusa.md | ||
using_cli.md | ||
using_guidance.md | ||
visual_language_models.md |