docs: link to internal Generation Utilities README

LogitsProcessor section
This commit is contained in:
Guspan Tanadi 2023-04-04 18:11:30 +07:00 committed by GitHub
parent 189465fd60
commit a70b555502
No known key found for this signature in database
GPG Key ID: 4AEE18F83AFDEB23

View File

@ -46,7 +46,7 @@ to power LLMs api-inference widgets.
- Quantization with [bitsandbytes](https://github.com/TimDettmers/bitsandbytes)
- [Safetensors](https://github.com/huggingface/safetensors) weight loading
- Watermarking with [A Watermark for Large Language Models](https://arxiv.org/abs/2301.10226)
- Logits warper (temperature scaling, top-p, top-k, repetition penalty, more details see [transformers.generation_logits_process](https://huggingface.co/transformers/v4.1.1/_modules/transformers/generation_logits_process.html))
- Logits warper (temperature scaling, top-p, top-k, repetition penalty, more details see [transformers.LogitsProcessor](https://huggingface.co/docs/transformers/internal/generation_utils#transformers.LogitsProcessor))
- Stop sequences
- Log probabilities
- Production ready (distributed tracing with Open Telemetry, Prometheus metrics)