style: Logits Warper mention top-p top-k README

more readability
This commit is contained in:
Guspan Tanadi 2023-04-04 15:07:28 +07:00 committed by GitHub
parent 578dee03bf
commit 189465fd60
No known key found for this signature in database
GPG Key ID: 4AEE18F83AFDEB23

View File

@ -46,7 +46,7 @@ to power LLMs api-inference widgets.
- Quantization with [bitsandbytes](https://github.com/TimDettmers/bitsandbytes)
- [Safetensors](https://github.com/huggingface/safetensors) weight loading
- Watermarking with [A Watermark for Large Language Models](https://arxiv.org/abs/2301.10226)
- Logits warper (temperature scaling, TopP, TopK, repetition penalty, more details see [transformers.generation_logits_process](https://huggingface.co/transformers/v4.1.1/_modules/transformers/generation_logits_process.html))
- Logits warper (temperature scaling, top-p, top-k, repetition penalty, more details see [transformers.generation_logits_process](https://huggingface.co/transformers/v4.1.1/_modules/transformers/generation_logits_process.html))
- Stop sequences
- Log probabilities
- Production ready (distributed tracing with Open Telemetry, Prometheus metrics)