mirror of
https://github.com/huggingface/text-generation-inference.git
synced 2025-04-22 15:32:08 +00:00
Adding some docs.
This commit is contained in:
parent
bf700e7eef
commit
cea291718e
@ -52,6 +52,8 @@ Text Generation Inference (TGI) is a toolkit for deploying and serving Large Lan
|
||||
- Logits warper (temperature scaling, top-p, top-k, repetition penalty, more details see [transformers.LogitsProcessor](https://huggingface.co/docs/transformers/internal/generation_utils#transformers.LogitsProcessor))
|
||||
- Stop sequences
|
||||
- Log probabilities
|
||||
- [Speculation](https://huggingface.co/docs/text-generation-inference/conceptual/speculation) ~2x latency
|
||||
- [Guidance/JSON](https://huggingface.co/docs/text-generation-inference/conceptual/guidance). Specify output format to speed up inference and make sure the output is valid according to some specs..
|
||||
- Custom Prompt Generation: Easily generate text by providing custom prompts to guide the model's output
|
||||
- Fine-tuning Support: Utilize fine-tuned models for specific tasks to achieve higher accuracy and performance
|
||||
|
||||
|
@ -37,4 +37,8 @@
|
||||
title: Safetensors
|
||||
- local: conceptual/flash_attention
|
||||
title: Flash Attention
|
||||
- local: conceptual/speculation
|
||||
title: Speculation (Medusa, ngram)
|
||||
- local: conceptual/guidance
|
||||
title: Guidance, JSON, tools (using outlines)
|
||||
title: Conceptual Guides
|
||||
|
1
docs/source/conceptual/guidance.md
Normal file
1
docs/source/conceptual/guidance.md
Normal file
@ -0,0 +1 @@
|
||||
## Guidance
|
1
docs/source/conceptual/speculation
Normal file
1
docs/source/conceptual/speculation
Normal file
@ -0,0 +1 @@
|
||||
## Speculation
|
1
docs/source/conceptual/speculation.md
Normal file
1
docs/source/conceptual/speculation.md
Normal file
@ -0,0 +1 @@
|
||||
## Speculation
|
Loading…
Reference in New Issue
Block a user