mirror of
https://github.com/huggingface/text-generation-inference.git
synced 2025-09-10 11:54:52 +00:00
Fix Tip formatting
This commit is contained in:
parent
21701d4b44
commit
197fbfde7c
@ -10,8 +10,11 @@ TGI supports [bits-and-bytes](https://github.com/TimDettmers/bitsandbytes#bitsan
|
||||
## RoPE Scaling
|
||||
|
||||
RoPE scaling can be used to increase the sequence length of the model during the inference time without necessarily fine-tuning it. To enable RoPE scaling, simply pass `--rope-scaling`, `--max-input-length` and `--rope-factors` flags when running through CLI. `--rope-scaling` can take the values `linear` or `dynamic`. If your model is not fine-tuned to a longer sequence length, use `dynamic`. `--rope-factor` is the ratio between the intended max sequence length and the model's original max sequence length. Make sure to pass `--max-input-length` to provide maximum input length for extension.
|
||||
|
||||
<Tip>
|
||||
|
||||
We recommend using `dynamic` RoPE scaling.
|
||||
|
||||
</Tip>
|
||||
|
||||
## Safetensors
|
||||
|
Loading…
Reference in New Issue
Block a user