mirror of
https://github.com/huggingface/text-generation-inference.git
synced 2025-09-11 12:24:53 +00:00
fix: adjust docs after rebase
This commit is contained in:
parent
4d040b3478
commit
5d8ef9f913
@ -162,7 +162,7 @@ Options:
|
||||
This setting is only applied if there is room in the batch as defined by `max_batch_total_tokens`.
|
||||
|
||||
[env: WAITING_SERVED_RATIO=]
|
||||
[default: 0.3]
|
||||
[default: 1.2]
|
||||
|
||||
```
|
||||
## MAX_BATCH_PREFILL_TOKENS
|
||||
|
Loading…
Reference in New Issue
Block a user