mirror of
https://github.com/huggingface/text-generation-inference.git
synced 2025-09-11 12:24:53 +00:00
Update docs after rebase.
This commit is contained in:
parent
4b06f318cb
commit
7143130ba4
@ -198,23 +198,22 @@ Options:
|
||||
[default: 20]
|
||||
|
||||
```
|
||||
|
||||
## MAX_BATCH_SIZE
|
||||
```shell
|
||||
--max-batch-size <MAX_BATCH_SIZE>
|
||||
Enforce a maximum number of requests per batch Specific flag for hardware targets that do not support unpadded inference
|
||||
|
||||
[env: MAX_BATCH_SIZE=]
|
||||
```
|
||||
|
||||
```
|
||||
## ENABLE_CUDA_GRAPHS
|
||||
```shell
|
||||
--enable-cuda-graphs
|
||||
Enable experimental support for cuda graphs
|
||||
|
||||
[env: ENABLE_CUDA_GRAPHS=]
|
||||
```
|
||||
|
||||
```
|
||||
## HOSTNAME
|
||||
```shell
|
||||
--hostname <HOSTNAME>
|
||||
|
Loading…
Reference in New Issue
Block a user