mirror of
https://github.com/huggingface/text-generation-inference.git
synced 2025-09-11 12:24:53 +00:00
Fix docker run
in README.md
This commit is contained in:
parent
23bc38b10d
commit
08b39d1ae2
@ -83,8 +83,7 @@ model=HuggingFaceH4/zephyr-7b-beta
|
|||||||
# share a volume with the Docker container to avoid downloading weights every run
|
# share a volume with the Docker container to avoid downloading weights every run
|
||||||
volume=$PWD/data
|
volume=$PWD/data
|
||||||
|
|
||||||
docker run --gpus all --shm-size 1g -p 8080:80 -v $volume:/data \
|
docker run --gpus all --shm-size 1g -p 8080:80 -v $volume:/data ghcr.io/huggingface/text-generation-inference:3.0.0 --model-id $model
|
||||||
3.0.0 ghcr.io/huggingface/text-generation-inference:3.0.0 --model-id $model
|
|
||||||
```
|
```
|
||||||
|
|
||||||
And then you can make requests like
|
And then you can make requests like
|
||||||
|
Loading…
Reference in New Issue
Block a user