Push users to streaming in the readme.

This commit is contained in:
Nicolas Patry 2024-04-02 19:27:17 +00:00
parent 4ee0a0c401
commit 638685ea94

View File

@ -82,7 +82,7 @@ docker run --gpus all --shm-size 1g -p 8080:80 -v $volume:/data ghcr.io/huggingf
And then you can make requests like
```bash
curl 127.0.0.1:8080/generate \
curl 127.0.0.1:8080/generate_stream \
-X POST \
-d '{"inputs":"What is Deep Learning?","parameters":{"max_new_tokens":20}}' \
-H 'Content-Type: application/json'