From 638685ea942ffcb5683ffcaf6d203aba9754c653 Mon Sep 17 00:00:00 2001 From: Nicolas Patry Date: Tue, 2 Apr 2024 19:27:17 +0000 Subject: [PATCH] Push users to streaming in the readme. --- README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.md b/README.md index 60fe83cd..bffe1e8a 100644 --- a/README.md +++ b/README.md @@ -82,7 +82,7 @@ docker run --gpus all --shm-size 1g -p 8080:80 -v $volume:/data ghcr.io/huggingf And then you can make requests like ```bash -curl 127.0.0.1:8080/generate \ +curl 127.0.0.1:8080/generate_stream \ -X POST \ -d '{"inputs":"What is Deep Learning?","parameters":{"max_new_tokens":20}}' \ -H 'Content-Type: application/json'