diff --git a/docs/source/conceptual/streaming.md b/docs/source/conceptual/streaming.md index 2e9f6b8b..49c48fa0 100644 --- a/docs/source/conceptual/streaming.md +++ b/docs/source/conceptual/streaming.md @@ -101,7 +101,7 @@ asyncio.run(main()) To use the OpenAI Chat Completions compatible Messages API `v1/chat/completions` endpoint with curl, you can add the `-N` flag, which disables curl default buffering and shows data as it arrives from the server ```curl -curl localhost:3000/v1/chat/completions \ +curl localhost:8080/v1/chat/completions \ -X POST \ -d '{ "model": "tgi",