text-generation-inference/test_chat_stream_options.py at 449cee49ca6eda19ff8fdb30c581a5f27de5c90b - text-generation-inference - Leaflow Developers

huggingface/text-generation-inference

mirror of https://github.com/huggingface/text-generation-inference.git synced 2025-07-01 21:40:16 +00:00

drbh dc5f05f8e6

Pr 3003 ci branch (#3007 )

* change ChatCompletionChunk to align with "OpenAI Chat Completions streaming API"

Moving after tool_calls2

Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>

add in Buffering..

Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>

fix: handle usage outside of stream state and add tests

Simplifying everything quite a bit.

Remove the unused model_dump.

Clippy.

Clippy ?

Ruff.

Uppgrade the flake for latest transformers.

Upgrade after rebase.

Remove potential footgun.

Fix completion test.

* Clippy.

* Tweak for multi prompt.

* Ruff.

* Update the snapshot a bit.

---------

Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com>

2025-03-10 17:56:19 +01:00

16 lines

315 B

Python

Raw Blame History

 import pytest
 @pytest.fixture(scope="module")
 def chat_handle(launcher):
     with launcher(
         "meta-llama/Meta-Llama-3.1-8B-Instruct",
     ) as handle:
         yield handle
 @pytest.fixture(scope="module")
 async def chat_client(chat_handle):
     await chat_handle.health(300)
     return chat_handle.client