text-generation-inference

mirror of https://github.com/huggingface/text-generation-inference.git synced 2025-05-22 02:02:07 +00:00

History

Nicolas Patry 818c8db29a change ChatCompletionChunk to align with "OpenAI Chat Completions streaming API" Moving after tool_calls2 Signed-off-by: Wang, Yi A <yi.a.wang@intel.com> add in Buffering.. Signed-off-by: Wang, Yi A <yi.a.wang@intel.com> fix: handle usage outside of stream state and add tests Simplifying everything quite a bit. Remove the unused model_dump. Clippy. Clippy ? Ruff. Uppgrade the flake for latest transformers. Upgrade after rebase. Remove potential footgun. Fix completion test.		2025-03-07 19:48:04 +01:00
..
test_chat_hfhub_nousage.json	change ChatCompletionChunk to align with "OpenAI Chat Completions streaming API"	2025-03-07 19:48:04 +01:00
test_chat_hfhub_usage.json	change ChatCompletionChunk to align with "OpenAI Chat Completions streaming API"	2025-03-07 19:48:04 +01:00
test_chat_openai_nousage.json	change ChatCompletionChunk to align with "OpenAI Chat Completions streaming API"	2025-03-07 19:48:04 +01:00
test_chat_openai_usage.json	change ChatCompletionChunk to align with "OpenAI Chat Completions streaming API"	2025-03-07 19:48:04 +01:00
test_flash_llama_completion_many_prompts.json	change ChatCompletionChunk to align with "OpenAI Chat Completions streaming API"	2025-03-07 19:48:04 +01:00
test_flash_llama_completion_single_prompt.json	change ChatCompletionChunk to align with "OpenAI Chat Completions streaming API"	2025-03-07 19:48:04 +01:00
test_flash_llama_completion_stream_usage.json	Fix tool call2 (#3076 )	2025-03-07 19:45:57 +01:00