mirror of
https://github.com/huggingface/text-generation-inference.git
synced 2025-05-22 02:02:07 +00:00
Moving after tool_calls2 Signed-off-by: Wang, Yi A <yi.a.wang@intel.com> add in Buffering.. Signed-off-by: Wang, Yi A <yi.a.wang@intel.com> fix: handle usage outside of stream state and add tests Simplifying everything quite a bit. Remove the unused model_dump. Clippy. Clippy ? Ruff. Uppgrade the flake for latest transformers. Upgrade after rebase. Remove potential footgun. Fix completion test. |
||
---|---|---|
.. | ||
test_chat_hfhub_nousage.json | ||
test_chat_hfhub_usage.json | ||
test_chat_openai_nousage.json | ||
test_chat_openai_usage.json | ||
test_flash_llama_completion_many_prompts.json | ||
test_flash_llama_completion_single_prompt.json | ||
test_flash_llama_completion_stream_usage.json |