text-generation-inference

mirror of https://github.com/huggingface/text-generation-inference.git synced 2025-10-09 06:55:24 +00:00

History

Wang, Yi A 201dc6294f clean cuda/rocm code in hpu backend, enable flat_hpu Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>		2025-03-14 01:25:31 -07:00
..
client	Revert "feat: improve qwen2-vl startup " (#2924 )	2025-01-17 12:09:05 -05:00
gaudi	clean cuda/rocm code in hpu backend, enable flat_hpu	2025-03-14 01:25:31 -07:00
grpc-metadata	Upgrading our rustc version. (#2908 )	2025-01-15 17:04:03 +01:00
llamacpp	Update the llamacpp backend (#3022 )	2025-03-11 09:19:01 +01:00
neuron	feat: add support for HF_HUB_USER_AGENT_ORIGIN to add user-agent Origin field in Hub requests. (#3061 )	2025-03-04 16:43:50 +01:00
trtllm	feat: add support for HF_HUB_USER_AGENT_ORIGIN to add user-agent Origin field in Hub requests. (#3061 )	2025-03-04 16:43:50 +01:00
v2	Add backend name to telemetry (#2962 )	2025-01-28 16:53:16 +01:00
v3	Making `tool_calls` a vector. (#3075 )	2025-03-05 22:32:31 +01:00