text-generation-inference

mirror of https://github.com/huggingface/text-generation-inference.git synced 2025-05-24 04:22:10 +00:00

History

Wang, Yi f08b44ade5 Upgrade to new vllm extension ops for Gaudi backend (fix issue in exponential bucketing) (#3239 ) Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>		2025-05-22 15:29:16 +02:00
..
__init__.py	Deepseek R1 for Gaudi backend (#3211 )	2025-05-19 16:36:39 +02:00
common.py	Move input_ids to hpu and remove disposal of adapter_meta (#3237 )	2025-05-22 09:21:31 +02:00
hpu.py	Upgrade to new vllm extension ops for Gaudi backend (fix issue in exponential bucketing) (#3239 )	2025-05-22 15:29:16 +02:00
kv_cache.py	Upgrade to new vllm extension ops for Gaudi backend (fix issue in exponential bucketing) (#3239 )	2025-05-22 15:29:16 +02:00