text-generation-inference

mirror of https://github.com/huggingface/text-generation-inference.git synced 2025-10-13 08:55:24 +00:00

History

Wang, Yi A 1e56e5fe5c [gaudi] HuggingFaceM4/idefics2-8b issue fix batch.prefill_cache_indices is reset in generate_token instead of forward, so that position_id could be updated correctly Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>		2025-06-12 22:15:33 -07:00
..
custom_modeling	[gaudi] HuggingFaceM4/idefics2-8b issue fix	2025-06-12 22:15:33 -07:00
__init__.py	[Gaudi] Remove optimum-habana (#3261 )	2025-06-12 22:35:36 +02:00
flash_causal_lm.py	[gaudi] Vlm rebase and issue fix in benchmark test (#3263 )	2025-06-12 22:26:37 +02:00
flash_vlm_causal_lm.py	[gaudi] HuggingFaceM4/idefics2-8b issue fix	2025-06-12 22:15:33 -07:00
globals.py	[Gaudi] Remove optimum-habana (#3261 )	2025-06-12 22:35:36 +02:00
mllama_causal_lm.py	[gaudi] Vlm rebase and issue fix in benchmark test (#3263 )	2025-06-12 22:26:37 +02:00
model.py	Gaudi: clean cuda/rocm code in hpu backend, enable flat_hpu (#3113 )	2025-04-14 15:58:13 +02:00
seq2seq_lm.py	Gaudi: clean cuda/rocm code in hpu backend, enable flat_hpu (#3113 )	2025-04-14 15:58:13 +02:00
types.py	Add Gaudi Backend (#3055 )	2025-02-28 12:14:58 +01:00