This website requires JavaScript.
Explore
Help
Sign In
huggingface
/
text-generation-inference
Watch
5
Star
0
Fork
0
You've already forked text-generation-inference
mirror of
https://github.com/huggingface/text-generation-inference.git
synced
2025-05-24 04:22:10 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
f08b44ade5
text-generation-inference
/
backends
/
gaudi
/
server
/
text_generation_server
/
layers
/
attention
History
Wang, Yi
f08b44ade5
Upgrade to new vllm extension ops for Gaudi backend (fix issue in exponential bucketing) (
#3239
)
...
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
2025-05-22 15:29:16 +02:00
..
__init__.py
Deepseek R1 for Gaudi backend (
#3211
)
2025-05-19 16:36:39 +02:00
common.py
Move input_ids to hpu and remove disposal of adapter_meta (
#3237
)
2025-05-22 09:21:31 +02:00
hpu.py
Upgrade to new vllm extension ops for Gaudi backend (fix issue in exponential bucketing) (
#3239
)
2025-05-22 15:29:16 +02:00
kv_cache.py
Upgrade to new vllm extension ops for Gaudi backend (fix issue in exponential bucketing) (
#3239
)
2025-05-22 15:29:16 +02:00