This website requires JavaScript.
Explore
Help
Sign In
huggingface
/
text-generation-inference
Watch
5
Star
0
Fork
0
You've already forked text-generation-inference
mirror of
https://github.com/huggingface/text-generation-inference.git
synced
2025-04-21 23:12:07 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
a84da5b698
text-generation-inference
/
backends
/
gaudi
/
server
/
text_generation_server
/
layers
/
attention
History
Wang, Yi A
c55a8caea2
remove torch.where to fix incorrect output in hpu graph model
...
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
2025-03-31 22:51:54 -07:00
..
__init__.py
clean cuda/rocm code in hpu backend, enable flat_hpu
2025-03-14 01:25:31 -07:00
common.py
clean cuda/rocm code in hpu backend, enable flat_hpu
2025-03-14 01:25:31 -07:00
hpu.py
remove block_tables and prefill_cache_indices which will lead to dynamic shape
2025-03-27 23:57:59 -07:00
kv_cache.py
remove torch.where to fix incorrect output in hpu graph model
2025-03-31 22:51:54 -07:00