Wang, Yi A
|
76cc129796
|
remove block_scales which is not needed anymore
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
|
2025-04-11 01:28:14 -07:00 |
|
Wang, Yi A
|
4cdc34ec4d
|
match the latest vllm_extension ops
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
|
2025-04-10 19:32:32 -07:00 |
|
Wang, Yi A
|
1508ee8de1
|
remove block_tables and prefill_cache_indices which will lead to dynamic shape
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
|
2025-03-27 23:57:59 -07:00 |
|
Wang, Yi A
|
201dc6294f
|
clean cuda/rocm code in hpu backend, enable flat_hpu
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
|
2025-03-14 01:25:31 -07:00 |
|