gptq
|
fix gptq issue
|
2025-03-22 20:58:50 -07:00 |
moe
|
add moe support, fix qwen/mistral/mixtral crash
|
2025-03-18 00:45:15 -07:00 |
__init__.py
|
Add Gaudi Backend (#3055)
|
2025-02-28 12:14:58 +01:00 |
bnb.py
|
Add Gaudi Backend (#3055)
|
2025-02-28 12:14:58 +01:00 |
conv.py
|
Add Gaudi Backend (#3055)
|
2025-02-28 12:14:58 +01:00 |
exl2.py
|
Add Gaudi Backend (#3055)
|
2025-02-28 12:14:58 +01:00 |
fp8.py
|
match the latest vllm_extension ops
|
2025-04-10 19:32:32 -07:00 |
lora.py
|
Add Gaudi Backend (#3055)
|
2025-02-28 12:14:58 +01:00 |
medusa.py
|
Add Gaudi Backend (#3055)
|
2025-02-28 12:14:58 +01:00 |
mlp.py
|
Add Gaudi Backend (#3055)
|
2025-02-28 12:14:58 +01:00 |
speculative.py
|
Add Gaudi Backend (#3055)
|
2025-02-28 12:14:58 +01:00 |