text-generation-inference/server/text_generation_server/layers/moe
jiqing-feng b7bdbbd8c0 revert unquantized changes
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
2025-02-26 12:23:33 +00:00
..
__init__.py Use kernels from the kernel hub (#2988) 2025-02-10 19:19:25 +01:00
fp8.py Use kernels from the kernel hub (#2988) 2025-02-10 19:19:25 +01:00
fused_moe_ipex.py fix moe in quantization path (#2935) 2025-01-22 14:36:15 +01:00
gptq_marlin.py Support sigmoid scoring function in GPTQ-MoE (#3017) 2025-02-14 11:33:49 +01:00
unquantized.py revert unquantized changes 2025-02-26 12:23:33 +00:00