text-generation-inference/server/text_generation_server/layers/moe
jiqing-feng 0bad926fb8 fix modules_to_not_convert
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
2025-02-24 16:11:48 +00:00
..
__init__.py Use kernels from the kernel hub (#2988) 2025-02-10 19:19:25 +01:00
fp8.py Use kernels from the kernel hub (#2988) 2025-02-10 19:19:25 +01:00
fused_moe_ipex.py fix moe in quantization path (#2935) 2025-01-22 14:36:15 +01:00
gptq_marlin.py Support sigmoid scoring function in GPTQ-MoE (#3017) 2025-02-14 11:33:49 +01:00
unquantized.py fix modules_to_not_convert 2025-02-24 16:11:48 +00:00