text-generation-inference/server/text_generation_server/layers/moe
Wang, Yi d7a24c03cf
some minor fix (#3048)
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
2025-02-25 12:07:55 +01:00
..
__init__.py Use kernels from the kernel hub (#2988) 2025-02-10 19:19:25 +01:00
fp8.py Use kernels from the kernel hub (#2988) 2025-02-10 19:19:25 +01:00
fused_moe_ipex.py fix moe in quantization path (#2935) 2025-01-22 14:36:15 +01:00
gptq_marlin.py Support sigmoid scoring function in GPTQ-MoE (#3017) 2025-02-14 11:33:49 +01:00
unquantized.py some minor fix (#3048) 2025-02-25 12:07:55 +01:00