text-generation-inference/server/text_generation_server/layers/moe
Mohit Sharma 8e01191b4c add model
2025-04-01 16:11:19 +00:00
..
__init__.py Use kernels from the kernel hub (#2988) 2025-02-10 19:19:25 +01:00
fp8.py Use kernels from the kernel hub (#2988) 2025-02-10 19:19:25 +01:00
fused_moe_ipex.py fix moe in quantization path (#2935) 2025-01-22 14:36:15 +01:00
gptq_marlin.py Support sigmoid scoring function in GPTQ-MoE (#3017) 2025-02-14 11:33:49 +01:00
unquantized.py add model 2025-04-01 16:11:19 +00:00