text-generation-inference/server/text_generation_server/layers/moe
2025-07-07 07:35:41 +00:00
..
__init__.py Use kernels from the kernel hub (#2988) 2025-02-10 19:19:25 +01:00
fp8.py Use kernels from the kernel hub (#2988) 2025-02-10 19:19:25 +01:00
fused_moe_ipex.py fix moe in quantization path (#2935) 2025-01-22 14:36:15 +01:00
gptq_marlin.py Update quantization kernels 2025-07-07 07:35:41 +00:00
unquantized.py some minor fix (#3048) 2025-02-25 12:07:55 +01:00