text-generation-inference

mirror of https://github.com/huggingface/text-generation-inference.git synced 2025-09-10 03:44:54 +00:00

History

Daniël de Kok a76ae953fe Update quantization kernels		2025-07-07 07:35:41 +00:00
..
__init__.py	Use kernels from the kernel hub (#2988 )	2025-02-10 19:19:25 +01:00
fp8.py	Use kernels from the kernel hub (#2988 )	2025-02-10 19:19:25 +01:00
fused_moe_ipex.py	fix moe in quantization path (#2935 )	2025-01-22 14:36:15 +01:00
gptq_marlin.py	Update quantization kernels	2025-07-07 07:35:41 +00:00
unquantized.py	some minor fix (#3048 )	2025-02-25 12:07:55 +01:00