text-generation-inference/server/text_generation_server/layers/marlin
2025-07-07 07:35:41 +00:00
..
__init__.py Handle GPTQ-Marlin loading in GPTQMarlinWeightLoader (#2300) 2024-07-31 13:08:41 +02:00
fp8.py Update quantization kernels 2025-07-07 07:35:41 +00:00
gptq.py Update quantization kernels 2025-07-07 07:35:41 +00:00
marlin.py Use kernels from the kernel hub (#2988) 2025-02-10 19:19:25 +01:00
util.py Use kernels from the kernel hub (#2988) 2025-02-10 19:19:25 +01:00