text-generation-inference/server/exllama_kernels/exllama_kernels/cuda_func
2024-02-16 11:58:58 +01:00
..
column_remap.cu feat(server): Add exllama GPTQ CUDA kernel support #553 (#666) 2023-07-21 10:59:00 +02:00
column_remap.cuh chore: add pre-commit (#1569) 2024-02-16 11:58:58 +01:00
q4_matmul.cu feat: experimental support for cuda graphs (#1428) 2024-02-12 10:09:29 +01:00
q4_matmul.cuh feat: experimental support for cuda graphs (#1428) 2024-02-12 10:09:29 +01:00
q4_matrix.cu feat: experimental support for cuda graphs (#1428) 2024-02-12 10:09:29 +01:00
q4_matrix.cuh chore: add pre-commit (#1569) 2024-02-16 11:58:58 +01:00