text-generation-inference/server/exllama_kernels/exllama_kernels/cuda_func
OlivierDehaene 0d794af6a5
feat: experimental support for cuda graphs (#1428)
Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com>
2024-02-12 10:09:29 +01:00
..
column_remap.cu feat(server): Add exllama GPTQ CUDA kernel support #553 (#666) 2023-07-21 10:59:00 +02:00
column_remap.cuh feat(server): Add exllama GPTQ CUDA kernel support #553 (#666) 2023-07-21 10:59:00 +02:00
q4_matmul.cu feat: experimental support for cuda graphs (#1428) 2024-02-12 10:09:29 +01:00
q4_matmul.cuh feat: experimental support for cuda graphs (#1428) 2024-02-12 10:09:29 +01:00
q4_matrix.cu feat: experimental support for cuda graphs (#1428) 2024-02-12 10:09:29 +01:00
q4_matrix.cuh feat(server): Add exllama GPTQ CUDA kernel support #553 (#666) 2023-07-21 10:59:00 +02:00