text-generation-inference/backends/gaudi/server/exllamav2_kernels/exllamav2_kernels/cuda
2025-02-25 12:08:42 +00:00
..
quant wip(gaudi): import server and dockerfile from tgi-gaudi fork 2025-02-25 12:08:42 +00:00
compat.cuh wip(gaudi): import server and dockerfile from tgi-gaudi fork 2025-02-25 12:08:42 +00:00
matrix_view.cuh wip(gaudi): import server and dockerfile from tgi-gaudi fork 2025-02-25 12:08:42 +00:00
q_gemm_kernel_gptq.cuh wip(gaudi): import server and dockerfile from tgi-gaudi fork 2025-02-25 12:08:42 +00:00
q_gemm_kernel.cuh wip(gaudi): import server and dockerfile from tgi-gaudi fork 2025-02-25 12:08:42 +00:00
q_gemm.cu wip(gaudi): import server and dockerfile from tgi-gaudi fork 2025-02-25 12:08:42 +00:00
q_gemm.cuh wip(gaudi): import server and dockerfile from tgi-gaudi fork 2025-02-25 12:08:42 +00:00
q_matrix.cu wip(gaudi): import server and dockerfile from tgi-gaudi fork 2025-02-25 12:08:42 +00:00
q_matrix.cuh wip(gaudi): import server and dockerfile from tgi-gaudi fork 2025-02-25 12:08:42 +00:00
util.cuh wip(gaudi): import server and dockerfile from tgi-gaudi fork 2025-02-25 12:08:42 +00:00