text-generation-inference/server/exllama_kernels
2024-04-24 15:32:02 +03:00
..
exllama_kernels chore: add pre-commit (#1569) 2024-04-24 15:32:02 +03:00
setup.py feat: add cuda memory fraction (#659) 2023-07-24 11:43:58 +02:00