text-generation-inference/server/exllama_kernels
2024-02-16 11:58:58 +01:00
..
exllama_kernels chore: add pre-commit () 2024-02-16 11:58:58 +01:00
setup.py feat: add cuda memory fraction () 2023-07-24 11:43:58 +02:00