text-generation-inference/server/exllama_kernels
2024-04-18 23:31:28 +00:00
..
exllama_kernels at last working! 2024-04-18 23:31:28 +00:00
setup.py feat: add cuda memory fraction (#659) 2023-07-24 11:43:58 +02:00