text-generation-inference

mirror of https://github.com/huggingface/text-generation-inference.git synced 2025-05-24 20:42:07 +00:00

History

Daniël de Kok e32528792c Switch to punica-sgmv kernel from the Hub (#3236 ) * Switch to punica-sgmv kernel from the Hub This also switches (temporarily) to the tgi-nix/kernel-builder merge branch, bumping up to CUDA 12.8 (same as non-Nix Torch). * nix: client depends on aiohttp This probably worked before the nixpkgs bump because a dependency propagated aiohttp.		2025-05-21 15:44:15 +02:00
..
__init__.py	Enable multiple LoRa adapters (#2010 )	2024-06-25 14:46:27 -04:00
config.py	feat: add ruff and resolve issue (#2262 )	2024-07-26 10:29:09 -04:00
lora.py	Switch to punica-sgmv kernel from the Hub (#3236 )	2025-05-21 15:44:15 +02:00
weights.py	fix: refactor adapter weight loading and mapping (#2193 )	2024-07-24 15:32:14 -04:00