text-generation-inference/server/text_generation_server/adapters
Daniël de Kok e32528792c
Switch to punica-sgmv kernel from the Hub (#3236)
* Switch to punica-sgmv kernel from the Hub

This also switches (temporarily) to the tgi-nix/kernel-builder merge
branch, bumping up to CUDA 12.8 (same as non-Nix Torch).

* nix: client depends on aiohttp

This probably worked before the nixpkgs bump because a dependency
propagated aiohttp.
2025-05-21 15:44:15 +02:00
..
__init__.py Enable multiple LoRa adapters (#2010) 2024-06-25 14:46:27 -04:00
config.py feat: add ruff and resolve issue (#2262) 2024-07-26 10:29:09 -04:00
lora.py Switch to punica-sgmv kernel from the Hub (#3236) 2025-05-21 15:44:15 +02:00
weights.py fix: refactor adapter weight loading and mapping (#2193) 2024-07-24 15:32:14 -04:00