text-generation-inference/server/tests/utils
drbh 5d85a958c9
fix: refactor adapter weight loading and mapping (#2193)
* fix: refactor adapter weight loading and mapping

* feat: enable lora load from directory

* fix: adjust launcher for local lora adapters

* feat: improve weight loading and add tests

* fix: improve logging and rebase syntax issue

* fix: impove adapter merge comments and remove unused conditional

* fix: improve get_model_with_lora_adapters naming

* fix: comment typo
2024-07-24 15:32:14 -04:00
..
test_adapter.py fix: refactor adapter weight loading and mapping (#2193) 2024-07-24 15:32:14 -04:00
test_convert.py fix(server): harden the weights choice to save on disk. (#561) 2023-07-07 14:50:12 +02:00
test_hub.py Fix local load for peft (#1373) 2023-12-21 17:29:23 +01:00
test_layers.py Move quantized weight handling out of the Weights class (#2194) 2024-07-09 20:04:03 +02:00
test_tokens.py feat(server): add frequency penalty (#1541) 2024-02-08 18:41:25 +01:00
test_watermark.py feat(server): add watermarking tests (#248) 2023-04-27 19:16:35 +02:00
test_weights.py Split up layers.marlin into several files (#2292) 2024-07-24 16:33:26 +02:00