text-generation-inference/server/text_generation_server
ssmi153 2c4bf88268
fix(server): Bug fixes for GPTQ_BITS environment variable passthrough (#590)
# What does this PR do?

This fixes a typo and extends the GPTP_BITS environment variables
through to the second method which requires the same logic. Please let
me know if there's anything I've misunderstood in this change.

Thanks @Narsil for the original fix.
2023-07-12 14:17:35 +02:00
..
models fix(server): Adding logger import to t5_modeling.py (#585) 2023-07-12 10:40:32 +02:00
pb feat(server): clear cache on error (#143) 2023-03-28 11:29:35 +02:00
utils fix(server): Bug fixes for GPTQ_BITS environment variable passthrough (#590) 2023-07-12 14:17:35 +02:00
__init__.py feat(clients): Python client (#103) 2023-03-07 18:52:22 +01:00
cache.py fix(server): decrease memory fragmentation (#557) 2023-07-06 14:28:33 +02:00
cli.py fix(server): harden the weights choice to save on disk. (#561) 2023-07-07 14:50:12 +02:00
interceptor.py feat(clients): Python client (#103) 2023-03-07 18:52:22 +01:00
server.py feat: Add the option to force another dtype than f16. (#513) 2023-06-30 20:30:09 +02:00
tracing.py feat(clients): Python client (#103) 2023-03-07 18:52:22 +01:00