text-generation-inference/server/text_generation_server
ssmi153 3628559516
GPTQ Env vars: catch correct type of error (#596)
# What does this PR do?

When passing in environment variables like gptq_bits, we still get
errors thrown from TGI because the try/catch block is catching the wrong
type of error. This PR aims to fix that.

@Narsil - let me know if this is how you want this formatted. My Python
is a little shaky, so I hope this syntax is correct.
2023-07-12 19:57:46 +02:00
..
models feat(server): empty cache on errors 2023-07-12 17:06:19 +02:00
pb feat(server): clear cache on error (#143) 2023-03-28 11:29:35 +02:00
utils GPTQ Env vars: catch correct type of error (#596) 2023-07-12 19:57:46 +02:00
__init__.py feat(clients): Python client (#103) 2023-03-07 18:52:22 +01:00
cache.py fix(server): decrease memory fragmentation (#557) 2023-07-06 14:28:33 +02:00
cli.py fix(server): harden the weights choice to save on disk. (#561) 2023-07-07 14:50:12 +02:00
interceptor.py feat(server): empty cache on errors 2023-07-12 17:06:19 +02:00
server.py feat(server): empty cache on errors 2023-07-12 17:06:19 +02:00
tracing.py feat(clients): Python client (#103) 2023-03-07 18:52:22 +01:00