text-generation-inference/backends/gaudi/server/text_generation_server/layers/gptq
2025-02-25 12:08:42 +00:00
..
__init__.py wip(gaudi): import server and dockerfile from tgi-gaudi fork 2025-02-25 12:08:42 +00:00
custom_autotune.py wip(gaudi): import server and dockerfile from tgi-gaudi fork 2025-02-25 12:08:42 +00:00
exllama.py wip(gaudi): import server and dockerfile from tgi-gaudi fork 2025-02-25 12:08:42 +00:00
exllamav2.py wip(gaudi): import server and dockerfile from tgi-gaudi fork 2025-02-25 12:08:42 +00:00
quant_linear.py wip(gaudi): import server and dockerfile from tgi-gaudi fork 2025-02-25 12:08:42 +00:00
quantize.py wip(gaudi): import server and dockerfile from tgi-gaudi fork 2025-02-25 12:08:42 +00:00
utils.py wip(gaudi): import server and dockerfile from tgi-gaudi fork 2025-02-25 12:08:42 +00:00