text-generation-inference/backends/gaudi/server
baptiste 7cdbd694b3 fix(gaudi): refactor server and implement requested changes
wip(gaudi): fix typos

wip(gaudi): refactor version numbers for pytorch and habana software to make it more flexible

wip(gaudi): debugging the refactored server

wip(gaudi): delete useless files

fix(gaudi): server working after refactoring

fix(gaudi): refactor and implement requested changes
2025-02-27 12:59:28 +00:00
..
tests wip(gaudi): import server and dockerfile from tgi-gaudi fork 2025-02-25 12:08:42 +00:00
text_generation_server fix(gaudi): refactor server and implement requested changes 2025-02-27 12:59:28 +00:00
.gitignore wip(gaudi): import server and dockerfile from tgi-gaudi fork 2025-02-25 12:08:42 +00:00
dill-0.3.7-patch.sh fix prehooks issues 2025-02-25 15:24:35 +00:00
dill-0.3.8-patch.sh fix prehooks issues 2025-02-25 15:24:35 +00:00
Makefile feat(gaudi): new gaudi backend working 2025-02-25 12:08:53 +00:00
Makefile-awq wip(gaudi): import server and dockerfile from tgi-gaudi fork 2025-02-25 12:08:42 +00:00
Makefile-eetq wip(gaudi): import server and dockerfile from tgi-gaudi fork 2025-02-25 12:08:42 +00:00
Makefile-fbgemm wip(gaudi): import server and dockerfile from tgi-gaudi fork 2025-02-25 12:08:42 +00:00
Makefile-flash-att wip(gaudi): import server and dockerfile from tgi-gaudi fork 2025-02-25 12:08:42 +00:00
Makefile-flash-att-v2 wip(gaudi): import server and dockerfile from tgi-gaudi fork 2025-02-25 12:08:42 +00:00
Makefile-selective-scan wip(gaudi): import server and dockerfile from tgi-gaudi fork 2025-02-25 12:08:42 +00:00
Makefile-vllm wip(gaudi): import server and dockerfile from tgi-gaudi fork 2025-02-25 12:08:42 +00:00
poetry.lock wip(gaudi): import server and dockerfile from tgi-gaudi fork 2025-02-25 12:08:42 +00:00
pyproject.toml wip(gaudi): import server and dockerfile from tgi-gaudi fork 2025-02-25 12:08:42 +00:00
README.md wip(gaudi): import server and dockerfile from tgi-gaudi fork 2025-02-25 12:08:42 +00:00
requirements.txt wip(gaudi): import server and dockerfile from tgi-gaudi fork 2025-02-25 12:08:42 +00:00

Text Generation Inference Python gRPC Server

A Python gRPC server for Text Generation Inference

Install

make install

Run

make run-dev