text-generation-inference/backends/gaudi/server/text_generation_server
baptiste 7cdbd694b3 fix(gaudi): refactor server and implement requested changes
wip(gaudi): fix typos

wip(gaudi): refactor version numbers for pytorch and habana software to make it more flexible

wip(gaudi): debugging the refactored server

wip(gaudi): delete useless files

fix(gaudi): server working after refactoring

fix(gaudi): refactor and implement requested changes
2025-02-27 12:59:28 +00:00
..
adapters wip(gaudi): import server and dockerfile from tgi-gaudi fork 2025-02-25 12:08:42 +00:00
layers fix(gaudi): refactor server and implement requested changes 2025-02-27 12:59:28 +00:00
models fix(gaudi): refactor server and implement requested changes 2025-02-27 12:59:28 +00:00
pb wip(gaudi): import server and dockerfile from tgi-gaudi fork 2025-02-25 12:08:42 +00:00
utils fix(gaudi): refactor server and implement requested changes 2025-02-27 12:59:28 +00:00
__init__.py wip(gaudi): import server and dockerfile from tgi-gaudi fork 2025-02-25 12:08:42 +00:00
cache.py wip(gaudi): import server and dockerfile from tgi-gaudi fork 2025-02-25 12:08:42 +00:00
cli.py fix prehooks issues 2025-02-25 15:24:35 +00:00
habana_quantization_env.py fix prehooks issues 2025-02-25 15:24:35 +00:00
interceptor.py fix prehooks issues 2025-02-25 15:24:35 +00:00
server.py fix prehooks issues 2025-02-25 15:24:35 +00:00
tgi_service.py fix prehooks issues 2025-02-25 15:24:35 +00:00
tracing.py wip(gaudi): import server and dockerfile from tgi-gaudi fork 2025-02-25 12:08:42 +00:00