adapters
|
Add Gaudi Backend (#3055)
|
2025-02-28 12:14:58 +01:00 |
layers
|
enable dbrx remove some unused code
|
2025-03-19 03:16:41 -07:00 |
models
|
enable dbrx remove some unused code
|
2025-03-19 03:16:41 -07:00 |
pb
|
Add Gaudi Backend (#3055)
|
2025-02-28 12:14:58 +01:00 |
utils
|
clean cuda/rocm code in hpu backend, enable flat_hpu
|
2025-03-14 01:25:31 -07:00 |
__init__.py
|
Add Gaudi Backend (#3055)
|
2025-02-28 12:14:58 +01:00 |
cache.py
|
Add Gaudi Backend (#3055)
|
2025-02-28 12:14:58 +01:00 |
cli.py
|
fix TP in pageattn
|
2025-03-14 18:01:58 -07:00 |
habana_quantization_env.py
|
Add Gaudi Backend (#3055)
|
2025-02-28 12:14:58 +01:00 |
interceptor.py
|
Add Gaudi Backend (#3055)
|
2025-02-28 12:14:58 +01:00 |
server.py
|
Add Gaudi Backend (#3055)
|
2025-02-28 12:14:58 +01:00 |
tgi_service.py
|
Add Gaudi Backend (#3055)
|
2025-02-28 12:14:58 +01:00 |
tracing.py
|
Add Gaudi Backend (#3055)
|
2025-02-28 12:14:58 +01:00 |