models
|
feat: update exllamav2 kernels (#1370)
|
2024-04-22 09:02:53 +03:00 |
pb
|
feat(server): clear cache on error (#143)
|
2023-03-28 11:29:35 +02:00 |
__init__.py
|
feat(clients): Python client (#103)
|
2023-03-07 18:52:22 +01:00 |
cli.py
|
fix: fix local loading for .bin models (#1419)
|
2024-04-22 09:17:52 +03:00 |
habana_quantization_env.py
|
Add Habana copyright header (#122)
|
2024-04-08 18:06:21 +02:00 |
interceptor.py
|
Add Habana copyright header (#122)
|
2024-04-08 18:06:21 +02:00 |
server.py
|
fix: fix gpt-q with groupsize = -1 (#1358)
|
2024-04-19 15:05:50 +03:00 |
tgi_service.py
|
Speculative (#1308)
|
2024-04-18 12:39:39 +00:00 |
tracing.py
|
feat(clients): Python client (#103)
|
2023-03-07 18:52:22 +01:00 |