models
|
feat: add cuda memory fraction (#659)
|
2023-07-24 11:43:58 +02:00 |
pb
|
feat(server): clear cache on error (#143)
|
2023-03-28 11:29:35 +02:00 |
utils
|
fix(server): fix exllama buffers (#689)
|
2023-07-24 14:25:43 +02:00 |
__init__.py
|
feat(clients): Python client (#103)
|
2023-03-07 18:52:22 +01:00 |
interceptor.py
|
feat(server): empty cache on errors
|
2023-07-12 17:06:19 +02:00 |
server.py
|
fix(server): fix exllama buffers (#689)
|
2023-07-24 14:25:43 +02:00 |
tracing.py
|
feat(clients): Python client (#103)
|
2023-03-07 18:52:22 +01:00 |