layers
|
Fix text-generation-server quantize (#2103)
|
2024-09-24 03:46:09 +00:00 |
models
|
Support exl2-quantized Qwen2 models (#2085)
|
2024-09-24 03:46:09 +00:00 |
pb
|
chore: add pre-commit (#1569)
|
2024-04-24 15:32:02 +03:00 |
utils
|
Factor out sharding of packed tensors (#2059)
|
2024-09-24 03:46:09 +00:00 |
__init__.py
|
feat(clients): Python client (#103)
|
2023-03-07 18:52:22 +01:00 |
cli.py
|
Fix text-generation-server quantize (#2103)
|
2024-09-24 03:46:09 +00:00 |
tracing.py
|
feat(clients): Python client (#103)
|
2023-03-07 18:52:22 +01:00 |