layers
|
Add support for Marlin-quantized models
|
2024-09-24 03:38:05 +00:00 |
models
|
Add support for Marlin-quantized models
|
2024-09-24 03:38:05 +00:00 |
pb
|
chore: add pre-commit (#1569)
|
2024-04-24 15:32:02 +03:00 |
utils
|
marlin: support tp>1 when group_size==-1
|
2024-09-24 03:38:05 +00:00 |
__init__.py
|
feat(clients): Python client (#103)
|
2023-03-07 18:52:22 +01:00 |
cli.py
|
Add support for Marlin-quantized models
|
2024-09-24 03:38:05 +00:00 |
server.py
|
Add support for exl2 quantization
|
2024-09-24 03:19:39 +00:00 |
tracing.py
|
feat(clients): Python client (#103)
|
2023-03-07 18:52:22 +01:00 |