awq
|
ROCm AWQ support (#1514)
|
2024-04-24 09:21:34 +00:00 |
gptq
|
ROCm AWQ support (#1514)
|
2024-04-24 09:21:34 +00:00 |
__init__.py
|
Add Habana copyright header (#122)
|
2024-04-08 18:06:21 +02:00 |
convert.py
|
fit for baichuan models (#981)
|
2023-09-08 16:51:34 +02:00 |
debug.py
|
Add Habana copyright header (#122)
|
2024-04-08 18:06:21 +02:00 |
dist.py
|
Add changes from Optimum Habana's TGI folder
|
2023-12-05 11:12:16 +01:00 |
hub.py
|
Fix local load for peft (#1373)
|
2024-04-22 09:03:34 +03:00 |
import_utils.py
|
Add RoCm support (#1243)
|
2023-11-27 14:08:12 +01:00 |
layers.py
|
ROCm AWQ support (#1514)
|
2024-04-24 09:21:34 +00:00 |
log.py
|
v1.3.4
|
2024-04-22 09:08:34 +03:00 |
medusa.py
|
chore: formatting
|
2024-04-18 16:26:00 +03:00 |
paged_attention.py
|
chore: formatting
|
2024-04-18 16:26:00 +03:00 |
peft.py
|
fix: fix local loading for .bin models (#1419)
|
2024-04-22 09:17:52 +03:00 |
speculate.py
|
chore: formatting
|
2024-04-18 16:26:00 +03:00 |
tokens.py
|
feat(server): add frequency penalty (#1541)
|
2024-04-24 08:43:50 +00:00 |