merges
|
fix: refactors and helpful comments
|
2024-06-24 13:39:56 +00:00 |
chunks.py
|
server: use chunked inputs
|
2024-06-07 08:09:04 +02:00 |
dist.py
|
add intel xpu support for TGI (#1475)
|
2024-04-26 15:48:58 +02:00 |
log.py
|
v1.3.4
|
2023-12-22 15:46:04 +01:00 |
logits_process.py
|
Fixing frequency penalty (#1811)
|
2024-04-30 12:13:23 +02:00 |
peft.py
|
fix: refactors and adjust flash llama lora logic
|
2024-06-19 16:13:42 +00:00 |
sgmv.py
|
fix: refactors and adjust flash llama lora logic
|
2024-06-19 16:13:42 +00:00 |
speculate.py
|
chore: formatting
|
2023-12-11 14:49:52 +01:00 |
tokens.py
|
Use the generation config. (#1808)
|
2024-04-25 19:41:50 +02:00 |
watermark.py
|
Fixing watermark. (#851)
|
2023-08-16 07:17:26 +02:00 |
weights.py
|
Add support for GPTQ Marlin (#2052)
|
2024-06-14 09:45:42 +02:00 |