client
|
Pass the max_batch_total_tokens to causal_lm
|
2024-10-23 08:28:26 +00:00 |
grpc-metadata
|
Rebase TRT-llm (#2331)
|
2024-09-25 05:55:39 +00:00 |
trtllm
|
More fixes trtllm (#2342)
|
2024-09-25 06:08:00 +00:00 |
v2
|
chore: Add old V2 backend (#2551)
|
2024-10-25 08:53:36 +00:00 |
v3
|
Pass the max_batch_total_tokens to causal_lm
|
2024-10-23 08:28:26 +00:00 |