text-generation-inference/backends
OlivierDehaene 73e6090d53 chore: Add old V2 backend (#2551)
* wip

* added v2
2024-10-25 08:53:36 +00:00
..
client Pass the max_batch_total_tokens to causal_lm 2024-10-23 08:28:26 +00:00
grpc-metadata Rebase TRT-llm (#2331) 2024-09-25 05:55:39 +00:00
trtllm More fixes trtllm (#2342) 2024-09-25 06:08:00 +00:00
v2 chore: Add old V2 backend (#2551) 2024-10-25 08:53:36 +00:00
v3 Pass the max_batch_total_tokens to causal_lm 2024-10-23 08:28:26 +00:00