text-generation-inference/backends
2024-10-22 09:52:05 +02:00
..
client feat: prefill chunking (#2600) 2024-10-16 12:49:33 +02:00
grpc-metadata Rebase TRT-llm (#2331) 2024-07-31 10:33:10 +02:00
trtllm chore(trtllm): validate there are enough GPus on the system for the desired model 2024-10-22 09:52:05 +02:00
v2 feat: prefill chunking (#2600) 2024-10-16 12:49:33 +02:00
v3 feat: prefill chunking (#2600) 2024-10-16 12:49:33 +02:00