mirror of
https://github.com/huggingface/text-generation-inference.git
synced 2025-10-11 16:05:24 +00:00
250 B
250 B
sequenceDiagram
TensorRtLlmBackend -->> TensorRtLlmBackendImpl: New thread which instantiates actual backend impl
TensorRtLlmBackendImpl -->> TensorRtLlmBackendImpl.Receiver: Awaits incoming request sent throught the queue