text-generation-inference/README.md at 9c60c9ca43b903b0bf5ed5df49af36fd33d23a6a - text-generation-inference - Leaflow Developers

huggingface/text-generation-inference

mirror of https://github.com/huggingface/text-generation-inference.git synced 2025-10-11 07:55:24 +00:00

Morgan Funtowicz d0a34a95f2 adding missing ld_library_path for cuda stubs in Dockerfile

2024-07-22 15:16:39 +00:00

6 lines

250 B

Markdown

Raw Blame History

 ```mermaid
 sequenceDiagram
     TensorRtLlmBackend -->> TensorRtLlmBackendImpl: New thread which instantiates actual backend impl
     TensorRtLlmBackendImpl -->> TensorRtLlmBackendImpl.Receiver: Awaits incoming request sent throught the queue
 ```