Update docs

Signed-off-by: Adrien Gallouët <angt@huggingface.co>
2025-09-11 12:24:53 +00:00 · 2025-02-06 10:31:05 +00:00 · 2025-02-06 10:31:05 +00:00 · 8bc10d37ee
commit 8bc10d37ee
parent 2b0d99c1cf
2 changed files with 4 additions and 0 deletions
--- a/docs/source/_toctree.yml
+++ b/docs/source/_toctree.yml
@ -52,6 +52,8 @@
 - sections:
  - local: backends/trtllm
    title: TensorRT-LLM
  - local: backends/llamacpp
    title: Llamacpp
  title: Backends
 - sections:
  - local: reference/launcher
--- a/docs/source/multi_backend_support.md
+++ b/docs/source/multi_backend_support.md
@ -11,3 +11,5 @@ TGI remains consistent across backends, allowing you to switch between them seam
 * **[TGI TRTLLM backend](./backends/trtllm)**: This backend leverages NVIDIA's TensorRT library to accelerate LLM inference.
  It utilizes specialized optimizations and custom kernels for enhanced performance.
  However, it requires a model-specific compilation step for each GPU architecture.
 * **[TGI Llamacpp backend](./backends/llamacpp)**: This backend facilitates the deployment of large language models
  (LLMs) by integrating [llama.cpp][llama.cpp], an advanced inference engine optimized for both CPU and GPU computation.