Update docs/source/supported_models.md

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
2025-09-10 11:54:52 +00:00 · 2023-08-02 23:45:09 +03:00 · 2023-08-02 23:45:09 +03:00 · 4f1657418d
commit 4f1657418d
parent 0efe4384c0
1 changed files with 1 additions and 1 deletions
--- a/docs/source/supported_models.md
+++ b/docs/source/supported_models.md
@ -33,7 +33,7 @@ For the optimized models above, TGI uses custom CUDA kernels for better inferenc
 TGI optimized models are supported on NVIDIA [A100](https://www.nvidia.com/en-us/data-center/a100/), [A10G](https://www.nvidia.com/en-us/data-center/products/a10-gpu/) and [T4](https://www.nvidia.com/en-us/data-center/tesla-t4/) GPUs with CUDA 11.8+. Note that you have to install [NVIDIA Container Toolkit](https://docs.nvidia.com/datacenter/cloud-native/container-toolkit/install-guide.html) to use it. For other hardware, continuous batching will still apply, but some operations like flash attention and paged attention will not be executed. 
 TGI is also supported on the following AI hardware accelerators:
- *Habana first-gen Gaudi and Gaudi2:* checkout [here](https://github.com/huggingface/optimum-habana/tree/main/text-generation-inference) how to serve models with TGI on Gaudi and Gaudi2 with [Optimum Habana](https://huggingface.co/docs/optimum/habana/index)
+- *Habana first-gen Gaudi and Gaudi2:* check out this [example](https://github.com/huggingface/optimum-habana/tree/main/text-generation-inference) how to serve models with TGI on Gaudi and Gaudi2 with [Optimum Habana](https://huggingface.co/docs/optimum/habana/index)