Hotfix link2 (#2812)

2nd hotfix ?
2025-09-08 10:54:53 +00:00 · 2024-12-10 01:27:18 +05:30 · 2024-12-10 01:27:18 +05:30 · b2fac5d947
commit b2fac5d947
parent a70dd2998b
1 changed files with 1 additions and 1 deletions
--- a/docs/source/conceptual/chunking.md
+++ b/docs/source/conceptual/chunking.md
@ -72,7 +72,7 @@ Long:  `MODEL_ID=$MODEL_ID  HOST=localhost:8000 k6 run load_tests/long.js`
 ### Results
-![benchmarks_v3](https://github.com/huggingface/text-generation-inference/blob/042791fbd5742b1644d42c493db6bec669df6537/assets/v3_benchmarks.png)
+![benchmarks_v3](https://raw.githubusercontent.com/huggingface/text-generation-inference/refs/heads/main/assets/v3_benchmarks.png)
 Our benchmarking results show significant performance gains, with a 13x speedup over vLLM with prefix caching, and up to 30x speedup without prefix caching. These results are consistent with our production data and demonstrate the effectiveness of our optimized LLM architecture.