fix: update docs for model addition

2025-09-09 03:14:53 +00:00 · 2024-08-07 23:47:37 +00:00 · 2024-08-07 23:47:37 +00:00 · 7372a0dc38
commit 7372a0dc38
parent 3e41ec28c7
2 changed files with 2 additions and 21 deletions
--- a/docs/source/basic_tutorials/launcher.md
+++ b/docs/source/basic_tutorials/launcher.md
@ -272,7 +272,7 @@ Options:
      --huggingface-hub-cache <HUGGINGFACE_HUB_CACHE>
          The location of the huggingface hub cache. Used to override the location if you want to provide a mounted disk for instance
-          [env: HUGGINGFACE_HUB_CACHE=]
+          [env: HUGGINGFACE_HUB_CACHE=/nvme0n1/Models/]
 ```
 ## WEIGHTS_CACHE_OVERRIDE
@ -349,12 +349,6 @@ Options:
      --cors-allow-origin <CORS_ALLOW_ORIGIN>
          [env: CORS_ALLOW_ORIGIN=]
 ```
 ## API_KEY
 ```shell
      --api-key <API_KEY>
          [env: API_KEY=]
 ```
 ## WATERMARK_GAMMA
 ```shell
@ -430,20 +424,6 @@ Options:
          [env: LORA_ADAPTERS=]
 ```
 ## USAGE_STATS
 ```shell
      --usage-stats <USAGE_STATS>
          Control if anonymous usage stats are collected. Options are "on", "off" and "no-stack" Defaul is on
          [env: USAGE_STATS=]
          [default: on]
          Possible values:
          - on:       Default option, usage statistics are collected anonymously
          - off:      Disables all collection of usage statistics
          - no-stack: Doesn't send the error stack trace or error type, but allows sending a crash event
 ```
 ## HELP
 ```shell
--- a/docs/source/supported_models.md
+++ b/docs/source/supported_models.md
@ -32,6 +32,7 @@ Text Generation Inference enables serving optimized models on specific hardware
 - [Mpt](https://huggingface.co/mosaicml/mpt-7b-instruct)
 - [Gpt2](https://huggingface.co/openai-community/gpt2)
 - [Gpt Neox](https://huggingface.co/EleutherAI/gpt-neox-20b)
 - [Gptj](https://huggingface.co/EleutherAI/gpt-j-6b)
 - [Idefics](https://huggingface.co/HuggingFaceM4/idefics-9b) (Multimodal)