Initial commit

2025-09-10 20:04:52 +00:00 · 2023-08-22 23:02:17 +03:00 · 2023-08-22 23:02:17 +03:00 · ee19513cf7
commit ee19513cf7
parent c4422e5678
2 changed files with 9 additions and 0 deletions
--- a/docs/source/_toctree.yml
+++ b/docs/source/_toctree.yml
@ -21,4 +21,6 @@
 - sections:
  - local: conceptual/streaming
    title: Streaming
  - local: conceptual/safetensors
    title: Safetensors
  title: Conceptual Guides
--- a/docs/source/conceptual/safetensors.md
+++ b/docs/source/conceptual/safetensors.md
@ -0,0 +1,7 @@
 # Safetensors
 Safetensors is a model serialization format for deep learning models. It is [faster](https://huggingface.co/docs/safetensors/speed) and safer compared to other serialization formats like pickle (which is used under the hood in many deep learning libraries). 
 TGI depends on safetensors format mainly to enable [tensor parallelism sharding](./tensor_parallelism). For a given model repository during serving, TGI looks for safetensors weights. If there are no safetensors weights, TGI converts the PyTorch weights to safetensors format. 
 You can learn more about safetensors by reading the [safetensors documentation](https://huggingface.co/docs/safetensors/index).