mirror of
https://github.com/huggingface/text-generation-inference.git
synced 2025-09-10 20:04:52 +00:00
Added note on weight-cache-override
This commit is contained in:
parent
b03d2621a7
commit
eb8f59083d
@ -28,6 +28,10 @@ AutoModelForCausalLM.from_pretrained(<model>, device_map="auto")`
|
|||||||
AutoModelForSeq2SeqLM.from_pretrained(<model>, device_map="auto")
|
AutoModelForSeq2SeqLM.from_pretrained(<model>, device_map="auto")
|
||||||
```
|
```
|
||||||
|
|
||||||
|
If you wish to serve a different version of a model that exists in a local folder, you can use `weight-cache-override` flag like below 👇
|
||||||
|
```bash
|
||||||
|
text-generation-launcher --model-id bigscience/bloom --weights-cache-override <PATH-TO-LOCAL-BLOOM>
|
||||||
|
```
|
||||||
|
|
||||||
## Supported Hardware
|
## Supported Hardware
|
||||||
|
|
||||||
|
Loading…
Reference in New Issue
Block a user