mirror of
https://github.com/huggingface/text-generation-inference.git
synced 2025-04-22 15:32:08 +00:00
updated doc
This commit is contained in:
parent
6d6b0bdcc4
commit
0a5b19a3ed
@ -60,8 +60,6 @@ docker run --gpus all --shm-size 64g -p 8080:80 -v $volume:/data \
|
|||||||
--kv-cache-dtype fp8
|
--kv-cache-dtype fp8
|
||||||
```
|
```
|
||||||
|
|
||||||
We strongly suggest referring to the detailed [installation instructions](https://github.com/Dao-AILab/flash-attention?tab=readme-ov-file#installation-and-features) to learn more about supported hardware and data types!
|
|
||||||
|
|
||||||
</hfoption>
|
</hfoption>
|
||||||
<hfoption id="AMD">
|
<hfoption id="AMD">
|
||||||
|
|
||||||
|
Loading…
Reference in New Issue
Block a user