mirror of
https://github.com/huggingface/text-generation-inference.git
synced 2025-04-20 06:12:07 +00:00
* Prepare for release 3.1.0 * Back on main flake. * Fixing stuff. * Upgrade to moe-kernels 0.8.2 for Hip support. * Deactivating the flaky test. |
||
---|---|---|
.. | ||
chunking.md | ||
external.md | ||
flash_attention.md | ||
guidance.md | ||
lora.md | ||
paged_attention.md | ||
quantization.md | ||
safetensors.md | ||
speculation.md | ||
streaming.md | ||
tensor_parallelism.md |