mirror of
https://github.com/huggingface/text-generation-inference.git
synced 2025-09-11 12:24:53 +00:00
Odd.
This commit is contained in:
parent
f65d06cfb6
commit
0b20661cb7
@ -1,5 +1,6 @@
|
||||
## Speculation
|
||||
|
||||
|
||||
Speculative decoding, assisted generation, Medusa, and others are a few different names for the same idea.
|
||||
The idea is to generate tokens *before* the large model actually runs, and only *check* if those tokens where valid.
|
||||
|
||||
|
Loading…
Reference in New Issue
Block a user