This commit is contained in:
Nicolas Patry 2024-04-22 21:28:08 +00:00
parent f65d06cfb6
commit 0b20661cb7

View File

@ -1,5 +1,6 @@
## Speculation
Speculative decoding, assisted generation, Medusa, and others are a few different names for the same idea.
The idea is to generate tokens *before* the large model actually runs, and only *check* if those tokens where valid.