Abel’s Substack
Subscribe
Sign in
Speeding up open source LLMs
Oct 22, 2023
Speculative decoding provides a 2-4x speedup
Read →
Comments
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts
Speeding up open source LLMs
Speculative decoding provides a 2-4x speedup