Adaptive speculative decoding: picking draft lengths at runtime
By hasheddan · 2026-06-22 · 2 points · 0 comments
https://fergusfinn.com/blog/adaptive-speculation/
By hasheddan · 2 points · 0 comments · on Hacker News, read on BetterNews.
Open the full discussion on BetterNews