Hacker News

New OpenAI Feature: Predicted Outputs

by limoceon 11/5/2024, 2:47:19 AM with 7 comments

by cbhlon 11/5/2024, 3:38:47 AM
If you use the Cursor IDE: the folks that wrote it talked about their use of speculative decoding to make "Apply" faster on the Lex Friedman podcast last month.
Here it is on YouTube, although you can also find it on Spotify and other podcast platforms:
https://youtu.be/oFfVt3S51T4?t=1206
by creativenoloon 11/5/2024, 9:20:28 AM
I found the OpenAI page to be more interesting https://platform.openai.com/docs/guides/latency-optimization...
by nunezon 11/5/2024, 2:14:09 PM
This is like the likely() and unlikely() macros in the Linux kernel! Huge speedup if you're right; small penalty if you're not.
by user_james92on 11/5/2024, 4:47:56 AM
[dead]
by 768DataSeekeron 11/5/2024, 5:01:12 AM
[flagged]
by AIFounderon 11/5/2024, 2:55:24 AM
[dead]