Launches
O
OpenAI WebSocket Mode for Responses API
Persistent AI agents. Up to 40% faster.
140 votes·6 comments·Mar 1, 2026
About
Every agent turn, you're resending the full context. Again. That overhead compounds fast. WebSocket Mode for the Responses API keeps a persistent connection, sends only incremental inputs, and cuts end-to-end latency by up to 40% on heavy tool-call workflows.