GPT-Realtime-2: OpenAI's Real-time Voice AI Revolution
Summary
OpenAI has just released GPT-Realtime-2, a new voice model that can reason in real-time, directly within an audio conversation. This flagship model, launched on May 7th, 2026, brings GPT-5 level intelligence to live voice interactions. Here's the thing: instead of stitching together different systems for transcription and understanding, GPT-Realtime-2 processes everything inside the audio loop. This means much faster and more natural conversations. It also boasts a massive 128,000 token context window, a big jump from the previous 32,000. What's interesting is it allows for features like preambles, parallel tool calls, and even tone control. OpenAI also launched companion models: GPT-Realtime-Translate, handling over 70 input languages, and GPT-Realtime-Whisper for streaming speech-to-text. Early results are impressive. Zillow saw a 26-point increase in call success rates, jumping from 69% to 95%. Pricing is competitive, with GPT-Realtime-2 costing about $0.048 per minute. The bottom line: this could revolutionize how we interact with AI through voice, making conversations much more human-like and efficient.
This is an AI-generated audio summary. Always check the original source for complete reporting.