OpenAI has officially launched GPT-4o, a new flagship model that significantly enhances the capabilities of ChatGPT. The “o” stands for “omni,” highlighting the model’s native ability to process and generate content across text, audio, and vision in real time. The announcement was made during the company’s live-streamed “Spring Update” event on May 13, 2024.
Perhaps the most significant aspect of the launch is that GPT-4o’s intelligence will be available to all users, including those on the free tier. This move democratizes access to state-of-the-art AI, a feature previously reserved for paid subscribers. While free users will have a limit on the number of messages they can send using the new model, this marks a major strategic shift for the company.
During the demonstration, OpenAI showcased the model’s remarkable speed and expressiveness. It responded to audio inputs in as little as 232 milliseconds, which is comparable to human conversational response times. The AI could understand and generate different emotional tones, interrupt users naturally, and even perform real-time language translation.
In addition to the model update, OpenAI introduced a new ChatGPT desktop application for macOS. This app allows the model to have visibility into a user’s screen, enabling it to answer questions about code, presentations, or other on-screen content. For developers, GPT-4o is not only faster and more capable but also 50% cheaper to use in the API compared to GPT-4 Turbo. This release positions OpenAI to compete aggressively with upcoming announcements from rivals like Google, setting a new baseline for what users and developers can expect from AI assistants.


