Say hello to GPT-4o, our new flagship model which can reason across audio, vision, and text in real time:
Text and image input rolling out today in API and ChatGPT with voice and video in the coming weeks. openai.com/index/hello-gp…
Two GPT-4os interacting and singing
Realtime translation with GPT-4o
Lullabies and whispers with GPT-4o
Happy birthday with GPT-4o
@BeMyEyes with GPT-4o
Dad jokes with GPT-4o
Meeting AI with GPT-4o
Sarcasm with GPT-4o
Math problems with GPT-4o and @khanacademy
Point and learn Spanish with GPT-4o
Rock, Paper, Scissors with GPT-4o
Harmonizing with two GPT-4os
Interview prep with GPT-4o
Fast counting with GPT-4o
Dog meets GPT-4o
Live demo of GPT-4o realtime conversational speech
Live demo of GPT-4o voice variation
Live demo of GPT-4o vision
Live demo of coding assistance and desktop app
Live audience request for GPT-4o realtime translation
Live audience request for GPT-4o vision capabilities
All users will start to get access to GPT-4o today. In coming weeks we’ll begin rolling out the new voice and vision capabilities we demo’d today to ChatGPT Plus.
• • •
Missing some Tweet in this thread? You can try to
force a refresh
Together with researchers at Boston Children’s Hospital and Harvard, we published a study in NEJM AI showing how o3 Deep Research helped clinicians revisit previously unsolved rare pediatric disease cases, and find answers for families who had waited years.
The team reanalyzed 376 de-identified cases that had already gone through genetic testing and expert review, helping identify 18 diagnoses across neurodevelopmental disorders, rare neuromuscular disease, sudden unexpected death in pediatrics, and early-onset psychosis.
Rare disease diagnosis is challenging, as sequencing can surface millions of variants, and medical knowledge changes constantly.
o3 Deep Research helped connect clinical features, inheritance patterns, variant evidence, and scientific literature into hypotheses for specialists to review.
Every result went through human adjudication and clinical confirmation. AI’s role here was to help experts reason through complex, fragmented evidence faster and more thoroughly.
Introducing Daybreak: frontier AI for cyber defenders.
Daybreak brings together the most capable OpenAI models, Codex, and our security partners to accelerate cyber defense and continuously secure software.
A step toward a future where security teams can move at the speed defense demands.
Find and fix vulnerabilities earlier with Daybreak
Introducing workspace agents in ChatGPT—shared agents that can handle complex tasks and long-running workflows across tools and teams.
Agents are built to help with the kind of work that takes time, context, and follow-through: coordinating across tools, tracking progress, and moving tasks forward without needing constant supervision.
A state-of-the-art image model that can take on complex visual tasks and produce precise, immediately usable visuals, with sharper editing, richer layouts, and thinking-level intelligence.
Video made with ChatGPT Images
ChatGPT Images 2.0 is a step change in detailed instruction following, placing and relating objects accurately, and rendering dense text, with the ability to generate across aspect ratios.
It’s also accurate across languages and uses its expanded visual and world knowledge to fill in the gaps for you, so you get smarter images with less prompting.
ChatGPT Images 2.0 can conceptualize more sophisticated images, and then actually bring that vision to life effectively.
It’s able to follow instructions, preserve requested details, and render the fine-grained elements that often break image models: small text, iconography, UI elements, dense compositions, and subtle stylistic constraints, all at up to 2K resolution.
It can now use apps on your Mac, connect to more of your tools, create images, learn from previous actions, remember how you like to work, and take on ongoing and repeatable tasks.
With computer use on macOS, Codex can now use any app by seeing, clicking, and typing with its own cursor.
It runs in the background without taking over your computer, working on tasks like frontend iteration, app testing, or any workflow that doesn't expose an API.
You can now generate and iterate on images with gpt-image-1.5 in Codex to create frontend designs, mockups, game assets, and more without leaving your workflow.
Usage is included with your ChatGPT account, no API key needed.