OpenAI Profile picture
OpenAI’s mission is to ensure that artificial general intelligence benefits all of humanity. We’re hiring: https://t.co/dJGr6Lg202
24 subscribers
Oct 6 8 tweets 2 min read
Everything shipped at DevDay [2025] 🧵
Sep 25 6 tweets 3 min read
Today we’re introducing GDPval, a new evaluation that measures AI on real-world, economically valuable tasks.

Evals ground progress in evidence instead of speculation and help track how AI improves at the kind of work that matters most.
openai.com/index/gdpval-v0 GDPval spans 44 occupations selected from the top 9 sectors contributing to U.S. Gross Domestic Product (GDP). Image
Image
Image
Sep 17 7 tweets 3 min read
Today we’re releasing research with @apolloaievals.

In controlled tests, we found behaviors consistent with scheming in frontier models—and tested a way to reduce it.

While we believe these behaviors aren’t causing serious harm today, this is a future risk we’re preparing for. openai.com/index/detectin… Scheming = when an AI behaves one way on the surface while hiding its true goals.

Today’s deployed systems have little opportunity to scheme in ways that could cause serious harm. The most common failures are simple deceptions—like pretending to complete a task without doing it. We’ve studied and mitigated these issues and made meaningful improvements in GPT-5 over earlier models.

But as AIs take on more complex, long-term tasks with real-world impact, the potential for harmful scheming will grow—so our safeguards and testing must grow with it.
Aug 5 7 tweets 3 min read
We released two open-weight reasoning models—gpt-oss-120b and gpt-oss-20b—under an Apache 2.0 license.

Developed with open-source community feedback, these models deliver meaningful advancements in both reasoning capabilities & safety.

openai.com/index/introduc… gpt-oss-120b matches OpenAI o4-mini on core benchmarks and exceeds it in narrow domains like competitive math or health-related questions, all while fitting on a single 80GB GPU (or high-end laptop).

gpt-oss-20b fits on devices as small as 16GB, while matching or exceeding OpenAI o3-mini.Image
Image
Image
Jul 17 7 tweets 3 min read
ChatGPT can now do work for you using its own computer.

Introducing ChatGPT agent—a unified agentic system combining Operator’s action-taking remote browser, deep research’s web synthesis, and ChatGPT’s conversational strengths. ChatGPT agent starts rolling out today to Pro, Plus, and Team users.

Pro users will get access by the end of day, while Plus and Team users will get access over the next few days.

Enterprise and Edu users will get access in the coming weeks. openai.com/index/introduc…
Jun 4 4 tweets 1 min read
ChatGPT can now connect to more internal sources & pull in real-time context—keeping existing user-level permissions.

Connectors available in deep research for Plus & Pro users (excl. EEA, CH, UK) and Team, Enterprise & Edu users:

Outlook
Teams
Google Drive
Gmail
Linear
& more Additional connectors available in ChatGPT for Team, Enterprise, and Edu users:

SharePoint
Dropbox
Box
May 16 6 tweets 2 min read
We’re launching a research preview of Codex: a cloud-based software engineering agent that can work on many tasks in parallel.

Rolling out to Pro, Enterprise, and Team users in ChatGPT starting today.

chatgpt.com/codex Codex independently navigates your codebase, implements and tests code changes, and proposes pull requests for you to review.

It’s powered by codex-1, a version of OpenAI o3 optimized for software engineering.

openai.com/index/introduc…
Apr 28 5 tweets 2 min read
We're excited to announce we’ve launched several improvements to ChatGPT search, and today we’re starting to roll out a better shopping experience.

Search has become one of our most popular & fastest growing features, with over 1 billion web searches just in the past week 🧵 Shopping

We’re experimenting with making shopping simpler and faster to find, compare, and buy products in ChatGPT.

✅ Improved product results
✅ Visual product details, pricing, and reviews
✅ Direct links to buy

Product results are chosen independently and are not ads.

These shopping improvements are starting to roll out today to Plus, Pro, Free, and logged-out users everywhere ChatGPT is available. It will take a few days to complete the rollout.
Apr 16 5 tweets 2 min read
Introducing OpenAI o3 and o4-mini—our smartest and most capable models to date.

For the first time, our reasoning models can agentically use and combine every tool within ChatGPT, including web search, Python, image analysis, file interpretation, and image generation. OpenAI o3 is a powerful model across multiple domains, setting a new standard for coding, math, science, and visual reasoning tasks.

o4-mini is a remarkably smart model for its speed and cost-efficiency. This allows it to support significantly higher usage limits than o3, making it a strong high-volume, high-throughput option for everyone with questions that benefit from reasoning. openai.com/index/introduc…
Apr 10 4 tweets 2 min read
Starting today, memory in ChatGPT can now reference all of your past chats to provide more personalized responses, drawing on your preferences and interests to make it even more helpful for writing, getting advice, learning, and beyond. In addition to the saved memories that were there before, it can now reference your past chats to deliver responses that feel noticeably more relevant and useful.

New conversations naturally build upon what it already knows about you, making interactions feel smoother and uniquely tailored to you.
Feb 25 5 tweets 1 min read
Deep research is now rolling out to all ChatGPT Plus, Team, Edu, and Enterprise users 🍾 Since the initial launch, we’ve made some improvements to deep research:

✅Embedded images with citations in the output

✅Better at understanding and referencing uploaded files
Feb 18 6 tweets 2 min read
Today we’re launching SWE-Lancer—a new, more realistic benchmark to evaluate the coding performance of AI models. SWE-Lancer includes over 1,400 freelance software engineering tasks from Upwork, valued at $1 million USD total in real-world payouts. openai.com/index/swe-lanc… SWE-Lancer tasks span the full engineering stack, from UI/UX to systems design, and include a range of task types, from $50 bug fixes to $32,000 feature implementations. SWE-Lancer includes both independent engineering tasks and management tasks, where models choose between technical implementation proposals.Image
Dec 20, 2024 5 tweets 2 min read
Today, we shared evals for an early version of the next model in our o-model reasoning series: OpenAI o3 On several of the most challenging frontier evals, OpenAI o3 sets new milestones for what’s possible in coding, math, and scientific reasoning.

It also makes significant progress on the ARC-AGI evaluation for the first time.
Dec 19, 2024 6 tweets 2 min read
ChatGPT can now work directly with more coding and note-taking apps—through voice or text—on macOS. Work with your code in context with expanded support for coding apps like Warp, IntelliJ IDEA, PyCharm, and more.
Dec 16, 2024 4 tweets 2 min read
🌐ChatGPT search 🌐is starting to roll out to all Free users today.

Search the web in a faster, better way—available globally on and our mobile and desktop apps for all logged-in users. chatgpt.com Search with Advanced Voice in ChatGPT, rolling out over the next week.
Dec 11, 2024 4 tweets 1 min read
ChatGPT is now integrated into Apple experiences within iOS, iPadOS, and macOS, allowing users to access ChatGPT’s capabilities right within the OS. Siri with ChatGPT
Dec 5, 2024 6 tweets 2 min read
OpenAI o1 is now out of preview in ChatGPT.

What’s changed since the preview? A faster, more powerful reasoning model that’s better at coding, math & writing.

o1 now also supports image uploads, allowing it to apply reasoning to visuals for more detailed & useful responses. OpenAI o1 is more concise in its thinking, resulting in faster response times than o1-preview.

Our testing shows that o1 outperforms o1-preview, reducing major errors on difficult real-world questions by 34%.
Sep 24, 2024 6 tweets 2 min read
Advanced Voice is rolling out to all Plus and Team users in the ChatGPT app over the course of the week.

While you’ve been patiently waiting, we’ve added Custom Instructions, Memory, five new voices, and improved accents.

It can also say “Sorry I’m late” in over 50 languages. If you are a Plus or Team user, you will see a notification in the app when you have access to Advanced Voice. Image
Sep 19, 2024 11 tweets 2 min read
Some favorite posts about OpenAI o1, as selected by researchers who worked on the model 🧵
Sep 12, 2024 8 tweets 3 min read
We're releasing a preview of OpenAI o1—a new series of AI models designed to spend more time thinking before they respond.

These models can reason through complex tasks and solve harder problems than previous models in science, coding, and math. openai.com/index/introduc… Rolling out today in ChatGPT to all Plus and Team users, and in the API for developers on tier 5.
Jul 30, 2024 5 tweets 1 min read
We’re starting to roll out advanced Voice Mode to a small group of ChatGPT Plus users. Advanced Voice Mode offers more natural, real-time conversations, allows you to interrupt anytime, and senses and responds to your emotions. Users in this alpha will receive an email with instructions and a message in their mobile app. We'll continue to add more people on a rolling basis and plan for everyone on Plus to have access in the fall. As previously mentioned, video and screen sharing capabilities will launch at a later date.