Post

How to get URL link on X (Twitter) App

On the Twitter thread, click on or icon on the bottom
Click again on or Share Via icon
Click on Copy Link to Tweet
Paste it above and click "Unroll Thread"!
More info at Twitter Help

Greg Brockman

@gdb

Feb 6 • 1 tweets • 3 min read • Read on X

Software development is undergoing a renaissance in front of our eyes.

If you haven't used the tools recently, you likely are underestimating what you're missing. Since December, there's been a step function improvement in what tools like Codex can do. Some great engineers at OpenAI yesterday told me that their job has fundamentally changed since December. Prior to then, they could use Codex for unit tests; now it writes essentially all the code and does a great deal of their operations and debugging. Not everyone has yet made that leap, but it's usually because of factors besides the capability of the model.

Every company faces the same opportunity now, and navigating it well — just like with cloud computing or the Internet — requires careful thought. This post shares how OpenAI is currently approaching retooling our teams towards agentic software development. We're still learning and iterating, but here's how we're thinking about it right now:

As a first step, by March 31st, we're aiming that:

(1) For any technical task, the tool of first resort for humans is interacting with an agent rather than using an editor or terminal.
(2) The default way humans utilize agents is explicitly evaluated as safe, but also productive enough that most workflows do not need additional permissions.

In order to get there, here's what we recommended to the team a few weeks ago:

1. Take the time to try out the tools. The tools do sell themselves — many people have had amazing experiences with 5.2 in Codex, after having churned from codex web a few months ago. But many people are also so busy they haven't had a chance to try Codex yet or got stuck thinking "is there any way it could do X" rather than just trying.
- Designate an "agents captain" for your team — the primary person responsible for thinking about how agents can be brought into the teams' workflow.
- Share experiences or questions in a few designated internal channels
- Take a day for a company-wide Codex hackathon

2. Create skills and AGENTS[.md].
- Create and maintain an AGENTS[.md] for any project you work on; update the AGENTS[.md] whenever the agent does something wrong or struggles with a task.
- Write skills for anything that you get Codex to do, and commit it to the skills directory in a shared repository

3. Inventory and make accessible any internal tools.
- Maintain a list of tools that your team relies on, and make sure someone takes point on making it agent-accessible (such as via a CLI or MCP server).

4. Structure codebases to be agent-first. With the models changing so fast, this is still somewhat untrodden ground, and will require some exploration.
- Write tests which are quick to run, and create high-quality interfaces between components.

5. Say no to slop. Managing AI generated code at scale is an emerging problem, and will require new processes and conventions to keep code quality high
- Ensure that some human is accountable for any code that gets merged. As a code reviewer, maintain at least the same bar as you would for human-written code, and make sure the author understands what they're submitting.

6. Work on basic infra. There's a lot of room for everyone to build basic infrastructure, which can be guided by internal user feedback. The core tools are getting a lot better and more usable, but there's a lot of infrastructure that currently go around the tools, such as observability, tracking not just the committed code but the agent trajectories that led to them, and central management of the tools that agents are able to use.

Overall, adopting tools like Codex is not just a technical but also a deep cultural change, with a lot of downstream implications to figure out. We encourage every manager to drive this with their team, and to think through other action items — for example, per item 5 above, what else can prevent a lot of "functionally-correct but poorly-maintainable code" from creeping into codebases.

• • •

Missing some Tweet in this thread? You can try to force a refresh

This Thread may be Removed Anytime!

Twitter may remove this content at anytime! Save it as PDF for later use!

More from @gdb

Greg Brockman

@gdb

Apr 13, 2019

@OpenAI

To follow today's @OpenAI / @OGesports event:

- I'll be live-tweeting in this thread👇
- @FakePsyho will be live-tweeting in the following thread:

https://twitter.com/FakePsyho/status/1117097180429348865

- Watch our livestream: twitch.tv/openai!

Official event hashtag: #OpenAIFive

@OGesports

Team @OGesports getting warmed up.

@OpenAI

@OpenAI + @OGesports

Read 58 tweets

Greg Brockman

@gdb

Aug 23, 2018

OpenAI Five match starting now!!!

About to reveal the players...

Crowd is excited :)!!!!!!!

Read 44 tweets

Greg Brockman

@gdb

Aug 23, 2018

Glimmer of hope. Won some exciting fights. Now 40-29 kills, humans up by 8k gold.

Just a glimmer though.

Enemy sniper died. Pretty big deal. 40-29, 6k gold to the humans.

Gold lead is narrowing — humans up by 5k now.

Read 7 tweets

Greg Brockman

@gdb

Aug 5, 2018

Finished last staff meeting before event. Doors open in 25 mins! #openai5

Doors open in 2!

Doors are open!

Read 151 tweets

Greg Brockman

@gdb

Jul 30, 2018

https://twitter.com/OpenAI/status/1023963840587788288

The system built for Dota, applied to tasks in the physical world:

https://twitter.com/OpenAI/status/1023963840587788288

. Step towards truly general-purpose AI systems.

My favorite part of the result: the robot's grasps are recognizable and can be labeled with the standard taxonomy of human grasps. These grasps were discovered entirely in simulation from scratch, and run on the physical robot.

Robotic hands have been on the market for decades, but they have been unusable since no one could program them — in contrast to walking or backflipping robots. Learning has enabled something that was unachievable. Compare also to existing hand results: youtube.com/playlist?list=…

Read 4 tweets

Support us! We are indie developers!

This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Share this page!

Enter URL or ID to Unroll

Greg Brockman

Try unrolling a thread yourself!

More from @gdb

Greg Brockman

Greg Brockman

Greg Brockman

Greg Brockman

Greg Brockman

Did Thread Reader help you today?

Don't want to be a Premium member but still want to support us?

Send Email!